Multidimensional single-cell analysis requires approaches to visualize complex data in intuitive 2D graphs. In this regard, t-distributed stochastic neighboring embedding (tSNE) is the most popular algorithm for single-cell RNA sequencing and cytometry by time-of-flight (CyTOF), but its application to polychromatic flow cytometry, including the recently developed 30-parameter platform, is still under investigation. We identified differential distribution of background values between samples, generated by either background calculation or spreading error (SE), as a major source of variability in polychromatic flow cytometry data representation by tSNE, ultimately resulting in the identification of erroneous heterogeneity among cell populations. Biexponential transformation of raw data and limiting SE during panel development dramatically improved data visualization. These aspects must be taken into consideration when using computational approaches as discovery tools in large sets of samples from independent experiments or immunomonitoring in clinical trials.
Keywords: CD8; T cell; high-dimensional data; polychromatic flow cytometry; single cell; tSNE.
© 2018 The Authors. Cytometry Part A published by Wiley Periodicals, Inc. on behalf of International Society for Advancement of Cytometry.