The accurate quantification of tumor-infiltrating immune cells turns crucial to uncover their role in tumor immune escape, to determine patient prognosis and to predict response to immune checkpoint blockade. Current state-of-the-art methods that quantify immune cells from tumor biopsies using gene expression data apply computational deconvolution methods that present multicollinearity and estimation errors resulting in the overestimation or underestimation of the diversity of infiltrating immune cells and their quantity. To overcome such limitations, we developed MIXTURE, a new ν-support vector regression-based noise constrained recursive feature selection algorithm based on validated immune cell molecular signatures. MIXTURE provides increased robustness to cell type identification and proportion estimation, outperforms the current methods, and is available to the wider scientific community. We applied MIXTURE to transcriptomic data from tumor biopsies and found relevant novel associations between the components of the immune infiltrate and molecular subtypes, tumor driver biomarkers, tumor mutational burden, microsatellite instability, intratumor heterogeneity, cytolytic score, programmed cell death ligand 1 expression, patients' survival and response to anti-cytotoxic T-lymphocyte-associated antigen 4 and anti-programmed cell death protein 1 immunotherapy.
Keywords: RNA sequencing; cancer; deconvolution; digital cytometry; immune infiltrate; immunotherapy.
© The Author(s) 2020. Published by Oxford University Press. All rights reserved. For Permissions, please email: journals.permissions@oup.com.