FloReMi: Flow density survival regression using minimal feature redundancy

Cytometry A. 2016 Jan;89(1):22-9. doi: 10.1002/cyto.a.22734. Epub 2015 Aug 4.

Abstract

Advances in flow cytometry bioinformatics have resulted in a wide variety of clustering, classification and visualization techniques. To objectively evaluate the performance of such methods, common benchmarks such as the FlowCAP initiative have proven to be of great value. In this work, we report on a novel method, FloReMi, which was developed to tackle the most recent FlowCAP IV challenge. This challenge was formulated as a survival modeling problem, where participants were expected to design a model to predict the time until progression to AIDS for HIV patients. It is known that variability in progression rate cannot be fully predicted by simple CD4(+) T cell counts. However, it is hypothesized that the immunopathogenesis established early in HIV already indicates the course of future disease. Adequately estimating the progression rate of HIV patients is crucial in their treatment. Using an automated pipeline to preprocess the data, and subsequently identify and select informative cell subsets, a survival regression method based on random survival forests was built, which obtained the best results of all submitted approaches to the FlowCAP IV challenge.

Keywords: feature selection; machine learning; polychromatic flow cytometry; survival time prediction.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Acquired Immunodeficiency Syndrome / diagnosis
  • Acquired Immunodeficiency Syndrome / mortality
  • Acquired Immunodeficiency Syndrome / pathology*
  • Algorithms
  • Benchmarking*
  • Cluster Analysis
  • Computational Biology / methods*
  • Data Interpretation, Statistical
  • Disease Progression*
  • Flow Cytometry / methods*
  • HIV Seropositivity
  • Humans
  • Machine Learning
  • Regression Analysis
  • Staining and Labeling
  • T-Lymphocytes / cytology