Prospective Assessment of Virtual Screening Heuristics Derived Using a Novel Fusion Score

SLAS Discov. 2017 Sep;22(8):995-1006. doi: 10.1177/2472555217706058. Epub 2017 Apr 20.

Abstract

High-throughput screening (HTS) is a widespread method in early drug discovery for identifying promising chemical matter that modulates a target or phenotype of interest. Because HTS campaigns involve screening millions of compounds, it is often desirable to initiate screening with a subset of the full collection. Subsequently, virtual screening methods prioritize likely active compounds in the remaining collection in an iterative process. With this approach, orthogonal virtual screening methods are often applied, necessitating the prioritization of hits from different approaches. Here, we introduce a novel method of fusing these prioritizations and benchmark it prospectively on 17 screening campaigns using virtual screening methods in three descriptor spaces. We found that the fusion approach retrieves 15% to 65% more active chemical series than any single machine-learning method and that appropriately weighting contributions of similarity and machine-learning scoring techniques can increase enrichment by 1% to 19%. We also use fusion scoring to evaluate the tradeoff between screening more chemical matter initially in lieu of replicate samples to prevent false-positives and find that the former option leads to the retrieval of more active chemical series. These results represent guidelines that can increase the rate of identification of promising active compounds in future iterative screens.

Keywords: chemoinformatics; computational chemistry; statistical analyses; ultra-high-throughput screening.

MeSH terms

  • Drug Evaluation, Preclinical*
  • Heuristics*
  • Machine Learning
  • User-Computer Interface*