A simple strategy for mitigating the effect of data variability on the identification of active chemotypes from high-throughput screening data

J Biomol Screen. 2007 Mar;12(2):276-84. doi: 10.1177/1087057106297826. Epub 2007 Feb 1.

Abstract

Among the several goals of a high-throughput screening campaign is the identification of as many active chemotypes as possible for further evaluation. Often, however, the number of concentration response curves (e.g., IC(50)s or K(i)s) that can be collected following a primary screen is limited by practical constraints such as protein supply, screening workload, and so forth. One possible approach to this dilemma is to cluster the hits from the primary screen and sample only a few compounds from each cluster. This introduces the question as to how many compounds must be selected from a cluster to ensure that an active compound is identified, if it exists at all. This article seeks to address this question using a Monte Carlo simulation in which the dependence of the success of sampling is directly linked to screening data variability. Furthermore, the authors demonstrate that the use of replicated compounds in the screening collection can easily assess this variability and provide a priori guidance to the screener and chemist as to the extent of sampling required to maximize chemotype identification during the triage process. The individual steps of the Monte Carlo simulation provide insight into the correspondence between the percentage inhibition and eventual IC(50) curves.

Publication types

  • Comparative Study

MeSH terms

  • Adenosine Triphosphate / metabolism
  • Biocompatible Materials / chemistry
  • Biotinylation
  • Cluster Analysis
  • Computer Simulation
  • Coumarins / metabolism
  • Drug Evaluation, Preclinical / methods*
  • Fluorescein
  • Fluorescence Resonance Energy Transfer
  • Fluorescent Dyes
  • Inhibitory Concentration 50
  • Monte Carlo Method
  • Polystyrenes / chemistry
  • Protein Kinases / analysis*
  • Receptor Protein-Tyrosine Kinases / analysis*
  • Receptor Protein-Tyrosine Kinases / antagonists & inhibitors
  • Receptors, G-Protein-Coupled / analysis*
  • Receptors, G-Protein-Coupled / antagonists & inhibitors
  • Sampling Studies
  • Scintillation Counting / methods
  • Software Design
  • Spectrophotometry
  • Wheat Germ Agglutinins / chemistry

Substances

  • Biocompatible Materials
  • Coumarins
  • Fluorescent Dyes
  • Polystyrenes
  • Receptors, G-Protein-Coupled
  • Wheat Germ Agglutinins
  • Adenosine Triphosphate
  • coumarin
  • Protein Kinases
  • Receptor Protein-Tyrosine Kinases
  • Fluorescein