Comparison of bioactive chemical space networks generated using substructure- and fingerprint-based measures of molecular similarity

J Comput Aided Mol Des. 2015 Jul;29(7):595-608. doi: 10.1007/s10822-015-9852-5. Epub 2015 Jun 7.

Abstract

Chemical space networks (CSNs) have recently been introduced as a conceptual alternative to coordinate-based representations of chemical space. CSNs were initially designed as threshold networks using the Tanimoto coefficient as a continuous similarity measure. The analysis of CSNs generated from sets of bioactive compounds revealed that many statistical properties were strongly dependent on their edge density. While it was difficult to compare CSNs at pre-defined similarity threshold values, CSNs with constant edge density were directly comparable. In the current study, alternative CSN representations were constructed by applying the matched molecular pair (MMP) formalism as a substructure-based similarity criterion. For more than 150 compound activity classes, MMP-based CSNs (MMP-CSNs) were compared to corresponding threshold CSNs (THR-CSNs) at a constant edge density by applying different parameters from network science, measures of community structure distributions, and indicators of structure-activity relationship (SAR) information content. MMP-CSNs were found to be an attractive alternative to THR-CSNs, yielding low edge densities and well-resolved topologies. MMP-CSNs and corresponding THR-CSNs often had similar topology and closely corresponding community structures, although there was only limited overlap in similarity relationships. The homophily principle from network science was shown to affect MMP-CSNs and THR-CSNs in different ways, despite the presence of conserved topological features. Moreover, activity cliff distributions in alternative CSN designs markedly differed, which has important implications for SAR analysis.

Publication types

  • Comparative Study
  • Research Support, Non-U.S. Gov't

MeSH terms

  • Cluster Analysis
  • Computer Graphics
  • Matrix Metalloproteinase 13 / chemistry
  • Matrix Metalloproteinase 13 / metabolism
  • Models, Chemical*
  • Models, Molecular
  • Models, Statistical
  • Receptor, Bradykinin B1 / chemistry
  • Receptor, Bradykinin B1 / metabolism
  • Structure-Activity Relationship*

Substances

  • Receptor, Bradykinin B1
  • Matrix Metalloproteinase 13