Multi-Descriptor Read Across (MuDRA): A Simple and Transparent Approach for Developing Accurate Quantitative Structure-Activity Relationship Models

Vinicius M Alves; Alexander Golbraikh; Stephen J Capuzzi; Kammy Liu; Wai In Lam; Daniel Robert Korn; Diane Pozefsky; Carolina Horta Andrade; Eugene N Muratov; Alexander Tropsha

doi:10.1021/acs.jcim.8b00124

Multi-Descriptor Read Across (MuDRA): A Simple and Transparent Approach for Developing Accurate Quantitative Structure-Activity Relationship Models

J Chem Inf Model. 2018 Jun 25;58(6):1214-1223. doi: 10.1021/acs.jcim.8b00124. Epub 2018 Jun 13.

Authors

Affiliations

¹ Laboratory for Molecular Modeling, Division of Chemical Biology and Medicinal Chemistry, UNC Eshelman School of Pharmacy , University of North Carolina , Chapel Hill , North Carolina 27599 , United States.
² Laboratory for Molecular Modeling and Design, Department of Pharmacy , Federal University of Goias , Goiania , GO 74605-170 , Brazil.
³ Department of Computer Science , University of North Carolina , Chapel Hill , North Carolina 27599 , United States.
⁴ Department of Chemical Technology , Odessa National Polytechnic University , Odessa , 65000 , Ukraine.

Abstract

Multiple approaches to quantitative structure-activity relationship (QSAR) modeling using various statistical or machine learning techniques and different types of chemical descriptors have been developed over the years. Oftentimes models are used in consensus to make more accurate predictions at the expense of model interpretation. We propose a simple, fast, and reliable method termed Multi-Descriptor Read Across (MuDRA) for developing both accurate and interpretable models. The method is conceptually related to the well-known kNN approach but uses different types of chemical descriptors simultaneously for similarity assessment. To benchmark the new method, we have built MuDRA models for six different end points (Ames mutagenicity, aquatic toxicity, hepatotoxicity, hERG liability, skin sensitization, and endocrine disruption) and compared the results with those generated with conventional consensus QSAR modeling. We find that models built with MuDRA show consistently high external accuracy similar to that of conventional QSAR models. However, MuDRA models excel in terms of transparency, interpretability, and computational efficiency. We posit that due to its methodological simplicity and reliable predictive accuracy, MuDRA provides a powerful alternative to a much more complex consensus QSAR modeling. MuDRA is implemented and freely available at the Chembench web portal ( https://chembench.mml.unc.edu/mudra ).

Publication types

Research Support, N.I.H., Extramural
Research Support, Non-U.S. Gov't

MeSH terms

Algorithms
Databases, Factual
Humans
Internet
Models, Biological
Mutagens / toxicity
Quantitative Structure-Activity Relationship*
Software
Toxicity Tests

Substances

Mutagens

Grants and funding

U01 CA207160/CA/NCI NIH HHS/United States