Structure-based chemical ontology improves chemometric prediction of antibacterial essential oils

Sci Rep. 2024 Jul 1;14(1):15014. doi: 10.1038/s41598-024-65882-9.

Abstract

Plants are valuable resources for drug discovery as they produce diverse bioactive compounds. However, the chemical diversity makes it difficult to predict the biological activity of plant extracts via conventional chemometric methods. In this research, we propose a new computational model that integrates chemical composition data with structure-based chemical ontology. For a model validation, two training datasets were prepared from literature on antibacterial essential oils to classify active/inactive oils. Random forest classifiers constructed from the data showed improved prediction performance in both test datasets. Prior feature selection using hierarchical information criterion further improved the performance. Furthermore, an antibacterial assay using a standard strain of Staphylococcus aureus revealed that the classifier correctly predicted the activity of commercially available oils with an accuracy of 83% (= 10/12). The results of this study indicate that machine learning of chemical composition data integrated with chemical ontology can be a highly efficient approach for exploring bioactive plant extracts.

Keywords: Antibacterial activity; Chemical ontology; Chemometrics; Essential oil; Machine learning.

MeSH terms

  • Anti-Bacterial Agents* / chemistry
  • Anti-Bacterial Agents* / pharmacology
  • Chemometrics / methods
  • Machine Learning
  • Microbial Sensitivity Tests
  • Oils, Volatile* / chemistry
  • Oils, Volatile* / pharmacology
  • Plant Extracts / chemistry
  • Plant Extracts / pharmacology
  • Staphylococcus aureus* / drug effects

Substances

  • Oils, Volatile
  • Anti-Bacterial Agents
  • Plant Extracts