Multimodal Fusion-Based Lightweight Model for Enhanced Generalization in Drug-Target Interaction Prediction

Jonghyun Lee; Dokyoon Kim; Dae Won Jun; Yun Kim

doi:10.1021/acs.jcim.4c01397

Multimodal Fusion-Based Lightweight Model for Enhanced Generalization in Drug-Target Interaction Prediction

J Chem Inf Model. 2024 Dec 3. doi: 10.1021/acs.jcim.4c01397. Online ahead of print.

Authors

Jonghyun Lee¹, Dokyoon Kim¹, Dae Won Jun^{2

3}, Yun Kim⁴

Affiliations

¹ Institute of Biomedical Informatics, Perelman School of Medicine, University of Pennsylvania, Philadelphia, Pennsylvania 19104, United States.
² Department of Medical and Digital Engineering, Hanyang University College of Engineering, Seoul 04763, Republic of Korea.
³ Department of Internal Medicine, Hanyang University College of Medicine, Seoul 04763, Republic of Korea.
⁴ College of Pharmacy, Daegu Catholic University, Gyeongsan 38430, Republic of Korea.

PMID: 39626073
DOI: 10.1021/acs.jcim.4c01397

Abstract

Predicting drug-target interactions (DTIs) with precision is a crucial challenge in the quest for efficient and cost-effective drug discovery. Existing DTI prediction models often require significant computational resources because of the intricate and exceptionally lengthy protein target sequences. This study introduces MMF-DTI, a lightweight model that uses multimodal fusion, to improve the generalizability of DTI predictions without sacrificing computational efficiency. The MMF-DTI model combines four distinct modalities: molecular sequence, molecular properties, target sequence, and target function description. This approach is noteworthy because it is the first to use natural language-based target function descriptions in predicting DTIs. The effectiveness of MMF-DTI has been confirmed through benchmark data sets, demonstrating its comparable performance in terms of generalizability, especially in scenarios with limited information about the drug or target. Remarkably, MMF-DTI accomplishes this using only half of the parameters and 17% of the VRAM compared with previous state-of-the-art models. This allows it to function even in constrained computational environments. The combination of performance and efficiency highlights the potential of multimodal data fusion in improving the overall applicability of models, providing promising opportunities for future drug discovery endeavors.