Accurate refinement of docked protein complexes using evolutionary information and deep learning

Bahar Akbal-Delibas; Roshanak Farhoodi; Marc Pomplun; Nurit Haspel

doi:10.1142/S0219720016420026

Accurate refinement of docked protein complexes using evolutionary information and deep learning

J Bioinform Comput Biol. 2016 Jun;14(3):1642002. doi: 10.1142/S0219720016420026. Epub 2015 Nov 24.

Authors

Bahar Akbal-Delibas¹, Roshanak Farhoodi¹, Marc Pomplun¹, Nurit Haspel¹

Affiliation

¹ 1 Department of Computer Science, University of Massachusetts Boston, 100 Morrissey Boulevard, Boston, MA 02125, USA.

PMID: 26846813
DOI: 10.1142/S0219720016420026

Abstract

One of the major challenges for protein docking methods is to accurately discriminate native-like structures from false positives. Docking methods are often inaccurate and the results have to be refined and re-ranked to obtain native-like complexes and remove outliers. In a previous work, we introduced AccuRefiner, a machine learning based tool for refining protein-protein complexes. Given a docked complex, the refinement tool produces a small set of refined versions of the input complex, with lower root-mean-square-deviation (RMSD) of atomic positions with respect to the native structure. The method employs a unique ranking tool that accurately predicts the RMSD of docked complexes with respect to the native structure. In this work, we use a deep learning network with a similar set of features and five layers. We show that a properly trained deep learning network can accurately predict the RMSD of a docked complex with 1.40 Å error margin on average, by approximating the complex relationship between a wide set of scoring function terms and the RMSD of a docked structure. The network was trained on 35000 unbound docking complexes generated by RosettaDock. We tested our method on 25 different putative docked complexes produced also by RosettaDock for five proteins that were not included in the training data. The results demonstrate that the high accuracy of the ranking tool enables AccuRefiner to consistently choose the refinement candidates with lower RMSD values compared to the coarsely docked input structures.

Keywords: Protein docking; deep learning neural networks; ranking and scoring functions.

Publication types

Research Support, U.S. Gov't, Non-P.H.S.

MeSH terms

Databases, Protein
Molecular Docking Simulation / methods*
Neural Networks, Computer
Protein Conformation
Proteins / chemistry*
Proteins / metabolism

Substances

Proteins

Grants and funding

CCF-1421871/National Science Foundation/International