论文标题
DRLCOMPLEX:使用深钢筋学习重建蛋白质第四纪结构
DRLComplex: Reconstruction of protein quaternary structures using deep reinforcement learning
论文作者
论文摘要
预测的链间残留物残留触点可用于从头开始构建蛋白质复合物的第四纪结构。但是,仅开发了少量方法来使用预测的链间接触来重建蛋白质四级结构。在这里,我们提出了一种基于代理的自学习方法,基于深钢筋学习(DRLCOMPLEX),以使用链间接触作为距离约束来构建蛋白质复杂结构。我们使用真实和预测的InterChain触点作为输入,在两个标准数据集(即CASP-CAPRI同二聚体和STD_32 Heterodimer数据集)上对两个标准数据集(即CASP-CAPRI同型二聚体和STD_32 Heterodimer数据集)进行了严格测试。利用真实的接触作为输入,DRLCOMPLEX分别在两个数据集上分别达到0.9895和0.9881和低平均接口RMSD(I_RMSD)和0.92的平均TM得分。当使用预测的触点时,该方法分别达到同二聚体和异二聚体的TM分别为0.73和0.76。我们的实验发现,重建的第四纪结构的准确性取决于接触预测的准确性。与其他用于重建链间接触的第四纪结构的优化方法相比,DRLCOMPLEX的性能类似于先进的梯度下降方法,并且比马尔可夫链蒙特卡洛模拟方法和基于模拟退火的方法更好,从而验证了Drlcomplex对蛋白质复合物的Quaternary Rectruction的有效性。
Predicted inter-chain residue-residue contacts can be used to build the quaternary structure of protein complexes from scratch. However, only a small number of methods have been developed to reconstruct protein quaternary structures using predicted inter-chain contacts. Here, we present an agent-based self-learning method based on deep reinforcement learning (DRLComplex) to build protein complex structures using inter-chain contacts as distance constraints. We rigorously tested DRLComplex on two standard datasets of homodimeric and heterodimeric protein complexes (i.e., the CASP-CAPRI homodimer and Std_32 heterodimer datasets) using both true and predicted interchain contacts as inputs. Utilizing true contacts as input, DRLComplex achieved high average TM-scores of 0.9895 and 0.9881 and a low average interface RMSD (I_RMSD) of 0.2197 and 0.92 on the two datasets, respectively. When predicted contacts are used, the method achieves TM-scores of 0.73 and 0.76 for homodimers and heterodimers, respectively. Our experiments find that the accuracy of reconstructed quaternary structures depends on the accuracy of the contact predictions. Compared to other optimization methods for reconstructing quaternary structures from inter-chain contacts, DRLComplex performs similar to an advanced gradient descent method and better than a Markov Chain Monte Carlo simulation method and a simulated annealing-based method, validating the effectiveness of DRLComplex for quaternary reconstruction of protein complexes.