Journal of Frontiers of Computer Science and Technology ›› 2013, Vol. 7 ›› Issue (3): 227-235.DOI: 10.3778/j.issn.1673-9418.1208014

Previous Articles     Next Articles

Fast Calculation of Protein Molecular Field and Its Application on Protein Recognition Research

ZHANG Fan1,2,3, WANG Zhangye2+, WU Tao1, PENG Qunsheng2   

  1. 1. Research Center of Soft Matter, Zhejiang University, Hangzhou 310007, China
    2. State Key Lab of CAD&CG, Zhejiang University, Hangzhou 310007, China
    3. College of Computer Science and Technology, Zhejiang University of Technology, Hangzhou 310023, China
  • Online:2013-03-01 Published:2013-03-05

分子场快速计算及在蛋白质识别研究中的应用

张  繁1,2,3,王章野2+,吴  韬1,彭群生2   

  1. 1. 浙江大学 软物质科学研究中心,杭州 310007
    2. 浙江大学 CAD&CG国家重点实验室,杭州 310007
    3. 浙江工业大学 计算机科学与技术学院,杭州 310023

Abstract: It has been found that key area research during protein recognition process is typically important for proclaiming biological phenomena essence, improving drug design efficiency and decreasing cost and shortening industrial cycle of new drug design. For high complexity of protein macromolecule structure, common computer systems are not capable for continuously tracking coherence change of structure and function during protein recognition. This paper sets up a heterogeneous cluster based on GPU/CPU as a uniform biological computing environment, proposes particular data structures and algorithms to deal with large biological data, and reconfigures Kd-tree based on GPU and data access modes to improve parallel calculation performance. Thus it becomes feasible to calculate time-varying protein molecular filed fast enough to do further analysis on key area for protein recognition. The corresponding biochemistry experiment results show that the method can be used as a new effective and intuitive tool for protein interaction research.

Key words: biological computation, first principle, parallel calculation, molecular docking, protein recognition

摘要: 蛋白质识别关键区域的研究对揭示生命现象的本质规律,提高药物设计效率,降低新药物开发的成本和周期有重大的应用价值。但由于蛋白质大分子结构的高度复杂性,一般的计算机系统难以对蛋白质识别过程中结构与功能的连续性变化实现快速动态分析。设计并实现了一种基于GPU/CPU异构的集群系统,根据生物计算的特点对异构集群进行数据结构和算法设计,建立起基于GPU的Kd-tree构造和访问的高效算法,以提高系统并行计算的性能。在此基础上对蛋白质分子场的动态时变序列进行快速计算,对结果进行分类,能快速高效地确定出蛋白质的相互作用关键区域。该方法得到了相应的生化实验结果验证,为揭示蛋白质作用机制提供了一种高效直观的新方法。

关键词: 生物计算, 第一性原理, 并行计算, 分子对接, 蛋白质识别