Bioinfo Chem
System biology and Infochemistry | Online ISSN 3071-4826
1
Citations
25.5k
Views
32
Articles
RESEARCH ARTICLE (Open Access)
Prediction of Protein–Metal Ion-Binding Sites Using Sequence Homology and Machine-Learning Methods
Zihan Tian 1, Cao Wei 1, Yutaka Moriwaki 1, Tohru Terada 1, Shugo Nakamura 1, Kazuya Sumikoshi 1, Fang Chun 1, and Kentaro Shimizu 1*
Bioinfo Chem 1 (1) 025-036 https://doi.org/10.25163/abc.11208022130119
Submitted: 21 July 2019 Revised: 22 August 2019 Published: 06 September 2019
Abstract
Metal ions are essential for metalloproteins to perform their catalytic or structural functions. To understand their role in protein function, it is important to identify metal ion-binding sites. Because experimental identification is labor-intensive and time-consuming, computational methods are expected to be used in the prediction of protein–metal ion-binding sites. A range of computational methods have been proposed to predict metal ion-binding sites from protein sequences. In this study, we implemented two methods of predicting metal ion-binding sites for Ca2+, Co2+, Cu2+, Cu+, Fe3+, Fe2+, Hg2+, Mg2+, Mn2+, Ni2+, and Zn2+ from amino acid sequences. One is a homology-based method, and the other is a machine-learning method. The homology-based method predicts the binding sites from homologous sequences obtained by a protein–protein basic local alignment search tool (BLASTP) search. The machine-learning method uses a support vector machine with three protein sequence features. Our results showed that the homology-based method achieved an accuracy of 0.9905 and a specificity of 0.9978, while the machine-learning method showed balanced performance with regard to accuracy, sensitivity, and specificity. Especially, the sensitivity of the machine-learning method was 0.8239, and many metal ion-binding sites were predicted only by the machine-learning method.
Keywords: protein, metal ion, binding site prediction, machine learning, homology search
References
PMid:9254694 PMCid:PMC146917
PMid:10592235 PMCid:PMC102472
PMid:23861030
PMid:11120676
PMid:23060610 PMCid:PMC3516142
PMid:17847089
PMid:26365239 PMCid:PMC4737176
PMid:30977960
PMid:17703195 PMCid:PMC2757439
PMid:11848828
PMid:16101285
PMid:19131089
PMid:29307143 PMCid:PMC5769865
PMid:15912584
PMid:22723976 PMCid:PMC3377655
PMid:18554898
PMid:17280606 PMCid:PMC1800866
PMid:16927295
PMid:16006526 PMCid:PMC1177371
PMid:29241411
PMid:31046040
PMid:14964382
Recommended articles
Fairness-Aware AI for Healthcare Supply Chain Equity: Integrating Social Determinants of Health, Causal Modeling, and Geospatial Optimization
Early Prediction of Postprandial Glycemic Response in Gestational Diabetes Using Continuous Glucose Monitoring and Gradient Boosting Models
Comparative Growth Dynamics, Carcass Yield, and Meat Quality in Fast- and Slow-Growing Broiler Strains: A Review
14
Save
Save
0
Citation
Citation
2967
View
View
5
Share
Share