Submit Manuscript  

Article Details


Predicting Thermophilic Proteins by Machine Learning

Author(s):

Xian-Fang Wang*, Peng Gao, Yi-Feng Liu, Hong-Fei Li and Fan Lu  

Abstract:


Background´╝ÜThermophilic proteins can maintain good activity under high temperature, so it is important to study thermophilic proteins for the thermal stability of proteins.

Objective: In order to solve the problem of low precision and low efficiency in predicting thermophilic proteins, a prediction method based on feature fusion and machine learning was proposed in this paper.

Method: For the selected thermophilic data sets, firstly, the thermophilic protein sequence was characterized based on feature fusion by the combination of g-gap dipeptide, entropy density and autocorrelation coefficient. Then, kernel principal component analysis (KPCA) was used to reduce the dimension of the expressed protein sequence features in order to reduce training time and improve efficiency. Finally, the classification model was designed by using classification algorithm.

Results: A variety of classification algorithms were used to train and test on the selected thermophilic dataset. By comparison, the accuracy of the support vector machine (SVM) under the jackknife method was over 92%. The combination of other evaluation indicators also proved that the SVM performance was the best.

Conclusion: Because of choosing an effectively feature representation method and a robust classifier, the proposed method is suitable for predicting thermophilic proteins and is superior to most reported methods.

Keywords:

Ginsenoside Rb1, antioxidant, hydroxyl radical, HOCl, superoxide anion, cell free system, DNA plasmid, thermophilic proteins, feature fusion, g-gap, entropy density, autocorrelation coefficient, KPCA, machine learning

Affiliation:

School of Computer and Information Engineering, Henan Normal University, Henan, School of Computer and Information Engineering, Henan Normal University, Henan, School of Computer and Information Engineering, Henan Normal University, Henan, School of Computer and Information Engineering, Henan Normal University, Henan, School of Computer and Information Engineering, Henan Normal University, Henan



Full Text Inquiry