Submit Manuscript  

Article Details


Identification of DNA-Binding Proteins via Hypergraph Based Laplacian Support Vector Machine

Author(s):

Yuqing Qian, Hao Meng, Weizhong Lu, Zhijun Liao*, Yijie Ding* and Hongjie Wu*   Pages 1 - 10 ( 10 )

Abstract:


Background: The identification of DNA binding proteins (DBP) is an important research field. Experiment-based methods are time-consuming and labor-intensive for detecting DBP.

Objective: To solve the problem of large-scale DBP identification, some machine learning methods are proposed. However, these methods have insufficient predictive accuracy. Our aim is to develop a sequence- based machine learning model to predict DBP.

Methods: In our study, we extracted six types of features (including NMBAC, GE, MCD, PSSM-AB, PSSM-DWT, and PsePSSM) from protein sequences. We used Multiple Kernel Learning based on Hilbert- Schmidt Independence Criterion (MKL-HSIC) to estimate the optimal kernel. Then, we constructed a hypergraph model to describe the relationship between labeled and unlabeled samples. Finally, Laplacian Support Vector Machines (LapSVM) is employed to train the predictive model. Our method is tested on PDB186, PDB1075, PDB2272 and PDB14189 data sets.

Results: Compared with other methods, our model achieved best results on benchmark data sets.

Conclusion: The accuracy of 87.1% and 74.2% are achieved on PDB186 (Independent test of PDB1075) and PDB2272 (Independent test of PDB14189), respectively.

Keywords:

DNA-binding proteins, feature extraction, laplacian support vector machine, multiple kernel learning, hypergraph learning, PDB.

Affiliation:

School of Electronic and Information Engineering, Suzhou University of Science and Technology, Suzhou, School of Electronic and Information Engineering, Suzhou University of Science and Technology, Suzhou, School of Electronic and Information Engineering, Suzhou University of Science and Technology, Suzhou, Department of Biochemistry and Molecular Biology, School of Basic Medical Sciences, Fujian Medical University, Fuzhou, Yangtze Delta Region Institute, University of Electronic Science and Technology of China, Quzhou, School of Electronic and Information Engineering, Suzhou University of Science and Technology, Suzhou



Read Full-Text article