Publication | Open Access
Identifying Phage Virion Proteins by Using Two-Step Feature Selection Methods
48
Citations
61
References
2018
Year
Structural BioinformaticsMolecular BiologyGene RecognitionSupport Vector MachineBiostatisticsPhage BiologyProteomicsProtein ModelingProtein Structure PredictionPhage Virion ProteinFunctional GenomicsBioinformaticsPhage Virion ProteinsStructural BiologyProtein BioinformaticsNatural SciencesComputational BiologyProtein EngineeringMicrobiologySystems BiologyMedicine
Accurate identification of phage virion protein is not only a key step for understanding the function of the phage virion protein but also helpful for further understanding the lysis mechanism of the bacterial cell. Since traditional experimental methods are time-consuming and costly for identifying phage virion proteins, it is extremely urgent to apply machine learning methods to accurately and efficiently identify phage virion proteins. In this work, a support vector machine (SVM) based method was proposed by mixing multiple sets of optimal g-gap dipeptide compositions. The analysis of variance (ANOVA) and the minimal-redundancy-maximal-relevance (mRMR) with an increment feature selection (IFS) were applied to single out the optimal feature set. In the five-fold cross-validation test, the proposed method achieved an overall accuracy of 87.95%. We believe that the proposed method will become an efficient and powerful method for scientists concerning phage virion proteins.
| Year | Citations | |
|---|---|---|
Page 1
Page 1