Publication | Open Access
Prediction of Ubiquitination Sites by Using the Composition of k-Spaced Amino Acid Pairs
215
Citations
30
References
2011
Year
Structural BioinformaticsBiomolecular Structure PredictionMolecular BiologySupport Vector MachineProtein FoldingBioanalysisUbiquitination SitesProteomicsProtein ChemistryBiochemistryProtein ModelingProtein Structure PredictionFunctional GenomicsBioinformaticsProtein BioinformaticsStructural BiologyYeast Ubiquitination SitesNatural SciencesComputational BiologyProtein EvolutionProtein EngineeringSystems BiologyMedicine
As one of the most important reversible protein post-translation modifications, ubiquitination has been reported to be involved in lots of biological processes and closely implicated with various diseases. To fully decipher the molecular mechanisms of ubiquitination-related biological processes, an initial but crucial step is the recognition of ubiquitylated substrates and the corresponding ubiquitination sites. Here, a new bioinformatics tool named CKSAAP_UbSite was developed to predict ubiquitination sites from protein sequences. With the assistance of Support Vector Machine (SVM), the highlight of CKSAAP_UbSite is to employ the composition of k-spaced amino acid pairs surrounding a query site (i.e. any lysine in a query sequence) as input. When trained and tested in the dataset of yeast ubiquitination sites (Radivojac et al, Proteins, 2010, 78: 365-380), a 100-fold cross-validation on a 1∶1 ratio of positive and negative samples revealed that the accuracy and MCC of CKSAAP_UbSite reached 73.40% and 0.4694, respectively. The proposed CKSAAP_UbSite has also been intensively benchmarked to exhibit better performance than some existing predictors, suggesting that it can be served as a useful tool to the community. Currently, CKSAAP_UbSite is freely accessible at http://protein.cau.edu.cn/cksaap_ubsite/. Moreover, we also found that the sequence patterns around ubiquitination sites are not conserved across different species. To ensure a reasonable prediction performance, the application of the current CKSAAP_UbSite should be limited to the proteome of yeast.
| Year | Citations | |
|---|---|---|
Page 1
Page 1