Publication | Open Access
Protein subcellular location prediction
376
Citations
18
References
1999
Year
EngineeringBiomolecular Structure PredictionSubcellular LocalizationMolecular BiologySignal RecognitionPrediction AlgorithmSubcellular LocationProteomicsBiochemistryProtein ModelingProtein Structure PredictionBioinformaticsProtein BioinformaticsBiologyNatural SciencesComputational BiologyQuery ProteinCellular BiochemistrySystems Biology
Protein function depends on its subcellular location. The study investigates whether a bioinformatic approach can accelerate subcellular localization prediction for newly sequenced proteins. Proteins were grouped into 12 subcellular compartments and a covariant discriminant algorithm using amino‑acid composition was developed to predict their locations. The algorithm achieved higher prediction accuracy than existing methods and is expected to aid functional annotation and drug target prioritization of novel proteins.
The function of a protein is closely correlated with its subcellular location. With the rapid increase in new protein sequences entering into data banks, we are confronted with a challenge: is it possible to utilize a bioinformatic approach to help expedite the determination of protein subcellular locations? To explore this problem, proteins were classified, according to their subcellular locations, into the following 12 groups: (1) chloroplast, (2) cytoplasm, (3) cytoskeleton, (4) endoplasmic reticulum, (5) extracell, (6) Golgi apparatus, (7) lysosome, (8) mitochondria, (9) nucleus, (10) peroxisome, (11) plasma membrane and (12) vacuole. Based on the classification scheme that has covered almost all the organelles and subcellular compartments in an animal or plant cell, a covariant discriminant algorithm was proposed to predict the subcellular location of a query protein according to its amino acid composition. Results obtained through self-consistency, jackknife and independent dataset tests indicated that the rates of correct prediction by the current algorithm are significantly higher than those by the existing methods. It is anticipated that the classification scheme and concept and also the prediction algorithm can expedite the functionality determination of new proteins, which can also be of use in the prioritization of genes and proteins identified by genomic efforts as potential molecular targets for drug design.
| Year | Citations | |
|---|---|---|
Page 1
Page 1