Publication | Open Access
PredAlgo: A New Subcellular Localization Prediction Tool Dedicated to Green Algae
323
Citations
63
References
2012
Year
Prime ModelEngineeringSubcellular LocalizationMolecular BiologyAforementioned OrganellesUnicellular OrganismIntracellular LocalizationProteomicsPhotosynthesisAlgal BiologyBioinformaticsFunctional GenomicsBiologyCell OrganelleComputational BiologyOrganelle BiogenesisPhycologySystems BiologyMedicinePlant PhysiologyGreen Algae
Chlamydomonas reinhardtii is a key model for studying photosynthetic cell compartments, yet existing sequence‑based localization tools designed for land plants misclassify many algal proteins, underscoring the need for algae‑specific predictors. We developed PredAlgo to assign nuclear‑encoded proteins in green algae to mitochondrion, chloroplast, or secretory pathway. PredAlgo uses a neural network trained on curated C. reinhardtii proteins that scans N‑terminal 19‑residue windows to score cleavable targeting sequences, then infers localization and predicts cleavage sites.
The unicellular green alga Chlamydomonas reinhardtii is a prime model for deciphering processes occurring in the intracellular compartments of the photosynthetic cell. Organelle-specific proteomic studies have started to delineate its various subproteomes, but sequence-based prediction software is necessary to assign proteins subcellular localizations at whole genome scale. Unfortunately, existing tools are oriented toward land plants and tend to mispredict the localization of nuclear-encoded algal proteins, predicting many chloroplast proteins as mitochondrion targeted. We thus developed a new tool called PredAlgo that predicts intracellular localization of those proteins to one of three intracellular compartments in green algae: the mitochondrion, the chloroplast, and the secretory pathway. At its core, a neural network, trained using carefully curated sets of C. reinhardtii proteins, divides the N-terminal sequence into overlapping 19-residue windows and scores the probability that they belong to a cleavable targeting sequence for one of the aforementioned organelles. A targeting prediction is then deduced for the protein, and a likely cleavage site is predicted based on the shape of the scoring function along the N-terminal sequence. When assessed on an independent benchmarking set of C. reinhardtii sequences, PredAlgo showed a highly improved discrimination capacity between chloroplast- and mitochondrion-localized proteins. Its predictions matched well the results of chloroplast proteomics studies. When tested on other green algae, it gave good results with Chlorophyceae and Trebouxiophyceae but tended to underpredict mitochondrial proteins in Prasinophyceae. Approximately 18% of the nuclear-encoded C. reinhardtii proteome was predicted to be targeted to the chloroplast and 15% to the mitochondrion.
| Year | Citations | |
|---|---|---|
Page 1
Page 1