Concepedia

Publication | Closed Access

Robust Method for Proteome Analysis by MS/MS Using an Entire Translated Genome:  Demonstration on the Ciliome of<i>Tetrahymena</i><i>t</i><i>hermophila</i>

120

Citations

52

References

2005

Year

Abstract

To improve the utility of increasingly large numbers of available unannotated and initially poorly annotated genomic sequences for proteome analysis, we demonstrate that effective protein identification can be made on a large and unannotated genome. The strategy developed is to translate the unannotated genome sequence into amino acid sequence encoding putative proteins in all six reading frames, to identify peptides by tandem mass spectrometry (MS/MS), to localize them on the genome sequence, and to preliminarily annotate the protein via a similarity search by BLAST. These tasks have been optimized and automated. Optimization to obtain multiple peptide matches in effect extends the searchable region and results in more robust protein identification. The viability of this strategy is demonstrated with the identification of 223 cilia proteins in the unicellular eukaryotic model organism Tetrahymena thermophila, whose initial genomic sequence draft was released in November 2003. To the best of our knowledge, this is the first demonstration of large-scale protein identification based on such a large, unannotated genome. Of the 223 cilia proteins, 84 have no similarity to proteins in NCBI's nonredundant (nr) database. This methodology allows identifying the locations of the genes encoding these novel proteins, which is a necessary first step to downstream functional genomic experimentation.

References

YearCitations

1976

225.3K

1976

209.3K

1990

92.8K

2004

763

2001

585

2004

512

2002

443

1995

372

2004

353

2001

241

Page 1