A1 Refereed original research article in a scientific journal

Phylogenomics of Prokaryotic Ribosomal Proteins




AuthorsYutin N, Puigbo P, Koonin EV, Wolf YI

PublisherPUBLIC LIBRARY SCIENCE

Publication year2012

JournalPLoS ONE

Journal name in sourcePLOS ONE

Journal acronymPLOS ONE

Article numberARTN e36972

Volume7

Issue5

Number of pages10

ISSN1932-6203

DOIhttps://doi.org/10.1371/journal.pone.0036972


Abstract
Archaeal and bacterial ribosomes contain more than 50 proteins, including 34 that are universally conserved in the three domains of cellular life (bacteria, archaea, and eukaryotes). Despite the high sequence conservation, annotation of ribosomal (r-) protein genes is often difficult because of their short lengths and biased sequence composition. We developed an automated computational pipeline for identification of r-protein genes and applied it to 995 completely sequenced bacterial and 87 archaeal genomes available in the RefSeq database. The pipeline employs curated seed alignments of r-proteins to run position-specific scoring matrix (PSSM)-based BLAST searches against six-frame genome translations, mitigating possible gene annotation errors. As a result of this analysis, we performed a census of prokaryotic r-protein complements, enumerated missing and paralogous r-proteins, and analyzed the distributions of ribosomal protein genes among chromosomal partitions. Phyletic patterns of bacterial and archaeal r-protein genes were mapped to phylogenetic trees reconstructed from concatenated alignments of r-proteins to reveal the history of likely multiple independent gains and losses. These alignments, available for download, can be used as search profiles to improve genome annotation of r-proteins and for further comparative genomics studies.



Last updated on 2024-26-11 at 14:46