A1 Refereed original research article in a scientific journal

Sequence analysis of pooled bacterial samples enables identification of strain variation in group A streptococcus




AuthorsWeldatsadik RG, Wang JW, Puhakainen K, Jiao H, Jalava J, Raisanen K, Datta N, Skoog T, Vuopio J, Jokiranta TS, Kere J

PublisherNATURE PUBLISHING GROUP

Publication year2017

JournalScientific Reports

Journal name in sourceSCIENTIFIC REPORTS

Journal acronymSCI REP-UK

Article numberARTN 45771

Volume7

Number of pages10

ISSN2045-2322

DOIhttps://doi.org/10.1038/srep45771

Self-archived copy’s web addresshttps://research.utu.fi/converis/portal/detail/Publication/20510928


Abstract
Knowledge of the genomic variation among different strains of a pathogenic microbial species can help in selecting optimal candidates for diagnostic assays and vaccine development. Pooled sequencing (Pool-seq) is a cost effective approach for population level genetic studies that require large numbers of samples such as various strains of a microbe. To test the use of Pool-seq in identifying variation, we pooled DNA of 100 Streptococcus pyogenes strains of different emm types in two pools, each containing 50 strains. We used four variant calling tools (Freebayes, UnifiedGenotyper, SNVer, and SAMtools) and one emm1 strain, SF370, as a reference genome. In total 63719 SNPs and 164 INDELs were identified in the two pools concordantly by at least two of the tools. Majority of the variants (93.4%) from six individually sequenced strains used in the pools could be identified from the two pools and 72.3% and 97.4% of the variants in the pools could be mined from the analysis of the 44 complete Str. pyogenes genomes and 3407 sequence runs deposited in the European Nucleotide Archive respectively. We conclude that DNA sequencing of pooled samples of large numbers of bacterial strains is a robust, rapid and cost-efficient way to discover sequence variation.

Downloadable publication

This is an electronic reprint of the original article.
This reprint may differ from the original in pagination and typographic detail. Please cite the original version.





Last updated on 2024-26-11 at 22:45