Comparative Genomics of Completely Sequenced Lactobacillus helveticus Genomes Provides Insights into Strain-Specific Genes and Resolves Metagenomics Data Down to the Strain Level
Version
Published
Date Issued
2018-01-30
Author(s)
Schmid, Michael
Muri, Jonathan
Melidis, Damianos
Varadarajan, Adithi R.
Somerville, Vincent
Wicki, Adrian
Moser, Aline
Bourqui, Marc
Wenzel, Claudia
Frey, Juerg E.
Irmler, Stefan
Ahrens, Christian H.
Type
Article
Language
English
Abstract
Although complete genome sequences hold particular value for an accurate description of core genomes, the identification of strain-specific genes, and as the optimal basis for functional genomics studies, they are still largely underrepresented in public repositories. Based on an assessment of the genome assembly complexity for all lactobacilli, we used Pacific Biosciences' long read technology to sequence and de novo assemble the genomes of three Lactobacillus helveticus starter strains, raising the number of completely sequenced strains to 12. The first comparative genomics study for L. helveticus—to our knowledge—identified a core genome of 988 genes and sets of unique, strain-specific genes ranging from about 30 to more than 200 genes. Importantly, the comparison of MiSeq- and PacBio-based assemblies uncovered that not only accessory but also core genes can be missed in incomplete genome assemblies based on short reads. Analysis of the three genomes revealed that a large number of pseudogenes were enriched for functional Gene Ontology categories such as amino acid transmembrane transport and carbohydrate metabolism, which is in line with a reductive genome evolution in the rich natural habitat of L. helveticus. Notably, the functional Clusters of Orthologous Groups of proteins categories “cell wall/membrane biogenesis” and “defense mechanisms” were found to be enriched among the strain-specific genes. A genome mining effort uncovered examples where an experimentally observed phenotype could be linked to the underlying genotype, such as for cell envelope proteinase PrtH3 of strain FAM8627. Another possible link identified for peptidoglycan hydrolases will require further experiments. Of note, strain FAM22155 did not harbor a CRISPR/Cas system; its loss was also observed in other L. helveticus strains and lactobacillus species, thus questioning the value of the CRISPR/Cas system for diagnostic purposes. Importantly, the complete genome sequences proved to be very useful for the analysis of natural whey starter cultures with metagenomics, as a larger percentage of the sequenced reads of these complex mixtures could be unambiguously assigned down to the strain level.
Subjects
Q Science (General)
T Technology (General)
Publisher DOI
Journal
Frontiers in Microbiology
ISSN
1664-302X
Organization
Volume
9
Issue
63
Publisher
Frontiers Media
Submitter
ZimmerD
Citation apa
Schmid, M., Muri, J., Melidis, D., Varadarajan, A. R., Somerville, V., Wicki, A., Moser, A., Bourqui, M., Wenzel, C., Eugster, E., Frey, J. E., Irmler, S., & Ahrens, C. H. (2018). Comparative Genomics of Completely Sequenced Lactobacillus helveticus Genomes Provides Insights into Strain-Specific Genes and Resolves Metagenomics Data Down to the Strain Level. In Frontiers in Microbiology (Vol. 9, Issue 63). Frontiers Media. https://doi.org/10.24451/arbor.8120
File(s)![Thumbnail Image]()
Loading...
Name
fmicb-09-00063.pdf
License
Attribution 4.0 International
Version
published
Size
4.08 MB
Format
Adobe PDF
Checksum (MD5)
43c15d49e4910f02dfa80fc38b4cf515
