Comparison of 61 Sequenced Escherichia coli Genomes

Lukjancenko, Oksana; Wassenaar, Trudy M.; Ussery, David W.
November 2010
Microbial Ecology;Nov2010, Vol. 60 Issue 4, p708
Academic Journal
Escherichia coli is an important component of the biosphere and is an ideal model for studies of processes involved in bacterial genome evolution. Sixty-one publically available E. coli and Shigella spp. sequenced genomes are compared, using basic methods to produce phylogenetic and proteomics trees, and to identify the pan- and core genomes of this set of sequenced strains. A hierarchical clustering of variable genes allowed clear separation of the strains into clusters, including known pathotypes; clinically relevant serotypes can also be resolved in this way. In contrast, when in silico MLST was performed, many of the various strains appear jumbled and less well resolved. The predicted pan-genome comprises 15,741 gene families, and only 993 (6%) of the families are represented in every genome, comprising the core genome. The variable or 'accessory' genes thus make up more than 90% of the pan-genome and about 80% of a typical genome; some of these variable genes tend to be co-localized on genomic islands. The diversity within the species E. coli, and the overlap in gene content between this and related species, suggests a continuum rather than sharp species borders in this group of Enterobacteriaceae.


Related Articles

  • Evolutionary Dynamics of Small RNAs in 27 Escherichia coli and Shigella Genomes. Skippington, Elizabeth; Ragan, Mark A. // Genome Biology & Evolution;Sep2012, Vol. 4 Issue 9, p330 

    Small RNAs (sRNAs) are widespread in bacteria and play critical roles in regulating physiological processes. They are best characterized in Escherichia coli K-12 MG1655, where 83 sRNAs constitute nearly 2% of the gene complement. Most sRNAs act by base pairing with a target mRNA, modulating its...

  • Different Amounts of DNA in Newborn Cells of Escherichia coli Preclude a Role for the Chromosome in Size Control According to the "Adder" Model. Huls, Peter G.; Vischer, Norbert O. E.; Woldringh, Conrad L. // Frontiers in Microbiology;4/5/2018, p1 

    According to the recently-revived adder model for cell size control, newborn cells of Escherichia coli will grow and divide after having added a constant size or length, ΔL, irrespective of their size at birth. Assuming exponential elongation, this implies that large newborns will divide...

  • An enhanced bioindicator for calorimetric monitoring of prophage‐activating chemicals in the trace concentration range. Xu, Juan; Jiang, Feng‐Lei; Liu, Yi; Kiesel, Bärbel; Maskow, Thomas // Engineering in Life Sciences;Jul2018, Vol. 18 Issue 7, p475 

    Abstract: Viruses that infect bacteria (bacteriophages) can either lyse bacteria directly or integrate their genome into the bacterial genome. In the latter case, the viral genome (called prophage) remains dormant, and both phages and bacteria are able to survive in this state. But the silent...

  • Evolutionary diversification of an ancient gene family (rhs) through C-terminal displacement. Jackson, Andrew P.; Thomas, Gavin H.; Parkhill, Julian; Thomson, Nicholas R. // BMC Genomics;2009, Vol. 10, p584 

    Background: Rhs genes are prominent features of bacterial genomes that have previously been implicated in genomic rearrangements in E. coli. By comparing rhs repertoires across the Enterobacteriaceae, this study provides a robust explanation of rhs diversification and evolution, and a...

  • The use of comparative genomic hybridization to characterize genome dynamics and diversity among the serotypes of Shigella. Peng, Junping; Zhang, Xiaobing; Yang, Jian; Wang, Jing; Yang, E; Bin, Wen; Wei, Candong; Sun, Meisheng; Jin, Qi // BMC Genomics;2006, Vol. 7, p1 

    Background: Compelling evidence indicates that Shigella species, the etiologic agents of bacillary dysentery, as well as enteroinvasive Escherichia coli, are derived from multiple origins of Escherichia coli and form a single pathovar. To further understand the genome diversity and virulence...

  • Identification and Quantification of Genomic Repeats and Sample Contamination in Assemblies of 454 Pyrosequencing Reads. Nederbragt, Alexander J.; Rounge, Trine Ballestad; Kausrud, Kyrre L.; Jakobsen, Kjetill S. // Sequencing;2010, p1 

    Contigs assembled from 454 reads from bacterial genomes demonstrate a range of read depths, with a number of contigs having a depth that is far higher than can be expected. For reference genome sequence datasets, there exists a high correlation between the contig specific read depth and the...

  • RsfA (YbeB) Proteins Are Conserved Ribosomal Silencing Factors. Häuser, Roman; Pech, Markus; Kijek, Jaroslaw; Yamamoto, Hiroshi; Titz, Björn; Naeve, Florian; Tovchigrechko, Andrey; Yamamoto, Kaori; Szaflarski, Witold; Takeuchi, Nono; Stellberger, Thorsten; Diefenbacher, Markus E.; Nierhaus, Knud H.; Uetz, Peter // PLoS Genetics;Jul2012, Vol. 8 Issue 7, Special section p1 

    The YbeB (DUF143) family of uncharacterized proteins is encoded by almost all bacterial and eukaryotic genomes but not archaea. While they have been shown to be associated with ribosomes, their molecular function remains unclear. Here we show that YbeB is a ribosomal silencing factor (RsfA) in...

  • Ten years of bacterial genome sequencing: comparative-genomics-based discoveries. Binnewies, Tim T.; Motro, Yair; Hallin, Peter F.; Lund, Ole; Dunn, David; La, Tom; Hampson, David J.; Bellgard, Matthew; Wassenaar, Trudy M.; Ussery, David W. // Functional & Integrative Genomics;Jul2006, Vol. 6 Issue 3, p165 

    It has been more than 10 years since the first bacterial genome sequence was published. Hundreds of bacterial genome sequences are now available for comparative genomics, and searching a given protein against more than a thousand genomes will soon be possible. The subject of this review will...

  • Low-temperature bacterial viruses VR - a small but diverse group of E. coli phages. Kaliniene, Laura; Zajančkauskaitė, Aurelija; Šimoliūnas, Eugenijus; Truncaitė, Lidija; Meškys, Rolandas // Archives of Virology;May2015, Vol. 160 Issue 5, p1367 

    The complete genome sequences of four low-temperature Escherichia coli-specific tevenviruses, vb_EcoM-VR5, vb_EcoM-VR20, vb_EcoM-VR25 and vb_EcoM-VR26, were determined. Genomic comparisons including recently described genomes of vb_EcoM-VR7 and JS98 as well as phage T4 allowed the identification...


Read the Article


Sorry, but this item is not currently available from your library.

Try another library?
Sign out of this library

Other Topics