The genome sequence of E. coli W (ATCC 9637): Comparative genome analysis and an improved genome-scale reconstruction of E. coli

Colin T. Archer, Jihyun F. Kim, Haeyoung Jeong, Jin H. Park, Claudia E. Vickers, Sang Y. Lee, Lars K. Nielsen

Research output: Contribution to journalArticle

81 Citations (Scopus)

Abstract

Escherichia coli is a model prokaryote, an important pathogen, and a key organism for industrial biotechnology. E. coli W (ATCC 9637), one of four strains designated as safe for laboratory purposes, has not been sequenced. E. coli W is a fast-growing strain and is the only safe strain that can utilize sucrose as a carbon source. Lifecycle analysis has demonstrated that sucrose from sugarcane is a preferred carbon source for industrial bioprocesses.Results: We have sequenced and annotated the genome of E. coli W. The chromosome is 4,900,968 bp and encodes 4,764 ORFs. Two plasmids, pRK1 (102,536 bp) and pRK2 (5,360 bp), are also present. W has unique features relative to other sequenced laboratory strains (K-12, B and Crooks): it has a larger genome and belongs to phylogroup B1 rather than A. W also grows on a much broader range of carbon sources than does K-12. A genome-scale reconstruction was developed and validated in order to interrogate metabolic properties.Conclusions: The genome of W is more similar to commensal and pathogenic B1 strains than phylogroup A strains, and therefore has greater utility for comparative analyses with these strains. W should therefore be the strain of choice, or 'type strain' for group B1 comparative analyses. The genome annotation and tools created here are expected to allow further utilization and development of E. coli W as an industrial organism for sucrose-based bioprocesses. Refinements in our E. coli metabolic reconstruction allow it to more accurately define E. coli metabolism relative to previous models.

Original languageEnglish
Article number9
JournalBMC Genomics
Volume12
DOIs
Publication statusPublished - 2011 Jan 6

Fingerprint

Genome
Escherichia coli
Sucrose
Carbon
Saccharum
Chromosomes, Human, Pair 4
Biotechnology
Open Reading Frames
Plasmids

All Science Journal Classification (ASJC) codes

  • Biotechnology
  • Genetics

Cite this

Archer, Colin T. ; Kim, Jihyun F. ; Jeong, Haeyoung ; Park, Jin H. ; Vickers, Claudia E. ; Lee, Sang Y. ; Nielsen, Lars K. / The genome sequence of E. coli W (ATCC 9637) : Comparative genome analysis and an improved genome-scale reconstruction of E. coli. In: BMC Genomics. 2011 ; Vol. 12.
@article{fbe3fa21eef74edcbf93dd754b59b410,
title = "The genome sequence of E. coli W (ATCC 9637): Comparative genome analysis and an improved genome-scale reconstruction of E. coli",
abstract = "Escherichia coli is a model prokaryote, an important pathogen, and a key organism for industrial biotechnology. E. coli W (ATCC 9637), one of four strains designated as safe for laboratory purposes, has not been sequenced. E. coli W is a fast-growing strain and is the only safe strain that can utilize sucrose as a carbon source. Lifecycle analysis has demonstrated that sucrose from sugarcane is a preferred carbon source for industrial bioprocesses.Results: We have sequenced and annotated the genome of E. coli W. The chromosome is 4,900,968 bp and encodes 4,764 ORFs. Two plasmids, pRK1 (102,536 bp) and pRK2 (5,360 bp), are also present. W has unique features relative to other sequenced laboratory strains (K-12, B and Crooks): it has a larger genome and belongs to phylogroup B1 rather than A. W also grows on a much broader range of carbon sources than does K-12. A genome-scale reconstruction was developed and validated in order to interrogate metabolic properties.Conclusions: The genome of W is more similar to commensal and pathogenic B1 strains than phylogroup A strains, and therefore has greater utility for comparative analyses with these strains. W should therefore be the strain of choice, or 'type strain' for group B1 comparative analyses. The genome annotation and tools created here are expected to allow further utilization and development of E. coli W as an industrial organism for sucrose-based bioprocesses. Refinements in our E. coli metabolic reconstruction allow it to more accurately define E. coli metabolism relative to previous models.",
author = "Archer, {Colin T.} and Kim, {Jihyun F.} and Haeyoung Jeong and Park, {Jin H.} and Vickers, {Claudia E.} and Lee, {Sang Y.} and Nielsen, {Lars K.}",
year = "2011",
month = "1",
day = "6",
doi = "10.1186/1471-2164-12-9",
language = "English",
volume = "12",
journal = "BMC Genomics",
issn = "1471-2164",
publisher = "BioMed Central",

}

The genome sequence of E. coli W (ATCC 9637) : Comparative genome analysis and an improved genome-scale reconstruction of E. coli. / Archer, Colin T.; Kim, Jihyun F.; Jeong, Haeyoung; Park, Jin H.; Vickers, Claudia E.; Lee, Sang Y.; Nielsen, Lars K.

In: BMC Genomics, Vol. 12, 9, 06.01.2011.

Research output: Contribution to journalArticle

TY - JOUR

T1 - The genome sequence of E. coli W (ATCC 9637)

T2 - Comparative genome analysis and an improved genome-scale reconstruction of E. coli

AU - Archer, Colin T.

AU - Kim, Jihyun F.

AU - Jeong, Haeyoung

AU - Park, Jin H.

AU - Vickers, Claudia E.

AU - Lee, Sang Y.

AU - Nielsen, Lars K.

PY - 2011/1/6

Y1 - 2011/1/6

N2 - Escherichia coli is a model prokaryote, an important pathogen, and a key organism for industrial biotechnology. E. coli W (ATCC 9637), one of four strains designated as safe for laboratory purposes, has not been sequenced. E. coli W is a fast-growing strain and is the only safe strain that can utilize sucrose as a carbon source. Lifecycle analysis has demonstrated that sucrose from sugarcane is a preferred carbon source for industrial bioprocesses.Results: We have sequenced and annotated the genome of E. coli W. The chromosome is 4,900,968 bp and encodes 4,764 ORFs. Two plasmids, pRK1 (102,536 bp) and pRK2 (5,360 bp), are also present. W has unique features relative to other sequenced laboratory strains (K-12, B and Crooks): it has a larger genome and belongs to phylogroup B1 rather than A. W also grows on a much broader range of carbon sources than does K-12. A genome-scale reconstruction was developed and validated in order to interrogate metabolic properties.Conclusions: The genome of W is more similar to commensal and pathogenic B1 strains than phylogroup A strains, and therefore has greater utility for comparative analyses with these strains. W should therefore be the strain of choice, or 'type strain' for group B1 comparative analyses. The genome annotation and tools created here are expected to allow further utilization and development of E. coli W as an industrial organism for sucrose-based bioprocesses. Refinements in our E. coli metabolic reconstruction allow it to more accurately define E. coli metabolism relative to previous models.

AB - Escherichia coli is a model prokaryote, an important pathogen, and a key organism for industrial biotechnology. E. coli W (ATCC 9637), one of four strains designated as safe for laboratory purposes, has not been sequenced. E. coli W is a fast-growing strain and is the only safe strain that can utilize sucrose as a carbon source. Lifecycle analysis has demonstrated that sucrose from sugarcane is a preferred carbon source for industrial bioprocesses.Results: We have sequenced and annotated the genome of E. coli W. The chromosome is 4,900,968 bp and encodes 4,764 ORFs. Two plasmids, pRK1 (102,536 bp) and pRK2 (5,360 bp), are also present. W has unique features relative to other sequenced laboratory strains (K-12, B and Crooks): it has a larger genome and belongs to phylogroup B1 rather than A. W also grows on a much broader range of carbon sources than does K-12. A genome-scale reconstruction was developed and validated in order to interrogate metabolic properties.Conclusions: The genome of W is more similar to commensal and pathogenic B1 strains than phylogroup A strains, and therefore has greater utility for comparative analyses with these strains. W should therefore be the strain of choice, or 'type strain' for group B1 comparative analyses. The genome annotation and tools created here are expected to allow further utilization and development of E. coli W as an industrial organism for sucrose-based bioprocesses. Refinements in our E. coli metabolic reconstruction allow it to more accurately define E. coli metabolism relative to previous models.

UR - http://www.scopus.com/inward/record.url?scp=78650900723&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=78650900723&partnerID=8YFLogxK

U2 - 10.1186/1471-2164-12-9

DO - 10.1186/1471-2164-12-9

M3 - Article

C2 - 21208457

AN - SCOPUS:78650900723

VL - 12

JO - BMC Genomics

JF - BMC Genomics

SN - 1471-2164

M1 - 9

ER -