U.S. flag

An official website of the United States government

Format

Send to:

Choose Destination

Download Assembly



ASM48839v1

Organism name:
Escherichia coli 907701 (E. coli)
Taxonomy check:
OK
Infraspecific name:
Strain: 907701
BioSample:
SAMN02436804
BioProject:
PRJNA183807
Submitter:
Washington University
Date:
2013/10/29
Assembly type:
na
Assembly level:
Scaffold
Genome representation:
full
GenBank assembly accession:
GCA_000488395.1 (latest)
RefSeq assembly accession:
GCF_000488395.1 (latest)
RefSeq assembly and GenBank assembly identical:
yes
WGS Project:
AXTG01
Assembly method:
Velvet v. 1.1.06
Genome coverage:
79x
Sequencing technology:
Illumina

IDs: 75791 [UID] 841228 [GenBank] 853808 [RefSeq]

See Genome Information for Escherichia coli

There are 274343 assemblies for this organism

See more

History (Show revision history)

Comment

Bacteria provided by David Creely and William Dunne (BioMerieux, Inc., 595 Anglum Road, Hazelwood, MO 63042).

Coding sequences were predicted using GeneMark and Glimmer3. Intergenic regions not spanned by GeneMark and Glimmer3 were blasted against NCBI's non-redundant (NR) database and ... predictions generated based on protein alignments. tRNA genes were determined using tRNAscan-SE and non-coding RNA genes by RNAmmer and Rfam. The final gene set is processed through several programs such as Kegg, psortB and Interproscan to determine possible function. Gene product names are determined by BER. Gene names are generated at the contig level and may not necessarily reflect any known order or orientation between contigs.

This is a reference genome for the Human Microbiome Project. This project is co-owned with the Human Microbiome Project DACC. This work was funded by the National Human Genome Research Institute (NHGRI)/National Institutes of Health (NIH) grant 5U54HG00496804 for characterization of this genome.  more

Global statistics

Total sequence length5,030,824
Total ungapped length5,020,824
Gaps between scaffolds0
Number of scaffolds149
Scaffold N50124,043
Scaffold L5014
Number of contigs249
Contig N5045,307
Contig L5033
Total number of chromosomes and plasmids0
Number of component sequences (WGS or clone)249

Supplemental Content

Recent activity

Your browsing activity is empty.

Activity recording is turned off.

Turn recording back on

See more...

Global assembly definition

Download the full sequence report
Click on the table row to see sequence details in the table to the right
Assembly Unit Name
Primary Assembly
The primary assembly unit does not have any assembled chromosomes or linkage groups.
Please download the full sequence report for information on the scaffolds.

Assembly statistics

MoleculeTotal
Length
Scaffold
Count
Ungapped
Length
Scaffold
N50
Spanned
Gaps
Unspanned
Gaps
unplaced5,030,8241495,020,824124,0431000

Assembly QA

Taxonomy Check Data

Declared organism

Organism nameSpecies name
Escherichia coli 907701Escherichia coli

Best-matching type-strain assembly for declared species

AssemblyOrganism nameType category
GCA_024519395.1Escherichia coli DSM 30083 = JCM 1649 = ATCC 11775neotype

Best-matching type-strain assembly

AssemblySpecies nameType category
GCA_024519395.1Escherichia colineotype

Average Nucleotide Identity (ANI) data

ANIQuery coverageSubject coverage
Declared type99.9198.5192.53
Best-match type99.9198.5192.53

ANI result

Taxonomy check statusBest match statusComment
OKspecies-matchna