U.S. flag

An official website of the United States government

Format

Send to:

Choose Destination

Download Assembly



ASM181102v1

Organism name:
Haemophilus sp. HMSC066D03 (g-proteobacteria)
Taxonomy check:
OK
Infraspecific name:
Strain: HMSC066D03
BioSample:
SAMN04477593
BioProject:
PRJNA299930
Submitter:
The Genome Institute at Washington University
Date:
2016/10/21
Assembly type:
na
Assembly level:
Scaffold
Genome representation:
full
GenBank assembly accession:
GCA_001811025.1 (latest)
RefSeq assembly accession:
GCF_001811025.1 (latest)
RefSeq assembly and GenBank assembly identical:
yes
WGS Project:
LWQN01
Assembly method:
Velvet v. 1.1.06
Genome coverage:
202x
Sequencing technology:
Illumina

IDs: 838411 [UID] 3567538 [GenBank] 3617748 [RefSeq]

See Genome Information for Haemophilus sp. HMSC066D03

History (Show revision history)

Comment

The WUSC is a large strain collection isolated from clinical samples. Each sample is associated with metadata, including source, isolation site, 16s rRNA, metabolic, and other phenotypic information. The goal is to sample an adequate number of important, yet ... minor species, further adding to the catelogue of sequenced bacterial genomes and improving the diversity of the genomes available to the public. WGS will be preformed on approximely 550 isolates. Samples were selected based on RDP analysis at the genus level. This is a reference genomes for the Human Microbiome Project and the work was funded by the National Institutes of Health (NIH) grant U54 HG004968.
Annotation was added by the NCBI Prokaryotic Genome Annotation Pipeline (released 2013). Information about the Pipeline can be found here: https://www.ncbi.nlm.nih.gov/genome/annotation_prok/  more

Genome-Annotation-Data

##Genome-Annotation-Data-START##
Annotation Provider::NCBI
Annotation Date::04/22/2016 17:59:08
Annotation Pipeline::NCBI Prokaryotic Genome Annotation Pipeline
Annotation Method::Best-placed reference protein set; GeneMarkS+
Annotation Software revision::3.1
Features Annotated::Gene; CDS; rRNA; tRNA; ncRNA; repeat_region
Genes (total)::1,960
CDS (total)::1,904
Genes (coding)::1,856
CDS (coding)::1,856
Genes (RNA)::56
rRNAs::2, 1, 4 (5S, 16S, 23S)
complete rRNAs::2, 1 (5S, 16S)
partial rRNAs::4 (23S)
tRNAs::45
ncRNAs::4
Pseudo Genes (total)::48
Pseudo Genes (ambiguous residues)::0 of 48
Pseudo Genes (frameshifted)::13 of 48
Pseudo Genes (incomplete)::34 of 48
Pseudo Genes (internal stop)::2 of 48
Pseudo Genes (multiple problems)::1 of 48
##Genome-Annotation-Data-END##

Global statistics

Total sequence length1,974,023
Total ungapped length1,970,523
Gaps between scaffolds0
Number of scaffolds35
Scaffold N50249,354
Scaffold L503
Number of contigs70
Contig N5058,774
Contig L5012
Total number of chromosomes and plasmids0
Number of component sequences (WGS or clone)70

Supplemental Content

Recent activity

Your browsing activity is empty.

Activity recording is turned off.

Turn recording back on

See more...

Global assembly definition

Download the full sequence report
Click on the table row to see sequence details in the table to the right
Assembly Unit Name
Primary Assembly
The primary assembly unit does not have any assembled chromosomes or linkage groups.
Please download the full sequence report for information on the scaffolds.

Assembly statistics

MoleculeTotal
Length
Scaffold
Count
Ungapped
Length
Scaffold
N50
Spanned
Gaps
Unspanned
Gaps
unplaced1,974,023351,970,523249,354350

Assembly QA

Taxonomy Check Data

Declared organism

Organism nameSpecies name
Haemophilus sp. HMSC066D03Haemophilus sp. HMSC066D03

Best-matching type-strain assembly for declared species

AssemblyOrganism nameType category
no-typeno-type

Best-matching type-strain assembly

AssemblySpecies nameType category
GCA_016127215.1Haemophilus parainfluenzaesuspected-type

Average Nucleotide Identity (ANI) data

ANIQuery coverageSubject coverage
Declared typenanana
Best-match type93.9786.4779.56

ANI result

Taxonomy check statusBest match statusComment
OKgenus-matchna