U.S. flag

An official website of the United States government

Format

Send to:

Choose Destination

Download Assembly



P_latipinna-1.0

Organism name:
Poecilia latipinna (sailfin molly)
Isolate:
Pla-1442-1.1
Sex:
female
BioSample:
SAMN02048973
BioProject:
PRJNA196862
Submitter:
The Genome Institute at Washington University School of Medicine (WUGSC)
Date:
2015/11/13
Assembly level:
Scaffold
Genome representation:
full
RefSeq category:
representative genome
GenBank assembly accession:
GCA_001443285.1 (latest)
RefSeq assembly accession:
GCF_001443285.1 (latest)
RefSeq assembly and GenBank assembly identical:
yes
WGS Project:
LMXD01
Assembly method:
ARGO v. 2014
Genome coverage:
34x
Sequencing technology:
Illumina

IDs: 583111 [UID] 2684078 [GenBank] 2721448 [RefSeq]

See Genome Information for Poecilia latipinna

History (Show revision history)

Comment

Poeciliia latipinna (Sailfin molly) Sequence Assembly Release Notes
 The Sailfin molly DNA for shotgun sequencing is derived from an adult female (Poecilia latipinna; fish id # Pla-1442-1.1) within the laboratory of Dr. Manfred Schartl at the Biocenter of the ... University of Wuerzburg, Germany. Total sequence genome input coverage on the Illumina HiSeq instrument was 
34x (20x fragments, and 14x 3kb). 
 The combined sequence reads were assembled into contigs by Richa Agarwala by aligning the reads to the Poecilia formosa draft assembly. The P. latipinna assembly was further improved with the external scaffolding tool SSPACE (Boetzer 2010), and with a custom gap filling tool similar to IMAGE (Tsai 2010). This 1.0 version has been cleaned of contaminating contigs, and contigs 200bp and less were removed. The assembly is made up of a total of 17,988 scaffolds (including single contig scaffolds) with an N50 scaffold length of 250Mb (N50 contig length was 33kb). The assembly spans 815Mb. 
 This work was supported by NIH grant R24 RR032658-01 to Dr. Warren, Washington University School of Medicine.
 DNA Source Contact: Dr. Manfred Schartl, Physiological Chemistry I University of Wuerzburg Biozentrum, Am Hubland Wuerzburg 97074 Germany
 Poecillia latipinna 1.0 Sequence and Assembly Credits:
 DNA source - Dr. Manfred Schartl, Physiologische Chemie, Biozentrum, Am Hubland, Universitat Wuerzburg, Germany. Genome Sequence - McDonnell Genome Institute, Washington University School of Medicine, St Louis, MO. Sequence Assembly - Richa Agarwala, National Center for Biotechnology Information, NIH, Bethesda, MD, USA, McDonnell Genome Institute, Washington University School of Medicine, St Louis, MO. Assembly curation - Pat Minx, McDonnell Genome Institute, Washington University School of Medicine, St Louis, MO.
 Citation upon use of this assembly in a manuscript:
 It is requested that users of this Poecillia latipinna 1.0 sequence assembly acknowledge Wesley Warren and McDonnell Genome Institute, Washington University School of Medicine in any publications that result from use of this sequence assembly. 

 Poecillia latipinna 1.0 assembly statistics: (scaffold = supercontig)
 *** Contiguity: Contig *** Total contig number: 54625 Total contig bases: 679780316 bp Average contig length: 12444 bp Maximum contig length: 267274 bp N50 contig length: 33278 bp N50 contig number: 5928
 *** Contiguity: Supercontig *** Total supercontig number: 17988 Average supercontig length: 37791 bp Maximum supercontig length: 1730811 bp N50 supercontig length: 250516 bp N50 supercontig number: 831
 *** Scaffold Distribution *** Scaffolds > 1M: 13 Scaffold 250K--1M: 818 Scaffold 100K--250K: 1232 Scaffold 10--100K: 3108 Scaffold 5--10K: 817 Scaffold 2--5K: 1196 Scaffold 0--2K: 10804  more

Global statistics

Total sequence length815,144,743
Total ungapped length679,780,316
Gaps between scaffolds0
Number of scaffolds17,988
Scaffold N50279,200
Scaffold L50882
Number of contigs54,625
Contig N5033,278
Contig L505,928
Total number of chromosomes and plasmids0
Number of component sequences (WGS or clone)54,625

Supplemental Content

Recent activity

Your browsing activity is empty.

Activity recording is turned off.

Turn recording back on

See more...

Global assembly definition

Download the full sequence report
Click on the table row to see sequence details in the table to the right
Assembly Unit Name
Primary Assembly
The primary assembly unit does not have any assembled chromosomes or linkage groups.
Please download the full sequence report for information on the scaffolds.

Assembly statistics

MoleculeTotal
Length
Scaffold
Count
Ungapped
Length
Scaffold
N50
Spanned
Gaps
Unspanned
Gaps
unplaced815,144,74317,988679,780,316279,20036,6370