U.S. flag

An official website of the United States government

Format

Send to:

Choose Destination

Download Assembly



mSarHar1.11

Organism name:
Sarcophilus harrisii (Tasmanian devil)
BioSample:
SAMEA6099886
BioProject:
PRJEB35073
Submitter:
SC
Date:
2019/11/07
Assembly type:
haploid (principal pseudohaplotype of diploid)
Assembly level:
Chromosome
Genome representation:
full
RefSeq category:
representative genome
GenBank assembly accession:
GCA_902635505.1 (latest)
RefSeq assembly accession:
GCF_902635505.1 (latest)
RefSeq assembly and GenBank assembly identical:
no (hide details)
  • Only in RefSeq: chromosome MT
  • Data displayed for RefSeq version
WGS Project:
CACPPN01
Genome coverage:
88x
Linked assembly:
GCA_902648505.1 (alternate pseudohaplotype of diploid)

IDs: 5334631 [UID] 15441038 [GenBank] 15648578 [RefSeq]

See Genome Information for Sarcophilus harrisii

There are 6 assemblies for this organism

See more

History (Show revision history)

Comment

The assembly mSarHar1.11 is based on 
88x ONT data including 10x ultra long reads sequenced at Oxford Nanopore Technologies; 60x 10X Genomics Chromium data, BioNano data, 
50x Illumina HiSeq XTen and 60x Dovetail Hi-C data generated at the Wellcome ... Sanger Institute. The assembly process included the following sequence of steps: initial ONT assembly generation with WTDBG, 10X based scaffolding with scaff10x, BioNano hybrid-scaffolding, Hi-C based scaffolding with scaffHiC, WTDBG2/Racon polishing, and two rounds of FreeBayes polishing. Finally, the assembly was analysed and manually improved using gEVAL, where haplotigs have been removed. Chromosome-scale scaffolds have been confirmed using the Hi-C data. Chromosomes are named according to established convention and the labels for chromosomes 1 and 2 are switched compared with a previous genome assembly (Devil7.0) (i.e. the chromosome labelled 1 in Devil7.0 is labelled 2 in the current assembly, and vice versa). Two contigs derived from the Y chromosome from a different individual, and sequenced with Illumina HiSeq 2000, have been added. The MT sequence from Devil7.0 has also been included.  more

Global statistics

Total sequence length3,086,674,442
Total ungapped length3,086,627,696
Gaps between scaffolds0
Number of scaffolds106
Scaffold N50611,347,268
Scaffold L503
Number of contigs445
Contig N5062,339,597
Contig L5014
Total number of chromosomes and plasmids9
Number of component sequences (WGS or clone)106

Supplemental Content

PubMed articles for this assembly

Recent activity

Your browsing activity is empty.

Activity recording is turned off.

Turn recording back on

See more...

Global assembly definition

Download the full sequence report
Click on the table row to see sequence details in the table to the right
Assembly Unit Name
Primary Assembly
non-nuclear
Assembly Unit: Primary Assembly (GCF_902635504.1)
Molecule nameGenBank sequenceRefSeq sequenceUnlocalized
sequences count
Chromosome 1LR735554.1=NC_045426.10
Chromosome 2LR735555.1=NC_045427.10
Chromosome 3LR735556.1=NC_045428.10
Chromosome 4LR735557.1=NC_045429.10
Chromosome 5LR735558.1=NC_045430.10
Chromosome 6LR735559.1=NC_045431.10
Chromosome XLR735560.1=NC_045432.10
Chromosome YLR735561.1=NC_045433.10
unplacedn/an/an/a97

Assembly statistics

MoleculeTotal
Length
Scaffold
Count
Ungapped
Length
Scaffold
N50
Spanned
Gaps
Unspanned
Gaps
All3,086,657,3251053,086,610,579611,347,2683390
Chromosome 1716,413,6291716,409,129716,413,629300
Chromosome 2662,751,7871662,746,187662,751,787370
Chromosome 3611,347,2681611,332,568611,347,2681100
Chromosome 4464,895,0541464,892,158464,895,054220
Chromosome 5288,121,6521288,118,652288,121,652200
Chromosome 6254,895,9791254,891,479254,895,979310
Chromosome X83,081,154183,070,10483,081,154840
Chromosome Y130,5641130,464130,56410
unplaced5,020,238975,019,83882,52240
MoleculeTotal
Length
Mitochondrion MT17,117