U.S. flag

An official website of the United States government

Format

Send to:

Choose Destination

Download Assembly



fErpCal1.3

Organism name:
Erpetoichthys calabaricus (reedfish)
BioSample:
SAMEA104026374
BioProject:
PRJEB31579
Submitter:
Wellcome Sanger Institute
Date:
2021/07/07
Assembly type:
haploid (principal pseudohaplotype of diploid)
Assembly level:
Chromosome
Genome representation:
full
RefSeq category:
representative genome
GenBank assembly accession:
GCA_900747795.4 (latest)
RefSeq assembly accession:
GCF_900747795.2 (latest)
RefSeq assembly and GenBank assembly identical:
no (hide details)
  • Different: chromosome MT.
  • Data displayed for RefSeq version
WGS Project:
CAADRL03
Genome coverage:
58x
Linked assembly:
GCA_900700845.3 (alternate pseudohaplotype of diploid)

IDs: 10494541 [UID] 27856948 [GenBank] 37153318 [RefSeq]

See Genome Information for Erpetoichthys calabaricus

There are 2 assemblies for this organism

See more

History (Show revision history)

Comment

The assembly fErpCal1.3 is based on 58x PacBio data, 31x 10X Genomics Chromium data, and 35x Arima Hi-C data generated at the Wellcome Sanger Institute, as well as BioNano Saphyr DLE data generated at the Rockefeller University Vertebrate Genome ... Laboratory. The assembly process included the following sequence of steps: initial PacBio assembly generation with Falcon-unzip, retained haplotig separation with purge_dups, 10X based scaffolding with scaff10x, BioNano hybrid-scaffolding with Solve, Hi-C based scaffolding with SALSA2, Arrow polishing using Merfin, and two rounds of FreeBayes polishing. Finally, the assembly was analysed and manually improved using gEVAL. The mitochondrial assembly was produced at The Rockefeller University using mitoVGP. Chromosome-scale scaffolds confirmed by the Hi-C data have been named in order of size. The GC profile of chromosome 11 is very unusual when compared with the other chromosomes.  more

Global statistics

Total sequence length3,613,551,144
Total ungapped length3,583,843,487
Gaps between scaffolds0
Number of scaffolds169
Scaffold N50217,689,105
Scaffold L506
Number of contigs1,600
Contig N506,772,339
Contig L50155
Total number of chromosomes and plasmids19
Number of component sequences (WGS or clone)169

Supplemental Content

Recent activity

Your browsing activity is empty.

Activity recording is turned off.

Turn recording back on

See more...

Global assembly definition

Download the full sequence report
Click on the table row to see sequence details in the table to the right
Assembly Unit Name
Primary Assembly
non-nuclear
Assembly Unit: Primary Assembly (GCF_900747794.2)
Molecule nameGenBank sequenceRefSeq sequenceUnlocalized
sequences count
Chromosome 1LR536432.2=NC_041394.227
Chromosome 2LR536433.2=NC_041395.20
Chromosome 3LR536434.2=NC_041396.20
Chromosome 4LR536435.2=NC_041397.20
Chromosome 5LR536436.2=NC_041398.20
Chromosome 6LR536437.2=NC_041399.20
Chromosome 7LR536438.2=NC_041400.20
Chromosome 8LR536439.2=NC_041401.20
Chromosome 9LR536440.2=NC_041402.20
Chromosome 10LR536441.2=NC_041403.20
Chromosome 11LR536442.2=NC_041404.20
Chromosome 12LR536443.2=NC_041405.20
Chromosome 13LR536444.2=NC_041406.20
Chromosome 14LR536445.2=NC_041407.20
Chromosome 15LR536446.2=NC_041408.20
Chromosome 16LR536447.2=NC_041409.20
Chromosome 17LR536448.2=NC_041410.20
Chromosome 18LR536449.2=NC_041411.20
unplacedn/an/an/a123

Assembly statistics

MoleculeSequence RoleTotal
Length
Scaffold
Count
Ungapped
Length
Scaffold
N50
Spanned
Gaps
Unspanned
Gaps
AllAssembled molecule3,613,534,4701683,583,826,813217,689,1051,4310
Chromosome 1AllAssembled moleculeUnlocalized scaffolds382,548,849356,776,21925,772,63028127372,283,544354,514,84417,768,700356,776,219356,776,2191,656,43724416282000
Chromosome 2Assembled molecule350,268,6371347,946,127350,268,6371490
Chromosome 3Assembled molecule316,334,6991312,857,048316,334,6991430
Chromosome 4Assembled molecule337,490,6351334,085,303337,490,6351480
Chromosome 5Assembled molecule252,032,9051250,217,644252,032,9051130
Chromosome 6Assembled molecule217,689,1051215,671,918217,689,105770
Chromosome 7Assembled molecule199,443,0071198,889,654199,443,007560
Chromosome 8Assembled molecule198,537,5091197,746,344198,537,509700
Chromosome 9Assembled molecule197,358,1131196,548,287197,358,113730
Chromosome 10Assembled molecule170,808,1971170,206,067170,808,197380
Chromosome 11Assembled molecule166,547,1831166,005,173166,547,183560
Chromosome 12Assembled molecule164,003,3111163,672,455164,003,311520
Chromosome 13Assembled molecule146,904,6621146,463,019146,904,662520
Chromosome 14Assembled molecule112,108,5771111,201,929112,108,577290
Chromosome 15Assembled molecule103,294,6051103,076,135103,294,605280
Chromosome 16Assembled molecule103,693,3221103,281,916103,693,322300
Chromosome 17Assembled molecule100,954,5461100,746,826100,954,546380
Chromosome 18Assembled molecule89,129,353188,541,06989,129,353300
unplacedAssembled molecule4,387,2551234,386,35543,43950
MoleculeTotal
Length
Mitochondrion MT16,674