U.S. flag

An official website of the United States government

Format

Send to:

Choose Destination

Download Assembly



fErpCal1.1

  • Record removed. This version of the assembly has been suppressed.
Organism name:
Erpetoichthys calabaricus (reedfish)
BioSample:
SAMEA104026374
BioProject:
PRJEB31579
Submitter:
SC
Date:
2019/03/26
Assembly level:
Chromosome
Genome representation:
full
GenBank assembly accession:
GCA_900747795.1 (suppressed)
RefSeq assembly accession:
n/a
RefSeq assembly and GenBank assembly identical:
n/a
WGS Project:
CAADRL01
Genome coverage:
51x

IDs: 2383971 [UID] 8850928 [GenBank]

See Genome Information for Erpetoichthys calabaricus

There are 2 assemblies for this organism

See more

History (Show revision history)

Comment

The assembly fErpCal1.2 is based on 
51x PacBio Sequel data, 
36x coverage Illumina HiSeqX data from a 10X Genomics Chromium library generated at the Wellcome Sanger Institute as well as BioNano Saphyr DLE data generated at the Rockefeller University ... Vertebrate Genome Laboratory and 
69x coverage HiSeqX data from a Hi-C library prepared by Arima Genomics. An initial PacBio assembly was made using Falcon-unzip. The primary contigs were then scaffolded using the 10X data with scaff10x, then scaffolded further with BioNano hybrid scaffolding and scaffolded further still using the Hi-C data with SALSA2. Polishing and gap-filling of both the primary scaffolds and haplotigs was performed using the PacBio reads and Arrow, followed by two rounds of Illumina polishing using the 10X data and freebayes. The mitochondrial assembly was produced at The Rockefeller University using mitoVGP. Finally, the assembly was manually improved using gEVAL to correct mis-joins, improve concordance with the BioNano and Hi-C data and remove retained haplotypic duplication using purge_haplotigs. Chromosomes identified from the Hi-C data have been named in order of size.  more

Global statistics

Total sequence length3,555,691,057
Total ungapped length3,345,461,422
Gaps between scaffolds0
Number of scaffolds41
Scaffold N50199,226,436
Scaffold L507
Number of contigs4,760
Contig N501,252,034
Contig L50773
Total number of chromosomes and plasmids18
Number of component sequences (WGS or clone)41

Supplemental Content

Recent activity

Your browsing activity is empty.

Activity recording is turned off.

Turn recording back on

See more...

Global assembly definition

Download the full sequence report
Click on the table row to see sequence details in the table to the right
Assembly Unit Name
Primary Assembly
Assembly Unit: Primary Assembly (GCA_900747794.1)
Molecule nameGenBank sequenceRefSeq sequenceUnlocalized
sequences count
Chromosome 1LR536432.1n/an/a2
Chromosome 2LR536433.1n/an/a5
Chromosome 3LR536434.1n/an/a1
Chromosome 4LR536435.1n/an/a6
Chromosome 5LR536436.1n/an/a5
Chromosome 6LR536437.1n/an/a2
Chromosome 7LR536438.1n/an/a0
Chromosome 8LR536439.1n/an/a0
Chromosome 9LR536440.1n/an/a0
Chromosome 10LR536441.1n/an/a0
Chromosome 11LR536442.1n/an/a0
Chromosome 12LR536443.1n/an/a0
Chromosome 13LR536444.1n/an/a2
Chromosome 14LR536445.1n/an/a0
Chromosome 15LR536446.1n/an/a0
Chromosome 16LR536447.1n/an/a0
Chromosome 17LR536448.1n/an/a0
Chromosome 18LR536449.1n/an/a0

Assembly statistics

MoleculeSequence RoleTotal
Length
Scaffold
Count
Ungapped
Length
Scaffold
N50
Spanned
Gaps
Unspanned
Gaps
AllAssembled molecule3,555,691,057413,345,461,422199,226,4364,7190
Chromosome 1AllAssembled moleculeUnlocalized scaffolds357,777,291350,098,6117,678,680312332,548,549328,090,8314,457,718350,098,611350,098,6114,163,55944042317000
Chromosome 2AllAssembled moleculeUnlocalized scaffolds349,406,900332,461,05316,945,847615323,988,100310,752,70313,235,397332,461,053332,461,0533,995,87043239933000
Chromosome 3AllAssembled moleculeUnlocalized scaffolds312,125,791309,313,7022,812,089211286,998,137285,340,2811,657,856309,313,702309,313,7022,812,0894364297000
Chromosome 4AllAssembled moleculeUnlocalized scaffolds326,283,616293,870,03332,413,583716306,206,804282,767,91023,438,894293,870,033293,870,0336,189,55745737879000
Chromosome 5AllAssembled moleculeUnlocalized scaffolds247,998,699232,056,43115,942,268615231,455,811220,984,97810,470,833232,056,431232,056,4312,772,68034230636000
Chromosome 6AllAssembled moleculeUnlocalized scaffolds216,311,125209,933,2856,377,840312203,162,956198,008,1845,154,772209,933,285209,933,2853,213,18628026614000
Chromosome 7Assembled molecule199,226,4361187,336,947199,226,4362840
Chromosome 8Assembled molecule197,106,4061189,510,732197,106,4062600
Chromosome 9Assembled molecule195,124,6331184,273,439195,124,6332790
Chromosome 10Assembled molecule172,872,4541167,188,721172,872,4542220
Chromosome 11Assembled molecule165,569,7541155,869,887165,569,7542380
Chromosome 12Assembled molecule163,428,5981158,255,434163,428,5981980
Chromosome 13AllAssembled moleculeUnlocalized scaffolds146,000,024141,079,6454,920,379312139,784,624135,067,2844,717,340141,079,645141,079,6452,462,9212021939000
Chromosome 14Assembled molecule111,844,2461106,061,130111,844,2461350
Chromosome 15Assembled molecule103,228,439197,504,817103,228,4391580
Chromosome 16Assembled molecule102,189,247198,740,383102,189,2471270
Chromosome 17Assembled molecule100,832,283190,583,056100,832,2831410
Chromosome 18Assembled molecule88,365,115185,991,89588,365,115880