U.S. flag

An official website of the United States government

Format

Send to:

Choose Destination

Download Assembly



icHarAxyr1.1

Organism name:
Harmonia axyridis (beetles)
BioSample:
SAMEA7520208
BioProject:
PRJEB47373
Submitter:
WELLCOME SANGER INSTITUTE
Date:
2021/09/16
Assembly type:
haploid (principal pseudohaplotype of diploid)
Assembly level:
Chromosome
Genome representation:
full
RefSeq category:
representative genome
GenBank assembly accession:
GCA_914767665.1 (latest)
RefSeq assembly accession:
GCF_914767665.1 (latest)
RefSeq assembly and GenBank assembly identical:
no (hide details)
  • Only in GenBank: chromosome MT
  • Data displayed for RefSeq version
WGS Project:
CAJZBN01
Genome coverage:
53x
Linked assembly:
GCA_914767675.1 (alternate pseudohaplotype of diploid)

IDs: 10934161 [UID] 28912828 [GenBank] 30629928 [RefSeq]

See Genome Information for Harmonia axyridis

There are 8 assemblies for this organism

See more

History (Show revision history)

Comment

The assembly icHarAxyr1.1 is based on 53x PacBio data, 10X Genomics Chromium data, and Arima Hi-C data generated by the Darwin Tree of Life Project (https://www.darwintreeoflife.org/). The assembly process included the following sequence of steps: initial PacBio assembly generation ... with Hifiasm, retained haplotig separation with purge_dups, short-read polishing using FreeBayes-called variants from 10X Genomics Chromium reads aligned with LongRanger, and Hi-C based scaffolding with SALSA2. The mitochondrial genome was assembled using MitoHifi. Finally, the assembly was analysed and manually improved using gEVAL. Chromosome-scale scaffolds confirmed by the Hi-C data have been named in order of size. 7 curated autosomes and 1 allosome (X) curated for Harmonia axyridis. Some scaffolds remain unplaced due to repetitive content giving ambiguous HiC signal. Large cluster of rDNA sequences placed on X using HiC data only.  more

Global statistics

Total sequence length425,524,972
Total ungapped length425,487,872
Gaps between scaffolds0
Number of scaffolds13
Scaffold N5063,675,256
Scaffold L503
Number of contigs185
Contig N5022,915,403
Contig L507
Total number of chromosomes and plasmids8
Number of component sequences (WGS or clone)13

Supplemental Content

Recent activity

Your browsing activity is empty.

Activity recording is turned off.

Turn recording back on

See more...

Global assembly definition

Download the full sequence report
Click on the table row to see sequence details in the table to the right
Assembly Unit Name
Primary Assembly
Assembly Unit: Primary Assembly (GCF_914767664.1)
Molecule nameGenBank sequenceRefSeq sequenceUnlocalized
sequences count
Chromosome 1OU611927.1=NC_059501.11
Chromosome 2OU611928.1=NC_059502.10
Chromosome 3OU611929.1=NC_059503.10
Chromosome 4OU611930.1=NC_059504.10
Chromosome 5OU611931.1=NC_059505.10
Chromosome 6OU611932.1=NC_059506.11
Chromosome 7OU611934.1=NC_059507.10
Chromosome XOU611933.1=NC_059508.11
unplacedn/an/an/a2

Assembly statistics

MoleculeSequence RoleTotal
Length
Scaffold
Count
Ungapped
Length
Scaffold
N50
Spanned
Gaps
Unspanned
Gaps
AllAssembled molecule425,524,97213425,487,87263,675,2561720
Chromosome 1AllAssembled moleculeUnlocalized scaffolds87,891,52387,845,13646,38721187,890,62387,844,23646,38787,845,13687,845,13646,387330000
Chromosome 2Assembled molecule64,433,203164,431,50364,433,20370
Chromosome 3Assembled molecule63,675,256163,675,05663,675,25610
Chromosome 4Assembled molecule49,282,603149,281,30349,282,60350
Chromosome 5Assembled molecule43,975,926143,975,42643,975,92610
Chromosome 6AllAssembled moleculeUnlocalized scaffolds40,459,36240,338,688120,67421140,457,16240,336,488120,67440,338,68840,338,688120,674880000
Chromosome 7Assembled molecule37,127,597137,126,39737,127,59730
Chromosome XAllAssembled moleculeUnlocalized scaffolds38,626,53538,596,30530,23021138,597,43538,567,20530,23038,596,30538,596,30530,2301441440000
unplacedAssembled molecule52,967252,96733,34500