U.S. flag

An official website of the United States government

Format

Send to:

Choose Destination

Download Assembly



idAnoNiliSN_F5_01

Organism name:
Anopheles nili (mosquitos)
BioSample:
SAMEA12928243
BioProject:
PRJEB53353
Submitter:
WELLCOME SANGER INSTITUTE
Date:
2022/06/24
Assembly type:
haploid (principal pseudohaplotype of diploid)
Assembly level:
Chromosome
Genome representation:
full
RefSeq category:
representative genome
GenBank assembly accession:
GCA_943737925.1 (latest)
RefSeq assembly accession:
GCF_943737925.1 (latest)
RefSeq assembly and GenBank assembly identical:
yes
WGS Project:
CALSGA01
Genome coverage:
48x
Linked assembly:
GCA_943737935.1 (alternate pseudohaplotype of diploid)

IDs: 13067261 [UID] 34136098 [GenBank] 40766388 [RefSeq]

See Genome Information for Anopheles nili

There are 3 assemblies for this organism

See more

History (Show revision history)

Comment

The assembly idAnoNiliSN_F5_01 is based on 48x PacBio HiFi data, 10X Genomics Chromium data, and Arima Hi-C data generated by the Anopheles Reference Genomes project. The assembly process included the following sequence of steps: initial PacBio assembly generation with ... hifiasm, retained haplotig separation with purge_dups, short-read polishing using FreeBayes-called variants from 10X Genomics Chromium reads aligned with LongRanger, and Hi-C based scaffolding with SALSA2. The mitochondrial genome was assembled using MitoHifi. Finally, the assembly was analysed and manually improved using gEVAL. Chromosome-scale scaffolds confirmed by the Hi-C data have been named, ordered and oriented based on published data.  more

Global statistics

Total sequence length195,236,048
Total ungapped length195,202,548
Gaps between scaffolds0
Number of scaffolds157
Scaffold N5075,938,266
Scaffold L502
Number of contigs257
Contig N5037,444,657
Contig L503
Total number of chromosomes and plasmids4
Number of component sequences (WGS or clone)157

Supplemental Content

Recent activity

Your browsing activity is empty.

Activity recording is turned off.

Turn recording back on

See more...

Global assembly definition

Download the full sequence report
Click on the table row to see sequence details in the table to the right
Assembly Unit Name
Primary Assembly
non-nuclear
Assembly Unit: Primary Assembly (GCF_943737924.1)
Molecule nameGenBank sequenceRefSeq sequenceUnlocalized
sequences count
Chromosome 2OX031310.1=NC_071291.10
Chromosome 3OX031311.1=NC_071292.10
Chromosome XOX031312.1=NC_071293.153
unplacedn/an/an/a100

Assembly statistics

MoleculeSequence RoleTotal
Length
Scaffold
Count
Ungapped
Length
Scaffold
N50
Spanned
Gaps
Unspanned
Gaps
AllAssembled molecule195,220,701156195,187,20175,938,2661000
Chromosome 2Assembled molecule78,106,464178,099,06478,106,464190
Chromosome 3Assembled molecule75,938,266175,937,56675,938,26620
Chromosome XAllAssembled moleculeUnlocalized scaffolds27,090,03015,967,62111,122,4095415327,067,13015,946,72111,120,40915,967,62115,967,621732,96174704000
unplacedAssembled molecule14,085,94110014,083,441299,07550
MoleculeTotal
Length
Scaffold
Count
Ungapped
Length
Scaffold
N50
Spanned
Gaps
Unspanned
Gaps
Mitochondrion MT15,347115,34715,34700