U.S. flag

An official website of the United States government

Format

Send to:

Choose Destination

Download Assembly



drMalSylv7.2

Organism name:
Malus sylvestris (European crab apple)
BioSample:
SAMEA9197672
BioProject:
PRJEB47504
Submitter:
WELLCOME SANGER INSTITUTE
Date:
2022/07/06
Release type:
minor
Assembly level:
Chromosome
Genome representation:
full
RefSeq category:
representative genome
GenBank assembly accession:
GCA_916048215.2 (replaced)
RefSeq assembly accession:
GCF_916048215.2 (latest)
RefSeq assembly and GenBank assembly identical:
no (hide details)
  • Only in RefSeq: chromosomes MT and Pltd
  • Data displayed for RefSeq version
WGS Project:
CAJZHM02
Genome coverage:
25x

IDs: 12133841 [UID] 31799108 [GenBank] 35259038 [RefSeq]

See Genome Information for Malus sylvestris

There are 5 assemblies for this organism

See more

History (Show revision history)

Comment

The assembly drMalSylv7.2 is based on 25x PacBio data, 10X Genomics Chromium data, and Arima Hi-C data generated by the Darwin Tree of Life Project (https://www.darwintreeoflife.org/). The assembly process included the following sequence of steps: initial PacBio assembly generation ... with Hifiasm, retained haplotig separation with purge_dups, short-read polishing using FreeBayes-called variants from 10X Genomics Chromium reads aligned with LongRanger, and Hi-C based scaffolding with SALSA2. Finally, the assembly was analysed and manually improved using rapid curation. Chromosome-scale scaffolds are named by synteny based on Malus domestica (apple) GCA_004115385.1. Shared sequences between chromosomes are visible in the Hi-C map in agreement with the findings of
https://doi.org/10.1038/ng.654 for Malus domestica. The Hi-C map provides evidence of inversions between haplotypes in chromosome 2 - 24.82-26.19Mb, and chromosome 11 - 17.06-19.05Mb. There are several scaffolds that it was possible to localize to a chromosome but not place, these have been labelled as unloc. Two of the largest unlocalised scaffolds -SUPER_4_unloc_1 and SUPER_14_unloc_1 - appear to be larger haplotypes of two specific loci; Chromosome 4 at 
26.13-26.31Mb and Chromosome 14 at 
782kb respectively. From the Hi-C it appears that these loci currently represent the shorter haplotype. As there is some uncertainty over these unloc scaffolds they have been left in the primary assembly.  more

Global statistics

Total sequence length641,526,810
Total ungapped length641,487,110
Gaps between scaffolds0
Number of scaffolds34
Scaffold N5036,902,754
Scaffold L508
Number of contigs135
Contig N508,322,097
Contig L5025
Total number of chromosomes and plasmids19
Number of component sequences (WGS or clone)34

Supplemental Content

PubMed articles for this assembly

Recent activity

Your browsing activity is empty.

Activity recording is turned off.

Turn recording back on

See more...

Global assembly definition

Download the full sequence report
Click on the table row to see sequence details in the table to the right
Assembly Unit Name
Primary Assembly
non-nuclear
Assembly Unit: Primary Assembly (GCF_916048214.1)
Molecule nameGenBank sequenceRefSeq sequenceUnlocalized
sequences count
Chromosome 1OU696503.1=NC_062260.10
Chromosome 2OU696504.1=NC_062261.10
Chromosome 3OU696505.1=NC_062262.10
Chromosome 4OU696506.1=NC_062263.110
Chromosome 5OU696507.1=NC_062264.10
Chromosome 6OU696508.1=NC_062265.10
Chromosome 7OU696509.1=NC_062266.10
Chromosome 8OU696510.1=NC_062267.10
Chromosome 9OU696511.1=NC_062268.10
Chromosome 10OU696512.1=NC_062269.10
Chromosome 11OU696513.1=NC_062270.10
Chromosome 12OU696514.1=NC_062271.10
Chromosome 13OU696515.1=NC_062272.12
Chromosome 14OU696516.1=NC_062273.11
Chromosome 15OU696517.1=NC_062274.10
Chromosome 16OU696518.1=NC_062275.10
Chromosome 17OU696519.1=NC_062276.10
unplacedn/an/an/a2

Assembly statistics

MoleculeSequence RoleTotal
Length
Scaffold
Count
Ungapped
Length
Scaffold
N50
Spanned
Gaps
Unspanned
Gaps
AllAssembled molecule640,969,90132640,930,20136,902,7541010
Chromosome 1Assembled molecule30,125,771130,124,07130,125,77140
Chromosome 2Assembled molecule37,960,862137,958,16237,960,86260
Chromosome 3Assembled molecule36,902,754136,900,75436,902,75440
Chromosome 4AllAssembled moleculeUnlocalized scaffolds31,934,89031,139,927794,9631111031,932,69031,137,727794,96331,139,92731,139,927436,959550000
Chromosome 5Assembled molecule47,127,301147,124,50147,127,30180
Chromosome 6Assembled molecule34,960,253134,958,65334,960,25350
Chromosome 7Assembled molecule35,530,347135,528,24735,530,34760
Chromosome 8Assembled molecule30,374,751130,373,85130,374,75130
Chromosome 9Assembled molecule35,179,981135,176,78135,179,981100
Chromosome 10Assembled molecule43,197,675143,194,97543,197,67560
Chromosome 11Assembled molecule41,347,971141,343,47141,347,97190
Chromosome 12Assembled molecule32,552,200132,549,80032,552,20060
Chromosome 13AllAssembled moleculeUnlocalized scaffolds44,275,47744,069,681205,79631244,274,07744,068,281205,79644,069,68144,069,681108,254440000
Chromosome 14AllAssembled moleculeUnlocalized scaffolds30,544,68630,380,474164,21221130,542,78630,378,574164,21230,380,47430,380,474164,212550000
Chromosome 15Assembled molecule54,898,662154,896,16254,898,66250
Chromosome 16Assembled molecule40,115,383140,112,58340,115,38380
Chromosome 17Assembled molecule33,873,494133,871,19433,873,49470
unplacedAssembled molecule67,443267,44339,69000
MoleculeTotal
Length
All556,909
Mitochondrion MT396,940
Chloroplast Pltd159,969