NCBI Dendroctonus ponderosae Annotation Release 101

The RefSeq genome records for Dendroctonus ponderosae were annotated by the NCBI Eukaryotic Genome Annotation Pipeline, an automated pipeline that annotates genes, transcripts and proteins on draft and finished genome assemblies. This report presents statistics on the annotation products, the input data used in the pipeline and intermediate alignment results.

The annotation products are available in the sequence databases and on the FTP site.

This report provides:

Annotation Release information: The name of the release, important dates, the software version
Assemblies: A brief description of the annotated assembly(ies)
Gene and feature statistics: The counts and characteristics of the annotated features
BUSCO results: Annotation completeness assessed with BUSCO
Alignment of the annotated proteins to a set of high-quality proteins: The number of annotated proteins with hits to a set of high-quality proteins
Masking of genomic sequence: How much of the genome was masked
Transcript and protein alignments: The number and type of evidence retrieved from public databases and used for gene prediction
Similarity of current and previous assembly: The similarity of the current and previous assembly
Comparison of the current and previous annotations: What proportion of the genes changed in this annotation

For more information on the annotation process, please visit the NCBI Eukaryotic Genome Annotation Pipeline page.

Annotation Release information

This annotation should be referred to as NCBI Dendroctonus ponderosae Annotation Release 101

Annotation release ID: 101
Date of Entrez queries for transcripts and proteins: Jun 8 2022
Date of submission of annotation to the public databases: Jun 12 2022
Software version: 9.0

Assemblies

The following assemblies were included in this annotation run:

Assembly name	Assembly accession	Submitter	Assembly date	Reference/Alternate	Assembly content
Dpon_F_20191213v2	GCF_020466585.1	Natural Resources Canada	10-13-2021	Reference	unplaced scaffolds

Gene and feature statistics

Counts and length of annotated features are provided below for each assembly.

Feature counts

Feature	Dpon_F_20191213v2
Genes and pseudogenes	15,054
protein-coding	12,777
non-coding	2,237
Transcribed pseudogenes	2
Non-transcribed pseudogenes	38
genes with variants	4,726
Immunoglobulin/T-cell receptor gene segments	0
other	0
mRNAs	23,095
fully-supported	22,374
with > 5% ab initio	334
partial	1,805
with filled gap(s)	1,631
known RefSeq (NM_)	0
model RefSeq (XM_)	23,095
non-coding RNAs	3,068
fully-supported	2,793
with > 5% ab initio	0
partial	7
with filled gap(s)	7
known RefSeq (NR_)	0
model RefSeq (XR_)	2,868
pseudo transcripts	2
fully-supported	2
with > 5% ab initio	0
partial	0
with filled gap(s)	0
known RefSeq (NR_)	0
model RefSeq (XR_)	2
CDSs	23,095
fully-supported	22,374
with > 5% ab initio	364
partial	1,522
with major correction(s)	474
known RefSeq (NP_)	0
model RefSeq (XP_)	23,095

Detailed reports

The counts below do not include pseudogenes.

Feature lengths

Feature	Count	Mean length (bp)	Median length (bp)	Min length (bp)	Max length (bp)
Genes	15,014	11,037	3,481	68	904,557
All transcripts	26,163	2,615	1,977	68	71,086
mRNA	23,095	2,806	2,131	129	71,086
misc_RNA	561	2,458	2,001	148	11,500
tRNA	200	74	73	71	84
lncRNA	2,239	984	699	92	28,664
snoRNA	26	107	83	68	210
snRNA	29	135	123	86	192
rRNA	13	494	120	120	3,291
Single-exon transcripts	620	1,445	1,232	129	11,223
coding transcripts (NM_/XM_ )	620	1,445	1,232	129	11,223
CDSs	23,095	1,925	1,410	111	70,074
Exons	112,833	313	198	2	26,763
in coding transcripts (NM_/XM_ )	106,865	308	197	2	18,567
in non-coding transcripts (NR_/XR_ )	8,328	361	213	2	26,763
Introns	95,147	1,762	86	30	535,680
in coding transcripts (NM_/XM_ )	91,575	1,725	83	30	535,680
in non-coding transcripts (NR_/XR_ )	5,841	2,376	151	32	347,616

Transcripts per gene, exons per transcript

	Mean	Median	Min	Max
Number of transcripts per gene	1.75	1	1	39
Number of exons per transcript	8.55	7	1	91

BUSCO analysis of gene annotation

BUSCO v4.1.4 was run in "protein" mode on the annotated gene set picking one longest protein per gene, and run using the endopterygota_odb10 lineage dataset. Results are reported for the gene set from the primary assembly unit, and presented in BUSCO notation.

Alignment of the annotated proteins to a set of high-quality proteins

The final set of annotated proteins was searched with BLASTP against the Drosophila melanogaster known RefSeq proteins, using the annotated proteins as the query and the high-quality proteins as the target. Out of 12777 coding genes, 10058 genes had a protein with an alignment covering 50% or more of the query and 3306 had an alignment covering 95% or more of the query.

Definition of query and target coverage. The query coverage is the percentage of the annotated protein length that is included in the alignment. The target coverage is the percentage of the target length that is included in the alignment.

Below is a cumulative graph displaying the number of genes with alignments above a given query or target coverage threshold. For comparison, corresponding statistics for other organisms annotated by the NCBI eukaryotic annotation pipeline were added to the graph.

Query: annotated proteins
Target: Drosophila melanogaster known RefSeq proteins

Masking of genomic sequence

Transcript and protein alignments are performed on the repeat-masked genome. Below are the percentages of genomic sequence masked by WindowMasker and RepeatMasker (if calculated), for each assembly. RepeatMasker results are only calculated for organisms with complete Dfam HMM model collections.

For this annotation run, transcripts and proteins were aligned to the genome masked with WindowMasker only.

Assembly name	Assembly accession	% Masked with WindowMasker
Dpon_F_20191213v2	GCF_020466585.1	26.30%

Transcript and protein alignments

The annotation pipeline relies heavily on alignments of experimental evidence for gene prediction. Below are the sets of transcripts and proteins that were retrieved from Entrez, aligned to the genome by Splign, minimap2, or ProSplign and passed to Gnomon, NCBI's gene prediction software.

Transcript alignments

Source	Number of sequences retrieved from Entrez	Number (%) of sequences aligned by Splign	Number (%) of sequences passed to Gnomon	Average % identity	Average % coverage
Same-species Genbank	2,442	2,380 (97.46%)	2,189 (89.64%)	99.56%	97.90%
Same-species EST	185,437	172,884 (93.23%)	166,385 (89.73%)	99.50%	99.30%

RNA-Seq alignments

The following RNA-Seq reads from the Sequence Read Archive were also used for gene prediction:

Hide alignments statistics, by sample (SAME, SAMN, SAMD, DRS)

Sample Id	Publication	Track name	Number of reads	Percent aligned reads	Percent of aligned reads with introns	Number of introns
All	NA	Aggregate of all aligned samples	6,527,973,453	71%	15%	114,010
SAMN00847512	22516182	Adult antennae (Dendroctonus ponderosae, SAMN00847512)	1,125,738	64%	68%	50,367
SAMN00847513	22516182	Midguts and fat bodies from juvenile hormone-treated adults (Dendroctonus ponderosae, SAMN00847513)	964,388	41%	69%	31,526
SAMN00847514	22516182	Midgut and fat bodies of phloem-fed adults (Dendroctonus ponderosae, SAMN00847514)	1,132,989	53%	69%	37,313
SAMN01999089	NA	adult male control (Dendroctonus ponderosae, male, SAMN01999089)	61,406,588	69%	17%	93,580
SAMN02010585	NA	adult female control (Dendroctonus ponderosae, female, SAMN02010585)	73,458,152	70%	17%	89,397
SAMN02010586	NA	adult male treatment (Dendroctonus ponderosae, male, SAMN02010586)	70,120,464	71%	17%	92,639
SAMN02010587	NA	adult female treatment (Dendroctonus ponderosae, female, SAMN02010587)	74,957,410	70%	16%	88,189
SAMN03256981	26792242	fat body, acetone treatment (Dendroctonus ponderosae, emerged adult, female, SAMN03256981)	11,161,152	36%	20%	53,295
SAMN03256982	26792242	fat body, acetone treatment (Dendroctonus ponderosae, emerged adult, female, SAMN03256982)	17,658,950	65%	18%	59,582
SAMN03256983	26792242	fat body, acetone treatment (Dendroctonus ponderosae, emerged adult, female, SAMN03256983)	34,260,822	63%	10%	61,017
SAMN03256984	26792242	fat body, acetone treatment (Dendroctonus ponderosae, emerged adult, female, SAMN03256984)	153,560,980	55%	9%	69,839
SAMN03256985	26792242	fat body, juvenile hormone treatment (Dendroctonus ponderosae, emerged adult, female, SAMN03256985)	11,518,038	42%	19%	52,666
SAMN03256986	26792242	fat body, juvenile hormone treatment (Dendroctonus ponderosae, emerged adult, female, SAMN03256986)	30,627,212	64%	18%	63,382
SAMN03256987	26792242	fat body, juvenile hormone treatment (Dendroctonus ponderosae, emerged adult, female, SAMN03256987)	22,438,006	61%	10%	54,682
SAMN03256988	26792242	fat body, juvenile hormone treatment (Dendroctonus ponderosae, emerged adult, female, SAMN03256988)	119,711,400	60%	8%	65,179
SAMN03256989	26792242	anterior midgut, acetone treatment (Dendroctonus ponderosae, emerged adult, female, SAMN03256989)	19,503,594	76%	10%	55,183
SAMN03256990	26792242	anterior midgut, acetone treatment (Dendroctonus ponderosae, emerged adult, female, SAMN03256990)	18,588,544	81%	10%	53,479
SAMN03256991	26792242	anterior midgut, acetone treatment (Dendroctonus ponderosae, emerged adult, female, SAMN03256991)	14,919,902	82%	10%	48,639
SAMN03256992	26792242	anterior midgut, acetone treatment (Dendroctonus ponderosae, emerged adult, female, SAMN03256992)	104,738,122	68%	9%	69,395
SAMN03256993	26792242	anterior midgut, juvenile hormone treatment (Dendroctonus ponderosae, emerged adult, female, SAMN03256993)	20,385,250	79%	10%	54,271
SAMN03256994	26792242	anterior midgut, juvenile hormone treatment (Dendroctonus ponderosae, emerged adult, female, SAMN03256994)	27,877,218	80%	10%	56,869
SAMN03256995	26792242	anterior midgut, juvenile hormone treatment (Dendroctonus ponderosae, emerged adult, female, SAMN03256995)	17,140,094	81%	10%	51,929
SAMN03256996	26792242	anterior midgut, juvenile hormone treatment (Dendroctonus ponderosae, emerged adult, female, SAMN03256996)	110,505,342	79%	9%	68,255
SAMN03256997	26792242	fat body, acetone treatment (Dendroctonus ponderosae, emerged adult, male, SAMN03256997)	21,122,818	66%	19%	64,631
SAMN03256998	26792242	fat body, acetone treatment (Dendroctonus ponderosae, emerged adult, male, SAMN03256998)	32,942,008	62%	19%	66,607
SAMN03256999	26792242	fat body, acetone treatment (Dendroctonus ponderosae, emerged adult, male, SAMN03256999)	27,339,696	58%	10%	55,875
SAMN03257000	26792242	fat body, acetone treatment (Dendroctonus ponderosae, emerged adult, male, SAMN03257000)	91,538,948	63%	9%	71,294
SAMN03257001	26792242	fat body, juvenile hormone treatment (Dendroctonus ponderosae, emerged adult, male, SAMN03257001)	16,209,236	42%	18%	55,528
SAMN03257002	26792242	fat body, juvenile hormone treatment (Dendroctonus ponderosae, emerged adult, male, SAMN03257002)	30,953,022	61%	18%	66,273
SAMN03257003	26792242	fat body, juvenile hormone treatment (Dendroctonus ponderosae, emerged adult, male, SAMN03257003)	20,062,770	49%	10%	50,358
SAMN03257004	26792242	fat body, juvenile hormone treatment (Dendroctonus ponderosae, emerged adult, male, SAMN03257004)	128,042,068	65%	9%	71,130
SAMN03257005	26792242	anterior midgut, acetone treatment (Dendroctonus ponderosae, emerged adult, male, SAMN03257005)	27,515,114	77%	10%	60,948
SAMN03257006	26792242	anterior midgut, acetone treatment (Dendroctonus ponderosae, emerged adult, male, SAMN03257006)	22,195,402	77%	10%	57,915
SAMN03257007	26792242	anterior midgut, acetone treatment (Dendroctonus ponderosae, emerged adult, male, SAMN03257007)	15,487,220	81%	10%	52,255
SAMN03257008	26792242	anterior midgut, acetone treatment (Dendroctonus ponderosae, emerged adult, male, SAMN03257008)	101,446,724	80%	9%	71,045
SAMN03257009	26792242	anterior midgut, juvenile hormone treatment (Dendroctonus ponderosae, emerged adult, male, SAMN03257009)	24,552,470	76%	9%	57,241
SAMN03257010	26792242	anterior midgut, juvenile hormone treatment (Dendroctonus ponderosae, emerged adult, male, SAMN03257010)	19,224,904	79%	10%	53,144
SAMN03257011	26792242	anterior midgut, juvenile hormone treatment (Dendroctonus ponderosae, emerged adult, male, SAMN03257011)	18,990,828	80%	10%	53,691
SAMN03257012	26792242	anterior midgut, juvenile hormone treatment (Dendroctonus ponderosae, emerged adult, male, SAMN03257012)	111,664,138	76%	9%	71,613
SAMN03702321	NA	larva (Dendroctonus ponderosae, SAMN03702321)	117,022,808	68%	9%	84,006
SAMN03702322	NA	larva (Dendroctonus ponderosae, SAMN03702322)	157,292,540	73%	9%	86,349
SAMN03702323	NA	larva (Dendroctonus ponderosae, SAMN03702323)	126,894,078	73%	17%	92,375
SAMN03702324	NA	larva (Dendroctonus ponderosae, SAMN03702324)	105,036,914	59%	17%	88,497
SAMN03702325	NA	larva (Dendroctonus ponderosae, SAMN03702325)	121,486,548	80%	9%	86,136
SAMN03702326	NA	larva (Dendroctonus ponderosae, SAMN03702326)	98,627,024	82%	10%	82,825
SAMN03703203	NA	whole insect (Dendroctonus ponderosae, SAMN03703203)	84,240,450	60%	16%	83,537
SAMN03703206	NA	whole insect (Dendroctonus ponderosae, SAMN03703206)	127,246,356	82%	18%	96,625
SAMN03703207	NA	whole insect (Dendroctonus ponderosae, SAMN03703207)	109,446,478	82%	9%	88,165
SAMN03703208	NA	whole insect (Dendroctonus ponderosae, SAMN03703208)	110,786,774	80%	9%	90,387
SAMN03703209	NA	whole insect (Dendroctonus ponderosae, SAMN03703209)	105,803,314	78%	17%	90,153
SAMN03703210	NA	whole insect (Dendroctonus ponderosae, SAMN03703210)	121,147,690	82%	18%	94,583
SAMN03703211	NA	whole insect (Dendroctonus ponderosae, SAMN03703211)	113,200,186	80%	9%	87,737
SAMN03703212	NA	whole insect (Dendroctonus ponderosae, SAMN03703212)	110,128,076	67%	8%	88,585
SAMN03703213	NA	whole insect (Dendroctonus ponderosae, SAMN03703213)	119,850,720	83%	16%	92,109
SAMN03703214	NA	whole insect (Dendroctonus ponderosae, SAMN03703214)	96,088,744	81%	17%	94,787
SAMN04595104	NA	midgut/fat body (Dendroctonus ponderosae, emerged adult, female, SAMN04595104)	65,739,618	73%	19%	75,694
SAMN04595105	NA	midgut/fat body (Dendroctonus ponderosae, emerged adult, female, SAMN04595105)	67,809,660	62%	18%	71,691
SAMN04595106	NA	midgut/fat body (Dendroctonus ponderosae, emerged adult, female, SAMN04595106)	49,918,424	71%	18%	69,211
SAMN04595107	NA	midgut/fat body (Dendroctonus ponderosae, emerged adult, female, SAMN04595107)	65,656,814	61%	18%	71,159
SAMN04595108	NA	midgut/fat body (Dendroctonus ponderosae, emerged adult, male, SAMN04595108)	67,679,060	65%	18%	73,667
SAMN04595109	NA	midgut/fat body (Dendroctonus ponderosae, emerged adult, male, SAMN04595109)	61,232,086	72%	18%	71,038
SAMN04595110	NA	midgut/fat body (Dendroctonus ponderosae, emerged adult, male, SAMN04595110)	59,628,414	66%	18%	68,849
SAMN04595111	NA	midgut/fat body (Dendroctonus ponderosae, emerged adult, male, SAMN04595111)	45,641,120	71%	19%	67,954
SAMN04595112	NA	midgut/fat body (Dendroctonus ponderosae, emerged adult, female, SAMN04595112)	40,844,362	72%	17%	75,168
SAMN04595113	NA	midgut/fat body (Dendroctonus ponderosae, emerged adult, female, SAMN04595113)	42,317,030	74%	17%	77,610
SAMN04595114	NA	midgut/fat body (Dendroctonus ponderosae, emerged adult, female, SAMN04595114)	41,416,422	72%	18%	71,488
SAMN04595115	NA	midgut/fat body (Dendroctonus ponderosae, emerged adult, female, SAMN04595115)	37,300,162	70%	17%	69,220
SAMN04595116	NA	midgut/fat body (Dendroctonus ponderosae, emerged adult, male, SAMN04595116)	49,717,508	69%	18%	79,709
SAMN04595117	NA	midgut/fat body (Dendroctonus ponderosae, emerged adult, male, SAMN04595117)	57,185,100	70%	18%	80,670
SAMN04595118	NA	midgut/fat body (Dendroctonus ponderosae, emerged adult, male, SAMN04595118)	43,539,264	71%	18%	72,079
SAMN04595119	NA	midgut/fat body (Dendroctonus ponderosae, emerged adult, male, SAMN04595119)	53,085,606	70%	18%	79,762
SAMN07839741	NA	head (Dendroctonus ponderosae, male, SAMN07839741)	106,209,744	48%	23%	71,746
SAMN07839742	NA	head (Dendroctonus ponderosae, female, SAMN07839742)	121,999,752	54%	24%	80,747
SAMN07839743	NA	head (Dendroctonus ponderosae, female, SAMN07839743)	104,480,944	58%	25%	81,711
SAMN07839744	NA	ovaries (Dendroctonus ponderosae, female, SAMN07839744)	136,382,448	62%	24%	77,287
SAMN07839745	NA	testes (Dendroctonus ponderosae, male, SAMN07839745)	95,290,172	61%	23%	88,053
SAMN07839746	NA	ovaries (Dendroctonus ponderosae, female, SAMN07839746)	28,442,992	69%	23%	73,010
SAMN07839747	NA	ovaries (Dendroctonus ponderosae, female, SAMN07839747)	31,179,924	70%	24%	75,955
SAMN07839748	NA	ovaries (Dendroctonus ponderosae, female, SAMN07839748)	17,744,610	66%	24%	69,341
SAMN07839749	NA	ovaries (Dendroctonus ponderosae, female, SAMN07839749)	32,181,102	64%	23%	72,028
SAMN07839750	NA	testes (Dendroctonus ponderosae, male, SAMN07839750)	30,744,096	64%	23%	76,761
SAMN07839751	NA	testes (Dendroctonus ponderosae, male, SAMN07839751)	14,468,220	59%	25%	67,919
SAMN07839752	NA	testes (Dendroctonus ponderosae, male, SAMN07839752)	17,415,174	66%	23%	71,581
SAMN07839753	NA	testes (Dendroctonus ponderosae, male, SAMN07839753)	23,329,606	66%	23%	76,332
SAMN08706805	NA	pharate pupa (prepupa), head (Dendroctonus ponderosae, pooled male and female, SAMN08706805)	187,703,656	60%	21%	94,662
SAMN16231675	NA	Full body (Dendroctonus ponderosae, female, SAMN16231675)	25,754,235	80%	17%	80,769
SAMN16231676	NA	Full body (Dendroctonus ponderosae, female, SAMN16231676)	49,329,583	47%	14%	79,985
SAMN16231677	NA	Full body (Dendroctonus ponderosae, female, SAMN16231677)	108,821,454	81%	17%	92,837
SAMN16231678	NA	Full body (Dendroctonus ponderosae, female, SAMN16231678)	96,514,255	80%	17%	90,961
SAMN16231679	NA	Full body (Dendroctonus ponderosae, female, SAMN16231679)	77,227,338	82%	18%	90,134
SAMN16231680	NA	Full body (Dendroctonus ponderosae, female, SAMN16231680)	73,582,124	81%	18%	89,092
SAMN16231681	NA	Full body (Dendroctonus ponderosae, female, SAMN16231681)	65,201,267	80%	18%	86,367
SAMN16231682	NA	Full body (Dendroctonus ponderosae, female, SAMN16231682)	69,685,842	83%	18%	86,920
SAMN16231683	NA	Full body (Dendroctonus ponderosae, female, SAMN16231683)	45,755,525	83%	18%	84,612
SAMN16231684	NA	Full body (Dendroctonus ponderosae, female, SAMN16231684)	25,126,794	78%	18%	78,320
SAMN16231685	NA	Full body (Dendroctonus ponderosae, female, SAMN16231685)	73,958,213	82%	18%	88,222
SAMN16231686	NA	Full body (Dendroctonus ponderosae, female, SAMN16231686)	44,266,005	82%	18%	83,342
SAMN16231687	NA	Full body (Dendroctonus ponderosae, female, SAMN16231687)	67,888,529	80%	18%	88,678
SAMN16231688	NA	Full body (Dendroctonus ponderosae, female, SAMN16231688)	62,072,490	83%	18%	88,176
SAMN16231689	NA	Full body (Dendroctonus ponderosae, female, SAMN16231689)	77,965,562	74%	18%	89,489
SAMN16231690	NA	Full body (Dendroctonus ponderosae, female, SAMN16231690)	78,620,099	80%	17%	90,594
SAMN16231691	NA	Full body (Dendroctonus ponderosae, female, SAMN16231691)	46,820,390	77%	17%	86,287
SAMN16231692	NA	Full body (Dendroctonus ponderosae, female, SAMN16231692)	91,754,237	76%	17%	90,088

Show alignments statistics, by run (ERR, SRR, DRR)

Run	Experiment	Project	Sample	Number of reads	Percent aligned reads	Percent of aligned reads with introns
SRR449538	SRX132062	SRP011990	SAMN00847512	75,425	66%	66%
SRR449539	SRX132062	SRP011990	SAMN00847512	525,846	64%	68%
SRR449540	SRX132062	SRP011990	SAMN00847512	524,467	64%	68%
SRR449542	SRX132064	SRP011990	SAMN00847513	79,520	38%	67%
SRR449543	SRX132064	SRP011990	SAMN00847513	446,675	43%	70%
SRR449544	SRX132064	SRP011990	SAMN00847513	438,193	41%	68%
SRR449545	SRX132065	SRP011990	SAMN00847514	74,004	47%	66%
SRR449546	SRX132065	SRP011990	SAMN00847514	522,915	54%	69%
SRR449547	SRX132065	SRP011990	SAMN00847514	536,070	54%	69%
SRR867432	SRX278476	SRP022859	SAMN01999089	17,832,774	66%	10%
SRR867433	SRX278477	SRP022859	SAMN01999089	12,183,782	71%	20%
SRR867434	SRX278478	SRP022859	SAMN01999089	15,136,996	72%	20%
SRR867436	SRX278479	SRP022859	SAMN01999089	16,253,036	70%	19%
SRR867160	SRX278464	SRP022859	SAMN02010585	18,614,028	73%	9%
SRR867161	SRX278467	SRP022859	SAMN02010585	18,125,008	73%	19%
SRR867162	SRX278468	SRP022859	SAMN02010585	16,883,616	68%	20%
SRR867176	SRX278469	SRP022859	SAMN02010585	19,835,500	66%	20%
SRR867438	SRX278480	SRP022859	SAMN02010586	19,508,508	72%	10%
SRR867439	SRX278481	SRP022859	SAMN02010586	14,638,068	74%	20%
SRR867440	SRX278482	SRP022859	SAMN02010586	17,109,650	67%	20%
SRR867441	SRX278483	SRP022859	SAMN02010586	18,864,238	71%	20%
SRR867179	SRX278470	SRP022859	SAMN02010587	25,226,386	65%	10%
SRR867183	SRX278471	SRP022859	SAMN02010587	16,780,682	72%	18%
SRR867186	SRX278472	SRP022859	SAMN02010587	13,266,286	77%	18%
SRR867188	SRX278473	SRP022859	SAMN02010587	19,684,056	69%	20%
SRR1702878	SRX803899	SRP051036	SAMN03256981	11,161,152	36%	20%
SRR1702890	SRX803902	SRP051036	SAMN03256982	17,658,950	65%	18%
SRR1702891	SRX803905	SRP051036	SAMN03256983	34,260,822	63%	10%
SRR1702894	SRX803908	SRP051036	SAMN03256984	153,560,980	55%	9%
SRR1702913	SRX803918	SRP051036	SAMN03256985	11,518,038	42%	19%
SRR1702916	SRX803920	SRP051036	SAMN03256986	30,627,212	64%	18%
SRR1702919	SRX803921	SRP051036	SAMN03256987	22,438,006	61%	10%
SRR1702923	SRX803922	SRP051036	SAMN03256988	119,711,400	60%	8%
SRR1702898	SRX803911	SRP051036	SAMN03256989	19,503,594	76%	10%
SRR1702901	SRX803914	SRP051036	SAMN03256990	18,588,544	81%	10%
SRR1702904	SRX803915	SRP051036	SAMN03256991	14,919,902	82%	10%
SRR1702910	SRX803916	SRP051036	SAMN03256992	104,738,122	68%	9%
SRR1702925	SRX803924	SRP051036	SAMN03256993	20,385,250	79%	10%
SRR1702927	SRX803925	SRP051036	SAMN03256994	27,877,218	80%	10%
SRR1702929	SRX803926	SRP051036	SAMN03256995	17,140,094	81%	10%
SRR1702930	SRX803938	SRP051036	SAMN03256996	110,505,342	79%	9%
SRR1702932	SRX803990	SRP051036	SAMN03256997	21,122,818	66%	19%
SRR1702933	SRX803991	SRP051036	SAMN03256998	32,942,008	62%	19%
SRR1702934	SRX803992	SRP051036	SAMN03256999	27,339,696	58%	10%
SRR1702950	SRX803993	SRP051036	SAMN03257000	91,538,948	63%	9%
SRR1702992	SRX803998	SRP051036	SAMN03257001	16,209,236	42%	18%
SRR1703009	SRX803999	SRP051036	SAMN03257002	30,953,022	61%	18%
SRR1703010	SRX804001	SRP051036	SAMN03257003	20,062,770	49%	10%
SRR1703012	SRX804002	SRP051036	SAMN03257004	128,042,068	65%	9%
SRR1702966	SRX803994	SRP051036	SAMN03257005	27,515,114	77%	10%
SRR1702979	SRX803995	SRP051036	SAMN03257006	22,195,402	77%	10%
SRR1702987	SRX803996	SRP051036	SAMN03257007	15,487,220	81%	10%
SRR1702988	SRX803997	SRP051036	SAMN03257008	101,446,724	80%	9%
SRR1703014	SRX804004	SRP051036	SAMN03257009	24,552,470	76%	9%
SRR1703016	SRX804005	SRP051036	SAMN03257010	19,224,904	79%	10%
SRR1703018	SRX804006	SRP051036	SAMN03257011	18,990,828	80%	10%
SRR1703019	SRX804007	SRP051036	SAMN03257012	111,664,138	76%	9%
SRR2044895	SRX1043704	SRP058540	SAMN03702321	117,022,808	68%	9%
SRR2044896	SRX1043705	SRP058540	SAMN03702322	157,292,540	73%	9%
SRR2044897	SRX1043706	SRP058540	SAMN03702323	126,894,078	73%	17%
SRR2044898	SRX1043707	SRP058540	SAMN03702324	105,036,914	59%	17%
SRR2044899	SRX1043708	SRP058540	SAMN03702325	121,486,548	80%	9%
SRR2044900	SRX1043709	SRP058540	SAMN03702326	98,627,024	82%	10%
SRR2044901	SRX1043710	SRP058540	SAMN03703203	84,240,450	60%	16%
SRR2044902	SRX1043711	SRP058540	SAMN03703206	127,246,356	82%	18%
SRR2044903	SRX1043712	SRP058540	SAMN03703207	109,446,478	82%	9%
SRR2044904	SRX1043713	SRP058540	SAMN03703208	110,786,774	80%	9%
SRR2044905	SRX1043714	SRP058540	SAMN03703209	105,803,314	78%	17%
SRR2044906	SRX1043715	SRP058540	SAMN03703210	121,147,690	82%	18%
SRR2044907	SRX1043716	SRP058540	SAMN03703211	113,200,186	80%	9%
SRR2044908	SRX1043717	SRP058540	SAMN03703212	110,128,076	67%	8%
SRR2044909	SRX1043718	SRP058540	SAMN03703213	119,850,720	83%	16%
SRR2044910	SRX1043719	SRP058540	SAMN03703214	96,088,744	81%	17%
SRR3323584	SRX1675474	SRP072778	SAMN04595104	65,739,618	73%	19%
SRR3340435	SRX1683565	SRP072778	SAMN04595105	67,809,660	62%	18%
SRR3340436	SRX1683566	SRP072778	SAMN04595106	49,918,424	71%	18%
SRR3340437	SRX1683567	SRP072778	SAMN04595107	65,656,814	61%	18%
SRR3340438	SRX1683568	SRP072778	SAMN04595108	67,679,060	65%	18%
SRR3340439	SRX1683569	SRP072778	SAMN04595109	61,232,086	72%	18%
SRR3340440	SRX1683571	SRP072778	SAMN04595110	59,628,414	66%	18%
SRR3340441	SRX1683572	SRP072778	SAMN04595111	45,641,120	71%	19%
SRR3342468	SRX1684679	SRP072778	SAMN04595112	40,844,362	72%	17%
SRR3342469	SRX1684680	SRP072778	SAMN04595113	42,317,030	74%	17%
SRR3342470	SRX1684681	SRP072778	SAMN04595114	41,416,422	72%	18%
SRR3342471	SRX1684682	SRP072778	SAMN04595115	37,300,162	70%	17%
SRR3342472	SRX1684683	SRP072778	SAMN04595116	49,717,508	69%	18%
SRR3342473	SRX1684684	SRP072778	SAMN04595117	57,185,100	70%	18%
SRR3342474	SRX1684685	SRP072778	SAMN04595118	43,539,264	71%	18%
SRR3342475	SRX1684686	SRP072778	SAMN04595119	53,085,606	70%	18%
SRR6279173	SRX3381425	SRP124743	SAMN07839741	106,209,744	48%	23%
SRR6279159	SRX3381439	SRP124743	SAMN07839742	121,999,752	54%	24%
SRR6279160	SRX3381438	SRP124743	SAMN07839743	104,480,944	58%	25%
SRR6279157	SRX3381441	SRP124743	SAMN07839744	136,382,448	62%	24%
SRR6279158	SRX3381440	SRP124743	SAMN07839745	95,290,172	61%	23%
SRR6279155	SRX3381443	SRP124743	SAMN07839746	28,442,992	69%	23%
SRR6279156	SRX3381442	SRP124743	SAMN07839747	31,179,924	70%	24%
SRR6279153	SRX3381445	SRP124743	SAMN07839748	17,744,610	66%	24%
SRR6279154	SRX3381444	SRP124743	SAMN07839749	32,181,102	64%	23%
SRR6279151	SRX3381447	SRP124743	SAMN07839750	30,744,096	64%	23%
SRR6279152	SRX3381446	SRP124743	SAMN07839751	14,468,220	59%	25%
SRR6279172	SRX3381426	SRP124743	SAMN07839752	17,415,174	66%	23%
SRR6279171	SRX3381427	SRP124743	SAMN07839753	23,329,606	66%	23%
SRR6830978	SRX3787060	SRP135495	SAMN08706805	187,703,656	60%	21%
SRR12682708	SRX9162735	SRP284250	SAMN16231675	25,754,235	80%	17%
SRR12682707	SRX9162736	SRP284250	SAMN16231676	49,329,583	47%	14%
SRR12682698	SRX9162745	SRP284250	SAMN16231677	108,821,454	81%	17%
SRR12682697	SRX9162746	SRP284250	SAMN16231678	96,514,255	80%	17%
SRR12682696	SRX9162747	SRP284250	SAMN16231679	77,227,338	82%	18%
SRR12682695	SRX9162748	SRP284250	SAMN16231680	73,582,124	81%	18%
SRR12682694	SRX9162749	SRP284250	SAMN16231681	65,201,267	80%	18%
SRR12682693	SRX9162750	SRP284250	SAMN16231682	69,685,842	83%	18%
SRR12682692	SRX9162751	SRP284250	SAMN16231683	45,755,525	83%	18%
SRR12682691	SRX9162752	SRP284250	SAMN16231684	25,126,794	78%	18%
SRR12682706	SRX9162737	SRP284250	SAMN16231685	73,958,213	82%	18%
SRR12682705	SRX9162738	SRP284250	SAMN16231686	44,266,005	82%	18%
SRR12682704	SRX9162739	SRP284250	SAMN16231687	67,888,529	80%	18%
SRR12682703	SRX9162740	SRP284250	SAMN16231688	62,072,490	83%	18%
SRR12682702	SRX9162741	SRP284250	SAMN16231689	77,965,562	74%	18%
SRR12682701	SRX9162742	SRP284250	SAMN16231690	78,620,099	80%	17%
SRR12682700	SRX9162743	SRP284250	SAMN16231691	46,820,390	77%	17%
SRR12682699	SRX9162744	SRP284250	SAMN16231692	91,754,237	76%	17%

Protein alignments

Source	Number of sequences retrieved from Entrez	Number (%) of sequences aligned by ProSplign	Number (%) of sequences passed to Gnomon	Average % identity	Average % coverage
Nicrophorus vespilloides high-quality model RefSeq (XP_)	6,488	6,248 (96.30%)	6,248 (96.30%)	65.30%	71.46%
Insecta GenBank	114,426	82,804 (72.36%)	82,804 (72.36%)	65.73%	62.87%
Acyrthosiphon pisum known RefSeq (NP_)	1,819	1,439 (79.11%)	1,439 (79.11%)	63.46%	64.19%
Tribolium castaneum high-quality model RefSeq (XP_)	7,031	6,760 (96.15%)	6,760 (96.15%)	66.58%	73.31%
Tribolium castaneum known RefSeq (NP_)	627	584 (93.14%)	584 (93.14%)	68.03%	66.69%
Drosophila melanogaster known RefSeq (NP_)	30,157	20,807 (69.00%)	20,807 (69.00%)	62.15%	53.09%
Nasonia vitripennis known RefSeq (NP_)	1,101	710 (64.49%)	710 (64.49%)	62.44%	50.57%
Apis mellifera known RefSeq (NP_)	528	401 (75.95%)	401 (75.95%)	64.59%	61.27%
Same-species GenBank	2,395	2,378 (99.29%)	2,378 (99.29%)	90.23%	93.68%

Assembly-assembly alignments of current to previous assembly

When the assembly changes between two rounds of annotation, genes in the current and the previous annotation are mapped to each other using the genomic alignments of the current assembly to the previous assembly so that gene identifiers can be preserved. The success of the remapping depends largely on how well the two assembly versions align to each other.

Below are the percent coverage of one assembly by the other and the average percent identity of the alignments. The 'First pass' alignments are reciprocal best hits, while the 'Total' alignments also include 'Second pass' or non-reciprocal best alignments. For more information about the assembly-assembly alignment process, please visit the NCBI Genome Remapping Service page.

First Pass	Total
Dpon_F_20191213v2 (Current) Coverage: 80.26%	Dpon_F_20191213v2 (Current) Coverage: 89.04%
DendPond_male_1.0 (Previous) Coverage: 85.05%	DendPond_male_1.0 (Previous) Coverage: 92.47%
Percent Identity: 87.01%	Percent Identity: 88.57%

Comparison of the current and previous annotations

The annotation produced for this release (101) was compared to the annotation in the previous release (100) for each assembly annotated in both releases. Scores for current and previous gene and transcript features were calculated based on overlap in exon sequence and matches in exon boundaries. Pairs of current and previous features were categorized based on these scores, whether they are reciprocal best matches, and changes in attributes (gene biotype, completeness, etc.). If the assembly was updated between the two releases, alignments between the current and the previous assembly were used to match the current and previous gene and transcript features in mapped regions.

The table below summarizes the changes in the gene set for each assembly as a percent of the number of genes in the current annotation release, and provides links to the details of the comparison in tabular format and in a Genome Workbench project.

	Dpon_F_20191213v2 (Current) to DendPond_male_1.0 (Previous)
Identical	4%
Minor changes	57%
Major changes	13%
New	22%
Deprecated	20%
Other	4%
Download the report	tabular, Genome Workbench

References

RefSeq: Pruitt KD, Brown GR, Hiatt SM, Thibaud-Nissen F, Astashyn A, Ermolaeva O, Farrell CM, Hart J, Landrum MJ, McGarvey KM, Murphy MR, O'Leary NA, Pujar S, Rajput B, Rangwala SH, Riddick LD, Shkeda A, Sun H, Tamez P, Tully RE, Wallin C, Webb D, Weber J, Wu W, Dicuccio M, Kitts P, Maglott DR, Murphy TD, Ostell JM. Nucleic Acids Research 2014, 42(Database issue):D756-63
BUSCO: Manni M, Berkeley MR, Seppey M, Simão FA, Zdobnov EM. Molecular biology and evolution 2021.38(10):4647-4654
RepeatMasker: Smit AFA, Hubley R, Green P. RepeatMasker Open-3.0. 1996–2004. http://www.repeatmasker.org
WindowMasker: Morgulis A, Gertz EM, Schäffer AA, Agarwala R. Bioinformatics 2006, 2:134-41
Splign: Kapustin Y, Souvorov A, Tatusova T, Lipman D. Biology Direct 2008, 3:20
Minimap2: Li H. Bioinformatics 2018 Sep 15;34(18):3094-3100

RefSeq

Integrated reference sequences