NCBI Daphnia magna Annotation Release 101

The RefSeq genome records for Daphnia magna were annotated by the NCBI Eukaryotic Genome Annotation Pipeline, an automated pipeline that annotates genes, transcripts and proteins on draft and finished genome assemblies. This report presents statistics on the annotation products, the input data used in the pipeline and intermediate alignment results.

The annotation products are available in the sequence databases and on the FTP site.

This report provides:

Annotation Release information: The name of the release, important dates, the software version
Assemblies: A brief description of the annotated assembly(ies)
Gene and feature statistics: The counts and characteristics of the annotated features
BUSCO results: Annotation completeness assessed with BUSCO
Alignment of the annotated proteins to a set of high-quality proteins: The number of annotated proteins with hits to a set of high-quality proteins
Masking of genomic sequence: How much of the genome was masked
Transcript and protein alignments: The number and type of evidence retrieved from public databases and used for gene prediction
Similarity of current and previous assembly: The similarity of the current and previous assembly
Comparison of the current and previous annotations: What proportion of the genes changed in this annotation

For more information on the annotation process, please visit the NCBI Eukaryotic Genome Annotation Pipeline page.

Annotation Release information

This annotation should be referred to as NCBI Daphnia magna Annotation Release 101

Annotation release ID: 101
Date of Entrez queries for transcripts and proteins: Nov 16 2021
Date of submission of annotation to the public databases: Nov 22 2021
Software version: 9.0

Assemblies

The following assemblies were included in this annotation run:

Assembly name	Assembly accession	Submitter	Assembly date	Reference/Alternate	Assembly content
ASM2063170v1.1	GCF_020631705.1	Sungkyunkwan University	10-27-2021	Reference	11 assembled chromosomes; unplaced scaffolds

Gene and feature statistics

Counts and length of annotated features are provided below for each assembly.

Feature counts

Feature	ASM2063170v1.1
Genes and pseudogenes	28,335
protein-coding	16,891
non-coding	9,240
Transcribed pseudogenes	8
Non-transcribed pseudogenes	2,195
genes with variants	6,849
Immunoglobulin/T-cell receptor gene segments	0
other	1
mRNAs	28,208
fully-supported	26,047
with > 5% ab initio	1,615
partial	215
with filled gap(s)	43
known RefSeq (NM_)	0
model RefSeq (XM_)	28,208
non-coding RNAs	17,407
fully-supported	12,834
with > 5% ab initio	0
partial	4
with filled gap(s)	0
known RefSeq (NR_)	0
model RefSeq (XR_)	13,750
pseudo transcripts	8
fully-supported	8
with > 5% ab initio	0
partial	0
with filled gap(s)	0
known RefSeq (NR_)	0
model RefSeq (XR_)	8
CDSs	28,221
fully-supported	26,047
with > 5% ab initio	1,688
partial	205
with major correction(s)	1,484
known RefSeq (NP_)	0
model RefSeq (XP_)	28,221

Detailed reports

The counts below do not include pseudogenes.

Feature lengths

Feature	Count	Mean length (bp)	Median length (bp)	Min length (bp)	Max length (bp)
Genes	26,132	3,187	1,722	62	213,801
All transcripts	45,615	2,241	1,744	62	33,213
mRNA	28,208	2,579	1,979	252	33,213
misc_RNA	2,049	2,917	2,305	191	13,459
tRNA	3,655	74	73	62	101
lncRNA	10,801	2,137	1,695	111	20,829
snoRNA	58	214	223	71	224
snRNA	307	142	142	100	196
rRNA	536	131	119	119	4,377
Single-exon transcripts	1,170	1,631	1,230	279	18,690
coding transcripts (NM_/XM_ )	1,164	1,634	1,230	279	18,690
non-coding transcripts (NR_/XR_ )	6	1,075	1,017	734	1,715
CDSs	28,221	1,808	1,341	168	31,230
Exons	163,702	322	185	1	25,957
in coding transcripts (NM_/XM_ )	133,251	314	183	1	25,957
in non-coding transcripts (NR_/XR_ )	34,625	341	188	2	13,098
Introns	133,457	376	75	30	98,208
in coding transcripts (NM_/XM_ )	112,342	382	74	30	94,974
in non-coding transcripts (NR_/XR_ )	25,308	347	77	31	98,208

Transcripts per gene, exons per transcript

	Mean	Median	Min	Max
Number of transcripts per gene	1.87	1	1	50
Number of exons per transcript	8	6	1	87

BUSCO analysis of gene annotation

BUSCO v4.1.4 (Simão et al 2015, PMID: 26059717) was run in "protein" mode on the annotated gene set picking one longest protein per gene, and run using the arthropoda_odb10 lineage dataset. Results are reported for the gene set from the primary assembly unit, and presented in BUSCO notation (C:complete [S:single-copy, D:duplicated], F:fragmented, M:missing, n:number of genes used).

Alignment of the annotated proteins to a set of high-quality proteins

The final set of annotated proteins was searched with BLASTP against the UniProtKB/Swiss-Prot curated proteins, using the annotated proteins as the query and the high-quality proteins as the target. Out of 16878 coding genes, 10337 genes had a protein with an alignment covering 50% or more of the query and 2220 had an alignment covering 95% or more of the query.

Definition of query and target coverage. The query coverage is the percentage of the annotated protein length that is included in the alignment. The target coverage is the percentage of the target length that is included in the alignment.

Below is a cumulative graph displaying the number of genes with alignments above a given query or target coverage threshold. For comparison, corresponding statistics for other organisms annotated by the NCBI eukaryotic annotation pipeline were added to the graph.

Query: annotated proteins
Target: UniProtKB/Swiss-Prot curated proteins

Masking of genomic sequence

Transcript and protein alignments are performed on the repeat-masked genome. Below are the percentages of genomic sequence masked by WindowMasker and RepeatMasker (if calculated), for each assembly. RepeatMasker results are only calculated for organisms with complete Dfam HMM model collections.

For this annotation run, transcripts and proteins were aligned to the genome masked with WindowMasker only.

Assembly name	Assembly accession	% Masked with WindowMasker
ASM2063170v1.1	GCF_020631705.1	29.40%

Transcript and protein alignments

The annotation pipeline relies heavily on alignments of experimental evidence for gene prediction. Below are the sets of transcripts and proteins that were retrieved from Entrez, aligned to the genome by Splign, minimap2, or ProSplign and passed to Gnomon, NCBI's gene prediction software.

Transcript alignments

Source	Number of sequences retrieved from Entrez	Number (%) of sequences aligned by Splign	Number (%) of sequences passed to Gnomon	Average % identity	Average % coverage
Same-species Genbank	4,475	4,468 (99.84%)	4,461 (99.69%)	99.62%	99.80%
Same-species EST	15,366	12,142 (79.02%)	11,504 (74.87%)	99.17%	99.11%
Crustacea Genbank	46,598	20,691 (44.40%)	12,488 (26.80%)	93.39%	97.80%
Crustacea EST	904,592	18,729 (2.07%)	10,829 (1.20%)	90.57%	97.09%

RNA-Seq alignments

The following RNA-Seq reads from the Sequence Read Archive were also used for gene prediction:

Hide alignments statistics, by sample (SAME, SAMN, SAMD, DRS)

Sample Id	Publication	Track name	Number of reads	Percent aligned reads	Percent of aligned reads with introns	Number of introns
All	NA	Aggregate of all aligned samples	9,104,367,049	72%	42%	180,071
SAMN03470176	NA	Whole body (Daphnia magna, SAMN03470176)	9,544,614	92%	47%	99,458
SAMN03470177	NA	Whole body (Daphnia magna, SAMN03470177)	11,180,752	91%	47%	102,523
SAMN03470178	NA	Whole body (Daphnia magna, SAMN03470178)	11,066,122	92%	56%	102,016
SAMN03470179	NA	Whole body (Daphnia magna, SAMN03470179)	8,144,674	90%	55%	102,633
SAMN07375994	28968576	whole organism (Daphnia magna, 7 day old, SAMN07375994)	42,593,524	86%	34%	131,135
SAMN07375995	28968576	whole organism (Daphnia magna, 7 day old, SAMN07375995)	40,226,894	87%	34%	129,250
SAMN07375996	28968576	whole organism (Daphnia magna, 7 day old, SAMN07375996)	39,466,370	86%	36%	129,674
SAMN07375997	28968576	whole organism (Daphnia magna, 7 day old, SAMN07375997)	38,507,684	87%	37%	130,337
SAMN08974731	NA	whole organism (Daphnia magna, 48-72 hrs, female, SAMN08974731)	15,898,674	85%	29%	84,027
SAMN08974732	NA	whole organism (Daphnia magna, 48-72 hrs, female, SAMN08974732)	27,556,298	88%	31%	121,215
SAMN08974733	NA	whole organism (Daphnia magna, 48-72 hrs, female, SAMN08974733)	28,806,104	82%	29%	124,839
SAMN08974734	NA	whole organism (Daphnia magna, 48-72 hrs, female, SAMN08974734)	31,060,696	80%	27%	120,524
SAMN08974735	NA	whole organism (Daphnia magna, 48-72 hrs, female, SAMN08974735)	36,978,052	80%	28%	123,975
SAMN08974736	NA	whole organism (Daphnia magna, 48-72 hrs, female, SAMN08974736)	31,917,780	79%	25%	109,860
SAMN08974737	NA	whole organism (Daphnia magna, 48-72 hrs, female, SAMN08974737)	31,141,250	79%	27%	119,964
SAMN08974738	NA	whole organism (Daphnia magna, 48-72 hrs, female, SAMN08974738)	26,197,014	81%	26%	115,244
SAMN08974739	NA	whole organism (Daphnia magna, 48-72 hrs, female, SAMN08974739)	24,372,554	80%	26%	111,957
SAMN08974740	NA	whole organism (Daphnia magna, 48-72 hrs, female, SAMN08974740)	25,800,034	82%	28%	118,957
SAMN08974741	NA	whole organism (Daphnia magna, 48-72 hrs, female, SAMN08974741)	31,159,336	84%	28%	122,227
SAMN08974742	NA	whole organism (Daphnia magna, 48-72 hrs, female, SAMN08974742)	25,255,914	84%	28%	116,887
SAMN08974743	NA	whole organism (Daphnia magna, 48-72 hrs, female, SAMN08974743)	27,747,854	85%	28%	119,922
SAMN08974744	NA	whole organism (Daphnia magna, 48-72 hrs, female, SAMN08974744)	34,335,246	80%	27%	121,832
SAMN08974745	NA	whole organism (Daphnia magna, 48-72 hrs, female, SAMN08974745)	32,961,164	84%	27%	121,685
SAMN09425737	31158668	whole animal (Daphnia magna, female, SAMN09425737)	192,911,810	79%	32%	148,804
SAMN10054298	NA	Whole body (Daphnia magna, pooled male and female, SAMN10054298)	49,260,620	70%	10%	91,297
SAMN10392302	30817885	whole body (Daphnia magna, SAMN10392302)	52,641,300	86%	44%	135,914
SAMN10392303	30817885	whole body (Daphnia magna, SAMN10392303)	28,099,414	85%	43%	116,767
SAMN10392304	30817885	whole body (Daphnia magna, SAMN10392304)	51,468,014	88%	44%	131,643
SAMN10392305	30817885	whole body (Daphnia magna, SAMN10392305)	37,523,128	87%	45%	127,951
SAMN10392306	30817885	whole body (Daphnia magna, SAMN10392306)	50,159,606	82%	43%	127,959
SAMN10392307	30817885	whole body (Daphnia magna, SAMN10392307)	48,568,660	82%	44%	136,094
SAMN10392308	30817885	whole body (Daphnia magna, SAMN10392308)	32,586,894	87%	44%	123,018
SAMN10392309	30817885	whole body (Daphnia magna, SAMN10392309)	37,738,348	83%	44%	111,661
SAMN10392310	30817885	whole body (Daphnia magna, SAMN10392310)	37,074,630	84%	43%	132,820
SAMN10392311	30817885	whole body (Daphnia magna, SAMN10392311)	39,189,100	87%	44%	124,394
SAMN10392312	30817885	whole body (Daphnia magna, SAMN10392312)	53,474,610	88%	44%	132,995
SAMN10392313	30817885	whole body (Daphnia magna, SAMN10392313)	48,814,452	86%	45%	133,309
SAMN10392314	30817885	whole body (Daphnia magna, SAMN10392314)	49,595,586	89%	44%	131,540
SAMN10392315	30817885	whole body (Daphnia magna, SAMN10392315)	38,193,976	86%	44%	133,739
SAMN10392316	30817885	whole body (Daphnia magna, SAMN10392316)	41,442,898	89%	44%	123,550
SAMN10392317	30817885	whole body (Daphnia magna, SAMN10392317)	36,257,122	89%	45%	130,608
SAMN10392318	30817885	whole body (Daphnia magna, SAMN10392318)	46,216,210	88%	44%	131,789
SAMN10392319	30817885	whole body (Daphnia magna, SAMN10392319)	31,905,808	86%	44%	129,782
SAMN10392320	30817885	whole body (Daphnia magna, SAMN10392320)	31,365,228	87%	44%	129,359
SAMN10392321	30817885	whole body (Daphnia magna, SAMN10392321)	34,896,732	88%	44%	131,135
SAMN10392322	30817885	whole body (Daphnia magna, SAMN10392322)	53,889,058	86%	44%	137,768
SAMN10392323	30817885	whole body (Daphnia magna, SAMN10392323)	43,061,164	87%	44%	134,354
SAMN10392324	30817885	whole body (Daphnia magna, SAMN10392324)	51,892,850	89%	43%	124,334
SAMN10392325	30817885	whole body (Daphnia magna, SAMN10392325)	36,499,622	89%	45%	126,108
SAMN10392326	30817885	whole body (Daphnia magna, SAMN10392326)	51,210,316	87%	44%	137,327
SAMN10392327	30817885	whole body (Daphnia magna, SAMN10392327)	29,291,560	87%	44%	121,126
SAMN10392328	30817885	whole body (Daphnia magna, SAMN10392328)	36,948,834	88%	44%	125,649
SAMN10392329	30817885	whole body (Daphnia magna, SAMN10392329)	46,263,892	87%	44%	133,569
SAMN10392330	30817885	whole body (Daphnia magna, SAMN10392330)	57,931,324	88%	43%	136,747
SAMN10392331	30817885	whole body (Daphnia magna, SAMN10392331)	52,005,272	87%	44%	129,758
SAMN10392332	30817885	whole body (Daphnia magna, SAMN10392332)	51,395,772	88%	44%	136,388
SAMN10392333	30817885	whole body (Daphnia magna, SAMN10392333)	40,746,918	88%	44%	134,011
SAMN10392334	30817885	whole body (Daphnia magna, SAMN10392334)	45,765,206	88%	44%	134,124
SAMN10392335	30817885	whole body (Daphnia magna, SAMN10392335)	51,083,134	89%	45%	134,733
SAMN10392336	30817885	whole body (Daphnia magna, SAMN10392336)	39,985,208	87%	43%	132,561
SAMN10392337	30817885	whole body (Daphnia magna, SAMN10392337)	38,942,396	88%	43%	126,862
SAMN10392338	30817885	whole body (Daphnia magna, SAMN10392338)	52,522,198	89%	43%	130,603
SAMN10392339	30817885	whole body (Daphnia magna, SAMN10392339)	40,560,964	84%	43%	124,022
SAMN10392340	30817885	whole body (Daphnia magna, SAMN10392340)	63,277,028	89%	45%	135,930
SAMN10392341	30817885	whole body (Daphnia magna, SAMN10392341)	25,703,308	88%	44%	125,945
SAMN10392342	30817885	whole body (Daphnia magna, SAMN10392342)	29,885,540	85%	44%	124,483
SAMN10392343	30817885	whole body (Daphnia magna, SAMN10392343)	42,069,094	89%	45%	129,320
SAMN10392344	30817885	whole body (Daphnia magna, SAMN10392344)	56,437,596	87%	43%	132,015
SAMN10392345	30817885	whole body (Daphnia magna, SAMN10392345)	44,239,504	87%	43%	134,914
SAMN10392346	30817885	whole body (Daphnia magna, SAMN10392346)	38,725,372	87%	45%	129,171
SAMN10392347	30817885	whole body (Daphnia magna, SAMN10392347)	48,404,070	84%	43%	127,046
SAMN10392348	30817885	whole body (Daphnia magna, SAMN10392348)	56,080,258	86%	43%	136,084
SAMN10392349	30817885	whole body (Daphnia magna, SAMN10392349)	41,448,590	80%	43%	120,761
SAMN10606273	NA	body (Daphnia magna, 48h, female, SAMN10606273)	51,525,786	27%	49%	62,879
SAMN10739542	31158668	Whole body (Daphnia magna, SAMN10739542)	504,047,974	57%	33%	150,953
SAMN14600439	NA	embryo (Daphnia magna, 12 h after ovulation, female, SAMN14600439)	54,818,240	80%	45%	112,403
SAMN14600440	NA	embryo (Daphnia magna, 12 h after ovulation, female, SAMN14600440)	69,179,060	84%	47%	103,388
SAMN14600441	NA	embryo (Daphnia magna, 12 h after ovulation, female, SAMN14600441)	57,163,424	83%	45%	100,183
SAMN14600442	NA	embryo (Daphnia magna, 12 h after ovulation, female, SAMN14600442)	62,724,520	81%	45%	107,749
SAMN14600443	NA	embryo (Daphnia magna, 12 h after ovulation, female, SAMN14600443)	48,220,792	79%	43%	102,420
SAMN14600444	NA	embryo (Daphnia magna, 12 h after ovulation, female, SAMN14600444)	58,627,612	82%	49%	108,928
SAMN14791943	NA	silica_3 (Daphnia magna, SAMN14791943)	33,705,894	58%	42%	122,051
SAMN14791944	NA	silica_2 (Daphnia magna, SAMN14791944)	30,046,946	65%	42%	120,657
SAMN14791945	NA	silica_1 (Daphnia magna, SAMN14791945)	87,057,586	73%	42%	135,486
SAMN14791946	NA	control_2 (Daphnia magna, SAMN14791946)	44,592,258	61%	43%	126,904
SAMN14791947	NA	control_1 (Daphnia magna, SAMN14791947)	31,919,940	61%	43%	121,774
SAMN14791948	NA	nano_high_5 (Daphnia magna, SAMN14791948)	82,849,222	72%	41%	133,871
SAMN14791949	NA	nano_high_4 (Daphnia magna, SAMN14791949)	149,616,056	80%	40%	139,460
SAMN14791950	NA	nano_high_3 (Daphnia magna, SAMN14791950)	59,705,851	76%	42%	129,316
SAMN14791951	NA	nano_high_2 (Daphnia magna, SAMN14791951)	67,086,511	74%	41%	132,670
SAMN14791952	NA	control_5 (Daphnia magna, SAMN14791952)	21,182,615	65%	43%	116,274
SAMN14791953	NA	control_4 (Daphnia magna, SAMN14791953)	42,141,994	74%	43%	126,421
SAMN14791954	NA	control_3 (Daphnia magna, SAMN14791954)	50,270,205	59%	42%	126,469
SAMN14791955	NA	nano_high_1 (Daphnia magna, SAMN14791955)	98,187,509	76%	42%	136,769
SAMN14791956	NA	nano_low_5 (Daphnia magna, SAMN14791956)	59,821,353	71%	42%	131,543
SAMN14791957	NA	nano_low_4 (Daphnia magna, SAMN14791957)	24,755,096	70%	42%	117,008
SAMN14791958	NA	nano_low_3 (Daphnia magna, SAMN14791958)	16,042,960	72%	41%	111,576
SAMN14791959	NA	nano_low_2 (Daphnia magna, SAMN14791959)	67,020,752	70%	42%	132,659
SAMN14791960	NA	nano_low_1 (Daphnia magna, SAMN14791960)	28,529,420	66%	43%	120,071
SAMN14791973	NA	silica_5 (Daphnia magna, SAMN14791973)	13,110,727	68%	41%	103,111
SAMN14791974	NA	silica_4 (Daphnia magna, SAMN14791974)	16,710,850	67%	43%	113,586
SAMN14970493	32866176	Line B male A (Daphnia magna, 40 hour post oviposition, SAMN14970493)	19,980,912	77%	40%	108,635
SAMN14970494	NA	WTHF3 (Daphnia magna, post natal day 12, releasing 2nd clutch, female, SAMN14970494)	26,638,108	88%	45%	122,583
SAMN14970495	32866176	Line A male C (Daphnia magna, 40 hour post oviposition, SAMN14970495)	22,042,858	76%	39%	115,558
SAMN14970496	32866176	Line A male B (Daphnia magna, 40 hour post oviposition, SAMN14970496)	22,076,110	77%	40%	115,518
SAMN14970497	32866176	Line A male A (Daphnia magna, 40 hour post oviposition, SAMN14970497)	19,706,280	80%	41%	110,712
SAMN14970498	32866176	NIES female C (Daphnia magna, 40 hour post oviposition, SAMN14970498)	21,642,734	64%	35%	110,110
SAMN14970499	32866176	NIES female B (Daphnia magna, 40 hour post oviposition, SAMN14970499)	22,540,028	75%	39%	113,908
SAMN14970500	32866176	NIES female A (Daphnia magna, 40 hour post oviposition, SAMN14970500)	20,672,872	76%	39%	111,587
SAMN14970501	NA	WTHF2 (Daphnia magna, post natal day 12, releasing 2nd clutch, female, SAMN14970501)	26,716,794	88%	44%	120,461
SAMN14970502	NA	WTHF1 (Daphnia magna, post natal day 12, releasing 2nd clutch, female, SAMN14970502)	26,974,612	85%	43%	115,924
SAMN14970513	32866176	NIES male C (Daphnia magna, 40 hour post oviposition, SAMN14970513)	24,003,360	73%	38%	110,948
SAMN14970514	32866176	NIES male B (Daphnia magna, 40 hour post oviposition, SAMN14970514)	29,447,386	78%	40%	118,372
SAMN14970515	32866176	NIES male A (Daphnia magna, 40 hour post oviposition, SAMN14970515)	19,721,322	80%	41%	110,706
SAMN14970516	32866176	Line B male C (Daphnia magna, 40 hour post oviposition, SAMN14970516)	27,937,174	76%	39%	119,375
SAMN14970517	32866176	Line B male B (Daphnia magna, 40 hour post oviposition, SAMN14970517)	20,488,770	73%	37%	112,871
SAMN15894648	33199138	whole-body (Daphnia magna, SAMN15894648)	49,753,964	67%	39%	110,007
SAMN15894649	33199138	whole-body (Daphnia magna, SAMN15894649)	46,908,844	60%	42%	119,616
SAMN15894650	33199138	whole-body (Daphnia magna, SAMN15894650)	52,304,980	57%	43%	119,735
SAMN15894651	33199138	whole-body (Daphnia magna, SAMN15894651)	54,859,622	68%	42%	120,276
SAMN15894652	33199138	whole-body (Daphnia magna, SAMN15894652)	48,458,086	65%	41%	117,539
SAMN15894653	33199138	whole-body (Daphnia magna, SAMN15894653)	48,712,248	70%	40%	109,501
SAMN15894654	33199138	whole-body (Daphnia magna, SAMN15894654)	57,084,900	69%	39%	111,321
SAMN15894655	33199138	whole-body (Daphnia magna, SAMN15894655)	51,294,428	66%	40%	122,784
SAMN15894656	33199138	whole-body (Daphnia magna, SAMN15894656)	51,996,428	73%	42%	122,335
SAMN15894657	33199138	whole-body (Daphnia magna, SAMN15894657)	55,164,570	47%	44%	121,677
SAMN15894658	33199138	whole-body (Daphnia magna, SAMN15894658)	54,170,818	54%	43%	123,772
SAMN15894659	33199138	whole-body (Daphnia magna, SAMN15894659)	59,783,960	47%	43%	120,868
SAMN16197875	NA	MT_HF3 (Daphnia magna, post natal day 12, releasing 2nd clutch, female, SAMN16197875)	30,189,894	88%	46%	125,370
SAMN16197876	NA	MT_HF2 (Daphnia magna, post natal day 12, releasing 2nd clutch, female, SAMN16197876)	28,485,278	88%	45%	124,464
SAMN16197877	NA	MT_HF1 (Daphnia magna, post natal day 12, releasing 2nd clutch, female, SAMN16197877)	31,305,916	81%	41%	114,072
SAMN16197878	NA	MT_LF3 (Daphnia magna, post natal day 12, releasing 2nd clutch, female, SAMN16197878)	27,671,416	85%	43%	128,570
SAMN16197879	NA	MT_LF2 (Daphnia magna, post natal day 12, releasing 2nd clutch, female, SAMN16197879)	25,667,578	87%	44%	128,936
SAMN16197880	NA	MT_LF1 (Daphnia magna, post natal day 12, releasing 2nd clutch, female, SAMN16197880)	31,748,720	85%	43%	129,558
SAMN16197881	NA	WT_LF3 (Daphnia magna, post natal day 12, releasing 2nd clutch, female, SAMN16197881)	24,995,834	86%	44%	126,390
SAMN16197882	NA	WT_LF2 (Daphnia magna, post natal day 12, releasing 2nd clutch, female, SAMN16197882)	27,071,642	88%	44%	128,750
SAMN16197883	NA	WT_LF1 (Daphnia magna, post natal day 12, releasing 2nd clutch, female, SAMN16197883)	27,146,228	86%	45%	128,226
SAMN19980729	NA	Whole Body (Daphnia magna, Adult, female, SAMN19980729)	71,615,944	69%	42%	131,667
SAMN20427111	NA	Invertebrate sample from Daphnia magna (Daphnia magna, SAMN20427111)	294,999,366	61%	44%	149,197
SAMN20427112	NA	Invertebrate sample from Daphnia magna (Daphnia magna, SAMN20427112)	285,078,956	57%	44%	148,691
SAMN20427113	NA	Invertebrate sample from Daphnia magna (Daphnia magna, SAMN20427113)	430,157,724	64%	47%	151,432
SAMN20427114	NA	Invertebrate sample from Daphnia magna (Daphnia magna, SAMN20427114)	397,161,092	63%	48%	150,833
SAMN20427115	NA	Invertebrate sample from Daphnia magna (Daphnia magna, SAMN20427115)	414,572,860	64%	48%	149,748
SAMN20427116	NA	Invertebrate sample from Daphnia magna (Daphnia magna, SAMN20427116)	332,740,476	60%	45%	149,635
SAMN20427117	NA	Invertebrate sample from Daphnia magna (Daphnia magna, SAMN20427117)	370,863,214	59%	43%	144,235
SAMN20427118	NA	Invertebrate sample from Daphnia magna (Daphnia magna, SAMN20427118)	373,582,792	62%	47%	152,278

Show alignments statistics, by run (ERR, SRR, DRR)

Run	Experiment	Project	Sample	Number of reads	Percent aligned reads	Percent of aligned reads with introns
SRR1964030	SRX986251	SRP057045	SAMN03470176	9,544,614	92%	47%
SRR1964033	SRX986252	SRP057045	SAMN03470177	11,180,752	91%	47%
SRR1964031	SRX986249	SRP057045	SAMN03470178	11,066,122	92%	56%
SRR1964032	SRX986250	SRP057045	SAMN03470179	8,144,674	90%	55%
SRR5859136	SRX3027898	SRP113320	SAMN07375994	42,593,524	86%	34%
SRR5859135	SRX3027897	SRP113320	SAMN07375995	40,226,894	87%	34%
SRR5859134	SRX3027896	SRP113320	SAMN07375996	39,466,370	86%	36%
SRR5859133	SRX3027895	SRP113320	SAMN07375997	38,507,684	87%	37%
SRR7058656	SRX3989601	SRP142416	SAMN08974731	15,898,674	85%	29%
SRR7058655	SRX3989602	SRP142416	SAMN08974732	27,556,298	88%	31%
SRR7058654	SRX3989603	SRP142416	SAMN08974733	28,806,104	82%	29%
SRR7058653	SRX3989604	SRP142416	SAMN08974734	31,060,696	80%	27%
SRR7058652	SRX3989605	SRP142416	SAMN08974735	36,978,052	80%	28%
SRR7058651	SRX3989606	SRP142416	SAMN08974736	31,917,780	79%	25%
SRR7058650	SRX3989607	SRP142416	SAMN08974737	31,141,250	79%	27%
SRR7058649	SRX3989608	SRP142416	SAMN08974738	26,197,014	81%	26%
SRR7058648	SRX3989609	SRP142416	SAMN08974739	24,372,554	80%	26%
SRR7058647	SRX3989610	SRP142416	SAMN08974740	25,800,034	82%	28%
SRR7058644	SRX3989613	SRP142416	SAMN08974741	31,159,336	84%	28%
SRR7058643	SRX3989614	SRP142416	SAMN08974742	25,255,914	84%	28%
SRR7058646	SRX3989611	SRP142416	SAMN08974743	27,747,854	85%	28%
SRR7058645	SRX3989612	SRP142416	SAMN08974744	34,335,246	80%	27%
SRR7058642	SRX3989615	SRP142416	SAMN08974745	32,961,164	84%	27%
SRR7419487	SRX4290444	SRP151236	SAMN09425737	192,911,810	79%	32%
SRR8439016	SRX5246444	SRP151236	SAMN10739542	97,898,274	54%	34%
SRR8439015	SRX5246445	SRP151236	SAMN10739542	153,135,000	69%	36%
SRR8439014	SRX5246446	SRP151236	SAMN10739542	103,463,964	53%	24%
SRR8439013	SRX5246447	SRP151236	SAMN10739542	62,612,596	52%	34%
SRR8439012	SRX5246448	SRP151236	SAMN10739542	86,938,140	48%	34%
SRR7825507	SRX4676526	SRP161660	SAMN10054298	49,260,620	70%	10%
SRR8172431	SRX4992974	SRP168044	SAMN10392302	52,641,300	86%	44%
SRR8172424	SRX4992967	SRP168044	SAMN10392303	28,099,414	85%	43%
SRR8172423	SRX4992966	SRP168044	SAMN10392304	51,468,014	88%	44%
SRR8172422	SRX4992965	SRP168044	SAMN10392305	37,523,128	87%	45%
SRR8172421	SRX4992964	SRP168044	SAMN10392306	50,159,606	82%	43%
SRR8172420	SRX4992963	SRP168044	SAMN10392307	48,568,660	82%	44%
SRR8172419	SRX4992962	SRP168044	SAMN10392308	32,586,894	87%	44%
SRR8172418	SRX4992961	SRP168044	SAMN10392309	37,738,348	83%	44%
SRR8172417	SRX4992960	SRP168044	SAMN10392310	37,074,630	84%	43%
SRR8172416	SRX4992959	SRP168044	SAMN10392311	39,189,100	87%	44%
SRR8172415	SRX4992958	SRP168044	SAMN10392312	53,474,610	88%	44%
SRR8172442	SRX4992985	SRP168044	SAMN10392313	48,814,452	86%	45%
SRR8172441	SRX4992984	SRP168044	SAMN10392314	49,595,586	89%	44%
SRR8172440	SRX4992983	SRP168044	SAMN10392315	38,193,976	86%	44%
SRR8172439	SRX4992982	SRP168044	SAMN10392316	41,442,898	89%	44%
SRR8172438	SRX4992981	SRP168044	SAMN10392317	36,257,122	89%	45%
SRR8172437	SRX4992980	SRP168044	SAMN10392318	46,216,210	88%	44%
SRR8172436	SRX4992979	SRP168044	SAMN10392319	31,905,808	86%	44%
SRR8172435	SRX4992978	SRP168044	SAMN10392320	31,365,228	87%	44%
SRR8172434	SRX4992977	SRP168044	SAMN10392321	34,896,732	88%	44%
SRR8172432	SRX4992975	SRP168044	SAMN10392322	53,889,058	86%	44%
SRR8172433	SRX4992976	SRP168044	SAMN10392323	43,061,164	87%	44%
SRR8172462	SRX4993005	SRP168044	SAMN10392324	51,892,850	89%	43%
SRR8172461	SRX4993004	SRP168044	SAMN10392325	36,499,622	89%	45%
SRR8172460	SRX4993003	SRP168044	SAMN10392326	51,210,316	87%	44%
SRR8172459	SRX4993002	SRP168044	SAMN10392327	29,291,560	87%	44%
SRR8172458	SRX4993001	SRP168044	SAMN10392328	36,948,834	88%	44%
SRR8172457	SRX4993000	SRP168044	SAMN10392329	46,263,892	87%	44%
SRR8172456	SRX4992999	SRP168044	SAMN10392330	57,931,324	88%	43%
SRR8172455	SRX4992998	SRP168044	SAMN10392331	52,005,272	87%	44%
SRR8172454	SRX4992997	SRP168044	SAMN10392332	51,395,772	88%	44%
SRR8172453	SRX4992996	SRP168044	SAMN10392333	40,746,918	88%	44%
SRR8172452	SRX4992995	SRP168044	SAMN10392334	45,765,206	88%	44%
SRR8172451	SRX4992994	SRP168044	SAMN10392335	51,083,134	89%	45%
SRR8172450	SRX4992993	SRP168044	SAMN10392336	39,985,208	87%	43%
SRR8172449	SRX4992992	SRP168044	SAMN10392337	38,942,396	88%	43%
SRR8172448	SRX4992991	SRP168044	SAMN10392338	52,522,198	89%	43%
SRR8172447	SRX4992990	SRP168044	SAMN10392339	40,560,964	84%	43%
SRR8172446	SRX4992989	SRP168044	SAMN10392340	63,277,028	89%	45%
SRR8172445	SRX4992988	SRP168044	SAMN10392341	25,703,308	88%	44%
SRR8172444	SRX4992987	SRP168044	SAMN10392342	29,885,540	85%	44%
SRR8172443	SRX4992986	SRP168044	SAMN10392343	42,069,094	89%	45%
SRR8172430	SRX4992973	SRP168044	SAMN10392344	56,437,596	87%	43%
SRR8172429	SRX4992972	SRP168044	SAMN10392345	44,239,504	87%	43%
SRR8172428	SRX4992971	SRP168044	SAMN10392346	38,725,372	87%	45%
SRR8172427	SRX4992970	SRP168044	SAMN10392347	48,404,070	84%	43%
SRR8172426	SRX4992969	SRP168044	SAMN10392348	56,080,258	86%	43%
SRR8172425	SRX4992968	SRP168044	SAMN10392349	41,448,590	80%	43%
SRR8352226	SRX5163187	SRP173886	SAMN10606273	51,525,786	27%	49%
SRR11548367	SRX8118412	SRP256437	SAMN14600439	54,818,240	80%	45%
SRR11548366	SRX8118413	SRP256437	SAMN14600440	69,179,060	84%	47%
SRR11548365	SRX8118414	SRP256437	SAMN14600441	57,163,424	83%	45%
SRR11548364	SRX8118415	SRP256437	SAMN14600442	62,724,520	81%	45%
SRR11548363	SRX8118416	SRP256437	SAMN14600443	48,220,792	79%	43%
SRR11548362	SRX8118417	SRP256437	SAMN14600444	58,627,612	82%	49%
SRR11680737	SRX8241597	SRP259943	SAMN14791943	33,705,894	58%	42%
SRR11680736	SRX8241596	SRP259943	SAMN14791944	30,046,946	65%	42%
SRR11680735	SRX8241595	SRP259943	SAMN14791945	87,057,586	73%	42%
SRR11680731	SRX8241591	SRP259943	SAMN14791946	44,592,258	61%	43%
SRR11680730	SRX8241590	SRP259943	SAMN14791947	31,919,940	61%	43%
SRR11680729	SRX8241589	SRP259943	SAMN14791948	82,849,222	72%	41%
SRR11680728	SRX8241588	SRP259943	SAMN14791949	149,616,056	80%	40%
SRR11680727	SRX8241587	SRP259943	SAMN14791950	59,705,851	76%	42%
SRR11680726	SRX8241586	SRP259943	SAMN14791951	67,086,511	74%	41%
SRR11680734	SRX8241594	SRP259943	SAMN14791952	21,182,615	65%	43%
SRR11680733	SRX8241593	SRP259943	SAMN14791953	42,141,994	74%	43%
SRR11680732	SRX8241592	SRP259943	SAMN14791954	50,270,205	59%	42%
SRR11680725	SRX8241585	SRP259943	SAMN14791955	98,187,509	76%	42%
SRR11680724	SRX8241584	SRP259943	SAMN14791956	59,821,353	71%	42%
SRR11680723	SRX8241583	SRP259943	SAMN14791957	24,755,096	70%	42%
SRR11680722	SRX8241582	SRP259943	SAMN14791958	16,042,960	72%	41%
SRR11680721	SRX8241581	SRP259943	SAMN14791959	67,020,752	70%	42%
SRR11680720	SRX8241580	SRP259943	SAMN14791960	28,529,420	66%	43%
SRR11680739	SRX8241599	SRP259943	SAMN14791973	13,110,727	68%	41%
SRR11680738	SRX8241598	SRP259943	SAMN14791974	16,710,850	67%	43%
SRR11811043	SRX8362316	SRP262286	SAMN14970493	19,980,912	77%	40%
SRR11811042	SRX8362315	SRP262286	SAMN14970495	22,042,858	76%	39%
SRR11811041	SRX8362314	SRP262286	SAMN14970496	22,076,110	77%	40%
SRR11811040	SRX8362313	SRP262286	SAMN14970497	19,706,280	80%	41%
SRR11811039	SRX8362312	SRP262286	SAMN14970498	21,642,734	64%	35%
SRR11811038	SRX8362311	SRP262286	SAMN14970499	22,540,028	75%	39%
SRR11811037	SRX8362310	SRP262286	SAMN14970500	20,672,872	76%	39%
SRR11811048	SRX8362321	SRP262286	SAMN14970513	24,003,360	73%	38%
SRR11811047	SRX8362320	SRP262286	SAMN14970514	29,447,386	78%	40%
SRR11811046	SRX8362319	SRP262286	SAMN14970515	19,721,322	80%	41%
SRR11811045	SRX8362318	SRP262286	SAMN14970516	27,937,174	76%	39%
SRR11811044	SRX8362317	SRP262286	SAMN14970517	20,488,770	73%	37%
SRR11811061	SRX8362334	SRP262288	SAMN14970494	26,638,108	88%	45%
SRR11811060	SRX8362333	SRP262288	SAMN14970501	26,716,794	88%	44%
SRR11811059	SRX8362332	SRP262288	SAMN14970502	26,974,612	85%	43%
SRR12508270	SRX8999045	SRP278637	SAMN15894648	49,753,964	67%	39%
SRR12508267	SRX8999042	SRP278637	SAMN15894649	46,908,844	60%	42%
SRR12508266	SRX8999041	SRP278637	SAMN15894650	52,304,980	57%	43%
SRR12508265	SRX8999039	SRP278637	SAMN15894651	54,859,622	68%	42%
SRR12508264	SRX8999038	SRP278637	SAMN15894652	48,458,086	65%	41%
SRR12508269	SRX8999044	SRP278637	SAMN15894653	48,712,248	70%	40%
SRR12508268	SRX8999043	SRP278637	SAMN15894654	57,084,900	69%	39%
SRR12508263	SRX8999037	SRP278637	SAMN15894655	51,294,428	66%	40%
SRR12508262	SRX8999036	SRP278637	SAMN15894656	51,996,428	73%	42%
SRR12508261	SRX8999048	SRP278637	SAMN15894657	55,164,570	47%	44%
SRR12508272	SRX8999047	SRP278637	SAMN15894658	54,170,818	54%	43%
SRR12508271	SRX8999046	SRP278637	SAMN15894659	59,783,960	47%	43%
SRR12660012	SRX9141064	SRP282875	SAMN16197875	30,189,894	88%	46%
SRR12660011	SRX9141063	SRP282875	SAMN16197876	28,485,278	88%	45%
SRR12660010	SRX9141062	SRP282875	SAMN16197877	31,305,916	81%	41%
SRR12660009	SRX9141061	SRP282875	SAMN16197878	27,671,416	85%	43%
SRR12660008	SRX9141060	SRP282875	SAMN16197879	25,667,578	87%	44%
SRR12660007	SRX9141059	SRP282875	SAMN16197880	31,748,720	85%	43%
SRR12660006	SRX9141058	SRP282875	SAMN16197881	24,995,834	86%	44%
SRR12660005	SRX9141057	SRP282875	SAMN16197882	27,071,642	88%	44%
SRR12660004	SRX9141056	SRP282875	SAMN16197883	27,146,228	86%	45%
SRR15012076	SRX11324167	SRP326419	SAMN19980729	71,615,944	69%	42%
SRR15257905	SRX11563255	SRP330012	SAMN20427111	100,612,284	57%	45%
SRR15257904	SRX11563256	SRP330012	SAMN20427111	98,480,352	61%	44%
SRR15257893	SRX11563267	SRP330012	SAMN20427111	95,906,730	63%	44%
SRR15257882	SRX11563278	SRP330012	SAMN20427112	94,533,750	55%	44%
SRR15257881	SRX11563279	SRP330012	SAMN20427112	104,626,834	61%	44%
SRR15257880	SRX11563280	SRP330012	SAMN20427112	85,918,372	54%	43%
SRR15257879	SRX11563281	SRP330012	SAMN20427113	105,475,424	63%	48%
SRR15257878	SRX11563282	SRP330012	SAMN20427113	108,425,972	65%	46%
SRR15257877	SRX11563283	SRP330012	SAMN20427113	113,660,338	63%	48%
SRR15257876	SRX11563284	SRP330012	SAMN20427113	102,595,990	66%	47%
SRR15257903	SRX11563257	SRP330012	SAMN20427114	100,236,938	61%	47%
SRR15257902	SRX11563258	SRP330012	SAMN20427114	92,486,828	63%	48%
SRR15257901	SRX11563259	SRP330012	SAMN20427114	101,631,740	64%	48%
SRR15257900	SRX11563260	SRP330012	SAMN20427114	102,805,586	64%	49%
SRR15257899	SRX11563261	SRP330012	SAMN20427115	98,642,390	65%	46%
SRR15257898	SRX11563262	SRP330012	SAMN20427115	103,542,488	63%	48%
SRR15257897	SRX11563263	SRP330012	SAMN20427115	105,027,408	63%	49%
SRR15257896	SRX11563264	SRP330012	SAMN20427115	107,360,574	64%	49%
SRR15257895	SRX11563265	SRP330012	SAMN20427116	88,592,214	65%	44%
SRR15257894	SRX11563266	SRP330012	SAMN20427116	83,185,388	59%	45%
SRR15257892	SRX11563268	SRP330012	SAMN20427116	79,748,324	58%	45%
SRR15257891	SRX11563269	SRP330012	SAMN20427116	81,214,550	57%	46%
SRR15257890	SRX11563270	SRP330012	SAMN20427117	87,544,846	61%	44%
SRR15257889	SRX11563271	SRP330012	SAMN20427117	94,486,442	58%	43%
SRR15257888	SRX11563272	SRP330012	SAMN20427117	93,649,398	60%	44%
SRR15257887	SRX11563273	SRP330012	SAMN20427117	95,182,528	59%	42%
SRR15257886	SRX11563274	SRP330012	SAMN20427118	93,488,030	62%	48%
SRR15257885	SRX11563275	SRP330012	SAMN20427118	95,301,982	63%	47%
SRR15257884	SRX11563276	SRP330012	SAMN20427118	88,704,858	60%	47%
SRR15257883	SRX11563277	SRP330012	SAMN20427118	96,087,922	64%	46%

Protein alignments

Source	Number of sequences retrieved from Entrez	Number (%) of sequences aligned by ProSplign	Number (%) of sequences passed to Gnomon	Average % identity	Average % coverage
Hyalella azteca high-quality model RefSeq (XP_)	9,395	5,221 (55.57%)	5,221 (55.57%)	60.38%	50.32%
Same-species GenBank	4,469	2,720 (60.86%)	2,720 (60.86%)	92.75%	96.50%
Caenorhabditis elegans known RefSeq (NP_)	28,435	10,081 (35.45%)	10,081 (35.45%)	57.30%	41.53%
Crustacea GenBank	41,056	28,260 (68.83%)	28,260 (68.83%)	76.71%	81.36%
Daphnia pulex Other	31,008	21,972 (70.86%)	21,972 (70.86%)	72.12%	78.52%
Tribolium castaneum GenBank	672	289 (43.01%)	289 (43.01%)	67.86%	66.35%
Tribolium castaneum high-quality model RefSeq (XP_)	11,487	7,775 (67.69%)	7,775 (67.69%)	60.39%	53.48%
Tribolium castaneum known RefSeq (NP_)	627	500 (79.74%)	500 (79.74%)	64.65%	55.51%
Drosophila melanogaster known RefSeq (NP_)	30,704	13,755 (44.80%)	13,755 (44.80%)	61.11%	49.64%
Eurytemora affinis high-quality model RefSeq (XP_)	14,540	7,274 (50.03%)	7,274 (50.03%)	58.59%	46.20%

Assembly-assembly alignments of current to previous assembly

When the assembly changes between two rounds of annotation, genes in the current and the previous annotation are mapped to each other using the genomic alignments of the current assembly to the previous assembly so that gene identifiers can be preserved. The success of the remapping depends largely on how well the two assembly versions align to each other.

Below are the percent coverage of one assembly by the other and the average percent identity of the alignments. The 'First pass' alignments are reciprocal best hits, while the 'Total' alignments also include 'Second pass' or non-reciprocal best alignments. For more information about the assembly-assembly alignment process, please visit the NCBI Genome Remapping Service page.

First Pass	Total
JAIFAF01 (Current) Coverage: 64.48%	JAIFAF01 (Current) Coverage: 76.78%
QYSF01 (Previous) Coverage: 85.38%	QYSF01 (Previous) Coverage: 89.39%
Percent Identity: 98.66%	Percent Identity: 98.17%

Comparison of the current and previous annotations

The annotation produced for this release (101) was compared to the annotation in the previous release (100) for each assembly annotated in both releases. Scores for current and previous gene and transcript features were calculated based on overlap in exon sequence and matches in exon boundaries. Pairs of current and previous features were categorized based on these scores, whether they are reciprocal best matches, and changes in attributes (gene biotype, completeness, etc.). If the assembly was updated between the two releases, alignments between the current and the previous assembly were used to match the current and previous gene and transcript features in mapped regions.

The table below summarizes the changes in the gene set for each assembly as a percent of the number of genes in the current annotation release, and provides links to the details of the comparison in tabular format and in a Genome Workbench project.

	ASM2063170v1.1 (Current) to ASM399081v1 (Previous)
Identical	3%
Minor changes	41%
Major changes	12%
New	41%
Deprecated	18%
Other	3%
Download the report	tabular, Genome Workbench

References

RefSeq: Pruitt KD, Brown GR, Hiatt SM, Thibaud-Nissen F, Astashyn A, Ermolaeva O, Farrell CM, Hart J, Landrum MJ, McGarvey KM, Murphy MR, O'Leary NA, Pujar S, Rajput B, Rangwala SH, Riddick LD, Shkeda A, Sun H, Tamez P, Tully RE, Wallin C, Webb D, Weber J, Wu W, Dicuccio M, Kitts P, Maglott DR, Murphy TD, Ostell JM. Nucleic Acids Research 2014, 42(Database issue):D756-63
RepeatMasker: Smit AFA, Hubley R, Green P. RepeatMasker Open-3.0. 1996–2004. http://www.repeatmasker.org
WindowMasker: Morgulis A, Gertz EM, Schäffer AA, Agarwala R. Bioinformatics 2006, 2:134-41
Splign: Kapustin Y, Souvorov A, Tatusova T, Lipman D. Biology Direct 2008, 3:20
Minimap2: Li H. Bioinformatics 2018 Sep 15;34(18):3094-3100

RefSeq

Integrated reference sequences