NCBI Anas platyrhynchos Annotation Release 104

The RefSeq genome records for Anas platyrhynchos were annotated by the NCBI Eukaryotic Genome Annotation Pipeline, an automated pipeline that annotates genes, transcripts and proteins on draft and finished genome assemblies. This report presents statistics on the annotation products, the input data used in the pipeline and intermediate alignment results.

The annotation products are available in the sequence databases and on the FTP site.

This report provides:

Annotation Release information: The name of the release, important dates, the software version
Assemblies: A brief description of the annotated assembly(ies)
Gene and feature statistics: The counts and characteristics of the annotated features
Alignment of the annotated proteins to a set of high-quality proteins: The number of annotated proteins with hits to a set of high-quality proteins
Masking of genomic sequence: How much of the genome was masked
Transcript and protein alignments: The number and type of evidence retrieved from public databases and used for gene prediction
Similarity of current and previous assembly: The similarity of the current and previous assembly
Comparison of the current and previous annotations: What proportion of the genes changed in this annotation

For more information on the annotation process, please visit the NCBI Eukaryotic Genome Annotation Pipeline page.

Annotation Release information

This annotation should be referred to as NCBI Anas platyrhynchos Annotation Release 104

Annotation release ID: 104
Date of Entrez queries for transcripts and proteins: Dec 2 2020
Date of submission of annotation to the public databases: Dec 15 2020
Software version: 8.5

Assemblies

The following assemblies were included in this annotation run:

Assembly name	Assembly accession	Submitter	Assembly date	Reference/Alternate	Assembly content
ZJU1.0	GCF_015476345.1	Zhejiang University	11-23-2020	Reference	34 assembled chromosomes; unplaced scaffolds

Gene and feature statistics

Counts and length of annotated features are provided below for each assembly.

Feature counts

Feature	ZJU1.0
Genes and pseudogenes	25,093
protein-coding	16,836
non-coding	7,966
transcribed pseudogenes	2
non-transcribed pseudogenes	241
genes with variants	10,056
immunoglobulin/T-cell receptor gene segments	48
other	0
mRNAs	45,212
fully-supported	44,103
with > 5% ab initio	458
partial	367
with filled gap(s)	134
known RefSeq (NM_)	144
model RefSeq (XM_)	45,068
non-coding RNAs	15,030
fully-supported	14,308
with > 5% ab initio	0
partial	2
with filled gap(s)	2
known RefSeq (NR_)	0
model RefSeq (XR_)	14,627
pseudo transcripts	2
fully-supported	2
with > 5% ab initio	0
partial	0
with filled gap(s)	0
known RefSeq (NR_)	0
model RefSeq (XR_)	2
CDSs	45,273
fully-supported	44,103
with > 5% ab initio	535
partial	356
with major correction(s)	1,784
known RefSeq (NP_)	144
model RefSeq (XP_)	45,081

Detailed reports

The counts below do not include pseudogenes.

Feature lengths

Feature	Count	Mean length (bp)	Median length (bp)	Min length (bp)	Max length (bp)
Genes	24,802	29,554	10,034	59	1,330,789
All transcripts	60,242	4,139	3,254	59	103,952
mRNA	45,212	4,373	3,566	251	103,952
misc_RNA	2,201	4,046	3,176	200	39,440
tRNA	401	74	73	66	87
lncRNA	12,107	3,518	2,230	129	39,297
snoRNA	201	109	95	62	319
snRNA	68	150	162	59	194
guide_RNA	15	179	137	129	312
rRNA	37	379	119	118	3,964
Single-exon transcripts	828	1,802	999	279	15,255
coding transcripts (NM_/XM_ )	828	1,802	999	279	15,255
CDSs	45,225	2,218	1,593	96	102,741
Exons	249,884	402	143	1	41,642
in coding transcripts (NM_/XM_ )	216,012	335	139	1	41,642
in non-coding transcripts (NR_/XR_ )	44,788	678	170	2	37,617
Introns	221,456	4,273	974	30	613,998
in coding transcripts (NM_/XM_ )	196,998	4,041	938	30	613,998
in non-coding transcripts (NR_/XR_ )	35,005	5,465	1,279	30	488,129

Transcripts per gene, exons per transcript

	Mean	Median	Min	Max
Number of transcripts per gene	2.45	1	1	50
Number of exons per transcript	12.17	9	1	323

Alignment of the annotated proteins to a set of high-quality proteins

The final set of annotated proteins was searched with BLASTP against the UniProtKB/Swiss-Prot curated proteins, using the annotated proteins as the query and the high-quality proteins as the target. Out of 16823 coding genes, 16246 genes had a protein with an alignment covering 50% or more of the query and 10982 had an alignment covering 95% or more of the query.

Definition of query and target coverage. The query coverage is the percentage of the annotated protein length that is included in the alignment. The target coverage is the percentage of the target length that is included in the alignment.

Below is a cumulative graph displaying the number of genes with alignments above a given query or target coverage threshold. For comparison, corresponding statistics for other organisms annotated by the NCBI eukaryotic annotation pipeline were added to the graph.

Query: annotated proteins
Target: UniProtKB/Swiss-Prot curated proteins

Masking of genomic sequence

Transcript and protein alignments are performed on the repeat-masked genome. Below are the percentages of genomic sequence masked by WindowMasker and RepeatMasker for each assembly. RepeatMasker results are only used for organisms for which a comprehensive repeat library is available.

For this annotation run, transcripts and proteins were aligned to the genome masked with WindowMasker only.

Assembly name	Assembly accession	% Masked with RepeatMasker	% Masked with WindowMasker
ZJU1.0	GCF_015476345.1	8.01%	23.92%

Transcript and protein alignments

The annotation pipeline relies heavily on alignments of experimental evidence for gene prediction. Below are the sets of transcripts and proteins that were retrieved from Entrez, aligned to the genome by Splign or ProSplign and passed to Gnomon, NCBI's gene prediction software.

Depending on the other evidence available, long 454 reads (with average length above 250 nt) may be aligned as traditional evidence and reported in the Transcript alignments section or aligned with RNA-Seq reads and reported in the RNA-Seq alignments section.

Transcript alignments

Source	Number of sequences retrieved from Entrez	Number (%) of sequences aligned by Splign	Number (%) of sequences passed to Gnomon	Average % identity	Average % coverage
Same-species known RefSeq (NM_/NR_)	148	146 (98.65%)	141 (95.27%)	99.42%	99.44%
Same-species Genbank	1,390	1,342 (96.55%)	984 (70.79%)	99.09%	92.45%
Same-species EST	5,247	3,993 (76.10%)	3,151 (60.05%)	98.69%	97.20%

RNA-Seq alignments

The following RNA-Seq reads from the Sequence Read Archive were also used for gene prediction:

Hide alignments statistics, by sample (SAME, SAMN, SAMD, DRS)

Sample Id	Publication	Track name	Number of reads	Percent aligned reads	Percent of aligned reads with introns	Number of introns
All	NA	Aggregate of all aligned samples	9,472,138,472	76%	42%	302,813
SAMN05929735	NA	pectoral muscle (Anas platyrhynchos, E27, SAMN05929735)	42,247,472	72%	38%	160,926
SAMN05929736	NA	pectoral muscle (Anas platyrhynchos, E27, SAMN05929736)	47,513,284	73%	37%	165,117
SAMN05929737	NA	pectoral muscle (Anas platyrhynchos, 5 days post-hatching, SAMN05929737)	47,936,036	72%	39%	166,109
SAMN05929770	NA	pectoral muscle (Anas platyrhynchos, 5 days post-hatching, SAMN05929770)	45,789,270	74%	39%	165,621
SAMN05929771	NA	pectoral muscle (Anas platyrhynchos, E27, SAMN05929771)	49,324,100	73%	37%	166,992
SAMN05929772	NA	pectoral muscle (Anas platyrhynchos, E21, SAMN05929772)	39,680,224	79%	36%	170,526
SAMN05929773	NA	pectoral muscle (Anas platyrhynchos, E21, SAMN05929773)	48,457,180	79%	36%	174,464
SAMN05929774	NA	pectoral muscle (Anas platyrhynchos, E21, SAMN05929774)	59,657,382	79%	36%	179,328
SAMN05929775	NA	pectoral muscle (Anas platyrhynchos, 5 days post-hatching, SAMN05929775)	46,367,826	78%	39%	170,029
SAMN05929776	NA	pectoral muscle (Anas platyrhynchos, 5 days post-hatching, SAMN05929776)	48,616,526	75%	39%	168,919
SAMN05929777	NA	pectoral muscle (Anas platyrhynchos, 5 days post-hatching, SAMN05929777)	46,763,316	77%	39%	169,669
SAMN05929778	NA	pectoral muscle (Anas platyrhynchos, E27, SAMN05929778)	56,950,116	75%	39%	168,812
SAMN05929779	NA	pectoral muscle (Anas platyrhynchos, E27, SAMN05929779)	45,334,460	74%	37%	165,134
SAMN05929780	NA	pectoral muscle (Anas platyrhynchos, E27, SAMN05929780)	47,893,660	75%	37%	165,196
SAMN05929781	NA	pectoral muscle (Anas platyrhynchos, E21, SAMN05929781)	46,366,420	79%	37%	173,242
SAMN05929782	NA	pectoral muscle (Anas platyrhynchos, E21, SAMN05929782)	40,470,122	79%	38%	169,991
SAMN05929783	NA	pectoral muscle (Anas platyrhynchos, E21, SAMN05929783)	43,875,544	79%	37%	172,004
SAMN05929784	NA	pectoral muscle (Anas platyrhynchos, 5 days post-hatching, SAMN05929784)	46,220,546	74%	39%	166,732
SAMN06131559	NA	ovary (Anas platyrhynchos, 340 days, SAMN06131559)	92,273,954	77%	38%	227,868
SAMN06131560	NA	ovary (Anas platyrhynchos, 340 days, SAMN06131560)	96,739,366	76%	40%	229,944
SAMN06131561	NA	ovary (Anas platyrhynchos, 340 days, SAMN06131561)	104,755,624	67%	40%	225,549
SAMN06131562	NA	ovary (Anas platyrhynchos, 340 days, SAMN06131562)	98,032,380	75%	38%	230,235
SAMN06131564	NA	ovary (Anas platyrhynchos, 340 days, SAMN06131564)	85,682,124	78%	39%	228,898
SAMN06131565	NA	ovary (Anas platyrhynchos, 340 days, SAMN06131565)	120,202,132	75%	38%	229,803
SAMN08449642	30318291	somatic reprogrammed cells, (Anas platyrhynchos, SAMN08449642)	50,850,880	80%	28%	159,032
SAMN08449643	30318291	embryonic fibroblasts, (Anas platyrhynchos, SAMN08449643)	64,402,448	75%	18%	137,019
SAMN08449644	30318291	blastodermal cells, (Anas platyrhynchos, SAMN08449644)	72,316,080	53%	12%	133,938
SAMN08449645	30318291	blastodermal cells, (Anas platyrhynchos, SAMN08449645)	69,153,112	49%	17%	132,736
SAMN08449646	30318291	embryonic fibroblasts, (Anas platyrhynchos, SAMN08449646)	84,676,600	83%	21%	163,456
SAMN08449647	30318291	somatic reprogrammed cells, (Anas platyrhynchos, SAMN08449647)	79,256,620	78%	27%	170,150
SAMN08667434	NA	Liver (Anas platyrhynchos, male, SAMN08667434)	69,574,378	76%	52%	151,330
SAMN08667435	NA	Liver (Anas platyrhynchos, male, SAMN08667435)	86,256,452	78%	52%	156,025
SAMN08667436	NA	Liver (Anas platyrhynchos, male, SAMN08667436)	63,166,500	79%	52%	156,818
SAMN08667437	NA	Liver (Anas platyrhynchos, male, SAMN08667437)	73,128,306	84%	56%	162,004
SAMN08667438	NA	Liver (Anas platyrhynchos, male, SAMN08667438)	70,861,430	84%	54%	159,721
SAMN08667439	NA	Liver (Anas platyrhynchos, male, SAMN08667439)	68,653,432	83%	54%	159,527
SAMN08667440	NA	Liver (Anas platyrhynchos, male, SAMN08667440)	56,781,984	81%	56%	152,591
SAMN08667441	NA	Liver (Anas platyrhynchos, male, SAMN08667441)	68,298,598	81%	56%	155,130
SAMN08667442	NA	Liver (Anas platyrhynchos, male, SAMN08667442)	83,351,864	78%	53%	159,771
SAMN09976275	NA	subcutaneous adipose (Anas platyrhynchos, not determined, SAMN09976275)	1,908,767,014	75%	45%	241,803
SAMN10022760	NA	duodenum (Anas platyrhynchos, 60 days, female, SAMN10022760)	73,125,470	78%	43%	179,530
SAMN10022761	NA	duodenum (Anas platyrhynchos, 61 days, female, SAMN10022761)	73,684,340	77%	42%	183,702
SAMN10022762	NA	duodenum (Anas platyrhynchos, 62 days, female, SAMN10022762)	66,154,828	76%	42%	181,359
SAMN10022763	NA	duodenum (Anas platyrhynchos, 63 days, female, SAMN10022763)	65,376,768	76%	42%	181,742
SAMN10022764	NA	duodenum (Anas platyrhynchos, 64 days, female, SAMN10022764)	67,326,318	75%	41%	186,462
SAMN10022765	NA	duodenum (Anas platyrhynchos, 65 days, female, SAMN10022765)	60,511,912	75%	41%	183,397
SAMN10022766	NA	jejunum (Anas platyrhynchos, 66 days, female, SAMN10022766)	67,822,662	76%	42%	182,316
SAMN10022767	NA	jejunum (Anas platyrhynchos, 67 days, female, SAMN10022767)	65,484,404	76%	42%	184,171
SAMN10022768	NA	jejunum (Anas platyrhynchos, 68 days, female, SAMN10022768)	65,166,726	76%	42%	181,774
SAMN10022769	NA	jejunum (Anas platyrhynchos, 69 days, female, SAMN10022769)	63,470,418	77%	40%	185,125
SAMN10022770	NA	jejunum (Anas platyrhynchos, 70 days, female, SAMN10022770)	68,388,962	76%	41%	182,476
SAMN10022771	NA	jejunum (Anas platyrhynchos, 71 days, female, SAMN10022771)	64,449,210	76%	42%	181,567
SAMN10022772	NA	ileum (Anas platyrhynchos, 72 days, female, SAMN10022772)	70,635,206	77%	42%	188,374
SAMN10022773	NA	ileum (Anas platyrhynchos, 73 days, female, SAMN10022773)	71,477,382	79%	41%	185,116
SAMN10022774	NA	ileum (Anas platyrhynchos, 74 days, female, SAMN10022774)	70,974,922	79%	41%	185,467
SAMN10022775	NA	ileum (Anas platyrhynchos, 75 days, female, SAMN10022775)	71,531,484	79%	40%	190,607
SAMN10022776	NA	ileum (Anas platyrhynchos, 76 days, female, SAMN10022776)	67,092,032	79%	40%	182,076
SAMN10022777	NA	ileum (Anas platyrhynchos, 77 days, female, SAMN10022777)	63,647,252	78%	41%	183,083
SAMN10240507	NA	liver (Anas platyrhynchos, 300 days, female, SAMN10240507)	62,188,514	81%	48%	161,546
SAMN10240508	NA	liver (Anas platyrhynchos, 300 days, female, SAMN10240508)	58,230,552	84%	51%	154,891
SAMN10240509	NA	liver (Anas platyrhynchos, 300 days, female, SAMN10240509)	56,256,120	81%	49%	156,072
SAMN10240510	NA	liver (Anas platyrhynchos, 300 days, female, SAMN10240510)	58,527,830	84%	51%	156,919
SAMN10240511	NA	liver (Anas platyrhynchos, 300 days, female, SAMN10240511)	61,087,080	82%	48%	160,661
SAMN10240512	NA	liver (Anas platyrhynchos, 300 days, female, SAMN10240512)	53,315,708	82%	50%	157,125
SAMN10240513	NA	ovary (Anas platyrhynchos, 300 days, female, SAMN10240513)	56,609,726	83%	36%	217,855
SAMN10240514	NA	ovary (Anas platyrhynchos, 300 days, female, SAMN10240514)	48,330,944	82%	35%	213,046
SAMN10240515	NA	ovary (Anas platyrhynchos, 300 days, female, SAMN10240515)	57,968,174	83%	36%	222,741
SAMN10240516	NA	ovary (Anas platyrhynchos, 300 days, female, SAMN10240516)	58,306,252	83%	37%	219,800
SAMN10240517	NA	ovary (Anas platyrhynchos, 300 days, female, SAMN10240517)	61,615,204	83%	36%	220,585
SAMN10240518	NA	ovary (Anas platyrhynchos, 300 days, female, SAMN10240518)	69,560,418	84%	35%	220,230
SAMN10537197	NA	spleen (Anas platyrhynchos, six month, female, SAMN10537197)	65,567,242	81%	39%	177,962
SAMN10537198	NA	spleen (Anas platyrhynchos, six month, female, SAMN10537198)	69,451,288	79%	39%	180,033
SAMN10537199	NA	spleen (Anas platyrhynchos, six month, female, SAMN10537199)	71,435,520	78%	40%	179,459
SAMN10537200	NA	spleen (Anas platyrhynchos, six month, female, SAMN10537200)	75,457,384	78%	40%	182,060
SAMN10537201	NA	spleen (Anas platyrhynchos, six month, female, SAMN10537201)	69,008,260	81%	39%	172,301
SAMN10537202	NA	spleen (Anas platyrhynchos, six month, female, SAMN10537202)	70,789,526	75%	31%	168,282
SAMN10537203	NA	spleen (Anas platyrhynchos, six month, female, SAMN10537203)	74,295,262	78%	40%	177,706
SAMN10537204	NA	spleen (Anas platyrhynchos, six month, female, SAMN10537204)	71,637,864	81%	41%	179,709
SAMN12837616	NA	heart (Anas platyrhynchos, male, SAMN12837616)	60,551,710	63%	47%	155,668
SAMN12837617	NA	heart (Anas platyrhynchos, male, SAMN12837617)	60,289,600	63%	47%	155,217
SAMN12837618	NA	heart (Anas platyrhynchos, male, SAMN12837618)	73,781,668	62%	46%	160,012
SAMN12837619	NA	heart (Anas platyrhynchos, male, SAMN12837619)	66,727,436	64%	47%	161,909
SAMN12837620	NA	heart (Anas platyrhynchos, male, SAMN12837620)	63,985,318	71%	49%	164,117
SAMN12837621	NA	heart (Anas platyrhynchos, male, SAMN12837621)	76,769,352	69%	48%	169,273
SAMN12837622	NA	kindey (Anas platyrhynchos, male, SAMN12837622)	51,994,972	77%	46%	167,742
SAMN12837623	NA	kindey (Anas platyrhynchos, male, SAMN12837623)	51,151,534	77%	46%	170,950
SAMN12837624	NA	kindey (Anas platyrhynchos, male, SAMN12837624)	50,112,036	80%	47%	152,539
SAMN12837625	NA	kindey (Anas platyrhynchos, male, SAMN12837625)	50,185,624	79%	45%	169,422
SAMN12837626	NA	kindey (Anas platyrhynchos, male, SAMN12837626)	54,662,328	79%	46%	172,448
SAMN12837627	NA	kindey (Anas platyrhynchos, male, SAMN12837627)	54,457,776	79%	45%	175,135
SAMN12837628	NA	spleen (Anas platyrhynchos, male, SAMN12837628)	52,161,486	79%	45%	171,253
SAMN12837629	NA	spleen (Anas platyrhynchos, male, SAMN12837629)	51,060,614	80%	50%	167,146
SAMN12837630	NA	spleen (Anas platyrhynchos, male, SAMN12837630)	49,400,804	80%	45%	170,749
SAMN12837631	NA	spleen (Anas platyrhynchos, male, SAMN12837631)	50,436,938	78%	46%	170,443
SAMN12837632	NA	spleen (Anas platyrhynchos, male, SAMN12837632)	49,928,442	80%	47%	168,948
SAMN12837633	NA	spleen (Anas platyrhynchos, male, SAMN12837633)	48,901,752	79%	44%	171,694
SAMN13689808	32244328	brain (Anas platyrhynchos, 10 day old, SAMN13689808)	50,284,812	78%	29%	178,719
SAMN13689809	32244328	brain (Anas platyrhynchos, 10 day old, SAMN13689809)	40,835,734	79%	27%	177,200
SAMN13689810	32244328	brain (Anas platyrhynchos, 10 day old, SAMN13689810)	58,372,706	77%	31%	181,032
SAMN13689811	32244328	brain (Anas platyrhynchos, 10 day old, SAMN13689811)	52,859,536	78%	27%	181,661
SAMN13689812	32244328	brain (Anas platyrhynchos, 10 day old, SAMN13689812)	50,060,388	78%	28%	181,498
SAMN13689813	32244328	brain (Anas platyrhynchos, 10 day old, SAMN13689813)	52,151,546	78%	28%	177,475
SAMN14057052	NA	muscle (Anas platyrhynchos, 6week, male, SAMN14057052)	53,154,850	69%	47%	157,097
SAMN14057053	NA	muscle (Anas platyrhynchos, 6week, male, SAMN14057053)	77,003,220	69%	46%	160,747
SAMN14057054	NA	muscle (Anas platyrhynchos, 6week, male, SAMN14057054)	41,696,710	71%	46%	153,678
SAMN14057055	NA	muscle (Anas platyrhynchos, 6week, male, SAMN14057055)	42,245,120	71%	47%	155,094
SAMN14057056	NA	muscle (Anas platyrhynchos, 6week, male, SAMN14057056)	56,010,568	70%	43%	158,463
SAMN14057057	NA	muscle (Anas platyrhynchos, 6week, male, SAMN14057057)	41,035,150	71%	46%	154,663
SAMN14057058	NA	bone (Anas platyrhynchos, 6week, male, SAMN14057058)	52,801,406	79%	46%	157,397
SAMN14057059	NA	bone (Anas platyrhynchos, 6week, male, SAMN14057059)	53,753,048	77%	42%	152,585
SAMN14057060	NA	bone (Anas platyrhynchos, 6week, male, SAMN14057060)	62,487,466	83%	46%	161,664
SAMN14057061	NA	bone (Anas platyrhynchos, 6week, male, SAMN14057061)	40,761,828	83%	41%	162,535
SAMN14057062	NA	bone (Anas platyrhynchos, 6week, male, SAMN14057062)	50,161,898	79%	46%	160,159
SAMN14057063	NA	bone (Anas platyrhynchos, 6week, male, SAMN14057063)	62,912,236	82%	41%	166,701
SAMN16251828	NA	leg muscle (Anas platyrhynchos, day 17 of embryo, female, SAMN16251828)	53,528,944	71%	46%	160,882
SAMN16251829	NA	leg muscle (Anas platyrhynchos, day 17 of embryo, female, SAMN16251829)	64,049,872	72%	46%	168,967
SAMN16251830	NA	leg muscle (Anas platyrhynchos, day 17 of embryo, female, SAMN16251830)	47,188,640	66%	42%	154,721
SAMN16251831	NA	leg muscle (Anas platyrhynchos, day 21 of embryo, female, SAMN16251831)	53,101,146	74%	46%	163,830
SAMN16251832	NA	leg muscle (Anas platyrhynchos, day 21 of embryo, female, SAMN16251832)	66,211,580	73%	46%	169,119
SAMN16251833	NA	leg muscle (Anas platyrhynchos, day 21 of embryo, female, SAMN16251833)	44,467,622	71%	45%	162,283
SAMN16251834	NA	leg muscle (Anas platyrhynchos, day 27 of embryo, female, SAMN16251834)	42,713,632	68%	42%	148,791
SAMN16251835	NA	leg muscle (Anas platyrhynchos, day 27 of embryo, female, SAMN16251835)	46,046,128	69%	42%	151,042
SAMN16251836	NA	leg muscle (Anas platyrhynchos, day 27 of embryo, female, SAMN16251836)	44,464,314	66%	39%	146,703
SAMN16251837	NA	leg muscle (Anas platyrhynchos, postnatal 6-month-old, female, SAMN16251837)	54,467,886	74%	47%	163,950
SAMN16251838	NA	leg muscle (Anas platyrhynchos, postnatal 6-month-old, female, SAMN16251838)	47,642,322	74%	46%	159,876
SAMN16251839	NA	leg muscle (Anas platyrhynchos, postnatal 6-month-old, female, SAMN16251839)	50,504,848	71%	45%	154,729

Show alignments statistics, by run (ERR, SRR, DRR)

Run	Experiment	Project	Sample	Number of reads	Percent aligned reads	Percent of aligned reads with introns
SRR4434794	SRX2254023	SRP091845	SAMN05929735	42,247,472	72%	38%
SRR4434793	SRX2254022	SRP091845	SAMN05929736	47,513,284	73%	37%
SRR4434797	SRX2254026	SRP091845	SAMN05929737	47,936,036	72%	39%
SRR4434796	SRX2254025	SRP091845	SAMN05929770	45,789,270	74%	39%
SRR4434792	SRX2254021	SRP091845	SAMN05929771	49,324,100	73%	37%
SRR4434791	SRX2254020	SRP091845	SAMN05929772	39,680,224	79%	36%
SRR4434790	SRX2254019	SRP091845	SAMN05929773	48,457,180	79%	36%
SRR4434789	SRX2254018	SRP091845	SAMN05929774	59,657,382	79%	36%
SRR4434788	SRX2254017	SRP091845	SAMN05929775	46,367,826	78%	39%
SRR4434787	SRX2254016	SRP091845	SAMN05929776	48,616,526	75%	39%
SRR4434786	SRX2254015	SRP091845	SAMN05929777	46,763,316	77%	39%
SRR4434785	SRX2254014	SRP091845	SAMN05929778	56,950,116	75%	39%
SRR4434784	SRX2254013	SRP091845	SAMN05929779	45,334,460	74%	37%
SRR4434783	SRX2254012	SRP091845	SAMN05929780	47,893,660	75%	37%
SRR4434782	SRX2254011	SRP091845	SAMN05929781	46,366,420	79%	37%
SRR4434781	SRX2254010	SRP091845	SAMN05929782	40,470,122	79%	38%
SRR4434780	SRX2254009	SRP091845	SAMN05929783	43,875,544	79%	37%
SRR4434795	SRX2254024	SRP091845	SAMN05929784	46,220,546	74%	39%
SRR5098023	SRX2414823	SRP094933	SAMN06131559	92,273,954	77%	38%
SRR5098022	SRX2414822	SRP094933	SAMN06131560	96,739,366	76%	40%
SRR5098029	SRX2414829	SRP094933	SAMN06131561	104,755,624	67%	40%
SRR5098028	SRX2414828	SRP094933	SAMN06131562	98,032,380	75%	38%
SRR5098026	SRX2414826	SRP094933	SAMN06131564	85,682,124	78%	39%
SRR5098025	SRX2414825	SRP094933	SAMN06131565	120,202,132	75%	38%
SRR6660792	SRX3637889	SRP131933	SAMN08449642	50,850,880	80%	28%
SRR6660797	SRX3637894	SRP131933	SAMN08449643	64,402,448	75%	18%
SRR6660796	SRX3637893	SRP131933	SAMN08449644	72,316,080	53%	12%
SRR6660795	SRX3637892	SRP131933	SAMN08449645	69,153,112	49%	17%
SRR6660794	SRX3637891	SRP131933	SAMN08449646	84,676,600	83%	21%
SRR6660793	SRX3637890	SRP131933	SAMN08449647	79,256,620	78%	27%
SRR6820460	SRX3777310	SRP134223	SAMN08667434	69,574,378	76%	52%
SRR6820461	SRX3777309	SRP134223	SAMN08667435	86,256,452	78%	52%
SRR6820458	SRX3777312	SRP134223	SAMN08667436	63,166,500	79%	52%
SRR6820459	SRX3777311	SRP134223	SAMN08667437	73,128,306	84%	56%
SRR6820464	SRX3777306	SRP134223	SAMN08667438	70,861,430	84%	54%
SRR6820465	SRX3777305	SRP134223	SAMN08667439	68,653,432	83%	54%
SRR6820462	SRX3777308	SRP134223	SAMN08667440	56,781,984	81%	56%
SRR6820463	SRX3777307	SRP134223	SAMN08667441	68,298,598	81%	56%
SRR6820457	SRX3777313	SRP134223	SAMN08667442	83,351,864	78%	53%
SRR7791702	SRX4646720	SRP159776	SAMN09976275	55,646,436	75%	45%
SRR7791701	SRX4646721	SRP159776	SAMN09976275	53,816,862	71%	45%
SRR7791700	SRX4646722	SRP159776	SAMN09976275	41,884,454	69%	45%
SRR7791699	SRX4646723	SRP159776	SAMN09976275	55,515,470	71%	45%
SRR7791698	SRX4646724	SRP159776	SAMN09976275	47,998,700	74%	45%
SRR7791697	SRX4646725	SRP159776	SAMN09976275	48,214,506	67%	44%
SRR7791696	SRX4646726	SRP159776	SAMN09976275	54,286,950	74%	45%
SRR7791695	SRX4646727	SRP159776	SAMN09976275	52,713,266	72%	47%
SRR7791694	SRX4646728	SRP159776	SAMN09976275	54,944,306	76%	45%
SRR7791693	SRX4646729	SRP159776	SAMN09976275	51,052,308	72%	44%
SRR7791692	SRX4646730	SRP159776	SAMN09976275	53,636,952	74%	46%
SRR7791691	SRX4646731	SRP159776	SAMN09976275	55,808,862	75%	45%
SRR7791690	SRX4646732	SRP159776	SAMN09976275	58,911,204	79%	46%
SRR7791689	SRX4646733	SRP159776	SAMN09976275	53,438,356	78%	46%
SRR7791688	SRX4646734	SRP159776	SAMN09976275	57,938,570	78%	45%
SRR7791687	SRX4646735	SRP159776	SAMN09976275	53,449,772	77%	46%
SRR7791686	SRX4646736	SRP159776	SAMN09976275	50,267,862	75%	44%
SRR7791685	SRX4646737	SRP159776	SAMN09976275	49,284,918	76%	44%
SRR7791684	SRX4646738	SRP159776	SAMN09976275	53,606,662	76%	45%
SRR7791683	SRX4646739	SRP159776	SAMN09976275	55,623,384	75%	45%
SRR7791682	SRX4646740	SRP159776	SAMN09976275	59,358,582	77%	45%
SRR7791681	SRX4646741	SRP159776	SAMN09976275	57,257,328	77%	45%
SRR7791680	SRX4646742	SRP159776	SAMN09976275	55,982,502	72%	44%
SRR7791679	SRX4646743	SRP159776	SAMN09976275	46,399,166	75%	44%
SRR7791678	SRX4646744	SRP159776	SAMN09976275	47,548,734	78%	46%
SRR7791677	SRX4646745	SRP159776	SAMN09976275	49,337,540	78%	45%
SRR7791676	SRX4646746	SRP159776	SAMN09976275	62,987,634	77%	45%
SRR7791675	SRX4646747	SRP159776	SAMN09976275	53,942,062	77%	44%
SRR7791674	SRX4646748	SRP159776	SAMN09976275	47,060,890	77%	46%
SRR7791673	SRX4646749	SRP159776	SAMN09976275	51,694,728	77%	45%
SRR7791672	SRX4646750	SRP159776	SAMN09976275	56,287,654	78%	46%
SRR7791671	SRX4646751	SRP159776	SAMN09976275	56,444,216	79%	45%
SRR7791670	SRX4646752	SRP159776	SAMN09976275	52,170,718	77%	46%
SRR7791669	SRX4646753	SRP159776	SAMN09976275	51,195,074	77%	45%
SRR7791668	SRX4646754	SRP159776	SAMN09976275	51,796,438	76%	46%
SRR7791667	SRX4646755	SRP159776	SAMN09976275	51,263,948	78%	45%
SRR7811372	SRX4663014	SRP160428	SAMN10022760	73,125,470	78%	43%
SRR7811373	SRX4663013	SRP160428	SAMN10022761	73,684,340	77%	42%
SRR7811367	SRX4663019	SRP160428	SAMN10022762	66,154,828	76%	42%
SRR7811370	SRX4663016	SRP160428	SAMN10022763	65,376,768	76%	42%
SRR7811380	SRX4663006	SRP160428	SAMN10022764	67,326,318	75%	41%
SRR7811384	SRX4663002	SRP160428	SAMN10022765	60,511,912	75%	41%
SRR7811382	SRX4663004	SRP160428	SAMN10022766	67,822,662	76%	42%
SRR7811371	SRX4663015	SRP160428	SAMN10022767	65,484,404	76%	42%
SRR7811369	SRX4663017	SRP160428	SAMN10022768	65,166,726	76%	42%
SRR7811368	SRX4663018	SRP160428	SAMN10022769	63,470,418	77%	40%
SRR7811376	SRX4663010	SRP160428	SAMN10022770	68,388,962	76%	41%
SRR7811377	SRX4663009	SRP160428	SAMN10022771	64,449,210	76%	42%
SRR7811378	SRX4663008	SRP160428	SAMN10022772	70,635,206	77%	42%
SRR7811379	SRX4663007	SRP160428	SAMN10022773	71,477,382	79%	41%
SRR7811381	SRX4663005	SRP160428	SAMN10022774	70,974,922	79%	41%
SRR7811383	SRX4663003	SRP160428	SAMN10022775	71,531,484	79%	40%
SRR7811374	SRX4663012	SRP160428	SAMN10022776	67,092,032	79%	40%
SRR7811375	SRX4663011	SRP160428	SAMN10022777	63,647,252	78%	41%
SRR8053817	SRX4883418	SRP165719	SAMN10240507	62,188,514	81%	48%
SRR8053818	SRX4883417	SRP165719	SAMN10240508	58,230,552	84%	51%
SRR8053819	SRX4883416	SRP165719	SAMN10240509	56,256,120	81%	49%
SRR8053820	SRX4883415	SRP165719	SAMN10240510	58,527,830	84%	51%
SRR8053824	SRX4883411	SRP165719	SAMN10240511	61,087,080	82%	48%
SRR8053821	SRX4883414	SRP165719	SAMN10240512	53,315,708	82%	50%
SRR8053815	SRX4883420	SRP165719	SAMN10240513	56,609,726	83%	36%
SRR8053816	SRX4883419	SRP165719	SAMN10240514	48,330,944	82%	35%
SRR8053822	SRX4883413	SRP165719	SAMN10240515	57,968,174	83%	36%
SRR8053823	SRX4883412	SRP165719	SAMN10240516	58,306,252	83%	37%
SRR8053813	SRX4883422	SRP165719	SAMN10240517	61,615,204	83%	36%
SRR8053814	SRX4883421	SRP165719	SAMN10240518	69,560,418	84%	35%
SRR8296268	SRX5110772	SRP172981	SAMN10537197	65,567,242	81%	39%
SRR8296269	SRX5110771	SRP172981	SAMN10537198	69,451,288	79%	39%
SRR8296270	SRX5110770	SRP172981	SAMN10537199	71,435,520	78%	40%
SRR8296271	SRX5110769	SRP172981	SAMN10537200	75,457,384	78%	40%
SRR8296264	SRX5110776	SRP172981	SAMN10537201	69,008,260	81%	39%
SRR8296265	SRX5110775	SRP172981	SAMN10537202	70,789,526	75%	31%
SRR8296266	SRX5110774	SRP172981	SAMN10537203	74,295,262	78%	40%
SRR8296267	SRX5110773	SRP172981	SAMN10537204	71,637,864	81%	41%
SRR10176886	SRX6898300	SRP223134	SAMN12837616	60,551,710	63%	47%
SRR10176885	SRX6898301	SRP223134	SAMN12837617	60,289,600	63%	47%
SRR10176894	SRX6898292	SRP223134	SAMN12837618	73,781,668	62%	46%
SRR10176893	SRX6898293	SRP223134	SAMN12837619	66,727,436	64%	47%
SRR10176892	SRX6898294	SRP223134	SAMN12837620	63,985,318	71%	49%
SRR10176891	SRX6898295	SRP223134	SAMN12837621	76,769,352	69%	48%
SRR10176890	SRX6898296	SRP223134	SAMN12837622	51,994,972	77%	46%
SRR10176889	SRX6898297	SRP223134	SAMN12837623	51,151,534	77%	46%
SRR10176888	SRX6898298	SRP223134	SAMN12837624	50,112,036	80%	47%
SRR10176887	SRX6898299	SRP223134	SAMN12837625	50,185,624	79%	45%
SRR10176884	SRX6898302	SRP223134	SAMN12837626	54,662,328	79%	46%
SRR10176883	SRX6898303	SRP223134	SAMN12837627	54,457,776	79%	45%
SRR10176900	SRX6898286	SRP223134	SAMN12837628	52,161,486	79%	45%
SRR10176899	SRX6898287	SRP223134	SAMN12837629	51,060,614	80%	50%
SRR10176898	SRX6898288	SRP223134	SAMN12837630	49,400,804	80%	45%
SRR10176897	SRX6898289	SRP223134	SAMN12837631	50,436,938	78%	46%
SRR10176896	SRX6898290	SRP223134	SAMN12837632	49,928,442	80%	47%
SRR10176895	SRX6898291	SRP223134	SAMN12837633	48,901,752	79%	44%
SRR10775276	SRX7449194	SRP238901	SAMN13689808	50,284,812	78%	29%
SRR10775275	SRX7449193	SRP238901	SAMN13689809	40,835,734	79%	27%
SRR10775274	SRX7449192	SRP238901	SAMN13689810	58,372,706	77%	31%
SRR10775273	SRX7449191	SRP238901	SAMN13689811	52,859,536	78%	27%
SRR10775272	SRX7449190	SRP238901	SAMN13689812	50,060,388	78%	28%
SRR10775271	SRX7449189	SRP238901	SAMN13689813	52,151,546	78%	28%
SRR11051697	SRX7701600	SRP247948	SAMN14057052	53,154,850	69%	47%
SRR11051696	SRX7701601	SRP247948	SAMN14057053	77,003,220	69%	46%
SRR11051693	SRX7701604	SRP247948	SAMN14057054	41,696,710	71%	46%
SRR11051692	SRX7701605	SRP247948	SAMN14057055	42,245,120	71%	47%
SRR11051691	SRX7701606	SRP247948	SAMN14057056	56,010,568	70%	43%
SRR11051690	SRX7701607	SRP247948	SAMN14057057	41,035,150	71%	46%
SRR11051689	SRX7701608	SRP247948	SAMN14057058	52,801,406	79%	46%
SRR11051688	SRX7701609	SRP247948	SAMN14057059	53,753,048	77%	42%
SRR11051687	SRX7701610	SRP247948	SAMN14057060	62,487,466	83%	46%
SRR11051686	SRX7701611	SRP247948	SAMN14057061	40,761,828	83%	41%
SRR11051695	SRX7701602	SRP247948	SAMN14057062	50,161,898	79%	46%
SRR11051694	SRX7701603	SRP247948	SAMN14057063	62,912,236	82%	41%
SRR12701877	SRX9181356	SRP285179	SAMN16251828	53,528,944	71%	46%
SRR12701876	SRX9181357	SRP285179	SAMN16251829	64,049,872	72%	46%
SRR12701873	SRX9181360	SRP285179	SAMN16251830	47,188,640	66%	42%
SRR12701872	SRX9181361	SRP285179	SAMN16251831	53,101,146	74%	46%
SRR12701871	SRX9181362	SRP285179	SAMN16251832	66,211,580	73%	46%
SRR12701870	SRX9181363	SRP285179	SAMN16251833	44,467,622	71%	45%
SRR12701869	SRX9181364	SRP285179	SAMN16251834	42,713,632	68%	42%
SRR12701868	SRX9181365	SRP285179	SAMN16251835	46,046,128	69%	42%
SRR12701867	SRX9181366	SRP285179	SAMN16251836	44,464,314	66%	39%
SRR12701866	SRX9181367	SRP285179	SAMN16251837	54,467,886	74%	47%
SRR12701875	SRX9181358	SRP285179	SAMN16251838	47,642,322	74%	46%
SRR12701874	SRX9181359	SRP285179	SAMN16251839	50,504,848	71%	45%

Protein alignments

Source	Number of sequences retrieved from Entrez	Number (%) of sequences aligned by ProSplign	Number (%) of sequences passed to Gnomon	Average % identity	Average % coverage
Aves GenBank	15,130	8,281 (54.73%)	8,281 (54.73%)	74.19%	83.93%
Aves known RefSeq (NP_)	7,926	7,520 (94.88%)	7,520 (94.88%)	77.77%	85.38%
Gallus gallus high-quality model RefSeq (XP_)	9,464	9,074 (95.88%)	9,074 (95.88%)	77.27%	84.19%
Homo sapiens known RefSeq (NP_)	60,887	40,260 (66.12%)	40,260 (66.12%)	70.35%	76.03%

Assembly-assembly alignments of current to previous assembly

When the assembly changes between two rounds of annotation, genes in the current and the previous annotation are mapped to each other using the genomic alignments of the current assembly to the previous assembly so that gene identifiers can be preserved. The success of the remapping depends largely on how well the two assembly versions align to each other.

Below are the percent coverage of one assembly by the other and the average percent identity of the alignments. The 'First pass' alignments are reciprocal best hits, while the 'Total' alignments also include 'Second pass' or non-reciprocal best alignments. For more information about the assembly-assembly alignment process, please visit the NCBI Genome Remapping Service page.

First Pass	Total
ZJU1.0 (Current) Coverage: 89.61%	ZJU1.0 (Current) Coverage: 91.75%
IASCAAS_PekingDuck_PBH1.5 (Previous) Coverage: 94.57%	IASCAAS_PekingDuck_PBH1.5 (Previous) Coverage: 97.53%
Percent Identity: 99.37%	Percent Identity: 99.27%

Comparison of the current and previous annotations

The annotation produced for this release (104) was compared to the annotation in the previous release (103) for each assembly annotated in both releases. Scores for current and previous gene and transcript features were calculated based on overlap in exon sequence and matches in exon boundaries. Pairs of current and previous features were categorized based on these scores, whether they are reciprocal best matches, and changes in attributes (gene biotype, completeness, etc.). If the assembly was updated between the two releases, alignments between the current and the previous assembly were used to match the current and previous gene and transcript features in mapped regions.

The table below summarizes the changes in the gene set for each assembly as a percent of the number of genes in the current annotation release, and provides links to the details of the comparison in tabular format and in a Genome Workbench project.

	ZJU1.0 (Current) to IASCAAS_PekingDuck_PBH1.5 (Previous)
Identical	3%
Minor changes	57%
Major changes	15%
New	23%
Deprecated	21%
Other	2%
Download the report	tabular, Genome Workbench

References

RefSeq: Pruitt KD, Brown GR, Hiatt SM, Thibaud-Nissen F, Astashyn A, Ermolaeva O, Farrell CM, Hart J, Landrum MJ, McGarvey KM, Murphy MR, O'Leary NA, Pujar S, Rajput B, Rangwala SH, Riddick LD, Shkeda A, Sun H, Tamez P, Tully RE, Wallin C, Webb D, Weber J, Wu W, Dicuccio M, Kitts P, Maglott DR, Murphy TD, Ostell JM. Nucleic Acids Research 2014, 42(Database issue):D756-63
RepeatMasker: Smit AFA, Hubley R, Green P. RepeatMasker Open-3.0. 1996–2004. http://www.repeatmasker.org
WindowMasker: Morgulis A, Gertz EM, Schäffer AA, Agarwala R. Bioinformatics 2006, 2:134-41
Splign: Kapustin Y, Souvorov A, Tatusova T, Lipman D. Biology Direct 2008, 3:20

RefSeq

Integrated reference sequences