NCBI Oncorhynchus mykiss Annotation Release 101

The RefSeq genome records for Oncorhynchus mykiss were annotated by the NCBI Eukaryotic Genome Annotation Pipeline, an automated pipeline that annotates genes, transcripts and proteins on draft and finished genome assemblies. This report presents statistics on the annotation products, the input data used in the pipeline and intermediate alignment results.

The annotation products are available in the sequence databases and on the FTP site.

This report provides:

Annotation Release information: The name of the release, important dates, the software version
Assemblies: A brief description of the annotated assembly(ies)
Gene and feature statistics: The counts and characteristics of the annotated features
Alignment of the annotated proteins to a set of high-quality proteins: The number of annotated proteins with hits to a set of high-quality proteins
Masking of genomic sequence: How much of the genome was masked
Transcript and protein alignments: The number and type of evidence retrieved from public databases and used for gene prediction
Similarity of current and previous assembly: The similarity of the current and previous assembly
Comparison of the current and previous annotations: What proportion of the genes changed in this annotation

For more information on the annotation process, please visit the NCBI Eukaryotic Genome Annotation Pipeline page.

Annotation Release information

This annotation should be referred to as NCBI Oncorhynchus mykiss Annotation Release 101

Annotation release ID: 101
Date of Entrez queries for transcripts and proteins: Oct 6 2020
Date of submission of annotation to the public databases: Oct 30 2020
Software version: 8.5

Assemblies

The following assemblies were included in this annotation run:

Assembly name	Assembly accession	Submitter	Assembly date	Reference/Alternate	Assembly content
USDA_OmykA_1.1	GCF_013265735.2	USDA/ARS	09-23-2020	Reference	33 assembled chromosomes; unplaced scaffolds

Gene and feature statistics

Counts and length of annotated features are provided below for each assembly.

Feature counts

Feature	USDA_OmykA_1.1
Genes and pseudogenes	72,772
protein-coding	41,896
non-coding	28,006
transcribed pseudogenes	46
non-transcribed pseudogenes	2,666
genes with variants	21,619
immunoglobulin/T-cell receptor gene segments	157
other	1
mRNAs	97,731
fully-supported	95,237
with > 5% ab initio	1,236
partial	404
with filled gap(s)	46
known RefSeq (NM_)	1,216
model RefSeq (XM_)	96,515
non-coding RNAs	34,014
fully-supported	10,119
with > 5% ab initio	0
partial	5
with filled gap(s)	5
known RefSeq (NR_)	0
model RefSeq (XR_)	21,486
pseudo transcripts	53
fully-supported	41
with > 5% ab initio	0
partial	0
with filled gap(s)	0
known RefSeq (NR_)	0
model RefSeq (XR_)	53
CDSs	97,900
fully-supported	95,237
with > 5% ab initio	1,420
partial	350
with major correction(s)	480
known RefSeq (NP_)	1,228
model RefSeq (XP_)	96,515

Detailed reports

The counts below do not include pseudogenes.

Feature lengths

Feature	Count	Mean length (bp)	Median length (bp)	Min length (bp)	Max length (bp)
Genes	69,903	18,975	4,422	53	1,190,608
All transcripts	131,745	3,071	2,550	53	100,803
mRNA	97,731	3,855	3,202	129	100,803
misc_RNA	4,082	3,473	3,008	144	28,295
tRNA	12,526	74	73	67	91
lncRNA	6,038	1,639	1,300	72	29,307
snoRNA	917	154	137	54	309
snRNA	1,356	138	141	53	197
guide_RNA	17	197	154	130	342
rRNA	9,078	278	119	117	3,982
Single-exon transcripts	1,399	1,903	1,596	291	29,149
coding transcripts (NM_/XM_ )	1,399	1,903	1,596	291	29,149
CDSs	97,744	2,270	1,611	96	99,522
Exons	526,477	327	145	1	29,149
in coding transcripts (NM_/XM_ )	503,767	318	144	1	29,149
in non-coding transcripts (NR_/XR_ )	45,984	359	150	2	10,230
Introns	467,265	3,111	496	26	1,018,851
in coding transcripts (NM_/XM_ )	452,276	3,137	506	26	1,018,851
in non-coding transcripts (NR_/XR_ )	37,911	2,670	403	30	690,030

Transcripts per gene, exons per transcript

	Mean	Median	Min	Max
Number of transcripts per gene	2.08	1	1	50
Number of exons per transcript	12.2	9	1	240

Alignment of the annotated proteins to a set of high-quality proteins

The final set of annotated proteins was searched with BLASTP against the UniProtKB/Swiss-Prot curated proteins, using the annotated proteins as the query and the high-quality proteins as the target. Out of 41883 coding genes, 38418 genes had a protein with an alignment covering 50% or more of the query and 18052 had an alignment covering 95% or more of the query.

Definition of query and target coverage. The query coverage is the percentage of the annotated protein length that is included in the alignment. The target coverage is the percentage of the target length that is included in the alignment.

Below is a cumulative graph displaying the number of genes with alignments above a given query or target coverage threshold. For comparison, corresponding statistics for other organisms annotated by the NCBI eukaryotic annotation pipeline were added to the graph.

Query: annotated proteins
Target: UniProtKB/Swiss-Prot curated proteins

Masking of genomic sequence

Transcript and protein alignments are performed on the repeat-masked genome. Below are the percentages of genomic sequence masked by WindowMasker and RepeatMasker for each assembly. RepeatMasker results are only used for organisms for which a comprehensive repeat library is available.

For this annotation run, transcripts and proteins were aligned to the genome masked with WindowMasker only.

Assembly name	Assembly accession	% Masked with RepeatMasker	% Masked with WindowMasker
USDA_OmykA_1.1	GCF_013265735.2	7.30%	52.10%

Transcript and protein alignments

The annotation pipeline relies heavily on alignments of experimental evidence for gene prediction. Below are the sets of transcripts and proteins that were retrieved from Entrez, aligned to the genome by Splign or ProSplign and passed to Gnomon, NCBI's gene prediction software.

Depending on the other evidence available, long 454 reads (with average length above 250 nt) may be aligned as traditional evidence and reported in the Transcript alignments section or aligned with RNA-Seq reads and reported in the RNA-Seq alignments section.

Transcript alignments

Source	Number of sequences retrieved from Entrez	Number (%) of sequences aligned by Splign	Number (%) of sequences passed to Gnomon	Average % identity	Average % coverage
Same-species known RefSeq (NM_/NR_)	1,257	1,256 (99.92%)	1,205 (95.86%)	99.36%	99.20%
Same-species Genbank	5,649	5,589 (98.94%)	4,938 (87.41%)	99.14%	95.88%
Same-species EST	287,565	252,326 (87.75%)	230,896 (80.29%)	98.96%	98.66%
Salmo salar known RefSeq (NM_/NR_)	3,547	3,514 (99.07%)	2,222 (62.64%)	95.28%	97.40%

RefSeq transcript alignment quality report

The known RefSeq transcripts (NM_ and NR_ accessions) are a set of hiqh-quality transcripts maintained by the RefSeq group at NCBI. Alignment statistics for this group of transcripts, such as percent and number of sequences not aligning at all, percent best alignments split between multiple scaffolds, and percent alignments not covering the full CDS are indicative of the genome quality and are provided below.

	USDA_OmykA_1.1 Primary Assembly
Number of sequences retrieved from Entrez	1,257
Number (%) of sequences not aligning	1 (0.08%)
Number (%) of sequences with multiple best alignments (split genes)	4 (0.32%)
Number (%) of sequences with CDS coverage < 95%	18 (1.43%)

RNA-Seq alignments

The following RNA-Seq reads from the Sequence Read Archive were also used for gene prediction:

Hide alignments statistics, by sample (SAME, SAMN, SAMD, DRS)

Sample Id	Publication	Track name	Number of reads	Percent aligned reads	Percent of aligned reads with introns	Number of introns
All	NA	Aggregate of all aligned samples	6,888,224,975	76%	34%	582,277
SAMD00008810	23109217	Rainbow trout (Oncorhynchus mykiss) gonad (Oncorhynchus mykiss, SAMD00008810)	344,736	65%	25%	18,918
SAMD00009826	21712077	Rainbow trout adipose (Oncorhynchus mykiss, SAMD00009826)	33,131	64%	42%	3,175
SAMEA1462629	NA	male embryonic gonads (Oncorhynchus mykiss, male, SAMEA1462629)	413,634	63%	54%	64,525
SAMEA1462630	NA	female embryonic gonads (Oncorhynchus mykiss, female, SAMEA1462630)	415,493	62%	58%	66,890
SAMEA6786275	NA	brain (Oncorhynchus mykiss, not available, SAMEA6786275)	33,853,192	81%	27%	371,126
SAMEA6786276	NA	eye (Oncorhynchus mykiss, not available, SAMEA6786276)	32,274,250	84%	34%	353,677
SAMEA6786277	NA	gut (Oncorhynchus mykiss, not available, SAMEA6786277)	35,563,892	84%	39%	311,975
SAMEA6786278	NA	gill (Oncorhynchus mykiss, not available, SAMEA6786278)	32,324,782	82%	33%	344,974
SAMEA6786279	NA	heart (Oncorhynchus mykiss, not available, SAMEA6786279)	33,571,608	80%	40%	305,013
SAMEA6786280	NA	head kidney (Oncorhynchus mykiss, not available, SAMEA6786280)	40,767,124	86%	39%	314,228
SAMEA6786281	NA	kidney (Oncorhynchus mykiss, not available, SAMEA6786281)	35,849,764	82%	35%	362,916
SAMEA6786282	NA	liver (Oncorhynchus mykiss, not available, SAMEA6786282)	35,435,312	88%	43%	252,429
SAMEA6786283	NA	liver (Oncorhynchus mykiss, not available, SAMEA6786283)	35,871,482	87%	43%	266,660
SAMEA6786284	NA	liver (Oncorhynchus mykiss, not available, SAMEA6786284)	36,388,016	83%	39%	250,039
SAMEA6786285	NA	muscle (Oncorhynchus mykiss, not available, SAMEA6786285)	31,476,580	90%	54%	253,848
SAMEA6786286	NA	pyloric caeca (Oncorhynchus mykiss, not available, SAMEA6786286)	33,718,598	85%	42%	276,831
SAMEA6786287	NA	skin (Oncorhynchus mykiss, not available, SAMEA6786287)	33,385,204	84%	41%	334,515
SAMEA6786288	NA	spleen (Oncorhynchus mykiss, not available, SAMEA6786288)	31,222,190	85%	33%	311,458
SAMN00002956	20942956	Generic sample from Oncorhynchus mykiss (Oncorhynchus mykiss, SAMN00002956)	1,298,911	72%	30%	158,655
SAMN00139479	NA	Non-normalized cDNA from Oncorhynchus mykiss stage I testis (Oncorhynchus mykiss, SAMN00139479)	400,181	71%	42%	98,794
SAMN00139480	NA	Non-normalized cDNA from Oncorhynchus mykiss stage III testis (Oncorhynchus mykiss, SAMN00139480)	834,079	78%	46%	122,587
SAMN00139481	NA	Non-normalised cDNA from Oncorhynchus mykiss MOF brain (Oncorhynchus mykiss, SAMN00139481)	270,704	77%	29%	26,762
SAMN00139505	NA	Non-normalised cDNA from Oncorhynchus mykiss MOM brain (Oncorhynchus mykiss, SAMN00139505)	210,659	70%	24%	9,860
SAMN00139508	NA	Non-normalised cDNA from matured post-vitellogenic Oncorhynchus mykiss oocytes (Oncorhynchus mykiss, SAMN00139508)	366,966	80%	41%	82,392
SAMN00139517	NA	Non-normalised cDNA from non-matured post-vitellogenic Oncorhynchus mykiss oocytes (Oncorhynchus mykiss, SAMN00139517)	371,544	81%	42%	86,340
SAMN00139521	NA	Non-normalised cDNA from Oncorhynchus mykiss gills II (Oncorhynchus mykiss, SAMN00139521)	288,355	58%	32%	34,734
SAMN00139522	NA	Non-normalised cDNA from Oncorhynchus mykiss gills I (Oncorhynchus mykiss, SAMN00139522)	324,868	67%	36%	47,691
SAMN00210791	NA	cDNA from white muscle of Oncorhynchus mykiss adult (Oncorhynchus mykiss, SAMN00210791)	181,618	80%	73%	12,485
SAMN00210792	NA	cDNA from white muscle of Oncorhynchus mykiss adult (Oncorhynchus mykiss, SAMN00210792)	111,866	79%	76%	8,096
SAMN00210793	NA	cDNA from white muscle of Oncorhynchus mykiss adult (Oncorhynchus mykiss, SAMN00210793)	124,449	78%	75%	9,872
SAMN00210794	NA	cDNA from white muscle of Oncorhynchus mykiss adult (Oncorhynchus mykiss, SAMN00210794)	90,381	77%	77%	7,043
SAMN00210795	NA	cDNA from white muscle of Oncorhynchus mykiss adult (Oncorhynchus mykiss, SAMN00210795)	64,768	80%	78%	5,943
SAMN00210796	NA	cDNA from white muscle of Oncorhynchus mykiss adult (Oncorhynchus mykiss, SAMN00210796)	128,976	77%	74%	9,290
SAMN00210797	NA	cDNA from from 38 days larvae trunks (Oncorhynchus mykiss, SAMN00210797)	103,188	76%	73%	13,805
SAMN00210798	NA	cDNA from from 38 days larvae trunks (Oncorhynchus mykiss, SAMN00210798)	113,272	73%	69%	16,263
SAMN00210799	NA	cDNA from from 38 days larvae trunks (Oncorhynchus mykiss, SAMN00210799)	176,153	77%	78%	17,242
SAMN00210800	NA	cDNA from from 38 days larvae trunks (Oncorhynchus mykiss, SAMN00210800)	169,107	78%	72%	19,790
SAMN00210801	NA	cDNA from from 38 days larvae trunks (Oncorhynchus mykiss, SAMN00210801)	142,019	76%	75%	16,607
SAMN00210802	NA	cDNA from from 38 days larvae trunks (Oncorhynchus mykiss, SAMN00210802)	101,477	76%	72%	14,688
SAMN00631607	NA	gill, brain, liver, spleen, kidney and muscle (Oncorhynchus mykiss, SAMN00631607)	3,293,826	75%	59%	227,872
SAMN02178820	NA	whole embryo (Oncorhynchus mykiss, 15 days post fertilization, male, SAMN02178820)	96,077	61%	70%	33,133
SAMN02178821	NA	whole embryo (Oncorhynchus mykiss, 15 days post fertilization, male, SAMN02178821)	231,981	62%	63%	55,947
SAMN02178822	NA	whole embryo (Oncorhynchus mykiss, 15 days post fertilization, male, SAMN02178822)	64,002	58%	68%	24,962
SAMN02178823	NA	whole embryo (Oncorhynchus mykiss, 15 days post fertilization, male, SAMN02178823)	90,235	60%	69%	30,976
SAMN02178824	NA	whole embryo (Oncorhynchus mykiss, 15 days post fertilization, female, SAMN02178824)	88,186	58%	58%	28,353
SAMN02178825	NA	head kidney (Oncorhynchus mykiss, 1 year old, male, SAMN02178825, SAMN02178825)	95,196	55%	65%	23,680
SAMN02178826	NA	head kidney (Oncorhynchus mykiss, 1 year old, male, SAMN02178826, SAMN02178826)	108,170	46%	72%	20,789
SAMN02178827	NA	head kidney (Oncorhynchus mykiss, 1 year old, male, SAMN02178826, SAMN02178827)	126,150	47%	80%	20,457
SAMN02178829	NA	head kidney (Oncorhynchus mykiss, 1 year old, female, SAMN02178829, SAMN02178829)	197,793	48%	64%	31,751
SAMN02343402	NA	bone (Oncorhynchus mykiss, female, SAMN02343402)	652,133	47%	26%	38,047
SAMN05195102	NA	brain (Oncorhynchus mykiss, not collected, not collected, SAMN05195102)	34,517,081	76%	14%	259,601
SAMN05195103	NA	brain (Oncorhynchus mykiss, not collected, not collected, SAMN05195103)	30,439,400	79%	13%	280,944
SAMN05195104	NA	brain (Oncorhynchus mykiss, not collected, not collected, SAMN05195104)	28,788,050	80%	13%	237,125
SAMN05195105	NA	brain (Oncorhynchus mykiss, not collected, not collected, SAMN05195105)	30,964,667	80%	19%	326,442
SAMN05195106	NA	brain (Oncorhynchus mykiss, not collected, not collected, SAMN05195106)	26,582,061	80%	16%	278,505
SAMN05195107	NA	brain (Oncorhynchus mykiss, not collected, not collected, SAMN05195107)	30,968,411	78%	13%	254,169
SAMN05195108	NA	brain (Oncorhynchus mykiss, not collected, not collected, SAMN05195108)	30,320,750	80%	16%	301,283
SAMN05195109	NA	brain (Oncorhynchus mykiss, not collected, not collected, SAMN05195109)	34,590,705	80%	16%	322,032
SAMN05195110	NA	brain (Oncorhynchus mykiss, not collected, not collected, SAMN05195110)	26,989,750	80%	15%	266,526
SAMN05195111	NA	brain (Oncorhynchus mykiss, not collected, not collected, SAMN05195111)	27,959,436	79%	16%	294,376
SAMN05195112	NA	brain (Oncorhynchus mykiss, not collected, not collected, SAMN05195112)	28,968,733	80%	16%	273,499
SAMN05195113	NA	brain (Oncorhynchus mykiss, not collected, not collected, SAMN05195113)	27,893,893	79%	16%	272,577
SAMN05195114	NA	brain (Oncorhynchus mykiss, not collected, not collected, SAMN05195114)	26,396,892	81%	14%	241,029
SAMN05195115	NA	brain (Oncorhynchus mykiss, not collected, not collected, SAMN05195115)	30,445,934	80%	14%	239,985
SAMN05195116	NA	brain (Oncorhynchus mykiss, not collected, not collected, SAMN05195116)	28,923,991	80%	16%	275,128
SAMN05195117	NA	brain (Oncorhynchus mykiss, not collected, not collected, SAMN05195117)	28,530,403	78%	13%	230,579
SAMN06640870	NA	Liver (Oncorhynchus mykiss, 10 months, female, SAMN06640870)	35,435,312	79%	34%	165,839
SAMN06640871	NA	Liver (Oncorhynchus mykiss, 10 months, female, SAMN06640871)	35,871,482	87%	43%	266,660
SAMN06640872	NA	Liver (Oncorhynchus mykiss, 10 months, male, SAMN06640872)	36,388,016	83%	39%	250,039
SAMN06640873	NA	Skin (Oncorhynchus mykiss, 10 months, female, SAMN06640873)	33,385,204	84%	41%	334,515
SAMN06640874	NA	Muscle (Oncorhynchus mykiss, 10 months, female, SAMN06640874)	31,476,580	82%	49%	241,865
SAMN06640875	NA	Hearth (Oncorhynchus mykiss, 10 months, female, SAMN06640875)	33,571,608	80%	40%	305,013
SAMN06640876	NA	Gut (Oncorhynchus mykiss, 10 months, female, SAMN06640876)	35,563,892	84%	39%	311,975
SAMN06640877	NA	Pyloric caeca (Oncorhynchus mykiss, 10 months, female, SAMN06640877)	33,718,598	80%	38%	268,048
SAMN06640878	NA	Kidney (Oncorhynchus mykiss, 10 months, female, SAMN06640878)	35,849,764	67%	19%	295,371
SAMN06640879	NA	Head kidney (Oncorhynchus mykiss, 10 months, female, SAMN06640879)	40,767,124	86%	39%	314,228
SAMN06640880	NA	Spleen (Oncorhynchus mykiss, 10 months, female, SAMN06640880)	31,222,190	85%	33%	311,458
SAMN06640881	NA	Gill (Oncorhynchus mykiss, 10 months, female, SAMN06640881)	32,324,782	82%	33%	344,974
SAMN06640882	NA	Eye (Oncorhynchus mykiss, 10 months, female, SAMN06640882)	32,274,250	79%	29%	349,380
SAMN06640883	NA	Brain (Oncorhynchus mykiss, 10 months, female, SAMN06640883)	33,853,192	75%	19%	348,043
SAMN07203781	25793877,26864089,26895175	Brain (Oncorhynchus mykiss, male, SAMN07203781)	84,816,430	75%	22%	410,385
SAMN07203782	25793877,26864089,26895175	Fat (Oncorhynchus mykiss, male, SAMN07203782)	93,546,068	77%	26%	354,908
SAMN07203783	25793877,26864089,26895175	Gill (Oncorhynchus mykiss, male, SAMN07203783)	92,670,670	75%	25%	379,734
SAMN07203784	25793877,26864089,26895175	Head kidney (Oncorhynchus mykiss, male, SAMN07203784)	92,168,818	63%	25%	323,445
SAMN07203785	25793877,26864089,26895175	Intestine (Oncorhynchus mykiss, male, SAMN07203785)	91,613,688	73%	25%	389,323
SAMN07203786	25793877,26864089,26895175	Kidney (Oncorhynchus mykiss, male, SAMN07203786)	89,642,288	66%	22%	369,156
SAMN07203787	25793877,26864089,26895175	Liver (Oncorhynchus mykiss, male, SAMN07203787)	85,281,910	80%	28%	296,169
SAMN07203788	25793877,26864089,26895175	Red muscle (Oncorhynchus mykiss, male, SAMN07203788)	93,064,168	66%	32%	331,331
SAMN07203789	25793877,26864089,26895175	Skin (Oncorhynchus mykiss, male, SAMN07203789)	87,743,778	71%	27%	371,463
SAMN07203790	25793877,26864089,26895175	Spleen (Oncorhynchus mykiss, male, SAMN07203790)	93,532,200	76%	23%	331,437
SAMN07203791	25793877,26864089,26895175	Stomach (Oncorhynchus mykiss, male, SAMN07203791)	91,231,186	74%	30%	329,349
SAMN07203792	25793877,26864089,26895175	Testis (Oncorhynchus mykiss, male, SAMN07203792)	85,389,746	83%	27%	394,921
SAMN07203793	25793877,26864089,26895175	White muscle (Oncorhynchus mykiss, male, SAMN07203793)	86,643,770	84%	36%	272,571
SAMN07203794	25793877,26864089,26895175	Pineal gland (Oncorhynchus mykiss, Not determined, SAMN07203794)	78,802,668	76%	26%	395,837
SAMN07203795	25793877,26864089,26895175	Oocyte (Oncorhynchus mykiss, female, SAMN07203795)	90,135,204	71%	27%	300,063
SAMN08580422	NA	blood, red blood cells, (Oncorhynchus mykiss, not determined, SAMN08580422)	921,902,640	52%	25%	188,465
SAMN09748259	NA	mature oocytes (Oncorhynchus mykiss, SAMN09748259)	129,094,962	70%	37%	281,694
SAMN09748260	NA	mature oocytes (Oncorhynchus mykiss, SAMN09748260)	131,078,494	72%	37%	287,664
SAMN09748261	NA	mature oocytes (Oncorhynchus mykiss, SAMN09748261)	131,762,542	76%	37%	284,740
SAMN09748262	NA	mature oocytes (Oncorhynchus mykiss, SAMN09748262)	117,142,210	75%	36%	281,993
SAMN09748263	NA	mature oocytes (Oncorhynchus mykiss, SAMN09748263)	124,100,404	69%	38%	286,331
SAMN09748264	NA	mature oocytes (Oncorhynchus mykiss, SAMN09748264)	105,311,062	74%	37%	281,966
SAMN09748265	NA	mature oocytes (Oncorhynchus mykiss, SAMN09748265)	109,168,128	70%	35%	271,807
SAMN09748266	NA	mature oocytes (Oncorhynchus mykiss, SAMN09748266)	108,770,596	63%	35%	265,548
SAMN09748267	NA	embryos (Oncorhynchus mykiss, not determined, SAMN09748267)	116,124,196	88%	41%	445,363
SAMN09748268	NA	embryos (Oncorhynchus mykiss, not determined, SAMN09748268)	110,761,042	88%	41%	443,007
SAMN09748269	NA	embryos (Oncorhynchus mykiss, not determined, SAMN09748269)	117,102,770	85%	40%	442,581
SAMN09748270	NA	embryos (Oncorhynchus mykiss, not determined, SAMN09748270)	114,133,758	86%	40%	441,139
SAMN09748271	NA	embryos (Oncorhynchus mykiss, not determined, SAMN09748271)	109,219,180	82%	40%	439,669
SAMN09748272	NA	embryos (Oncorhynchus mykiss, not determined, SAMN09748272)	99,984,838	88%	40%	438,448
SAMN09748273	NA	embryos (Oncorhynchus mykiss, not determined, SAMN09748273)	121,918,782	85%	40%	447,070
SAMN09748274	NA	embryos (Oncorhynchus mykiss, not determined, SAMN09748274)	115,327,802	85%	40%	446,252
SAMN09748275	NA	embryos (Oncorhynchus mykiss, not determined, SAMN09748275)	124,516,240	88%	41%	447,670
SAMN09748276	NA	embryos (Oncorhynchus mykiss, not determined, SAMN09748276)	127,064,432	88%	41%	449,242
SAMN09748277	NA	embryos (Oncorhynchus mykiss, not determined, SAMN09748277)	112,856,114	86%	40%	439,555
SAMN09748278	NA	embryos (Oncorhynchus mykiss, not determined, SAMN09748278)	123,724,148	85%	40%	447,902
SAMN09748279	NA	embryos (Oncorhynchus mykiss, not determined, SAMN09748279)	102,099,838	88%	41%	439,503
SAMN09748280	NA	embryos (Oncorhynchus mykiss, not determined, SAMN09748280)	106,744,570	85%	40%	440,490
SAMN09748281	NA	embryos (Oncorhynchus mykiss, not determined, SAMN09748281)	116,173,706	86%	41%	442,266
SAMN09748282	NA	embryos (Oncorhynchus mykiss, not determined, SAMN09748282)	112,969,476	87%	40%	435,635
SAMN09939439	NA	interbranchial lymphoid tissue (Oncorhynchus mykiss, not determined, SAMN09939439)	31,379,578	59%	25%	301,101
SAMN10853962	NA	Skeletal muscle (Oncorhynchus mykiss, SAMN10853962)	362,702,446	81%	42%	378,958

Show alignments statistics, by run (ERR, SRR, DRR)

Run	Experiment	Project	Sample	Number of reads	Percent aligned reads	Percent of aligned reads with introns
DRR000835	DRX000493	DRP000322	SAMD00009826	33,131	64%	42%
DRR002162	DRX001594	DRP000599	SAMD00008810	344,736	65%	25%
ERR034907	ERX013225	ERP000696	SAMEA1462629	413,634	63%	54%
ERR034906	ERX013224	ERP000696	SAMEA1462630	415,493	62%	58%
ERR4029201	ERX4030517	ERP121186	SAMEA6786275	33,853,192	81%	27%
ERR4029202	ERX4030518	ERP121186	SAMEA6786276	32,274,250	84%	34%
ERR4029203	ERX4030519	ERP121186	SAMEA6786277	35,563,892	84%	39%
ERR4029204	ERX4030520	ERP121186	SAMEA6786278	32,324,782	82%	33%
ERR4029205	ERX4030521	ERP121186	SAMEA6786279	33,571,608	80%	40%
ERR4029206	ERX4030522	ERP121186	SAMEA6786280	40,767,124	86%	39%
ERR4029207	ERX4030523	ERP121186	SAMEA6786281	35,849,764	82%	35%
ERR4029208	ERX4030524	ERP121186	SAMEA6786282	35,435,312	88%	43%
ERR4029209	ERX4030525	ERP121186	SAMEA6786283	35,871,482	87%	43%
ERR4029210	ERX4030526	ERP121186	SAMEA6786284	36,388,016	83%	39%
ERR4029211	ERX4030527	ERP121186	SAMEA6786285	31,476,580	90%	54%
ERR4029212	ERX4030528	ERP121186	SAMEA6786286	33,718,598	85%	42%
ERR4029213	ERX4030529	ERP121186	SAMEA6786287	33,385,204	84%	41%
ERR4029214	ERX4030530	ERP121186	SAMEA6786288	31,222,190	85%	33%
SRR020739	SRX007396	SRP001007	SAMN00002956	642,488	72%	30%
SRR020740	SRX007396	SRP001007	SAMN00002956	656,423	71%	30%
SRR090448	SRX032910	SRP004756	SAMN00139479	400,181	71%	42%
SRR090449	SRX032911	SRP004756	SAMN00139480	300,661	77%	44%
SRR090450	SRX032911	SRP004756	SAMN00139480	533,418	78%	48%
SRR090451	SRX033005	SRP004756	SAMN00139481	270,704	77%	29%
SRR089816	SRX033006	SRP004756	SAMN00139505	32,149	54%	26%
SRR090454	SRX033006	SRP004756	SAMN00139505	178,510	73%	23%
SRR089808	SRX033001	SRP004756	SAMN00139508	366,966	80%	41%
SRR089807	SRX033002	SRP004756	SAMN00139517	371,544	81%	42%
SRR090452	SRX033004	SRP004756	SAMN00139521	288,355	58%	32%
SRR090453	SRX033003	SRP004756	SAMN00139522	324,868	67%	36%
SRR099272	SRX041526	SRP005674	SAMN00210791	181,618	80%	73%
SRR099273	SRX041527	SRP005674	SAMN00210792	111,866	79%	76%
SRR099274	SRX041528	SRP005674	SAMN00210793	124,449	78%	75%
SRR099275	SRX041529	SRP005674	SAMN00210794	90,381	77%	77%
SRR099515	SRX041530	SRP005674	SAMN00210795	64,768	80%	78%
SRR099516	SRX041531	SRP005674	SAMN00210796	128,976	77%	74%
SRR099517	SRX041532	SRP005674	SAMN00210797	103,188	76%	73%
SRR099518	SRX041533	SRP005674	SAMN00210798	113,272	73%	69%
SRR099519	SRX041534	SRP005674	SAMN00210799	176,153	77%	78%
SRR099520	SRX041535	SRP005674	SAMN00210800	169,107	78%	72%
SRR099521	SRX041536	SRP005674	SAMN00210801	142,019	76%	75%
SRR099522	SRX041537	SRP005674	SAMN00210802	101,477	76%	72%
SRR316698	SRX085156	SRP007371	SAMN00631607	1,413,483	75%	59%
SRR316699	SRX085156	SRP007371	SAMN00631607	1,880,343	75%	59%
SRR942786	SRX327605	SRP028233	SAMN02178820	96,077	61%	70%
SRR942788	SRX327608	SRP028233	SAMN02178821	231,981	62%	63%
SRR942892	SRX327707	SRP028233	SAMN02178822	64,002	58%	68%
SRR942894	SRX327709	SRP028233	SAMN02178823	90,235	60%	69%
SRR942896	SRX327711	SRP028233	SAMN02178824	88,186	58%	58%
SRR942899	SRX327714	SRP028233	SAMN02178825	95,196	55%	65%
SRR942901	SRX327716	SRP028233	SAMN02178826	108,170	46%	72%
SRR942902	SRX327718	SRP028233	SAMN02178827	126,150	47%	80%
SRR942906	SRX327721	SRP028233	SAMN02178829	197,793	48%	64%
SRR961872	SRX342748	SRP029437	SAMN02343402	652,133	47%	26%
SRR3623956	SRX1818924	SRP076103	SAMN05195102	34,517,081	76%	14%
SRR3623957	SRX1818925	SRP076103	SAMN05195103	30,439,400	79%	13%
SRR3623964	SRX1818932	SRP076103	SAMN05195104	28,788,050	80%	13%
SRR3623965	SRX1818933	SRP076103	SAMN05195105	30,964,667	80%	19%
SRR3623966	SRX1818934	SRP076103	SAMN05195106	26,582,061	80%	16%
SRR3623967	SRX1818935	SRP076103	SAMN05195107	30,968,411	78%	13%
SRR3623968	SRX1818936	SRP076103	SAMN05195108	30,320,750	80%	16%
SRR3623969	SRX1818937	SRP076103	SAMN05195109	34,590,705	80%	16%
SRR3623970	SRX1818941	SRP076103	SAMN05195110	26,989,750	80%	15%
SRR3623971	SRX1818942	SRP076103	SAMN05195111	27,959,436	79%	16%
SRR3623958	SRX1818926	SRP076103	SAMN05195112	28,968,733	80%	16%
SRR3623959	SRX1818927	SRP076103	SAMN05195113	27,893,893	79%	16%
SRR3623960	SRX1818928	SRP076103	SAMN05195114	26,396,892	81%	14%
SRR3623961	SRX1818929	SRP076103	SAMN05195115	30,445,934	80%	14%
SRR3623962	SRX1818930	SRP076103	SAMN05195116	28,923,991	80%	16%
SRR3623963	SRX1818931	SRP076103	SAMN05195117	28,530,403	78%	13%
SRR5373283	SRX2668657	SRP102416	SAMN06640870	35,435,312	79%	34%
SRR5373282	SRX2668656	SRP102416	SAMN06640871	35,871,482	87%	43%
SRR5373281	SRX2668655	SRP102416	SAMN06640872	36,388,016	83%	39%
SRR5373280	SRX2668653	SRP102416	SAMN06640873	33,385,204	84%	41%
SRR5373279	SRX2668652	SRP102416	SAMN06640874	31,476,580	82%	49%
SRR5373278	SRX2668651	SRP102416	SAMN06640875	33,571,608	80%	40%
SRR5373277	SRX2668650	SRP102416	SAMN06640876	35,563,892	84%	39%
SRR5373276	SRX2668649	SRP102416	SAMN06640877	33,718,598	80%	38%
SRR5373275	SRX2668648	SRP102416	SAMN06640878	35,849,764	67%	19%
SRR5373274	SRX2668647	SRP102416	SAMN06640879	40,767,124	86%	39%
SRR5373273	SRX2668646	SRP102416	SAMN06640880	31,222,190	85%	33%
SRR5373272	SRX2668645	SRP102416	SAMN06640881	32,324,782	82%	33%
SRR5373271	SRX2668644	SRP102416	SAMN06640882	32,274,250	79%	29%
SRR5373270	SRX2668643	SRP102416	SAMN06640883	33,853,192	75%	19%
SRR5657600	SRX2894156	SRP108798	SAMN07203781	84,816,430	75%	22%
SRR5657601	SRX2894155	SRP108798	SAMN07203782	93,546,068	77%	26%
SRR5657598	SRX2894158	SRP108798	SAMN07203783	92,670,670	75%	25%
SRR5657599	SRX2894157	SRP108798	SAMN07203784	92,168,818	63%	25%
SRR5657596	SRX2894160	SRP108798	SAMN07203785	91,613,688	73%	25%
SRR5657597	SRX2894159	SRP108798	SAMN07203786	89,642,288	66%	22%
SRR5657594	SRX2894162	SRP108798	SAMN07203787	85,281,910	80%	28%
SRR5657595	SRX2894161	SRP108798	SAMN07203788	93,064,168	66%	32%
SRR5657592	SRX2894164	SRP108798	SAMN07203789	87,743,778	71%	27%
SRR5657593	SRX2894163	SRP108798	SAMN07203790	93,532,200	76%	23%
SRR5657605	SRX2894151	SRP108798	SAMN07203791	91,231,186	74%	30%
SRR5657606	SRX2894150	SRP108798	SAMN07203792	85,389,746	83%	27%
SRR5657603	SRX2894153	SRP108798	SAMN07203793	86,643,770	84%	36%
SRR5657604	SRX2894152	SRP108798	SAMN07203794	78,802,668	76%	26%
SRR5657602	SRX2894154	SRP108798	SAMN07203795	90,135,204	71%	27%
SRR6806642	SRX3765307	SRP133501	SAMN08580422	49,208,806	31%	20%
SRR6806641	SRX3765308	SRP133501	SAMN08580422	68,789,728	17%	3%
SRR6806637	SRX3765312	SRP133501	SAMN08580422	56,766,778	86%	28%
SRR7262894	SRX4167054	SRP133501	SAMN08580422	22,253,042	29%	22%
SRR7262893	SRX4167055	SRP133501	SAMN08580422	25,533,688	27%	22%
SRR7262892	SRX4167056	SRP133501	SAMN08580422	21,475,620	30%	6%
SRR7262891	SRX4167057	SRP133501	SAMN08580422	22,769,100	53%	28%
SRR7262890	SRX4167058	SRP133501	SAMN08580422	35,399,352	14%	4%
SRR7262889	SRX4167059	SRP133501	SAMN08580422	24,389,626	10%	5%
SRR8164498	SRX4985314	SRP133501	SAMN08580422	27,879,400	28%	7%
SRR8164497	SRX4985315	SRP133501	SAMN08580422	18,417,806	34%	7%
SRR8164496	SRX4985316	SRP133501	SAMN08580422	20,682,690	29%	6%
SRR9644936	SRX6406742	SRP133501	SAMN08580422	49,041,908	69%	22%
SRR9644935	SRX6406743	SRP133501	SAMN08580422	49,666,830	74%	23%
SRR9644934	SRX6406744	SRP133501	SAMN08580422	50,785,626	84%	28%
SRR9644933	SRX6406745	SRP133501	SAMN08580422	52,374,186	80%	26%
SRR9644930	SRX6406748	SRP133501	SAMN08580422	35,937,260	24%	5%
SRR9645030	SRX6406836	SRP133501	SAMN08580422	78,249,590	67%	33%
SRR9645029	SRX6406837	SRP133501	SAMN08580422	72,865,674	64%	33%
SRR9645028	SRX6406838	SRP133501	SAMN08580422	69,256,588	52%	24%
SRR9645027	SRX6406839	SRP133501	SAMN08580422	70,159,342	60%	31%
SRR7631512	SRX4495272	SRP155932	SAMN09748259	129,094,962	70%	37%
SRR7631511	SRX4495273	SRP155932	SAMN09748260	131,078,494	72%	37%
SRR7631510	SRX4495274	SRP155932	SAMN09748261	131,762,542	76%	37%
SRR7631509	SRX4495275	SRP155932	SAMN09748262	117,142,210	75%	36%
SRR7631516	SRX4495268	SRP155932	SAMN09748263	124,100,404	69%	38%
SRR7631515	SRX4495269	SRP155932	SAMN09748264	105,311,062	74%	37%
SRR7631514	SRX4495270	SRP155932	SAMN09748265	109,168,128	70%	35%
SRR7631513	SRX4495271	SRP155932	SAMN09748266	108,770,596	63%	35%
SRR7631518	SRX4495266	SRP155932	SAMN09748267	116,124,196	88%	41%
SRR7631517	SRX4495267	SRP155932	SAMN09748268	110,761,042	88%	41%
SRR7631500	SRX4495284	SRP155932	SAMN09748269	117,102,770	85%	40%
SRR7631499	SRX4495285	SRP155932	SAMN09748270	114,133,758	86%	40%
SRR7631502	SRX4495282	SRP155932	SAMN09748271	109,219,180	82%	40%
SRR7631501	SRX4495283	SRP155932	SAMN09748272	99,984,838	88%	40%
SRR7631504	SRX4495280	SRP155932	SAMN09748273	121,918,782	85%	40%
SRR7631503	SRX4495281	SRP155932	SAMN09748274	115,327,802	85%	40%
SRR7631506	SRX4495278	SRP155932	SAMN09748275	124,516,240	88%	41%
SRR7631505	SRX4495279	SRP155932	SAMN09748276	127,064,432	88%	41%
SRR7631508	SRX4495276	SRP155932	SAMN09748277	112,856,114	86%	40%
SRR7631507	SRX4495277	SRP155932	SAMN09748278	123,724,148	85%	40%
SRR7631519	SRX4495265	SRP155932	SAMN09748279	102,099,838	88%	41%
SRR7631520	SRX4495264	SRP155932	SAMN09748280	106,744,570	85%	40%
SRR7631521	SRX4495263	SRP155932	SAMN09748281	116,173,706	86%	41%
SRR7631522	SRX4495262	SRP155932	SAMN09748282	112,969,476	87%	40%
SRR8112130	SRX4938590	SRP159187	SAMN09939439	31,379,578	59%	25%
SRR8513647	SRX5317260	SRP183074	SAMN10853962	99,551,370	85%	42%
SRR8513646	SRX5317261	SRP183074	SAMN10853962	90,201,836	84%	42%
SRR8513645	SRX5317262	SRP183074	SAMN10853962	61,644,404	84%	41%
SRR8513644	SRX5317263	SRP183074	SAMN10853962	111,304,836	74%	43%

Protein alignments

Source	Number of sequences retrieved from Entrez	Number (%) of sequences aligned by ProSplign	Number (%) of sequences passed to Gnomon	Average % identity	Average % coverage
Actinopterygii GenBank	82,850	68,009 (82.09%)	68,009 (82.09%)	71.34%	82.74%
Actinopterygii known RefSeq (NP_)	24,216	6,613 (27.31%)	6,613 (27.31%)	68.76%	79.07%
Same-species GenBank	3,783	2,093 (55.33%)	2,093 (55.33%)	79.84%	88.25%
Same-species known RefSeq (NP_)	1,257	1,176 (93.56%)	1,176 (93.56%)	78.03%	86.47%
Homo sapiens known RefSeq (NP_)	59,974	39,459 (65.79%)	39,459 (65.79%)	68.24%	73.10%

Assembly-assembly alignments of current to previous assembly

When the assembly changes between two rounds of annotation, genes in the current and the previous annotation are mapped to each other using the genomic alignments of the current assembly to the previous assembly so that gene identifiers can be preserved. The success of the remapping depends largely on how well the two assembly versions align to each other.

Below are the percent coverage of one assembly by the other and the average percent identity of the alignments. The 'First pass' alignments are reciprocal best hits, while the 'Total' alignments also include 'Second pass' or non-reciprocal best alignments. For more information about the assembly-assembly alignment process, please visit the NCBI Genome Remapping Service page.

First Pass	Total
USDA_OmykA_1.1 (Current) Coverage: 71.10%	USDA_OmykA_1.1 (Current) Coverage: 72.62%
Omyk_1.0 (Previous) Coverage: 78.34%	Omyk_1.0 (Previous) Coverage: 79.81%
Percent Identity: 98.68%	Percent Identity: 98.59%

Comparison of the current and previous annotations

The annotation produced for this release (101) was compared to the annotation in the previous release (100) for each assembly annotated in both releases. Scores for current and previous gene and transcript features were calculated based on overlap in exon sequence and matches in exon boundaries. Pairs of current and previous features were categorized based on these scores, whether they are reciprocal best matches, and changes in attributes (gene biotype, completeness, etc.). If the assembly was updated between the two releases, alignments between the current and the previous assembly were used to match the current and previous gene and transcript features in mapped regions.

The table below summarizes the changes in the gene set for each assembly as a percent of the number of genes in the current annotation release, and provides links to the details of the comparison in tabular format and in a Genome Workbench project.

	USDA_OmykA_1.1 (Current) to Omyk_1.0 (Previous)
Identical	3%
Minor changes	40%
Major changes	12%
New	43%
Deprecated	20%
Other	2%
Download the report	tabular, Genome Workbench

References

RefSeq: Pruitt KD, Brown GR, Hiatt SM, Thibaud-Nissen F, Astashyn A, Ermolaeva O, Farrell CM, Hart J, Landrum MJ, McGarvey KM, Murphy MR, O'Leary NA, Pujar S, Rajput B, Rangwala SH, Riddick LD, Shkeda A, Sun H, Tamez P, Tully RE, Wallin C, Webb D, Weber J, Wu W, Dicuccio M, Kitts P, Maglott DR, Murphy TD, Ostell JM. Nucleic Acids Research 2014, 42(Database issue):D756-63
RepeatMasker: Smit AFA, Hubley R, Green P. RepeatMasker Open-3.0. 1996–2004. http://www.repeatmasker.org
WindowMasker: Morgulis A, Gertz EM, Schäffer AA, Agarwala R. Bioinformatics 2006, 2:134-41
Splign: Kapustin Y, Souvorov A, Tatusova T, Lipman D. Biology Direct 2008, 3:20

RefSeq

Integrated reference sequences