NCBI Oreochromis niloticus Annotation Release 104

The RefSeq genome records for Oreochromis niloticus were annotated by the NCBI Eukaryotic Genome Annotation Pipeline, an automated pipeline that annotates genes, transcripts and proteins on draft and finished genome assemblies. This report presents statistics on the annotation products, the input data used in the pipeline and intermediate alignment results.

The annotation products are available in the sequence databases and on the FTP site.

This report provides:

Annotation Release information: The name of the release, important dates, the software version
Assemblies: A brief description of the annotated assembly(ies)
Gene and feature statistics: The counts and characteristics of the annotated features
Alignment of the annotated proteins to a set of high-quality proteins: The number of annotated proteins with hits to a set of high-quality proteins
Masking of genomic sequence: How much of the genome was masked
Transcript and protein alignments: The number and type of evidence retrieved from public databases and used for gene prediction
Similarity of current and previous assembly: The similarity of the current and previous assembly
Comparison of the current and previous annotations: What proportion of the genes changed in this annotation

For more information on the annotation process, please visit the NCBI Eukaryotic Genome Annotation Pipeline page.

Annotation Release information

This annotation should be referred to as NCBI Oreochromis niloticus Annotation Release 104

Annotation release ID: 104
Date of Entrez queries for transcripts and proteins: Jul 20 2018
Date of submission of annotation to the public databases: Jul 26 2018
Software version: 8.1

Assemblies

The following assemblies were included in this annotation run:

Assembly name	Assembly accession	Submitter	Assembly date	Reference/Alternate	Assembly content
O_niloticus_UMD_NMBU	GCF_001858045.2	University of Maryland	06-29-2018	Reference	23 assembled chromosomes; unplaced scaffolds

Gene and feature statistics

Counts and length of annotated features are provided below for each assembly.

Feature counts

Feature	O_niloticus_UMD_NMBU
Genes and pseudogenes	42,622
protein-coding	29,550
non-coding	12,030
transcribed pseudogenes	4
non-transcribed pseudogenes	685
genes with variants	14,756
immunoglobulin/T-cell receptor gene segments	353
other	0
mRNAs	61,666
fully-supported	59,685
with > 5% ab initio	931
partial	398
with filled gap(s)	77
known RefSeq (NM_)	183
model RefSeq (XM_)	61,483
non-coding RNAs	17,707
fully-supported	14,403
with > 5% ab initio	0
partial	9
with filled gap(s)	2
known RefSeq (NR_)	0
model RefSeq (XR_)	15,713
pseudo transcripts	5
fully-supported	4
with > 5% ab initio	0
partial	0
with filled gap(s)	0
known RefSeq (NR_)	0
model RefSeq (XR_)	5
CDSs	62,032
fully-supported	59,685
with > 5% ab initio	1,070
partial	403
with major correction(s)	765
known RefSeq (NP_)	183
model RefSeq (XP_)	61,496

Detailed reports

The counts below do not include pseudogenes.

Feature lengths

Feature	Count	Mean length (bp)	Median length (bp)	Min length (bp)	Max length (bp)
Genes	41,580	14,889	5,802	56	1,346,887
All transcripts	79,373	3,207	2,488	56	87,473
mRNA	61,666	3,717	2,946	192	87,473
misc_RNA	2,358	2,951	2,388	148	17,107
tRNA	1,992	74	73	66	84
lncRNA	12,073	1,461	1,012	93	17,635
snoRNA	179	117	105	65	321
snRNA	439	137	113	56	199
guide_RNA	7	227	273	130	370
rRNA	659	837	119	114	3,928
Single-exon transcripts	1,102	2,005	1,613	324	12,435
coding transcripts (NM_/XM_ )	1,100	2,005	1,613	324	12,435
non-coding transcripts (NR_/XR_ )	2	1,701	2,180	1,221	2,180
CDSs	61,679	2,120	1,482	96	86,457
Exons	367,751	334	144	1	21,949
in coding transcripts (NM_/XM_ )	331,018	324	142	1	21,949
in non-coding transcripts (NR_/XR_ )	48,688	371	150	2	13,694
Introns	321,425	2,051	392	30	1,173,321
in coding transcripts (NM_/XM_ )	296,055	2,030	388	30	1,173,321
in non-coding transcripts (NR_/XR_ )	37,013	2,196	429	30	706,871

Transcripts per gene, exons per transcript

	Mean	Median	Min	Max
Number of transcripts per gene	1.95	1	1	50
Number of exons per transcript	11.27	8	1	239

Alignment of the annotated proteins to a set of high-quality proteins

The final set of annotated proteins was searched with BLASTP against the UniProtKB/Swiss-Prot curated proteins, using the annotated proteins as the query and the high-quality proteins as the target. Out of 29537 coding genes, 25215 genes had a protein with an alignment covering 50% or more of the query and 10948 had an alignment covering 95% or more of the query.

Definition of query and target coverage. The query coverage is the percentage of the annotated protein length that is included in the alignment. The target coverage is the percentage of the target length that is included in the alignment.

Below is a cumulative graph displaying the number of genes with alignments above a given query or target coverage threshold. For comparison, corresponding statistics for other organisms annotated by the NCBI eukaryotic annotation pipeline were added to the graph.

Query: annotated proteins
Target: UniProtKB/Swiss-Prot curated proteins

Masking of genomic sequence

Transcript and protein alignments are performed on the repeat-masked genome. Below are the percentages of genomic sequence masked by WindowMasker and RepeatMasker for each assembly. RepeatMasker results are only used for organisms for which a comprehensive repeat library is available.

For this annotation run, transcripts and proteins were aligned to the genome masked with WindowMasker only.

Assembly name	Assembly accession	% Masked with RepeatMasker	% Masked with WindowMasker
O_niloticus_UMD_NMBU	GCF_001858045.2	5.40%	30.48%

Transcript and protein alignments

The annotation pipeline relies heavily on alignments of experimental evidence for gene prediction. Below are the sets of transcripts and proteins that were retrieved from Entrez, aligned to the genome by Splign or ProSplign and passed to Gnomon, NCBI's gene prediction software.

Depending on the other evidence available, long 454 reads (with average length above 250 nt) may be aligned as traditional evidence and reported in the Transcript alignments section or aligned with RNA-Seq reads and reported in the RNA-Seq alignments section.

Transcript alignments

Source	Number of sequences retrieved from Entrez	Number (%) of sequences aligned by Splign	Number (%) of sequences passed to Gnomon	Average % identity	Average % coverage
Same-species known RefSeq (NM_/NR_)	185	185 (100.00%)	183 (98.92%)	99.01%	99.13%
Same-species Genbank	930	924 (99.35%)	797 (85.70%)	98.73%	98.79%
Same-species EST	120,986	112,094 (92.65%)	108,158 (89.40%)	99.24%	98.98%

RNA-Seq alignments

The following RNA-Seq reads from the Sequence Read Archive were also used for gene prediction:

Hide alignments statistics, by sample (SAME, SAMN, SAMD, DRS)

Sample Id	Publication	Track name	Number of reads	Percent aligned reads	Percent of aligned reads with introns	Number of introns
All	NA	Aggregate of all aligned samples	5,109,928,945	84%	38%	490,305
SAMEA2482241	NA	5_ (Oreochromis niloticus, SAMEA2482241)	21,114,889	87%	32%	185,583
SAMEA2482242	NA	ANS (Oreochromis niloticus, SAMEA2482242)	17,443,589	87%	33%	187,723
SAMEA2482243	NA	7_ (Oreochromis niloticus, SAMEA2482243)	19,575,859	86%	32%	195,261
SAMEA2482244	NA	PNS (Oreochromis niloticus, SAMEA2482244)	18,583,158	85%	32%	194,472
SAMEA4513650	NA	liver (Oreochromis niloticus, male, SAMEA4513650)	25,043,138	74%	18%	133,976
SAMEA4513651	NA	liver (Oreochromis niloticus, male, SAMEA4513651)	24,292,904	76%	18%	137,080
SAMEA4513652	NA	liver (Oreochromis niloticus, male, SAMEA4513652)	24,082,752	76%	17%	135,768
SAMEA4513695	NA	muscle (Oreochromis niloticus, male, SAMEA4513695)	25,118,416	83%	20%	115,837
SAMEA4513696	NA	muscle (Oreochromis niloticus, male, SAMEA4513696)	23,570,208	85%	21%	108,293
SAMEA4513697	NA	muscle (Oreochromis niloticus, male, SAMEA4513697)	27,052,956	86%	22%	112,677
SAMN00767853	25186727	kidney (Oreochromis niloticus, Female, SAMN00767853)	87,556,570	80%	26%	232,105
SAMN00767854	25186727	heart (Oreochromis niloticus, Male, SAMN00767854)	60,263,332	79%	29%	207,140
SAMN00767855	25186727	skin (Oreochromis niloticus, Female, SAMN00767855)	45,307,090	81%	28%	208,640
SAMN00767856	25186727	eye (Oreochromis niloticus, Male, SAMN00767856)	56,997,428	82%	21%	225,535
SAMN00767857	25186727	blood (Oreochromis niloticus, Male, SAMN00767857)	77,932,704	90%	25%	170,104
SAMN00767858	25186727	ovary (Oreochromis niloticus, Female, SAMN00767858)	72,105,534	92%	34%	230,791
SAMN00767859	25186727	liver (Oreochromis niloticus, Female, SAMN00767859)	65,913,292	86%	33%	157,282
SAMN00767860	25186727	testis (Oreochromis niloticus, Male, SAMN00767860)	61,827,174	85%	29%	302,210
SAMN00767861	25186727	brain (Oreochromis niloticus, Female, SAMN00767861)	61,938,566	85%	19%	235,545
SAMN00767862	25186727	muscle (Oreochromis niloticus, Female, SAMN00767862)	56,207,368	87%	36%	177,945
SAMN00767863	25186727	embryo (Oreochromis niloticus, Male, SAMN00767863)	53,967,792	86%	29%	254,539
SAMN01086111	NA	90 dah XX gonad (Oreochromis niloticus, SAMN01086111)	26,466,666	88%	30%	195,711
SAMN01087769	NA	180 dah XY gonad (Oreochromis niloticus, SAMN01087769)	50,258,478	86%	27%	265,573
SAMN01087779	NA	90 dah XY gonad (Oreochromis niloticus, SAMN01087779)	26,292,052	83%	26%	242,762
SAMN01091651	NA	180 dah XX gonad (Oreochromis niloticus, SAMN01091651)	51,485,734	93%	30%	222,465
SAMN01093674	NA	30 dah XX gonad (Oreochromis niloticus, SAMN01093674)	53,140,336	90%	34%	249,093
SAMN01093676	NA	30 dah XY gonad (Oreochromis niloticus, SAMN01093676)	52,579,466	87%	27%	250,782
SAMN01985096	24068429	spleen and kidney (Oreochromis niloticus, SAMN01985096)	50,409,546	82%	27%	216,328
SAMN02212662	NA	spleen (Oreochromis niloticus, 60 days, SAMN02212662)	26,622,238	87%	31%	197,151
SAMN02374859	NA	General Sample for Oreochromis niloticus (Oreochromis niloticus, SAMN02374859)	100,744,586	91%	29%	264,875
SAMN03013320	NA	various (Oreochromis niloticus, nd, not determined, SAMN03013320)	59,755,940	76%	43%	296,240
SAMN03097727	NA	MIGS Eukaryotic sample from Oreochromis niloticus (Oreochromis niloticus, SAMN03097727)	112,827,374	87%	33%	313,846
SAMN03097728	NA	MIGS Eukaryotic sample from Oreochromis niloticus (Oreochromis niloticus, SAMN03097728)	146,217,044	90%	38%	293,537
SAMN03246845	NA	brain (Oreochromis niloticus, Six months old, SAMN03246845)	2,549,418	52%	17%	57,297
SAMN03246846	NA	gill (Oreochromis niloticus, Six months old, SAMN03246846)	17,142,398	11%	10%	39,059
SAMN03246849	NA	gill (Oreochromis niloticus, Six months old, SAMN03246849)	9,204,226	51%	9%	84,337
SAMN03246850	NA	brain (Oreochromis niloticus, Six months old, SAMN03246850)	4,926,518	43%	17%	75,209
SAMN03246852	NA	heart (Oreochromis niloticus, Six months old, SAMN03246852)	3,028,006	56%	17%	54,056
SAMN03246853	NA	spleen (Oreochromis niloticus, Six months old, SAMN03246853)	6,703,286	56%	11%	70,634
SAMN03246854	NA	liver (Oreochromis niloticus, Six months old, SAMN03246854)	11,800,620	51%	23%	26,151
SAMN03246855	NA	gill (Oreochromis niloticus, Six months old, SAMN03246855)	18,860,972	25%	8%	50,989
SAMN03246856	NA	heart (Oreochromis niloticus, Six months old, SAMN03246856)	9,911,876	74%	17%	85,501
SAMN03246857	NA	muscle (Oreochromis niloticus, Six months old, SAMN03246857)	20,502,444	56%	19%	32,323
SAMN03246858	NA	liver (Oreochromis niloticus, Six months old, SAMN03246858)	3,326,376	61%	11%	29,115
SAMN03246859	NA	kidney (Oreochromis niloticus, Six months old, SAMN03246859)	7,719,226	47%	12%	70,922
SAMN03246860	NA	liver (Oreochromis niloticus, Six months old, SAMN03246860)	6,538,476	61%	7%	29,858
SAMN03246861	NA	heart (Oreochromis niloticus, Six months old, SAMN03246861)	7,762,524	50%	25%	66,950
SAMN03246862	NA	kidney (Oreochromis niloticus, Six months old, SAMN03246862)	13,101,238	54%	14%	61,103
SAMN03246863	NA	kidney (Oreochromis niloticus, Six months old, SAMN03246863)	3,625,112	51%	14%	44,678
SAMN03246864	NA	spleen (Oreochromis niloticus, Six months old, SAMN03246864)	4,989,082	60%	21%	64,238
SAMN03246865	NA	spleen (Oreochromis niloticus, Six months old, SAMN03246865)	6,193,006	56%	13%	64,112
SAMN03246866	NA	muscle (Oreochromis niloticus, Six months old, SAMN03246866)	8,535,870	20%	26%	29,961
SAMN03246867	NA	muscle (Oreochromis niloticus, Six months old, SAMN03246867)	10,796,478	22%	15%	33,846
SAMN03273284	26265749	liver (Oreochromis niloticus, 10 weeks feeding, not determined, SAMN03273284)	49,091,968	88%	39%	170,811
SAMN03273285	26265749	liver (Oreochromis niloticus, 10 weeks feeding, not determined, SAMN03273285)	66,190,746	86%	36%	186,880
SAMN03273286	26265749	liver (Oreochromis niloticus, 10 weeks feeding, not determined, SAMN03273286)	56,844,730	88%	39%	175,490
SAMN03779459	27356472	gill (Oreochromis niloticus, Six months, SAMN03779459)	23,882,388	81%	36%	221,240
SAMN03779460	27356472	gill (Oreochromis niloticus, Six months, SAMN03779460)	28,263,450	80%	35%	224,562
SAMN03779461	27356472	gill (Oreochromis niloticus, Six months, SAMN03779461)	28,590,338	82%	36%	226,358
SAMN03779462	27356472	gill (Oreochromis niloticus, Six months, SAMN03779462)	31,351,944	82%	37%	223,236
SAMN03779463	27356472	gill (Oreochromis niloticus, Six months, SAMN03779463)	26,855,700	82%	37%	209,924
SAMN03779464	27356472	gill (Oreochromis niloticus, Six months, SAMN03779464)	34,053,640	84%	38%	223,222
SAMN03779465	27356472	gill (Oreochromis niloticus, Six months, SAMN03779465)	29,712,256	80%	34%	218,432
SAMN03779468	27356472	gill (Oreochromis niloticus, Six months, SAMN03779468)	30,430,518	80%	35%	220,279
SAMN03779469	27356472	gill (Oreochromis niloticus, Six months, SAMN03779469)	24,852,508	83%	37%	209,721
SAMN03785591	NA	gill (Oreochromis niloticus, one year old, pooled male and female, SAMN03785591)	39,185,076	83%	14%	203,280
SAMN05171417	27356472	gill (Oreochromis niloticus, Six months old, SAMN05171417)	18,057,772	81%	34%	206,757
SAMN05171418	27356472	gill (Oreochromis niloticus, Six months old, SAMN05171418)	22,125,982	83%	34%	211,514
SAMN05171419	27356472	gill (Oreochromis niloticus, Six months old, SAMN05171419)	26,495,110	83%	34%	202,681
SAMN06335325	28821885	liver (Oreochromis niloticus, SAMN06335325)	40,083,434	84%	45%	150,947
SAMN06335326	28821885	liver (Oreochromis niloticus, SAMN06335326)	42,428,160	82%	49%	151,518
SAMN06335327	28821885	liver (Oreochromis niloticus, SAMN06335327)	48,116,524	83%	44%	138,540
SAMN06335328	28821885	liver (Oreochromis niloticus, SAMN06335328)	40,826,712	84%	50%	146,467
SAMN06335333	28821885	liver (Oreochromis niloticus, SAMN06335333)	13,029,563	0%	34%	9
SAMN06335334	28821885	liver (Oreochromis niloticus, SAMN06335334)	13,602,611	0%	7%	9
SAMN06335335	28821885	liver (Oreochromis niloticus, SAMN06335335)	50,461,524	86%	49%	161,787
SAMN06335336	28821885	liver (Oreochromis niloticus, SAMN06335336)	43,120,542	83%	42%	141,610
SAMN06556235	NA	Liver (Oreochromis niloticus, Juvenil (4 month), SAMN06556235)	22,460,558	91%	38%	157,813
SAMN06556236	NA	Liver (Oreochromis niloticus, Juvenil (4 month), SAMN06556236)	23,646,020	92%	39%	159,568
SAMN06556237	NA	Thalamus-pituitary (Oreochromis niloticus, Juvenil (4 month), SAMN06556237)	27,153,210	82%	25%	222,378
SAMN06556238	NA	Cerebellum (Oreochromis niloticus, Juvenil (4 month), SAMN06556238)	15,379,478	81%	27%	193,096
SAMN06556239	NA	Cerebellum (Oreochromis niloticus, Juvenil (4 month), SAMN06556239)	29,028,782	87%	26%	222,230
SAMN06556240	NA	Thalamus-pituitary (Oreochromis niloticus, Juvenil (4 month), SAMN06556240)	33,012,412	78%	26%	227,289
SAMN06556241	NA	Thalamus-pituitary (Oreochromis niloticus, Juvenil (4 month), SAMN06556241)	29,009,660	85%	27%	229,829
SAMN06556242	NA	Cerebellum (Oreochromis niloticus, Juvenil (4 month), SAMN06556242)	9,645,382	81%	27%	178,598
SAMN06556243	NA	Cerebellum (Oreochromis niloticus, Juvenil (4 month), SAMN06556243)	26,629,214	88%	25%	214,318
SAMN06556244	NA	Cerebellum (Oreochromis niloticus, Juvenil (4 month), SAMN06556244)	22,464,140	88%	25%	211,946
SAMN06556245	NA	Thalamus-pituitary (Oreochromis niloticus, Juvenil (4 month), SAMN06556245)	20,620,676	87%	24%	217,425
SAMN06556246	NA	Cerebellum (Oreochromis niloticus, Juvenil (4 month), SAMN06556246)	30,442,050	87%	24%	223,807
SAMN06556247	NA	Liver (Oreochromis niloticus, Juvenil (4 month), SAMN06556247)	21,540,748	89%	38%	147,091
SAMN06556248	NA	Thalamus-pituitary (Oreochromis niloticus, Juvenil (4 month), SAMN06556248)	30,629,254	82%	24%	226,842
SAMN06556249	NA	Thalamus-pituitary (Oreochromis niloticus, Juvenil (4 month), SAMN06556249)	29,668,724	89%	26%	231,023
SAMN06556250	NA	Liver (Oreochromis niloticus, Juvenil (4 month), SAMN06556250)	18,424,964	91%	37%	150,345
SAMN06556251	NA	Liver (Oreochromis niloticus, Juvenil (4 month), SAMN06556251)	24,215,052	86%	42%	144,186
SAMN06556252	NA	Liver (Oreochromis niloticus, Juvenil (4 month), SAMN06556252)	8,451,196	80%	44%	98,854
SAMN07160535	NA	Gonad (Oreochromis niloticus, female, SAMN07160535)	39,208,280	91%	48%	242,150
SAMN07160536	NA	Gonad (Oreochromis niloticus, juvenile, female, SAMN07160536)	29,470,706	91%	49%	221,237
SAMN07160537	NA	Gonad (Oreochromis niloticus, juvenile, female, SAMN07160537)	29,828,306	91%	48%	226,692
SAMN07160548	NA	Gonad (Oreochromis niloticus, juvenile, female, SAMN07160548)	32,120,386	91%	50%	230,023
SAMN07160551	NA	Gonad (Oreochromis niloticus, juvenile, female, SAMN07160551)	30,823,082	91%	49%	234,022
SAMN07160559	NA	Gonad (Oreochromis niloticus, juvenile, female, SAMN07160559)	39,692,226	91%	49%	244,385
SAMN07160561	NA	Gonad (Oreochromis niloticus, juvenile, male, SAMN07160561)	33,260,180	88%	46%	271,519
SAMN07160562	NA	Gonad (Oreochromis niloticus, juvenile, male, SAMN07160562)	41,150,444	88%	43%	283,822
SAMN07160563	NA	Gonad (Oreochromis niloticus, juvenile, male, SAMN07160563)	49,871,394	86%	45%	280,581
SAMN07788113	NA	Liver (Oreochromis niloticus, 4 month, male, SAMN07788113)	65,602,206	86%	55%	164,855
SAMN07788114	NA	Liver (Oreochromis niloticus, 4 month, male, SAMN07788114)	62,577,732	87%	54%	174,863
SAMN07788115	NA	Liver (Oreochromis niloticus, 4 month, male, SAMN07788115)	52,218,986	89%	55%	173,835
SAMN07788116	NA	Liver (Oreochromis niloticus, 4 month, male, SAMN07788116)	61,943,158	85%	55%	158,758
SAMN07788117	NA	Liver (Oreochromis niloticus, 4 month, male, SAMN07788117)	72,557,568	85%	52%	174,831
SAMN07788118	NA	Liver (Oreochromis niloticus, 4 month, male, SAMN07788118)	44,339,096	88%	55%	170,473
SAMN07788119	NA	Liver (Oreochromis niloticus, 4 month, male, SAMN07788119)	66,843,268	87%	55%	169,306
SAMN07788120	NA	Liver (Oreochromis niloticus, 4 month, male, SAMN07788120)	54,696,356	88%	54%	174,467
SAMN07788121	NA	Liver (Oreochromis niloticus, 4 month, male, SAMN07788121)	76,142,700	89%	54%	189,226
SAMN08511615	NA	spleen (Oreochromis niloticus, 8weeks, male, SAMN08511615)	61,164,956	85%	46%	208,021
SAMN08511616	NA	spleen (Oreochromis niloticus, 8weeks, male, SAMN08511616)	55,488,936	88%	53%	196,661
SAMN08511617	NA	spleen (Oreochromis niloticus, 8weeks, male, SAMN08511617)	51,024,058	86%	48%	206,816
SAMN08511618	NA	spleen (Oreochromis niloticus, 8weeks, male, SAMN08511618)	50,497,204	86%	48%	195,481
SAMN08511619	NA	spleen (Oreochromis niloticus, 8weeks, male, SAMN08511619)	56,315,604	87%	52%	196,071
SAMN08511620	NA	spleen (Oreochromis niloticus, 8weeks, male, SAMN08511620)	46,033,474	85%	41%	193,733
SAMN08511621	NA	head kidney (Oreochromis niloticus, 8weeks, male, SAMN08511621)	53,229,692	87%	50%	187,107
SAMN08511622	NA	head kidney (Oreochromis niloticus, 8weeks, male, SAMN08511622)	51,273,428	88%	51%	188,333
SAMN08511623	NA	head kidney (Oreochromis niloticus, 8weeks, male, SAMN08511623)	55,833,682	86%	42%	176,066
SAMN08511624	NA	head kidney (Oreochromis niloticus, 8weeks, male, SAMN08511624)	53,706,764	87%	49%	188,022
SAMN08511625	NA	head kidney (Oreochromis niloticus, 8weeks, male, SAMN08511625)	55,909,212	87%	51%	200,272
SAMN08511626	NA	head kidney (Oreochromis niloticus, 8weeks, male, SAMN08511626)	48,843,460	85%	43%	178,298
SAMN08887793	NA	Brain (Oreochromis niloticus, 17 dah, female, SAMN08887793)	43,867,610	88%	38%	249,058
SAMN08887794	NA	Brain (Oreochromis niloticus, 17 dah, female, SAMN08887794)	41,813,218	87%	35%	239,506
SAMN08887795	NA	Brain (Oreochromis niloticus, 17 dah, female, SAMN08887795)	39,424,796	88%	35%	239,814
SAMN08887796	NA	Brain (Oreochromis niloticus, 17 dah, female, SAMN08887796)	35,877,596	87%	34%	237,492
SAMN08887797	NA	Brain (Oreochromis niloticus, 17 dah, female, SAMN08887797)	36,220,772	86%	36%	247,044
SAMN08887798	NA	Brain (Oreochromis niloticus, 17 dah, female, SAMN08887798)	36,297,218	87%	35%	241,273
SAMN08887799	NA	Brain (Oreochromis niloticus, 115 dah, female, SAMN08887799)	37,393,922	87%	37%	237,781
SAMN08887800	NA	Brain (Oreochromis niloticus, 115 dah, female, SAMN08887800)	42,794,270	87%	36%	244,283
SAMN08887801	NA	Brain (Oreochromis niloticus, 115 dah, female, SAMN08887801)	41,560,110	89%	31%	240,594
SAMN08887802	NA	Brain (Oreochromis niloticus, 115 dah, female, SAMN08887802)	42,621,506	87%	33%	238,887
SAMN08887803	NA	Brain (Oreochromis niloticus, 115 dah, female, SAMN08887803)	42,527,046	87%	34%	238,540
SAMN08887804	NA	Brain (Oreochromis niloticus, 115 dah, female, SAMN08887804)	37,878,290	87%	36%	241,128

Show alignments statistics, by run (ERR, SRR, DRR)

Run	Experiment	Project	Sample	Number of reads	Percent aligned reads	Percent of aligned reads with introns
ERR490212	ERX455666	ERP005655	SAMEA2482241	21,114,889	87%	32%
ERR490213	ERX455667	ERP005655	SAMEA2482242	17,443,589	87%	33%
ERR490214	ERX455668	ERP005655	SAMEA2482243	19,575,859	86%	32%
ERR490215	ERX455669	ERP005655	SAMEA2482244	18,583,158	85%	32%
ERR1940500	ERX2001033	ERP016820	SAMEA4513650	6,235,544	73%	18%
ERR1940590	ERX2001123	ERP016820	SAMEA4513650	6,246,870	73%	18%
ERR1952915	ERX2018511	ERP016820	SAMEA4513650	6,292,186	74%	18%
ERR1953005	ERX2018601	ERP016820	SAMEA4513650	6,268,538	74%	18%
ERR1940501	ERX2001034	ERP016820	SAMEA4513651	6,051,746	75%	18%
ERR1940591	ERX2001124	ERP016820	SAMEA4513651	6,059,690	76%	18%
ERR1952916	ERX2018512	ERP016820	SAMEA4513651	6,103,438	76%	18%
ERR1953006	ERX2018602	ERP016820	SAMEA4513651	6,078,030	76%	18%
ERR1940502	ERX2001035	ERP016820	SAMEA4513652	6,000,146	76%	17%
ERR1940592	ERX2001125	ERP016820	SAMEA4513652	6,015,886	76%	17%
ERR1952917	ERX2018513	ERP016820	SAMEA4513652	6,047,500	76%	17%
ERR1953007	ERX2018603	ERP016820	SAMEA4513652	6,019,220	76%	17%
ERR1940545	ERX2001078	ERP016820	SAMEA4513695	6,267,226	82%	20%
ERR1940635	ERX2001168	ERP016820	SAMEA4513695	6,286,458	83%	20%
ERR1952960	ERX2018556	ERP016820	SAMEA4513695	6,302,840	83%	20%
ERR1953050	ERX2018646	ERP016820	SAMEA4513695	6,261,892	83%	21%
ERR1940546	ERX2001079	ERP016820	SAMEA4513696	5,881,482	85%	21%
ERR1940636	ERX2001169	ERP016820	SAMEA4513696	5,904,388	85%	21%
ERR1952961	ERX2018557	ERP016820	SAMEA4513696	5,903,518	85%	21%
ERR1953051	ERX2018647	ERP016820	SAMEA4513696	5,880,820	85%	21%
ERR1940547	ERX2001080	ERP016820	SAMEA4513697	6,763,716	86%	22%
ERR1940637	ERX2001170	ERP016820	SAMEA4513697	6,776,520	86%	22%
ERR1952962	ERX2018558	ERP016820	SAMEA4513697	6,775,956	86%	22%
ERR1953052	ERX2018648	ERP016820	SAMEA4513697	6,736,764	86%	22%
SRR391680	SRX112566	SRP009911	SAMN00767853	28,435,226	78%	26%
SRR391684	SRX112566	SRP009911	SAMN00767853	29,765,322	81%	26%
SRR391689	SRX112566	SRP009911	SAMN00767853	29,356,022	80%	26%
SRR391681	SRX112567	SRP009911	SAMN00767854	20,499,202	80%	29%
SRR391686	SRX112567	SRP009911	SAMN00767854	20,167,908	80%	29%
SRR391696	SRX112567	SRP009911	SAMN00767854	19,596,222	78%	28%
SRR391682	SRX112568	SRP009911	SAMN00767855	14,763,778	79%	28%
SRR391694	SRX112568	SRP009911	SAMN00767855	15,386,980	82%	28%
SRR391710	SRX112568	SRP009911	SAMN00767855	15,156,332	82%	28%
SRR391683	SRX112569	SRP009911	SAMN00767856	19,090,574	83%	21%
SRR391700	SRX112569	SRP009911	SAMN00767856	19,401,466	83%	21%
SRR391712	SRX112569	SRP009911	SAMN00767856	18,505,388	81%	21%
SRR391685	SRX112570	SRP009911	SAMN00767857	26,102,090	91%	25%
SRR391692	SRX112570	SRP009911	SAMN00767857	25,315,020	88%	25%
SRR391703	SRX112570	SRP009911	SAMN00767857	26,515,594	91%	25%
SRR391687	SRX112571	SRP009911	SAMN00767858	23,389,304	90%	34%
SRR391691	SRX112571	SRP009911	SAMN00767858	24,544,706	93%	34%
SRR391693	SRX112571	SRP009911	SAMN00767858	24,171,524	92%	34%
SRR391688	SRX112572	SRP009911	SAMN00767859	22,059,186	87%	33%
SRR391698	SRX112572	SRP009911	SAMN00767859	21,468,202	84%	32%
SRR391708	SRX112572	SRP009911	SAMN00767859	22,385,904	87%	33%
SRR391690	SRX112573	SRP009911	SAMN00767860	21,025,146	86%	29%
SRR391695	SRX112573	SRP009911	SAMN00767860	20,695,416	86%	29%
SRR391701	SRX112573	SRP009911	SAMN00767860	20,106,612	83%	28%
SRR391697	SRX112574	SRP009911	SAMN00767861	20,763,590	85%	19%
SRR391699	SRX112574	SRP009911	SAMN00767861	20,087,900	83%	19%
SRR391709	SRX112574	SRP009911	SAMN00767861	21,087,076	86%	19%
SRR391702	SRX112575	SRP009911	SAMN00767862	19,126,854	88%	36%
SRR391704	SRX112575	SRP009911	SAMN00767862	18,235,448	86%	35%
SRR391706	SRX112575	SRP009911	SAMN00767862	18,845,066	88%	36%
SRR391705	SRX112576	SRP009911	SAMN00767863	17,513,910	85%	29%
SRR391707	SRX112576	SRP009911	SAMN00767863	18,376,206	87%	29%
SRR391711	SRX112576	SRP009911	SAMN00767863	18,077,676	87%	29%
SRR519096	SRX158097	SRP014017	SAMN01086111	26,466,666	88%	30%
SRR521273	SRX159747	SRP014017	SAMN01087769	50,258,478	86%	27%
SRR521274	SRX159748	SRP014017	SAMN01087779	26,292,052	83%	26%
SRR524807	SRX160791	SRP014017	SAMN01091651	51,485,734	93%	30%
SRR526901	SRX170662	SRP014017	SAMN01093674	53,140,336	90%	34%
SRR526903	SRX170664	SRP014017	SAMN01093676	52,579,466	87%	27%
SRR1011283	SRX364007	SRP014017	SAMN02374859	52,172,864	91%	29%
SRR1011284	SRX364009	SRP014017	SAMN02374859	48,571,722	91%	29%
SRR797490	SRX254258	SRP019938	SAMN01985096	50,409,546	82%	27%
SRR1291960	SRX320099	SRP026706	SAMN02212662	26,622,238	87%	31%
SRR1598978	SRX722101	SRP045891	SAMN03013320	11,240,926	81%	46%
SRR1598979	SRX722101	SRP045891	SAMN03013320	12,379,972	84%	42%
SRR1598980	SRX722101	SRP045891	SAMN03013320	15,589,964	85%	35%
SRR1598982	SRX722101	SRP045891	SAMN03013320	20,545,078	61%	50%
SRR1606274	SRX727305	SRP048688	SAMN03097727	112,827,374	87%	33%
SRR1606273	SRX727306	SRP048688	SAMN03097728	146,217,044	90%	38%
SRR1685917	SRX790840	SRP050368	SAMN03246845	2,549,418	52%	17%
SRR1685921	SRX790844	SRP050368	SAMN03246846	372,392	54%	25%
SRR1685922	SRX790844	SRP050368	SAMN03246846	16,770,006	10%	8%
SRR1685920	SRX790843	SRP050368	SAMN03246849	9,204,226	51%	9%
SRR1685918	SRX790841	SRP050368	SAMN03246850	4,926,518	43%	17%
SRR1685926	SRX790848	SRP050368	SAMN03246852	3,028,006	56%	17%
SRR1685939	SRX790860	SRP050368	SAMN03246853	6,703,286	56%	11%
SRR1685930	SRX790852	SRP050368	SAMN03246854	11,800,620	51%	23%
SRR1685923	SRX790845	SRP050368	SAMN03246855	18,860,972	25%	8%
SRR1685924	SRX790846	SRP050368	SAMN03246856	9,911,876	74%	17%
SRR1685934	SRX790856	SRP050368	SAMN03246857	20,502,444	56%	19%
SRR1685931	SRX790853	SRP050368	SAMN03246858	3,326,376	61%	11%
SRR1685927	SRX790849	SRP050368	SAMN03246859	7,719,226	47%	12%
SRR1685932	SRX790854	SRP050368	SAMN03246860	6,538,476	61%	7%
SRR1685925	SRX790847	SRP050368	SAMN03246861	7,762,524	50%	25%
SRR1685929	SRX790851	SRP050368	SAMN03246862	13,101,238	54%	14%
SRR1685928	SRX790850	SRP050368	SAMN03246863	3,625,112	51%	14%
SRR1685937	SRX790858	SRP050368	SAMN03246864	4,989,082	60%	21%
SRR1685938	SRX790859	SRP050368	SAMN03246865	6,193,006	56%	13%
SRR1685935	SRX790857	SRP050368	SAMN03246866	3,354,104	50%	27%
SRR1685936	SRX790857	SRP050368	SAMN03246866	5,181,766	0%	3%
SRR1685933	SRX790855	SRP050368	SAMN03246867	10,796,478	22%	15%
SRR1752053	SRX1257755	SRP051563	SAMN03273284	49,091,968	88%	39%
SRR1752057	SRX838027	SRP051563	SAMN03273285	66,190,746	86%	36%
SRR1752058	SRX1257756	SRP051563	SAMN03273286	56,844,730	88%	39%
SRR2067864	SRX1063344	SRP059605	SAMN03779459	23,882,388	81%	36%
SRR2067865	SRX1063345	SRP059605	SAMN03779460	28,263,450	80%	35%
SRR2067866	SRX1063346	SRP059605	SAMN03779461	28,590,338	82%	36%
SRR2067867	SRX1063347	SRP059605	SAMN03779462	31,351,944	82%	37%
SRR2067868	SRX1063348	SRP059605	SAMN03779463	26,855,700	82%	37%
SRR2067869	SRX1063349	SRP059605	SAMN03779464	34,053,640	84%	38%
SRR2067870	SRX1063350	SRP059605	SAMN03779465	29,712,256	80%	34%
SRR2067873	SRX1063353	SRP059605	SAMN03779468	30,430,518	80%	35%
SRR2067874	SRX1063354	SRP059605	SAMN03779469	24,852,508	83%	37%
SRR3579892	SRX1796660	SRP059605	SAMN05171417	18,057,772	81%	34%
SRR3579893	SRX1796661	SRP059605	SAMN05171418	22,125,982	83%	34%
SRR3579894	SRX1796662	SRP059605	SAMN05171419	26,495,110	83%	34%
SRR2074560	SRX1069584	SRP059768	SAMN03785591	16,461,040	83%	14%
SRR2074567	SRX1072840	SRP059768	SAMN03785591	22,724,036	83%	14%
SRR5258695	SRX2563706	SRP099810	SAMN06335325	40,083,434	84%	45%
SRR5258694	SRX2563705	SRP099810	SAMN06335326	42,428,160	82%	49%
SRR5258693	SRX2563704	SRP099810	SAMN06335327	48,116,524	83%	44%
SRR5258692	SRX2563703	SRP099810	SAMN06335328	40,826,712	84%	50%
SRR5258699	SRX2563710	SRP099810	SAMN06335333	13,029,563	0%	34%
SRR5258698	SRX2563709	SRP099810	SAMN06335334	13,602,611	0%	7%
SRR5258697	SRX2563708	SRP099810	SAMN06335335	50,461,524	86%	49%
SRR5258696	SRX2563707	SRP099810	SAMN06335336	43,120,542	83%	42%
SRR5331663	SRX2630490	SRP101661	SAMN06556235	22,460,558	91%	38%
SRR5331662	SRX2630489	SRP101661	SAMN06556236	23,646,020	92%	39%
SRR5331654	SRX2630481	SRP101661	SAMN06556237	27,153,210	82%	25%
SRR5331647	SRX2630474	SRP101661	SAMN06556238	15,379,478	81%	27%
SRR5331646	SRX2630473	SRP101661	SAMN06556239	29,028,782	87%	26%
SRR5331653	SRX2630480	SRP101661	SAMN06556240	33,012,412	78%	26%
SRR5331652	SRX2630479	SRP101661	SAMN06556241	29,009,660	85%	27%
SRR5331648	SRX2630475	SRP101661	SAMN06556242	9,645,382	81%	27%
SRR5331649	SRX2630476	SRP101661	SAMN06556243	26,629,214	88%	25%
SRR5331650	SRX2630477	SRP101661	SAMN06556244	22,464,140	88%	25%
SRR5331657	SRX2630484	SRP101661	SAMN06556245	20,620,676	87%	24%
SRR5331651	SRX2630478	SRP101661	SAMN06556246	30,442,050	87%	24%
SRR5331661	SRX2630488	SRP101661	SAMN06556247	21,540,748	89%	38%
SRR5331655	SRX2630482	SRP101661	SAMN06556248	30,629,254	82%	24%
SRR5331656	SRX2630483	SRP101661	SAMN06556249	29,668,724	89%	26%
SRR5331660	SRX2630487	SRP101661	SAMN06556250	18,424,964	91%	37%
SRR5331659	SRX2630486	SRP101661	SAMN06556251	24,215,052	86%	42%
SRR5331658	SRX2630485	SRP101661	SAMN06556252	8,451,196	80%	44%
SRR5590984	SRX2848088	SRP107916	SAMN07160535	39,208,280	91%	48%
SRR5590985	SRX2848089	SRP107916	SAMN07160536	29,470,706	91%	49%
SRR5590988	SRX2848092	SRP107916	SAMN07160537	29,828,306	91%	48%
SRR5590989	SRX2848093	SRP107916	SAMN07160548	32,120,386	91%	50%
SRR5590990	SRX2848094	SRP107916	SAMN07160551	30,823,082	91%	49%
SRR5590994	SRX2848098	SRP107916	SAMN07160559	39,692,226	91%	49%
SRR5590995	SRX2848099	SRP107916	SAMN07160561	33,260,180	88%	46%
SRR5590996	SRX2848100	SRP107916	SAMN07160562	41,150,444	88%	43%
SRR5590997	SRX2848101	SRP107916	SAMN07160563	49,871,394	86%	45%
SRR6181336	SRX3291942	SRP120142	SAMN07788113	65,602,206	86%	55%
SRR6181335	SRX3291943	SRP120142	SAMN07788114	62,577,732	87%	54%
SRR6181338	SRX3291940	SRP120142	SAMN07788115	52,218,986	89%	55%
SRR6181337	SRX3291941	SRP120142	SAMN07788116	61,943,158	85%	55%
SRR6181332	SRX3291946	SRP120142	SAMN07788117	72,557,568	85%	52%
SRR6181331	SRX3291947	SRP120142	SAMN07788118	44,339,096	88%	55%
SRR6181334	SRX3291944	SRP120142	SAMN07788119	66,843,268	87%	55%
SRR6181333	SRX3291945	SRP120142	SAMN07788120	54,696,356	88%	54%
SRR6181339	SRX3291939	SRP120142	SAMN07788121	76,142,700	89%	54%
SRR6701612	SRX3675821	SRP132530	SAMN08511621	53,229,692	87%	50%
SRR6701613	SRX3675820	SRP132530	SAMN08511622	51,273,428	88%	51%
SRR6701610	SRX3675823	SRP132530	SAMN08511623	55,833,682	86%	42%
SRR6701611	SRX3675822	SRP132530	SAMN08511624	53,706,764	87%	49%
SRR6701614	SRX3675819	SRP132530	SAMN08511625	55,909,212	87%	51%
SRR6701615	SRX3675818	SRP132530	SAMN08511626	48,843,460	85%	43%
SRR6701620	SRX3675825	SRP132531	SAMN08511615	61,164,956	85%	46%
SRR6701621	SRX3675824	SRP132531	SAMN08511616	55,488,936	88%	53%
SRR6701618	SRX3675827	SRP132531	SAMN08511617	51,024,058	86%	48%
SRR6701619	SRX3675826	SRP132531	SAMN08511618	50,497,204	86%	48%
SRR6701616	SRX3675829	SRP132531	SAMN08511619	56,315,604	87%	52%
SRR6701617	SRX3675828	SRP132531	SAMN08511620	46,033,474	85%	41%
SRR6957088	SRX3900039	SRP137948	SAMN08887793	43,867,610	88%	38%
SRR6957089	SRX3900038	SRP137948	SAMN08887794	41,813,218	87%	35%
SRR6957086	SRX3900041	SRP137948	SAMN08887795	39,424,796	88%	35%
SRR6957087	SRX3900040	SRP137948	SAMN08887796	35,877,596	87%	34%
SRR6957084	SRX3900043	SRP137948	SAMN08887797	36,220,772	86%	36%
SRR6957085	SRX3900042	SRP137948	SAMN08887798	36,297,218	87%	35%
SRR6957082	SRX3900045	SRP137948	SAMN08887799	37,393,922	87%	37%
SRR6957083	SRX3900044	SRP137948	SAMN08887800	42,794,270	87%	36%
SRR6957090	SRX3900037	SRP137948	SAMN08887801	41,560,110	89%	31%
SRR6957091	SRX3900036	SRP137948	SAMN08887802	42,621,506	87%	33%
SRR6957080	SRX3900047	SRP137948	SAMN08887803	42,527,046	87%	34%
SRR6957081	SRX3900046	SRP137948	SAMN08887804	37,878,290	87%	36%

Protein alignments

Source	Number of sequences retrieved from Entrez	Number (%) of sequences aligned by ProSplign	Number (%) of sequences passed to Gnomon	Average % identity	Average % coverage
Actinopterygii GenBank	80,710	77,257 (95.72%)	77,257 (95.72%)	68.46%	81.07%
Actinopterygii known RefSeq (NP_)	24,871	23,873 (95.99%)	23,873 (95.99%)	67.64%	78.58%
Homo sapiens known RefSeq (NP_)	51,073	43,883 (85.92%)	43,883 (85.92%)	65.37%	67.93%

Assembly-assembly alignments of current to previous assembly

When the assembly changes between two rounds of annotation, genes in the current and the previous annotation are mapped to each other using the genomic alignments of the current assembly to the previous assembly so that gene identifiers can be preserved. The success of the remapping depends largely on how well the two assembly versions align to each other.

Below are the percent coverage of one assembly by the other and the average percent identity of the alignments. The 'First pass' alignments are reciprocal best hits, while the 'Total' alignments also include 'Second pass' or non-reciprocal best alignments. For more information about the assembly-assembly alignment process, please visit the NCBI Genome Remapping Service page.

First Pass	Total
O_niloticus_UMD_NMBU (Current) Coverage: 99.99%	O_niloticus_UMD_NMBU (Current) Coverage: 99.99%
MKQE01 (Previous) Coverage: 99.58%	MKQE01 (Previous) Coverage: 99.58%
Percent Identity: 100.00%	Percent Identity: 100.00%

Comparison of the current and previous annotations

The annotation produced for this release (104) was compared to the annotation in the previous release (103) for each assembly annotated in both releases. Scores for current and previous gene and transcript features were calculated based on overlap in exon sequence and matches in exon boundaries. Pairs of current and previous features were categorized based on these scores, whether they are reciprocal best matches, and changes in attributes (gene biotype, completeness, etc.). If the assembly was updated between the two releases, alignments between the current and the previous assembly were used to match the current and previous gene and transcript features in mapped regions.

The table below summarizes the changes in the gene set for each assembly as a percent of the number of genes in the current annotation release, and provides links to the details of the comparison in tabular format and in a Genome Workbench project.

	O_niloticus_UMD_NMBU (Current) to ASM185804v2 (Previous)
Identical	9%
Minor changes	66%
Major changes	9%
New	16%
Deprecated	6%
Other	<1%
Download the report	tabular, Genome Workbench

References

RefSeq: Pruitt KD, Brown GR, Hiatt SM, Thibaud-Nissen F, Astashyn A, Ermolaeva O, Farrell CM, Hart J, Landrum MJ, McGarvey KM, Murphy MR, O'Leary NA, Pujar S, Rajput B, Rangwala SH, Riddick LD, Shkeda A, Sun H, Tamez P, Tully RE, Wallin C, Webb D, Weber J, Wu W, Dicuccio M, Kitts P, Maglott DR, Murphy TD, Ostell JM. Nucleic Acids Research 2014, 42(Database issue):D756-63
RepeatMasker: Smit AFA, Hubley R, Green P. RepeatMasker Open-3.0. 1996–2004. http://www.repeatmasker.org
WindowMasker: Morgulis A, Gertz EM, Schäffer AA, Agarwala R. Bioinformatics 2006, 2:134-41
Splign: Kapustin Y, Souvorov A, Tatusova T, Lipman D. Biology Direct 2008, 3:20

RefSeq

Integrated reference sequences