NCBI Salmo salar Annotation Release 102

The RefSeq genome records for Salmo salar were annotated by the NCBI Eukaryotic Genome Annotation Pipeline, an automated pipeline that annotates genes, transcripts and proteins on draft and finished genome assemblies. This report presents statistics on the annotation products, the input data used in the pipeline and intermediate alignment results.

The annotation products are available in the sequence databases and on the FTP site.

This report provides:

Annotation Release information: The name of the release, important dates, the software version
Assemblies: A brief description of the annotated assembly(ies)
Gene and feature statistics: The counts and characteristics of the annotated features
BUSCO results: Annotation completeness assessed with BUSCO
Alignment of the annotated proteins to a set of high-quality proteins: The number of annotated proteins with hits to a set of high-quality proteins
Masking of genomic sequence: How much of the genome was masked
Transcript and protein alignments: The number and type of evidence retrieved from public databases and used for gene prediction
Similarity of current and previous assembly: The similarity of the current and previous assembly
Comparison of the current and previous annotations: What proportion of the genes changed in this annotation

For more information on the annotation process, please visit the NCBI Eukaryotic Genome Annotation Pipeline page.

Annotation Release information

This annotation should be referred to as NCBI Salmo salar Annotation Release 102

Annotation release ID: 102
Date of Entrez queries for transcripts and proteins: Dec 15 2021
Date of submission of annotation to the public databases: Jan 7 2022
Software version: 9.0

Assemblies

The following assemblies were included in this annotation run:

Assembly name	Assembly accession	Submitter	Assembly date	Reference/Alternate	Assembly content
Ssal_v3.1	GCF_905237065.1	NORWEGIAN UNIVERSITY OF LIFE SCIENCES	04-21-2021	Reference	29 assembled chromosomes; unplaced scaffolds

Gene and feature statistics

Counts and length of annotated features are provided below for each assembly.

Feature counts

Feature	Ssal_v3.1
Genes and pseudogenes	65,343
protein-coding	42,985
non-coding	17,870
Transcribed pseudogenes	201
Non-transcribed pseudogenes	4,136
genes with variants	20,600
Immunoglobulin/T-cell receptor gene segments	126
other	25
mRNAs	95,298
fully-supported	92,394
with > 5% ab initio	1,470
partial	802
with filled gap(s)	154
known RefSeq (NM_)	3,480
model RefSeq (XM_)	91,818
non-coding RNAs	23,261
fully-supported	10,720
with > 5% ab initio	0
partial	15
with filled gap(s)	9
known RefSeq (NR_)	0
model RefSeq (XR_)	17,387
pseudo transcripts	212
fully-supported	183
with > 5% ab initio	0
partial	0
with filled gap(s)	0
known RefSeq (NR_)	0
model RefSeq (XR_)	212
CDSs	95,423
fully-supported	92,394
with > 5% ab initio	1,655
partial	746
with major correction(s)	1,613
known RefSeq (NP_)	3,479
model RefSeq (XP_)	91,818

Detailed reports

The counts below do not include pseudogenes.

Feature lengths

Feature	Count	Mean length (bp)	Median length (bp)	Min length (bp)	Max length (bp)
Genes	60,880	23,172	7,050	53	1,288,197
All transcripts	118,559	3,378	2,772	53	96,313
mRNA	95,298	3,937	3,239	135	96,313
misc_RNA	3,636	3,595	3,115	180	18,138
tRNA	5,874	74	73	68	96
lncRNA	7,096	1,548	1,153	87	15,933
snoRNA	869	116	93	63	319
snRNA	1,611	132	117	53	199
rRNA	4,151	124	119	115	3,926
Single-exon transcripts	1,461	1,999	1,732	213	18,717
coding transcripts (NM_/XM_ )	1,461	1,999	1,732	213	18,717
CDSs	95,298	2,255	1,587	96	95,031
Exons	530,103	328	145	1	27,288
in coding transcripts (NM_/XM_ )	504,484	320	144	1	27,288
in non-coding transcripts (NR_/XR_ )	46,900	361	147	2	11,828
Introns	470,681	3,254	492	26	1,143,324
in coding transcripts (NM_/XM_ )	453,519	3,285	498	26	1,143,324
in non-coding transcripts (NR_/XR_ )	38,009	2,685	422	30	507,922

Transcripts per gene, exons per transcript

	Mean	Median	Min	Max
Number of transcripts per gene	2.05	1	1	50
Number of exons per transcript	12.55	9	1	237

BUSCO analysis of gene annotation

BUSCO v4.1.4 (Simão et al 2015, PMID: 26059717) was run in "protein" mode on the annotated gene set picking one longest protein per gene, and run using the actinopterygii_odb10 lineage dataset. Results are reported for the gene set from the primary assembly unit, and presented in BUSCO notation (C:complete [S:single-copy, D:duplicated], F:fragmented, M:missing, n:number of genes used).

Alignment of the annotated proteins to a set of high-quality proteins

The final set of annotated proteins was searched with BLASTP against the UniProtKB/Swiss-Prot curated proteins, using the annotated proteins as the query and the high-quality proteins as the target. Out of 42985 coding genes, 39457 genes had a protein with an alignment covering 50% or more of the query and 18337 had an alignment covering 95% or more of the query.

Definition of query and target coverage. The query coverage is the percentage of the annotated protein length that is included in the alignment. The target coverage is the percentage of the target length that is included in the alignment.

Below is a cumulative graph displaying the number of genes with alignments above a given query or target coverage threshold. For comparison, corresponding statistics for other organisms annotated by the NCBI eukaryotic annotation pipeline were added to the graph.

Query: annotated proteins
Target: UniProtKB/Swiss-Prot curated proteins

Masking of genomic sequence

Transcript and protein alignments are performed on the repeat-masked genome. Below are the percentages of genomic sequence masked by WindowMasker and RepeatMasker (if calculated), for each assembly. RepeatMasker results are only calculated for organisms with complete Dfam HMM model collections.

For this annotation run, transcripts and proteins were aligned to the genome masked with WindowMasker only.

Assembly name	Assembly accession	% Masked with WindowMasker
Ssal_v3.1	GCF_905237065.1	57.86%

Transcript and protein alignments

The annotation pipeline relies heavily on alignments of experimental evidence for gene prediction. Below are the sets of transcripts and proteins that were retrieved from Entrez, aligned to the genome by Splign, minimap2, or ProSplign and passed to Gnomon, NCBI's gene prediction software.

Transcript alignments

Source	Number of sequences retrieved from Entrez	Number (%) of sequences aligned by Splign	Number (%) of sequences passed to Gnomon	Average % identity	Average % coverage
Same-species known RefSeq (NM_/NR_)	3,547	3,516 (99.13%)	3,449 (97.24%)	99.80%	99.26%
Same-species Genbank	13,718	13,597 (99.12%)	13,065 (95.24%)	99.65%	96.14%
Same-species EST	498,177	474,048 (95.16%)	452,243 (90.78%)	99.49%	99.30%

RefSeq transcript alignment quality report

The known RefSeq transcripts (NM_ and NR_ accessions) are a set of hiqh-quality transcripts maintained by the RefSeq group at NCBI. Alignment statistics for this group of transcripts, such as percent and number of sequences not aligning at all, percent best alignments split between multiple scaffolds, and percent alignments not covering the full CDS are indicative of the genome quality and are provided below.

	Ssal_v3.1 Primary Assembly
Number of sequences retrieved from Entrez	3,547
Number (%) of sequences not aligning	31 (0.87%)
Number (%) of sequences with multiple best alignments (split genes)	11 (0.31%)
Number (%) of sequences with CDS coverage < 95%	37 (1.05%)

RNA-Seq alignments

The following RNA-Seq reads from the Sequence Read Archive were also used for gene prediction:

Hide alignments statistics, by sample (SAME, SAMN, SAMD, DRS)

Sample Id	Publication	Track name	Number of reads	Percent aligned reads	Percent of aligned reads with introns	Number of introns
All	NA	Aggregate of all aligned samples	8,287,377,006	75%	33%	562,074
SAMEA104177543	NA	Hindgut (Salmo salar, SAMEA104177543)	26,452,487	87%	29%	289,297
SAMEA104177544	NA	Hindgut (Salmo salar, SAMEA104177544)	22,991,524	87%	28%	278,955
SAMEA104177545	NA	Hindgut (Salmo salar, SAMEA104177545)	27,702,494	87%	28%	301,922
SAMEA104177546	NA	Hindgut (Salmo salar, SAMEA104177546)	27,115,113	86%	29%	306,098
SAMEA104177547	NA	Hindgut (Salmo salar, SAMEA104177547)	17,318,937	84%	29%	261,459
SAMEA104177548	NA	Hindgut (Salmo salar, SAMEA104177548)	20,889,469	87%	31%	267,280
SAMEA104177549	NA	Hindgut (Salmo salar, SAMEA104177549)	24,219,443	85%	30%	266,193
SAMEA104177550	NA	Hindgut (Salmo salar, SAMEA104177550)	30,097,616	86%	35%	296,812
SAMEA104177551	NA	Hindgut (Salmo salar, SAMEA104177551)	29,961,765	86%	27%	286,380
SAMEA104177552	NA	Hindgut (Salmo salar, SAMEA104177552)	17,504,760	86%	29%	252,684
SAMEA104177553	NA	Hindgut (Salmo salar, SAMEA104177553)	24,234,208	86%	29%	269,178
SAMEA104177555	NA	Hindgut (Salmo salar, SAMEA104177555)	24,211,082	86%	23%	264,986
SAMEA104177556	NA	Hindgut (Salmo salar, SAMEA104177556)	27,570,481	83%	29%	280,480
SAMEA104177557	NA	Hindgut (Salmo salar, SAMEA104177557)	23,087,537	84%	27%	280,931
SAMEA104177558	NA	Pyloric caeca (Salmo salar, SAMEA104177558)	23,727,500	85%	27%	279,879
SAMEA104177559	NA	Pyloric caeca (Salmo salar, SAMEA104177559)	25,055,891	86%	30%	297,831
SAMEA104177560	NA	Pyloric caeca (Salmo salar, SAMEA104177560)	19,502,174	82%	25%	250,812
SAMEA104177561	NA	Pyloric caeca (Salmo salar, SAMEA104177561)	16,486,381	84%	29%	250,164
SAMEA104177562	NA	Pyloric caeca (Salmo salar, SAMEA104177562)	18,181,711	85%	28%	247,000
SAMEA104177563	NA	Pyloric caeca (Salmo salar, SAMEA104177563)	19,616,231	83%	25%	241,635
SAMEA104177564	NA	Pyloric caeca (Salmo salar, SAMEA104177564)	20,608,437	89%	32%	235,634
SAMEA104177565	NA	Pyloric caeca (Salmo salar, SAMEA104177565)	13,929,393	80%	26%	190,677
SAMEA104177566	NA	Pyloric caeca (Salmo salar, SAMEA104177566)	16,619,184	85%	28%	237,679
SAMEA104177567	NA	Pyloric caeca (Salmo salar, SAMEA104177567)	17,134,505	86%	32%	223,431
SAMEA104177568	NA	Pyloric caeca (Salmo salar, SAMEA104177568)	22,860,286	85%	29%	269,492
SAMEA104177569	NA	Pyloric caeca (Salmo salar, SAMEA104177569)	29,851,032	86%	31%	279,084
SAMEA104177570	NA	Pyloric caeca (Salmo salar, SAMEA104177570)	16,582,804	86%	30%	220,680
SAMEA104177571	NA	Pyloric caeca (Salmo salar, SAMEA104177571)	17,539,895	85%	30%	230,286
SAMEA104177572	NA	Pyloric caeca (Salmo salar, SAMEA104177572)	18,725,648	85%	29%	228,089
SAMEA104177573	NA	Stomach (Salmo salar, SAMEA104177573)	34,684,640	87%	29%	331,432
SAMEA104177574	NA	Stomach (Salmo salar, SAMEA104177574)	30,808,465	87%	25%	245,649
SAMEA104177575	NA	Stomach (Salmo salar, SAMEA104177575)	26,808,331	90%	36%	254,791
SAMEA104177576	NA	Stomach (Salmo salar, SAMEA104177576)	15,636,694	86%	31%	242,641
SAMEA104177577	NA	Stomach (Salmo salar, SAMEA104177577)	30,652,166	90%	36%	263,667
SAMEA104177578	NA	Stomach (Salmo salar, SAMEA104177578)	19,136,728	88%	28%	251,920
SAMEA104177579	NA	Stomach (Salmo salar, SAMEA104177579)	21,799,556	89%	34%	281,533
SAMEA104177580	NA	Stomach (Salmo salar, SAMEA104177580)	18,351,696	88%	34%	230,811
SAMEA104177581	NA	Stomach (Salmo salar, SAMEA104177581)	38,100,278	87%	30%	299,408
SAMEA104177582	NA	Stomach (Salmo salar, SAMEA104177582)	29,779,732	88%	30%	243,400
SAMEA104177583	NA	Stomach (Salmo salar, SAMEA104177583)	17,395,720	85%	25%	242,601
SAMEA104177584	NA	Stomach (Salmo salar, SAMEA104177584)	16,749,770	87%	32%	228,054
SAMEA104177585	NA	Stomach (Salmo salar, SAMEA104177585)	15,529,275	85%	27%	242,593
SAMEA104177586	NA	Stomach (Salmo salar, SAMEA104177586)	21,601,362	85%	27%	215,425
SAMEA104177587	NA	Stomach (Salmo salar, SAMEA104177587)	21,338,512	87%	26%	278,414
SAMEA3502741	NA	A.salmon: Intestine (Salmo salar, SAMEA3502741)	7,384,308	56%	59%	226,951
SAMEA3502742	NA	A.salmon: Intestine (Salmo salar, SAMEA3502742)	8,532,298	63%	62%	251,106
SAMEA3502743	NA	A.salmon: Intestine (Salmo salar, SAMEA3502743)	3,200,694	65%	56%	186,812
SAMEA3502744	NA	A.salmon: Intestine (Salmo salar, SAMEA3502744)	5,078,554	65%	61%	217,932
SAMEA3502745	NA	A.salmon: Intestine (Salmo salar, SAMEA3502745)	6,235,980	60%	60%	220,873
SAMEA3502746	NA	A.salmon: Intestine (Salmo salar, SAMEA3502746)	6,594,656	63%	61%	238,536
SAMEA3502747	NA	A.salmon: Intestine (Salmo salar, SAMEA3502747)	7,230,462	61%	59%	220,573
SAMEA3502748	NA	A.salmon: Intestine (Salmo salar, SAMEA3502748)	4,450,136	68%	56%	230,706
SAMEA3502749	NA	A.salmon: Intestine (Salmo salar, SAMEA3502749)	3,038,706	64%	58%	187,522
SAMEA3502750	NA	A.salmon: Intestine (Salmo salar, SAMEA3502750)	3,222,422	64%	58%	192,051
SAMEA3502751	NA	A.salmon: Intestine (Salmo salar, SAMEA3502751)	3,679,530	63%	60%	206,782
SAMN02597923	NA	notochord (Salmo salar, 510day degree, SAMN02597923)	23,030,660	88%	29%	231,782
SAMN02597924	NA	notochord (Salmo salar, 510day degree, SAMN02597924)	19,549,162	88%	28%	222,260
SAMN02597925	NA	notochord (Salmo salar, 510day degree, SAMN02597925)	19,882,380	88%	29%	235,594
SAMN02597926	NA	notochord (Salmo salar, 610day degree, SAMN02597926)	16,277,062	88%	30%	223,498
SAMN02597927	NA	notochord (Salmo salar, 610day degree, SAMN02597927)	19,895,740	89%	29%	229,036
SAMN02597928	NA	notochord (Salmo salar, 610day degree, SAMN02597928)	24,744,996	88%	28%	243,552
SAMN02597929	NA	notochord (Salmo salar, 710day degree, SAMN02597929)	21,245,750	87%	23%	214,675
SAMN02597930	NA	notochord (Salmo salar, 710day degree, SAMN02597930)	22,555,096	88%	26%	237,282
SAMN02597931	NA	notochord (Salmo salar, 710day degree, SAMN02597931)	21,445,728	88%	25%	232,458
SAMN02863983	20887641,10548724,27088604	ovary (Salmo salar, adult, female, SAMN02863983)	84,051,830	91%	33%	324,599
SAMN02863984	20887641,10548724,27088604	testis (Salmo salar, adult, male, SAMN02863984)	185,311,952	82%	25%	451,702
SAMN02864008	20887641,10548724,27088604	brain (Salmo salar, juvenile, male, SAMN02864008)	58,939,250	77%	20%	365,232
SAMN02864009	20887641,10548724,27088604	eye (Salmo salar, juvenile, male, SAMN02864009)	60,380,888	80%	23%	299,790
SAMN02864010	20887641,10548724,27088604	gill (Salmo salar, juvenile, male, SAMN02864010)	59,793,962	82%	28%	371,815
SAMN02864011	20887641,10548724,27088604	gut (Salmo salar, juvenile, male, SAMN02864011)	59,806,348	83%	31%	333,254
SAMN02864012	20887641,10548724,27088604	head kidney (Salmo salar, juvenile, male, SAMN02864012)	59,084,708	87%	31%	333,219
SAMN02864013	20887641,10548724,27088604	heart (Salmo salar, juvenile, male, SAMN02864013)	58,163,180	80%	29%	297,356
SAMN02864014	20887641,10548724,27088604	kidney (Salmo salar, juvenile, male, SAMN02864014)	61,054,936	77%	28%	349,736
SAMN02864015	20887641,10548724,27088604	liver (Salmo salar, juvenile, male, SAMN02864015)	58,784,272	83%	29%	250,217
SAMN02864016	20887641,10548724,27088604	muscle (Salmo salar, juvenile, male, SAMN02864016)	61,426,586	88%	35%	313,647
SAMN02864017	20887641,10548724,27088604	nose (Salmo salar, juvenile, male, SAMN02864017)	59,545,012	84%	26%	354,542
SAMN02864018	20887641,10548724,27088604	pyloric caecum (Salmo salar, juvenile, male, SAMN02864018)	61,602,874	86%	32%	321,843
SAMN02864019	20887641,10548724,27088604	skin (Salmo salar, juvenile, male, SAMN02864019)	270,961,440	51%	28%	102,367
SAMN02864020	20887641,10548724,27088604	spleen (Salmo salar, juvenile, male, SAMN02864020)	60,203,316	85%	29%	321,597
SAMN02864156	20887641,10548724,27088604	mixed (Salmo salar, 24 days, male, SAMN02864156)	200,931,860	83%	24%	450,180
SAMN02864157	20887641,10548724,27088604	mixed (Salmo salar, 24 days, female, SAMN02864157)	127,023,780	80%	28%	434,140
SAMN02864158	20887641,10548724,27088604	mixed (Salmo salar, 24 days, female, SAMN02864158)	187,581,008	82%	27%	451,361
SAMN02864159	20887641,10548724,27088604	mixed (Salmo salar, 24 days, male, SAMN02864159)	191,647,362	83%	27%	453,631
SAMN02864160	20887641,10548724,27088604	mixed (Salmo salar, 34 days, male, SAMN02864160)	220,655,182	81%	26%	459,689
SAMN02864161	20887641,10548724,27088604	mixed (Salmo salar, 34 days, female, SAMN02864161)	172,913,674	77%	38%	455,916
SAMN02864162	20887641,10548724,27088604	mixed (Salmo salar, 34 days, female, SAMN02864162)	171,503,954	80%	26%	460,322
SAMN02864163	20887641,10548724,27088604	mixed (Salmo salar, 34 days, male, SAMN02864163)	248,970,246	74%	24%	468,802
SAMN02864164	20887641,10548724,27088604	mixed (Salmo salar, 44 days, male, SAMN02864164)	229,249,072	77%	22%	443,038
SAMN02864165	20887641,10548724,27088604	mixed (Salmo salar, 44 days, female, SAMN02864165)	166,425,050	85%	28%	434,057
SAMN02864166	20887641,10548724,27088604	mixed (Salmo salar, 44 days, female, SAMN02864166)	207,827,708	80%	32%	456,991
SAMN02864167	20887641,10548724,27088604	mixed (Salmo salar, 44 days, male, SAMN02864167)	145,060,418	84%	32%	451,152
SAMN02864168	20887641,10548724,27088604	mixed (Salmo salar, 85 days, male, SAMN02864168)	113,882,898	84%	34%	455,805
SAMN02864169	20887641,10548724,27088604	mixed (Salmo salar, 85 days, female, SAMN02864169)	147,822,962	83%	33%	474,405
SAMN02864170	20887641,10548724,27088604	mixed (Salmo salar, 85 days, female, SAMN02864170)	148,718,702	82%	31%	478,657
SAMN02864171	20887641,10548724,27088604	mixed (Salmo salar, 85 days, male, SAMN02864171)	271,223,826	83%	30%	497,478
SAMN02867526	24951567	brain (Salmo salar, SAMN02867526)	835,301	85%	22%	64,045
SAMN02867527	24951567	pituitary gland (Salmo salar, SAMN02867527)	1,013,257	81%	26%	78,071
SAMN02867528	24951567	hypothalamus (Salmo salar, SAMN02867528)	443,891	84%	15%	27,089
SAMN02929440	NA	pool (Salmo salar, 3 months, SAMN02929440)	16,355,118	58%	53%	267,367
SAMN03758386	27088604	skin (Salmo salar, one year, female, SAMN03758386)	10,630,980	23%	50%	220,049
SAMN03758389	27088604	liver (Salmo salar, one year, female, SAMN03758389)	14,919,064	27%	63%	154,166
SAMN03758391	27088604	gill (Salmo salar, one year, female, SAMN03758391)	14,011,936	14%	20%	146,793
SAMN03758392	27088604	muscle (Salmo salar, one year, female, SAMN03758392)	11,891,276	29%	78%	132,588
SAMN03758393	27088604	brain (Salmo salar, one year, female, SAMN03758393)	13,199,982	17%	33%	220,789
SAMN03758395	27088604	spleen (Salmo salar, one year, female, SAMN03758395)	12,640,948	28%	50%	231,252
SAMN03758397	27088604	heart (Salmo salar, one year, female, SAMN03758397)	10,958,370	20%	44%	170,296
SAMN03758402	27088604	pyloric caecum (Salmo salar, one year, female, SAMN03758402)	8,463,286	17%	49%	132,738
SAMN03758403	27088604	pancreas (Salmo salar, one year, female, SAMN03758403)	8,461,936	12%	43%	111,437
SAMN03761426	27088604	gut (Salmo salar, one year, female, SAMN03761426)	14,802,698	24%	41%	203,331
SAMN04338468	27088604	sperm (Salmo salar, adult, male, SAMN04338468)	95,652,320	56%	11%	269,263
SAMN06196406	NA	gut (Salmo salar, female, SAMN06196406)	5,774,000	27%	54%	170,229
SAMN06196407	NA	gut (Salmo salar, male, SAMN06196407)	5,667,084	36%	55%	183,371
SAMN06196408	NA	gut (Salmo salar, female, SAMN06196408)	4,785,238	35%	56%	179,407
SAMN06196409	NA	gut (Salmo salar, male, SAMN06196409)	5,347,388	29%	54%	141,742
SAMN06196410	NA	gill (Salmo salar, female, SAMN06196410)	6,434,500	34%	55%	208,719
SAMN06196411	NA	gill (Salmo salar, male, SAMN06196411)	5,484,284	34%	55%	189,209
SAMN06196412	NA	gill (Salmo salar, female, SAMN06196412)	5,733,196	33%	53%	198,109
SAMN06196413	NA	gill (Salmo salar, male, SAMN06196413)	6,394,264	31%	53%	205,118
SAMN06196414	NA	brain (Salmo salar, female, SAMN06196414)	6,499,880	32%	40%	203,457
SAMN06196415	NA	brain (Salmo salar, male, SAMN06196415)	6,706,172	33%	40%	217,049
SAMN06196416	NA	brain (Salmo salar, female, SAMN06196416)	8,188,566	33%	42%	250,946
SAMN06196417	NA	brain (Salmo salar, male, SAMN06196417)	7,986,002	37%	41%	263,323
SAMN06196418	NA	kidney (Salmo salar, female, SAMN06196418)	6,856,148	34%	56%	199,163
SAMN06196419	NA	kidney (Salmo salar, male, SAMN06196419)	7,429,440	33%	54%	195,413
SAMN06196420	NA	kidney (Salmo salar, female, SAMN06196420)	6,979,072	36%	57%	206,856
SAMN06196421	NA	kidney (Salmo salar, male, SAMN06196421)	7,223,910	36%	57%	225,104
SAMN06196422	NA	liver (Salmo salar, female, SAMN06196422)	8,096,192	31%	55%	120,549
SAMN06196423	NA	liver (Salmo salar, male, SAMN06196423)	7,387,980	31%	57%	112,390
SAMN06196424	NA	liver (Salmo salar, female, SAMN06196424)	6,768,474	36%	60%	131,141
SAMN06196425	NA	liver (Salmo salar, male, SAMN06196425)	7,489,226	35%	61%	123,350
SAMN06196426	NA	eye (Salmo salar, female, SAMN06196426)	7,054,558	36%	58%	203,836
SAMN06196427	NA	eye (Salmo salar, male, SAMN06196427)	6,778,750	36%	57%	143,568
SAMN06196428	NA	eye (Salmo salar, female, SAMN06196428)	7,189,788	39%	64%	225,743
SAMN06196429	NA	eye (Salmo salar, male, SAMN06196429)	6,661,804	36%	60%	215,286
SAMN06196430	NA	spleen (Salmo salar, female, SAMN06196430)	7,291,102	37%	64%	158,807
SAMN06196431	NA	spleen (Salmo salar, male, SAMN06196431)	7,127,918	38%	59%	162,326
SAMN06196432	NA	spleen (Salmo salar, female, SAMN06196432)	8,339,566	37%	57%	225,177
SAMN06196433	NA	spleen (Salmo salar, male, SAMN06196433)	7,822,836	41%	59%	232,988
SAMN06196434	NA	skin (Salmo salar, female, SAMN06196434)	7,445,954	37%	54%	225,549
SAMN06196435	NA	skin (Salmo salar, male, SAMN06196435)	7,115,706	43%	52%	229,924
SAMN06196436	NA	skin (Salmo salar, female, SAMN06196436)	7,531,288	40%	56%	239,068
SAMN06196437	NA	skin (Salmo salar, male, SAMN06196437)	6,830,328	39%	53%	226,325
SAMN06196438	NA	olfactory pit (Salmo salar, female, SAMN06196438)	6,527,770	30%	53%	188,092
SAMN06196439	NA	olfactory pit (Salmo salar, male, SAMN06196439)	6,008,182	31%	44%	161,385
SAMN06196440	NA	olfactory pit (Salmo salar, female, SAMN06196440)	9,676,234	35%	54%	268,114
SAMN06196441	NA	olfactory pit (Salmo salar, male, SAMN06196441)	6,037,862	23%	42%	165,040
SAMN06196442	NA	pyloric cecae (Salmo salar, female, SAMN06196442)	6,358,000	20%	48%	112,724
SAMN06196443	NA	pyloric cecae (Salmo salar, male, SAMN06196443)	8,370,206	35%	59%	195,749
SAMN06196444	NA	pyloric cecae (Salmo salar, female, SAMN06196444)	7,694,092	30%	56%	173,615
SAMN06196445	NA	pyloric cecae (Salmo salar, male, SAMN06196445)	6,352,824	26%	55%	149,582
SAMN08237568	NA	Post-smolt, Head kidney, non-infected-R1, non-infected (Salmo salar, SAMN08237568)	5,267,214	79%	23%	169,088
SAMN08237569	NA	Post-smolt, Head kidney, P. salmonis T-LF89-R1, Early stage (Salmo salar, SAMN08237569)	6,075,638	66%	28%	208,024
SAMN08237570	NA	Post-smolt, Head kidney, P. salmonis T-EM90-R1, Early stage (Salmo salar, SAMN08237570)	6,083,230	65%	25%	198,601
SAMN08237571	NA	Post-smolt, Head kidney, P. salmonis CH-LF89-R1, Late stage (Salmo salar, SAMN08237571)	5,164,838	71%	28%	191,709
SAMN08237572	NA	Post-smolt, Head kidney, P. salmonis CH-LF90-R1, Late stage (Salmo salar, SAMN08237572)	6,050,036	77%	29%	179,607
SAMN09225659	NA	Gills (Salmo salar, SAMN09225659)	11,657,046	56%	52%	297,104
SAMN09225660	NA	Gills (Salmo salar, SAMN09225660)	12,827,560	58%	49%	302,119
SAMN09225661	NA	Gills (Salmo salar, SAMN09225661)	16,277,912	56%	50%	308,324
SAMN09225662	NA	Gills (Salmo salar, SAMN09225662)	13,451,584	57%	49%	290,691
SAMN09225663	NA	Liver (Salmo salar, SAMN09225663)	13,496,002	59%	57%	199,586
SAMN09225664	NA	Liver (Salmo salar, SAMN09225664)	11,971,598	52%	51%	180,514
SAMN09225665	NA	Liver (Salmo salar, SAMN09225665)	19,068,866	57%	47%	190,322
SAMN09225666	NA	Liver (Salmo salar, SAMN09225666)	18,346,104	58%	52%	200,817
SAMN09225667	NA	Head kidney (Salmo salar, SAMN09225667)	7,682,751	87%	51%	255,455
SAMN09225668	NA	Head kidney (Salmo salar, SAMN09225668)	7,364,655	88%	50%	235,676
SAMN09225669	NA	Head kidney (Salmo salar, SAMN09225669)	13,572,674	58%	49%	205,850
SAMN09225670	NA	Head kidney (Salmo salar, SAMN09225670)	13,861,374	69%	53%	235,801
SAMN10449040	NA	Hindgut (Salmo salar, 1 yr, SAMN10449040)	17,910,327	85%	35%	265,033
SAMN10449041	NA	Hindgut (Salmo salar, 1 yr, SAMN10449041)	16,088,214	85%	36%	258,421
SAMN10449042	NA	Hindgut (Salmo salar, 1 yr, SAMN10449042)	17,478,124	83%	35%	264,546
SAMN10449043	NA	Hindgut (Salmo salar, 1 yr, SAMN10449043)	17,151,193	85%	36%	261,015
SAMN10449044	NA	Midgut (Salmo salar, 1 yr, SAMN10449044)	18,575,165	85%	34%	260,631
SAMN10449045	NA	Midgut (Salmo salar, 1 yr, SAMN10449045)	20,114,090	84%	32%	265,980
SAMN10449046	NA	Midgut (Salmo salar, 1 yr, SAMN10449046)	13,649,922	86%	35%	241,421
SAMN10449047	NA	Midgut (Salmo salar, 1 yr, SAMN10449047)	15,973,644	86%	36%	254,398
SAMN10449048	NA	Hindgut (Salmo salar, 1 yr, SAMN10449048)	22,407,945	87%	36%	272,417
SAMN10449049	NA	Hindgut (Salmo salar, 1 yr, SAMN10449049)	23,628,387	86%	34%	271,922
SAMN10449050	NA	Hindgut (Salmo salar, 1 yr, SAMN10449050)	21,390,383	86%	32%	267,241
SAMN10449051	NA	Midgut (Salmo salar, 1 yr, SAMN10449051)	15,403,721	86%	35%	251,922
SAMN10449052	NA	Midgut (Salmo salar, 1 yr, SAMN10449052)	15,190,252	85%	34%	246,287
SAMN10449053	NA	Midgut (Salmo salar, 1 yr, SAMN10449053)	17,399,893	87%	35%	257,408
SAMN13046555	NA	Head kidney (Salmo salar, SAMN13046555)	2,549,660	6%	99%	6
SAMN13046556	NA	Head kidney (Salmo salar, SAMN13046556)	2,763,420	12%	99%	4
SAMN13046557	NA	Head kidney (Salmo salar, SAMN13046557)	1,311,772	5%	99%	2
SAMN13046558	NA	Head kidney (Salmo salar, SAMN13046558)	1,844,904	0%	93%	6
SAMN13046559	NA	Head kidney (Salmo salar, SAMN13046559)	1,630,476	0%	100%	3
SAMN13046560	NA	Head kidney (Salmo salar, SAMN13046560)	2,482,242	0%	21%	5
SAMN13046561	NA	Head kidney (Salmo salar, SAMN13046561)	2,576,086	6%	99%	3
SAMN13046562	NA	Head kidney (Salmo salar, SAMN13046562)	2,257,276	0%	100%	3
SAMN13046563	NA	Head kidney (Salmo salar, SAMN13046563)	2,670,600	0%	99%	4
SAMN16981425	33986770	pre-smolt, Gill (Salmo salar, SAMN16981425)	118,281,020	81%	40%	382,796
SAMN16981426	33986770	during smoltification, Gill (Salmo salar, SAMN16981426)	103,372,108	79%	38%	384,515
SAMN16981427	33986770	saltwater post-smolt period, Gill (Salmo salar, SAMN16981427)	110,626,442	81%	41%	391,784
SAMN16981428	33986770	pre-smolt, Liver (Salmo salar, SAMN16981428)	115,823,074	84%	47%	287,542
SAMN16981429	33986770	during smoltification, Liver (Salmo salar, SAMN16981429)	106,642,332	84%	46%	290,232
SAMN16981430	33986770	saltwater post-smolt period, Liver (Salmo salar, SAMN16981430)	120,348,126	82%	47%	308,536
SAMN16981431	33986770	pre-smolt, Head kidney (Salmo salar, SAMN16981431)	145,839,210	81%	44%	361,066
SAMN16981432	33986770	during smoltification, Head kidney (Salmo salar, SAMN16981432)	103,031,788	82%	47%	348,617
SAMN16981433	33986770	saltwater post-smolt period, Head kidney (Salmo salar, SAMN16981433)	114,822,586	84%	46%	307,360
SAMN16981434	33986770	adult, Head kidney (Salmo salar, SAMN16981434)	649,952,748	58%	43%	426,196
SAMN16981435	33986770	adult, Head kidney, Salmonid Alpha Virus (Salmo salar, SAMN16981435)	170,271,158	53%	42%	371,510

Show alignments statistics, by run (ERR, SRR, DRR)

Run	Experiment	Project	Sample	Number of reads	Percent aligned reads	Percent of aligned reads with introns
ERR981255	ERX1059239	ERP011527	SAMEA3502741	7,384,308	56%	59%
ERR981256	ERX1059240	ERP011527	SAMEA3502742	8,532,298	63%	62%
ERR981257	ERX1059241	ERP011527	SAMEA3502743	3,200,694	65%	56%
ERR981258	ERX1059242	ERP011527	SAMEA3502744	5,078,554	65%	61%
ERR981259	ERX1059243	ERP011527	SAMEA3502745	6,235,980	60%	60%
ERR981260	ERX1059244	ERP011527	SAMEA3502746	6,594,656	63%	61%
ERR981261	ERX1059245	ERP011527	SAMEA3502747	7,230,462	61%	59%
ERR981262	ERX1059246	ERP011527	SAMEA3502748	4,450,136	68%	56%
ERR981263	ERX1059247	ERP011527	SAMEA3502749	3,038,706	64%	58%
ERR981264	ERX1059248	ERP011527	SAMEA3502750	3,222,422	64%	58%
ERR981265	ERX1059249	ERP011527	SAMEA3502751	3,679,530	63%	60%
ERR2045701	ERX2104758	ERP024298	SAMEA104177543	26,452,487	87%	29%
ERR2045702	ERX2104759	ERP024298	SAMEA104177544	22,991,524	87%	28%
ERR2045703	ERX2104760	ERP024298	SAMEA104177545	27,702,494	87%	28%
ERR2045704	ERX2104761	ERP024298	SAMEA104177546	27,115,113	86%	29%
ERR2045705	ERX2104762	ERP024298	SAMEA104177547	17,318,937	84%	29%
ERR2045706	ERX2104763	ERP024298	SAMEA104177548	20,889,469	87%	31%
ERR2045707	ERX2104764	ERP024298	SAMEA104177549	24,219,443	85%	30%
ERR2045708	ERX2104765	ERP024298	SAMEA104177550	30,097,616	86%	35%
ERR2045709	ERX2104766	ERP024298	SAMEA104177551	29,961,765	86%	27%
ERR2045710	ERX2104767	ERP024298	SAMEA104177552	17,504,760	86%	29%
ERR2045711	ERX2104768	ERP024298	SAMEA104177553	24,234,208	86%	29%
ERR2045713	ERX2104770	ERP024298	SAMEA104177555	24,211,082	86%	23%
ERR2045714	ERX2104771	ERP024298	SAMEA104177556	27,570,481	83%	29%
ERR2045715	ERX2104772	ERP024298	SAMEA104177557	23,087,537	84%	27%
ERR2045716	ERX2104773	ERP024298	SAMEA104177558	23,727,500	85%	27%
ERR2045717	ERX2104774	ERP024298	SAMEA104177559	25,055,891	86%	30%
ERR2045718	ERX2104775	ERP024298	SAMEA104177560	19,502,174	82%	25%
ERR2045719	ERX2104776	ERP024298	SAMEA104177561	16,486,381	84%	29%
ERR2045720	ERX2104777	ERP024298	SAMEA104177562	18,181,711	85%	28%
ERR2045721	ERX2104778	ERP024298	SAMEA104177563	19,616,231	83%	25%
ERR2045722	ERX2104779	ERP024298	SAMEA104177564	20,608,437	89%	32%
ERR2045723	ERX2104780	ERP024298	SAMEA104177565	13,929,393	80%	26%
ERR2045724	ERX2104781	ERP024298	SAMEA104177566	16,619,184	85%	28%
ERR2045725	ERX2104782	ERP024298	SAMEA104177567	17,134,505	86%	32%
ERR2045726	ERX2104783	ERP024298	SAMEA104177568	22,860,286	85%	29%
ERR2045727	ERX2104784	ERP024298	SAMEA104177569	29,851,032	86%	31%
ERR2045728	ERX2104785	ERP024298	SAMEA104177570	16,582,804	86%	30%
ERR2045729	ERX2104786	ERP024298	SAMEA104177571	17,539,895	85%	30%
ERR2045730	ERX2104787	ERP024298	SAMEA104177572	18,725,648	85%	29%
ERR2045731	ERX2104788	ERP024298	SAMEA104177573	34,684,640	87%	29%
ERR2045732	ERX2104789	ERP024298	SAMEA104177574	30,808,465	87%	25%
ERR2045733	ERX2104790	ERP024298	SAMEA104177575	26,808,331	90%	36%
ERR2045734	ERX2104791	ERP024298	SAMEA104177576	15,636,694	86%	31%
ERR2045735	ERX2104792	ERP024298	SAMEA104177577	30,652,166	90%	36%
ERR2045736	ERX2104793	ERP024298	SAMEA104177578	19,136,728	88%	28%
ERR2045737	ERX2104794	ERP024298	SAMEA104177579	21,799,556	89%	34%
ERR2045738	ERX2104795	ERP024298	SAMEA104177580	18,351,696	88%	34%
ERR2045739	ERX2104796	ERP024298	SAMEA104177581	38,100,278	87%	30%
ERR2045740	ERX2104797	ERP024298	SAMEA104177582	29,779,732	88%	30%
ERR2045741	ERX2104798	ERP024298	SAMEA104177583	17,395,720	85%	25%
ERR2045742	ERX2104799	ERP024298	SAMEA104177584	16,749,770	87%	32%
ERR2045743	ERX2104800	ERP024298	SAMEA104177585	15,529,275	85%	27%
ERR2045744	ERX2104801	ERP024298	SAMEA104177586	21,601,362	85%	27%
ERR2045745	ERX2104802	ERP024298	SAMEA104177587	21,338,512	87%	26%
SRR1422871	SRX608620	SRP011583	SAMN02863983	84,051,830	91%	33%
SRR1422872	SRX608621	SRP011583	SAMN02863984	185,311,952	82%	25%
SRR1422856	SRX608607	SRP011583	SAMN02864008	58,939,250	77%	20%
SRR1422857	SRX608616	SRP011583	SAMN02864009	60,380,888	80%	23%
SRR1422858	SRX608399	SRP011583	SAMN02864010	59,793,962	82%	28%
SRR1422859	SRX608567	SRP011583	SAMN02864011	59,806,348	83%	31%
SRR1422860	SRX608569	SRP011583	SAMN02864012	59,084,708	87%	31%
SRR1422862	SRX608571	SRP011583	SAMN02864013	58,163,180	80%	29%
SRR1422864	SRX608574	SRP011583	SAMN02864014	61,054,936	77%	28%
SRR1422865	SRX608575	SRP011583	SAMN02864015	58,784,272	83%	29%
SRR1422866	SRX608579	SRP011583	SAMN02864016	61,426,586	88%	35%
SRR1422867	SRX608583	SRP011583	SAMN02864017	59,545,012	84%	26%
SRR1422868	SRX608588	SRP011583	SAMN02864018	61,602,874	86%	32%
SRR1422869	SRX608594	SRP011583	SAMN02864019	270,961,440	51%	28%
SRR1422870	SRX608599	SRP011583	SAMN02864020	60,203,316	85%	29%
SRR1422567	SRX608400	SRP011583	SAMN02864156	200,931,860	83%	24%
SRR1422637	SRX608401	SRP011583	SAMN02864157	127,023,780	80%	28%
SRR1422362	SRX608402	SRP011583	SAMN02864158	187,581,008	82%	27%
SRR1422389	SRX608403	SRP011583	SAMN02864159	191,647,362	83%	27%
SRR1422840	SRX608404	SRP011583	SAMN02864160	220,655,182	81%	26%
SRR1422850	SRX608405	SRP011583	SAMN02864161	172,913,674	77%	38%
SRR1422847	SRX608406	SRP011583	SAMN02864162	171,503,954	80%	26%
SRR1422848	SRX608407	SRP011583	SAMN02864163	248,970,246	74%	24%
SRR1422849	SRX608517	SRP011583	SAMN02864164	229,249,072	77%	22%
SRR1422652	SRX608521	SRP011583	SAMN02864165	166,425,050	85%	28%
SRR1422851	SRX608546	SRP011583	SAMN02864166	207,827,708	80%	32%
SRR1422656	SRX608553	SRP011583	SAMN02864167	145,060,418	84%	32%
SRR1422852	SRX608556	SRP011583	SAMN02864168	113,882,898	84%	34%
SRR1422853	SRX608557	SRP011583	SAMN02864169	147,822,962	83%	33%
SRR1422854	SRX608562	SRP011583	SAMN02864170	148,718,702	82%	31%
SRR1422855	SRX608564	SRP011583	SAMN02864171	271,223,826	83%	30%
SRR1146263	SRX450908	SRP035898	SAMN02597923	23,030,660	88%	29%
SRR1146684	SRX451802	SRP035898	SAMN02597924	19,549,162	88%	28%
SRR1146685	SRX451804	SRP035898	SAMN02597925	19,882,380	88%	29%
SRR1146689	SRX451808	SRP035898	SAMN02597926	16,277,062	88%	30%
SRR1151340	SRX456448	SRP035898	SAMN02597927	19,895,740	89%	29%
SRR1151341	SRX456449	SRP035898	SAMN02597928	24,744,996	88%	28%
SRR1151342	SRX456450	SRP035898	SAMN02597929	21,245,750	87%	23%
SRR1151343	SRX456451	SRP035898	SAMN02597930	22,555,096	88%	26%
SRR1151344	SRX456452	SRP035898	SAMN02597931	21,445,728	88%	25%
SRR1435329	SRX611317	SRP043420	SAMN02867526	835,301	85%	22%
SRR1435393	SRX611318	SRP043420	SAMN02867527	1,013,257	81%	26%
SRR1435483	SRX611319	SRP043420	SAMN02867528	443,891	84%	15%
SRR1522121	SRX658605	SRP044702	SAMN02929440	16,355,118	58%	53%
SRR2054768	SRX1046658	SRP059010	SAMN03758386	4,987,008	22%	49%
SRR2054771	SRX1046658	SRP059010	SAMN03758386	5,643,972	24%	51%
SRR2054779	SRX1052181	SRP059010	SAMN03758389	7,804,452	28%	63%
SRR2054781	SRX1052181	SRP059010	SAMN03758389	7,114,612	25%	62%
SRR2054776	SRX1052182	SRP059010	SAMN03758391	7,313,016	15%	20%
SRR2054777	SRX1052182	SRP059010	SAMN03758391	6,698,920	12%	19%
SRR2054783	SRX1052184	SRP059010	SAMN03758392	6,216,212	30%	78%
SRR2054784	SRX1052184	SRP059010	SAMN03758392	5,675,064	27%	77%
SRR2054789	SRX1052187	SRP059010	SAMN03758393	6,858,632	18%	33%
SRR2054790	SRX1052187	SRP059010	SAMN03758393	6,341,350	17%	32%
SRR2054791	SRX1052188	SRP059010	SAMN03758395	6,614,600	29%	51%
SRR2054792	SRX1052188	SRP059010	SAMN03758395	6,026,348	26%	49%
SRR2054794	SRX1052189	SRP059010	SAMN03758397	5,756,202	21%	45%
SRR2054795	SRX1052189	SRP059010	SAMN03758397	5,202,168	19%	43%
SRR2054796	SRX1052190	SRP059010	SAMN03758402	4,444,742	18%	50%
SRR2054797	SRX1052190	SRP059010	SAMN03758402	4,018,544	17%	48%
SRR2054798	SRX1052191	SRP059010	SAMN03758403	4,459,568	12%	44%
SRR2054799	SRX1052191	SRP059010	SAMN03758403	4,002,368	11%	42%
SRR2054800	SRX1052192	SRP059010	SAMN03761426	7,717,336	25%	42%
SRR2054801	SRX1052192	SRP059010	SAMN03761426	7,085,362	22%	40%
SRR3002516	SRX1483196	SRP059010	SAMN04338468	95,652,320	56%	11%
SRR5138353	SRX2457484	SRP095919	SAMN06196406	5,774,000	27%	54%
SRR5138368	SRX2457499	SRP095919	SAMN06196407	5,667,084	36%	55%
SRR5138385	SRX2457516	SRP095919	SAMN06196408	4,785,238	35%	56%
SRR5138371	SRX2457502	SRP095919	SAMN06196409	5,347,388	29%	54%
SRR5138384	SRX2457515	SRP095919	SAMN06196410	6,434,500	34%	55%
SRR5138357	SRX2457488	SRP095919	SAMN06196411	5,484,284	34%	55%
SRR5138352	SRX2457483	SRP095919	SAMN06196412	5,733,196	33%	53%
SRR5138370	SRX2457501	SRP095919	SAMN06196413	6,394,264	31%	53%
SRR5138381	SRX2457512	SRP095919	SAMN06196414	6,499,880	32%	40%
SRR5138363	SRX2457494	SRP095919	SAMN06196415	6,706,172	33%	40%
SRR5138362	SRX2457493	SRP095919	SAMN06196416	8,188,566	33%	42%
SRR5138380	SRX2457511	SRP095919	SAMN06196417	7,986,002	37%	41%
SRR5138354	SRX2457485	SRP095919	SAMN06196418	6,856,148	34%	56%
SRR5138355	SRX2457486	SRP095919	SAMN06196419	7,429,440	33%	54%
SRR5138372	SRX2457503	SRP095919	SAMN06196420	6,979,072	36%	57%
SRR5138379	SRX2457510	SRP095919	SAMN06196421	7,223,910	36%	57%
SRR5138367	SRX2457498	SRP095919	SAMN06196422	8,096,192	31%	55%
SRR5138350	SRX2457481	SRP095919	SAMN06196423	7,387,980	31%	57%
SRR5138351	SRX2457482	SRP095919	SAMN06196424	6,768,474	36%	60%
SRR5138378	SRX2457509	SRP095919	SAMN06196425	7,489,226	35%	61%
SRR5138369	SRX2457500	SRP095919	SAMN06196426	7,054,558	36%	58%
SRR5138386	SRX2457517	SRP095919	SAMN06196427	6,778,750	36%	57%
SRR5138356	SRX2457487	SRP095919	SAMN06196428	7,189,788	39%	64%
SRR5138364	SRX2457495	SRP095919	SAMN06196429	6,661,804	36%	60%
SRR5138374	SRX2457505	SRP095919	SAMN06196430	7,291,102	37%	64%
SRR5138365	SRX2457496	SRP095919	SAMN06196431	7,127,918	38%	59%
SRR5138349	SRX2457480	SRP095919	SAMN06196432	8,339,566	37%	57%
SRR5138360	SRX2457491	SRP095919	SAMN06196433	7,822,836	41%	59%
SRR5138377	SRX2457508	SRP095919	SAMN06196434	7,445,954	37%	54%
SRR5138383	SRX2457514	SRP095919	SAMN06196435	7,115,706	43%	52%
SRR5138382	SRX2457513	SRP095919	SAMN06196436	7,531,288	40%	56%
SRR5138376	SRX2457507	SRP095919	SAMN06196437	6,830,328	39%	53%
SRR5138358	SRX2457489	SRP095919	SAMN06196438	6,527,770	30%	53%
SRR5138387	SRX2457518	SRP095919	SAMN06196439	6,008,182	31%	44%
SRR5138366	SRX2457497	SRP095919	SAMN06196440	9,676,234	35%	54%
SRR5138359	SRX2457490	SRP095919	SAMN06196441	6,037,862	23%	42%
SRR5138373	SRX2457504	SRP095919	SAMN06196442	6,358,000	20%	48%
SRR5138361	SRX2457492	SRP095919	SAMN06196443	8,370,206	35%	59%
SRR5138375	SRX2457506	SRP095919	SAMN06196444	7,694,092	30%	56%
SRR5138348	SRX2457479	SRP095919	SAMN06196445	6,352,824	26%	55%
SRR6415121	SRX3508134	SRP126697	SAMN08237568	5,267,214	79%	23%
SRR6415120	SRX3508135	SRP126697	SAMN08237569	6,075,638	66%	28%
SRR6415119	SRX3508136	SRP126697	SAMN08237570	6,083,230	65%	25%
SRR6415118	SRX3508137	SRP126697	SAMN08237571	5,164,838	71%	28%
SRR6415122	SRX3508133	SRP126697	SAMN08237572	6,050,036	77%	29%
SRR7184469	SRX4101235	SRP148469	SAMN09225659	11,657,046	56%	52%
SRR7184468	SRX4101236	SRP148469	SAMN09225660	12,827,560	58%	49%
SRR7184471	SRX4101233	SRP148469	SAMN09225661	16,277,912	56%	50%
SRR7184470	SRX4101234	SRP148469	SAMN09225662	13,451,584	57%	49%
SRR7184465	SRX4101239	SRP148469	SAMN09225663	13,496,002	59%	57%
SRR7184464	SRX4101240	SRP148469	SAMN09225664	11,971,598	52%	51%
SRR7184467	SRX4101237	SRP148469	SAMN09225665	19,068,866	57%	47%
SRR7184466	SRX4101238	SRP148469	SAMN09225666	18,346,104	58%	52%
SRR7184463	SRX4101241	SRP148469	SAMN09225667	7,682,751	87%	51%
SRR7184462	SRX4101242	SRP148469	SAMN09225668	7,364,655	88%	50%
SRR7184461	SRX4101243	SRP148469	SAMN09225669	13,572,674	58%	49%
SRR7184460	SRX4101244	SRP148469	SAMN09225670	13,861,374	69%	53%
SRR8208898	SRX5028087	SRP169832	SAMN10449040	17,910,327	85%	35%
SRR8208899	SRX5028086	SRP169832	SAMN10449041	16,088,214	85%	36%
SRR8208895	SRX5028089	SRP169832	SAMN10449042	17,478,124	83%	35%
SRR8208896	SRX5028088	SRP169832	SAMN10449043	17,151,193	85%	36%
SRR8208897	SRX5028091	SRP169832	SAMN10449044	18,575,165	85%	34%
SRR8208894	SRX5028090	SRP169832	SAMN10449045	20,114,090	84%	32%
SRR8208892	SRX5028093	SRP169832	SAMN10449046	13,649,922	86%	35%
SRR8208893	SRX5028092	SRP169832	SAMN10449047	15,973,644	86%	36%
SRR8208900	SRX5028085	SRP169832	SAMN10449048	22,407,945	87%	36%
SRR8208901	SRX5028084	SRP169832	SAMN10449049	23,628,387	86%	34%
SRR8208890	SRX5028095	SRP169832	SAMN10449050	21,390,383	86%	32%
SRR8208891	SRX5028094	SRP169832	SAMN10449051	15,403,721	86%	35%
SRR8208888	SRX5028097	SRP169832	SAMN10449052	15,190,252	85%	34%
SRR8208889	SRX5028096	SRP169832	SAMN10449053	17,399,893	87%	35%
SRR10298035	SRX7010791	SRP226009	SAMN13046555	2,549,660	6%	99%
SRR10298034	SRX7010792	SRP226009	SAMN13046556	2,763,420	12%	99%
SRR10298033	SRX7010793	SRP226009	SAMN13046557	1,311,772	5%	99%
SRR10298032	SRX7010794	SRP226009	SAMN13046558	1,844,904	0%	93%
SRR10298031	SRX7010795	SRP226009	SAMN13046559	1,630,476	0%	100%
SRR10298030	SRX7010796	SRP226009	SAMN13046560	2,482,242	0%	21%
SRR10298029	SRX7010797	SRP226009	SAMN13046561	2,576,086	6%	99%
SRR10298028	SRX7010798	SRP226009	SAMN13046562	2,257,276	0%	100%
SRR10298027	SRX7010799	SRP226009	SAMN13046563	2,670,600	0%	99%
SRR13202168	SRX9635584	SRP296448	SAMN16981425	118,281,020	81%	40%
SRR13202156	SRX9635596	SRP296448	SAMN16981426	103,372,108	79%	38%
SRR13202152	SRX9635600	SRP296448	SAMN16981427	110,626,442	81%	41%
SRR13202150	SRX9635602	SRP296448	SAMN16981428	115,823,074	84%	47%
SRR13202148	SRX9635604	SRP296448	SAMN16981429	106,642,332	84%	46%
SRR13202166	SRX9635586	SRP296448	SAMN16981430	120,348,126	82%	47%
SRR13202164	SRX9635588	SRP296448	SAMN16981431	145,839,210	81%	44%
SRR13202162	SRX9635590	SRP296448	SAMN16981432	103,031,788	82%	47%
SRR13202160	SRX9635592	SRP296448	SAMN16981433	114,822,586	84%	46%
SRR13202158	SRX9635594	SRP296448	SAMN16981434	649,952,748	58%	43%
SRR13202155	SRX9635597	SRP296448	SAMN16981435	170,271,158	53%	42%

Protein alignments

Source	Number of sequences retrieved from Entrez	Number (%) of sequences aligned by ProSplign	Number (%) of sequences passed to Gnomon	Average % identity	Average % coverage
Actinopterygii GenBank	76,971	64,432 (83.71%)	64,432 (83.71%)	70.56%	82.08%
Actinopterygii known RefSeq (NP_)	21,927	6,656 (30.36%)	6,656 (30.36%)	68.62%	78.84%
Same-species GenBank	12,030	5,425 (45.10%)	5,425 (45.10%)	79.85%	87.32%
Same-species known RefSeq (NP_)	3,547	3,439 (96.96%)	3,439 (96.96%)	79.60%	86.69%
Homo sapiens known RefSeq (NP_)	62,879	40,775 (64.85%)	40,775 (64.85%)	68.19%	72.83%

Assembly-assembly alignments of current to previous assembly

When the assembly changes between two rounds of annotation, genes in the current and the previous annotation are mapped to each other using the genomic alignments of the current assembly to the previous assembly so that gene identifiers can be preserved. The success of the remapping depends largely on how well the two assembly versions align to each other.

Below are the percent coverage of one assembly by the other and the average percent identity of the alignments. The 'First pass' alignments are reciprocal best hits, while the 'Total' alignments also include 'Second pass' or non-reciprocal best alignments. For more information about the assembly-assembly alignment process, please visit the NCBI Genome Remapping Service page.

First Pass	Total
Ssal_v3.1 (Current) Coverage: 77.48%	Ssal_v3.1 (Current) Coverage: 81.28%
ICSASG_v2 (Previous) Coverage: 76.94%	ICSASG_v2 (Previous) Coverage: 79.54%
Percent Identity: 99.21%	Percent Identity: 99.00%

Comparison of the current and previous annotations

The annotation produced for this release (102) was compared to the annotation in the previous release (100) for each assembly annotated in both releases. Scores for current and previous gene and transcript features were calculated based on overlap in exon sequence and matches in exon boundaries. Pairs of current and previous features were categorized based on these scores, whether they are reciprocal best matches, and changes in attributes (gene biotype, completeness, etc.). If the assembly was updated between the two releases, alignments between the current and the previous assembly were used to match the current and previous gene and transcript features in mapped regions.

The table below summarizes the changes in the gene set for each assembly as a percent of the number of genes in the current annotation release, and provides links to the details of the comparison in tabular format and in a Genome Workbench project.

	Ssal_v3.1 (Current) to ICSASG_v2 (Previous)
Identical	4%
Minor changes	49%
Major changes	12%
New	31%
Deprecated	22%
Other	4%
Download the report	tabular, Genome Workbench

References

RefSeq: Pruitt KD, Brown GR, Hiatt SM, Thibaud-Nissen F, Astashyn A, Ermolaeva O, Farrell CM, Hart J, Landrum MJ, McGarvey KM, Murphy MR, O'Leary NA, Pujar S, Rajput B, Rangwala SH, Riddick LD, Shkeda A, Sun H, Tamez P, Tully RE, Wallin C, Webb D, Weber J, Wu W, Dicuccio M, Kitts P, Maglott DR, Murphy TD, Ostell JM. Nucleic Acids Research 2014, 42(Database issue):D756-63
RepeatMasker: Smit AFA, Hubley R, Green P. RepeatMasker Open-3.0. 1996–2004. http://www.repeatmasker.org
WindowMasker: Morgulis A, Gertz EM, Schäffer AA, Agarwala R. Bioinformatics 2006, 2:134-41
Splign: Kapustin Y, Souvorov A, Tatusova T, Lipman D. Biology Direct 2008, 3:20
Minimap2: Li H. Bioinformatics 2018 Sep 15;34(18):3094-3100

RefSeq

Integrated reference sequences