NCBI Acinonyx jubatus Annotation Release 101

The RefSeq genome records for Acinonyx jubatus were annotated by the NCBI Eukaryotic Genome Annotation Pipeline, an automated pipeline that annotates genes, transcripts and proteins on draft and finished genome assemblies. This report presents statistics on the annotation products, the input data used in the pipeline and intermediate alignment results.

The annotation products are available in the sequence databases and on the FTP site.

This report provides:

Annotation Release information: The name of the release, important dates, the software version
Assemblies: A brief description of the annotated assembly(ies)
Gene and feature statistics: The counts and characteristics of the annotated features
Alignment of the annotated proteins to a set of high-quality proteins: The number of annotated proteins with hits to a set of high-quality proteins
Masking of genomic sequence: How much of the genome was masked
Transcript and protein alignments: The number and type of evidence retrieved from public databases and used for gene prediction
Similarity of current and previous assembly: The similarity of the current and previous assembly
Comparison of the current and previous annotations: What proportion of the genes changed in this annotation

For more information on the annotation process, please visit the NCBI Eukaryotic Genome Annotation Pipeline page.

Annotation Release information

This annotation should be referred to as NCBI Acinonyx jubatus Annotation Release 101

Annotation release ID: 101
Date of Entrez queries for transcripts and proteins: Nov 7 2018
Date of submission of annotation to the public databases: Nov 13 2018
Software version: 8.1

Assemblies

The following assemblies were included in this annotation run:

Assembly name	Assembly accession	Submitter	Assembly date	Reference/Alternate	Assembly content
Aci_jub_2	GCF_003709585.1	Felidae consortium	10-22-2018	Reference	1 assembled chromosomes; unplaced scaffolds

Gene and feature statistics

Counts and length of annotated features are provided below for each assembly.

Feature counts

Feature	Aci_jub_2
Genes and pseudogenes	34,482
protein-coding	19,529
non-coding	11,000
transcribed pseudogenes	4
non-transcribed pseudogenes	3,849
genes with variants	12,720
immunoglobulin/T-cell receptor gene segments	100
other	0
mRNAs	56,248
fully-supported	55,059
with > 5% ab initio	544
partial	283
with filled gap(s)	0
known RefSeq (NM_)	1
model RefSeq (XM_)	56,247
non-coding RNAs	16,404
fully-supported	14,223
with > 5% ab initio	0
partial	0
with filled gap(s)	0
known RefSeq (NR_)	0
model RefSeq (XR_)	15,902
pseudo transcripts	4
fully-supported	4
with > 5% ab initio	0
partial	0
with filled gap(s)	0
known RefSeq (NR_)	0
model RefSeq (XR_)	4
CDSs	56,361
fully-supported	55,059
with > 5% ab initio	669
partial	295
with major correction(s)	801
known RefSeq (NP_)	14
model RefSeq (XP_)	56,247

Detailed reports

The counts below do not include pseudogenes.

Feature lengths

Feature	Count	Mean length (bp)	Median length (bp)	Min length (bp)	Max length (bp)
Genes	30,529	40,924	11,391	49	2,869,184
All transcripts	72,652	3,161	2,506	45	105,690
mRNA	56,248	3,626	2,935	153	105,690
misc_RNA	2,278	2,784	2,297	165	28,057
tRNA	500	74	73	59	85
lncRNA	11,945	1,604	1,133	45	17,847
snoRNA	569	111	106	49	329
snRNA	1,069	114	107	60	199
guide_RNA	31	165	134	81	411
rRNA	12	828	153	119	2,471
Single-exon transcripts	1,532	1,503	991	153	9,619
coding transcripts (NM_/XM_ )	1,532	1,503	991	153	9,619
CDSs	56,261	2,156	1,557	96	104,622
Exons	273,655	325	141	1	21,782
in coding transcripts (NM_/XM_ )	240,534	302	137	1	21,782
in non-coding transcripts (NR_/XR_ )	47,070	403	161	2	13,184
Introns	240,742	6,689	1,564	30	1,122,777
in coding transcripts (NM_/XM_ )	218,208	6,509	1,517	30	1,122,777
in non-coding transcripts (NR_/XR_ )	36,085	7,327	1,855	30	575,575

Transcripts per gene, exons per transcript

	Mean	Median	Min	Max
Number of transcripts per gene	2.4	1	1	50
Number of exons per transcript	11.75	8	1	332

Alignment of the annotated proteins to a set of high-quality proteins

The final set of annotated proteins was searched with BLASTP against the UniProtKB/Swiss-Prot curated proteins, using the annotated proteins as the query and the high-quality proteins as the target. Out of 19516 coding genes, 19077 genes had a protein with an alignment covering 50% or more of the query and 16598 had an alignment covering 95% or more of the query.

Definition of query and target coverage. The query coverage is the percentage of the annotated protein length that is included in the alignment. The target coverage is the percentage of the target length that is included in the alignment.

Below is a cumulative graph displaying the number of genes with alignments above a given query or target coverage threshold. For comparison, corresponding statistics for other organisms annotated by the NCBI eukaryotic annotation pipeline were added to the graph.

Query: annotated proteins
Target: UniProtKB/Swiss-Prot curated proteins

Masking of genomic sequence

Transcript and protein alignments are performed on the repeat-masked genome. Below are the percentages of genomic sequence masked by WindowMasker and RepeatMasker for each assembly. RepeatMasker results are only used for organisms for which a comprehensive repeat library is available.

For this annotation run, transcripts and proteins were aligned to the genome masked with WindowMasker only.

Assembly name	Assembly accession	% Masked with RepeatMasker	% Masked with WindowMasker
Aci_jub_2	GCF_003709585.1	42.66%	32.31%

Transcript and protein alignments

The annotation pipeline relies heavily on alignments of experimental evidence for gene prediction. Below are the sets of transcripts and proteins that were retrieved from Entrez, aligned to the genome by Splign or ProSplign and passed to Gnomon, NCBI's gene prediction software.

Depending on the other evidence available, long 454 reads (with average length above 250 nt) may be aligned as traditional evidence and reported in the Transcript alignments section or aligned with RNA-Seq reads and reported in the RNA-Seq alignments section.

Transcript alignments

Source	Number of sequences retrieved from Entrez	Number (%) of sequences aligned by Splign	Number (%) of sequences passed to Gnomon	Average % identity	Average % coverage
Same-species known RefSeq (NM_/NR_)	1	1 (100.00%)	1 (100.00%)	99.00%	100.00%
Same-species Genbank	11	11 (100.00%)	10 (90.91%)	99.60%	100.00%
Felis catus known RefSeq (NM_/NR_)	418	418 (100.00%)	389 (93.06%)	98.56%	99.04%
Felis catus Genbank	1,579	1,497 (94.81%)	976 (61.81%)	97.04%	88.97%
Felis catus EST	919	862 (93.80%)	810 (88.14%)	97.64%	99.05%

RNA-Seq alignments

The following RNA-Seq reads from the Sequence Read Archive were also used for gene prediction:

Hide alignments statistics, by sample (SAME, SAMN, SAMD, DRS)

Sample Id	Publication	Track name	Number of reads	Percent aligned reads	Percent of aligned reads with introns	Number of introns
All	NA	Aggregate of all aligned samples	5,958,187,234	84%	24%	340,460
SAMN01831871	25677554	normal liver (Felis catus, adult, SAMN01831871)	36,356,376	82%	10%	119,228
SAMN01831872	25677554	normal liver (Felis catus, adult, SAMN01831872)	37,302,574	81%	10%	117,510
SAMN01831873	25677554	normal liver (Felis catus, adult, SAMN01831873)	43,786,726	82%	10%	120,368
SAMN01831921	25677554	normal kidney (Felis catus, adult, SAMN01831921)	42,717,142	80%	7%	141,370
SAMN01831922	25677554	normal kidney (Felis catus, adult, SAMN01831922)	43,667,072	80%	7%	133,230
SAMN01831923	25677554	normal kidney (Felis catus, adult, SAMN01831923)	36,100,760	79%	9%	136,573
SAMN01831965	25677554	normal brain, frontal parts (Felis catus, adult, SAMN01831965)	46,356,088	81%	6%	156,822
SAMN01831966	25677554	normal brain, frontal parts (Felis catus, adult, SAMN01831966)	48,799,196	80%	6%	156,497
SAMN01831967	25677554	normal brain, frontal parts (Felis catus, adult, SAMN01831967)	39,439,684	81%	7%	150,446
SAMN02058438	NA	Subcutaneous adipose, Short Day (Felis catus, SAMN02058438)	25,217,636	93%	29%	135,399
SAMN02058439	NA	Subcutaneous adipose, Long Day (Felis catus, SAMN02058439)	27,572,827	93%	25%	151,999
SAMN02058440	NA	Subcutaneous adipose, Long Day (Felis catus, SAMN02058440)	26,276,810	94%	30%	144,376
SAMN02058441	NA	Subcutaneous adipose, Short Day (Felis catus, SAMN02058441)	27,280,242	93%	27%	154,825
SAMN02058442	NA	Subcutaneous adipose, Long Day (Felis catus, SAMN02058442)	27,959,031	93%	26%	145,099
SAMN02058443	NA	Subcutaneous adipose, Short Day (Felis catus, SAMN02058443)	29,321,049	93%	28%	155,670
SAMN02058444	NA	Subcutaneous adipose, Long Day (Felis catus, SAMN02058444)	27,074,727	93%	23%	157,351
SAMN02058445	NA	Subcutaneous adipose, Short Day (Felis catus, SAMN02058445)	27,384,876	93%	25%	149,873
SAMN02058446	NA	Subcutaneous adipose, Long Day (Felis catus, SAMN02058446)	27,650,878	93%	27%	153,417
SAMN02058447	NA	Subcutaneous adipose, Short Day (Felis catus, SAMN02058447)	26,426,247	93%	24%	152,963
SAMN02058448	NA	Subcutaneous adipose, Short Day (Felis catus, SAMN02058448)	30,277,827	94%	32%	141,504
SAMN02058449	NA	Subcutaneous adipose, Long Day (Felis catus, SAMN02058449)	30,827,991	93%	23%	160,412
SAMN02058450	NA	Subcutaneous adipose, Short Day (Felis catus, SAMN02058450)	30,835,006	93%	29%	152,415
SAMN02058451	NA	Subcutaneous adipose, Long Day (Felis catus, SAMN02058451)	33,864,272	93%	29%	149,911
SAMN02058452	NA	Subcutaneous adipose, Long Day (Felis catus, SAMN02058452)	28,897,848	93%	26%	155,147
SAMN02058453	NA	Subcutaneous adipose, Short Day (Felis catus, SAMN02058453)	29,065,945	93%	28%	153,423
SAMN02058454	NA	Subcutaneous adipose, Short Day (Felis catus, SAMN02058454)	30,055,186	93%	23%	154,433
SAMN02058455	NA	Subcutaneous adipose, Long Day (Felis catus, SAMN02058455)	28,341,003	93%	27%	152,209
SAMN02058456	NA	Subcutaneous adipose, Long Day (Felis catus, SAMN02058456)	31,620,077	93%	25%	165,429
SAMN02058457	NA	Subcutaneous adipose, Short Day (Felis catus, SAMN02058457)	32,392,358	93%	26%	164,000
SAMN04099974	27149523	Iridiocorneal angle (Felis catus, SAMN04099974)	177,835,580	93%	27%	234,824
SAMN04099975	27149523	Iridiocorneal angle (Felis catus, SAMN04099975)	182,421,744	93%	26%	230,793
SAMN04498517	NA	embryo (fetus) (Felis catus, SAMN04498517)	166,944,018	81%	24%	225,310
SAMN04498518	NA	embryo (fetus) (Felis catus, SAMN04498518)	201,883,266	83%	29%	230,012
SAMN04498519	NA	lung (Felis catus, male, SAMN04498519)	123,208,322	79%	21%	216,555
SAMN04498520	NA	pancreas (Felis catus, male, SAMN04498520)	166,157,056	78%	48%	150,133
SAMN04498521	NA	heart (Felis catus, male, SAMN04498521)	215,449,116	84%	16%	201,608
SAMN04498522	NA	muscle (Felis catus, male, SAMN04498522)	106,996,830	86%	41%	152,153
SAMN04498523	NA	ear cartilage (Felis catus, male, SAMN04498523)	148,955,788	82%	25%	227,882
SAMN04498524	NA	spinal cord (Felis catus, male, SAMN04498524)	162,679,594	82%	19%	211,689
SAMN04498525	NA	thymus (Felis catus, male, SAMN04498525)	104,390,084	71%	17%	168,295
SAMN04498526	NA	kidney (Felis catus, male, SAMN04498526)	94,438,504	82%	25%	190,589
SAMN04498527	NA	testes (Felis catus, male, SAMN04498527)	160,548,060	80%	30%	254,384
SAMN04498528	NA	cerebellum (brain) (Felis catus, male, SAMN04498528)	131,799,730	85%	20%	215,073
SAMN04498529	NA	parietal lobe (brain) (Felis catus, female, SAMN04498529)	96,748,380	81%	20%	207,248
SAMN04498530	NA	hippocampus (brain) (Felis catus, female, SAMN04498530)	110,467,612	84%	22%	220,334
SAMN04498531	NA	liver (Felis catus, female, SAMN04498531)	132,973,642	85%	33%	167,336
SAMN04498532	NA	cerebellum (brain) (Felis catus, female, SAMN04498532)	128,178,488	80%	21%	215,971
SAMN04498533	NA	temporal lobe (brain), Seizures (Felis catus, female, SAMN04498533)	119,882,660	81%	22%	191,778
SAMN04498534	NA	salivary gland (Felis catus, female, SAMN04498534)	113,655,162	75%	24%	190,113
SAMN04498535	NA	bone marrow (Felis catus, female, SAMN04498535)	72,193,490	76%	26%	157,325
SAMN04498536	NA	head (embryo) (Felis catus, SAMN04498536)	122,891,208	74%	20%	196,268
SAMN04498537	NA	body (embryo) (Felis catus, SAMN04498537)	123,289,946	80%	25%	225,244
SAMN04498538	NA	retina (Felis catus, male, SAMN04498538)	108,620,542	87%	26%	217,372
SAMN04498539	NA	skin (Felis catus, female, SAMN04498539)	133,555,718	82%	24%	223,805
SAMN04498540	NA	retina (Felis catus, female, SAMN04498540)	117,242,720	80%	16%	219,454
SAMN04498541	NA	kidney (Felis catus, female, SAMN04498541)	120,590,350	84%	23%	202,169
SAMN04498542	NA	spleen (Felis catus, female, SAMN04498542)	121,451,690	81%	22%	221,403
SAMN04498543	NA	ear tip (Felis catus, female, SAMN04498543)	123,699,138	85%	27%	210,690
SAMN04498544	NA	uterus (Felis catus, female, SAMN04498544)	103,269,448	78%	18%	197,272
SAMN04498545	NA	spleen (Felis catus, female, SAMN04498545)	85,691,628	81%	24%	174,269
SAMN04498546	NA	skin (orange color) (Felis catus, missing, SAMN04498546)	155,642,984	81%	25%	227,345
SAMN04498547	NA	skin (white color) (Felis catus, missing, SAMN04498547)	117,414,704	80%	23%	217,095
SAMN04498548	NA	occipital (brain) (Felis catus, female, SAMN04498548)	149,283,588	85%	19%	220,980
SAMN06319450	28320483	feline primary adipose-derived MSC culture, (Felis catus, SAMN06319450)	76,484,434	93%	28%	164,020
SAMN06319451	28320483	feline primary adipose-derived MSC culture, (Felis catus, SAMN06319451)	70,830,070	93%	29%	167,422
SAMN06319452	28320483	feline primary adipose-derived MSC culture, (Felis catus, SAMN06319452)	96,437,318	93%	28%	170,494
SAMN08432501	29916804	FHV-1,Raltegravir-3 (Felis catus, SAMN08432501)	16,316,683	94%	25%	111,063
SAMN08432502	29916804	FHV-1,Raltegravir-2 (Felis catus, SAMN08432502)	15,464,572	95%	25%	102,891
SAMN08432503	29916804	FHV-1,Raltegravir-1 (Felis catus, SAMN08432503)	16,287,160	95%	27%	109,042
SAMN08432504	29916804	FHV-1,DMSO-3 (Felis catus, SAMN08432504)	14,027,987	91%	18%	108,264
SAMN08432505	29916804	FHV-1,DMSO-2 (Felis catus, SAMN08432505)	13,442,786	92%	19%	105,643
SAMN08432506	29916804	FHV-1,DMSO-1 (Felis catus, SAMN08432506)	15,281,316	92%	21%	111,960
SAMN08432507	29916804	Mock,Raltegravir-3 (Felis catus, SAMN08432507)	26,628,178	95%	27%	130,958
SAMN08432508	29916804	Mock,Raltegravir-2 (Felis catus, SAMN08432508)	30,416,721	96%	27%	134,290
SAMN08432509	29916804	Mock,Raltegravir-1 (Felis catus, SAMN08432509)	28,952,259	96%	28%	130,874
SAMN08432510	29916804	Mock,DMSO-3 (Felis catus, SAMN08432510)	28,022,864	96%	28%	134,752
SAMN08432511	29916804	Mock,DMSO-2 (Felis catus, SAMN08432511)	31,886,603	96%	28%	137,543
SAMN08432512	29916804	Mock,DMSO-1 (Felis catus, SAMN08432512)	24,390,039	96%	29%	128,947

Show alignments statistics, by run (ERR, SRR, DRR)

Run	Experiment	Project	Sample	Number of reads	Percent aligned reads	Percent of aligned reads with introns
SRR636854	SRX211594	SRP017611	SAMN01831871	36,356,376	82%	10%
SRR636855	SRX211595	SRP017611	SAMN01831872	37,302,574	81%	10%
SRR636856	SRX211596	SRP017611	SAMN01831873	43,786,726	82%	10%
SRR636904	SRX211644	SRP017611	SAMN01831921	42,717,142	80%	7%
SRR636905	SRX211645	SRP017611	SAMN01831922	43,667,072	80%	7%
SRR636906	SRX211646	SRP017611	SAMN01831923	36,100,760	79%	9%
SRR636948	SRX211688	SRP017611	SAMN01831965	46,356,088	81%	6%
SRR636949	SRX211689	SRP017611	SAMN01831966	48,799,196	80%	6%
SRR636950	SRX211690	SRP017611	SAMN01831967	39,439,684	81%	7%
SRR835484	SRX272130	SRP021539	SAMN02058438	25,217,636	93%	29%
SRR835485	SRX272131	SRP021539	SAMN02058439	27,572,827	93%	25%
SRR835486	SRX272132	SRP021539	SAMN02058440	26,276,810	94%	30%
SRR835487	SRX272133	SRP021539	SAMN02058441	27,280,242	93%	27%
SRR835488	SRX272134	SRP021539	SAMN02058442	27,959,031	93%	26%
SRR835489	SRX272135	SRP021539	SAMN02058443	29,321,049	93%	28%
SRR835490	SRX272136	SRP021539	SAMN02058444	27,074,727	93%	23%
SRR835491	SRX272137	SRP021539	SAMN02058445	27,384,876	93%	25%
SRR835492	SRX272138	SRP021539	SAMN02058446	27,650,878	93%	27%
SRR835493	SRX272139	SRP021539	SAMN02058447	26,426,247	93%	24%
SRR835494	SRX272140	SRP021539	SAMN02058448	30,277,827	94%	32%
SRR835495	SRX272141	SRP021539	SAMN02058449	30,827,991	93%	23%
SRR835496	SRX272142	SRP021539	SAMN02058450	30,835,006	93%	29%
SRR835497	SRX272143	SRP021539	SAMN02058451	33,864,272	93%	29%
SRR835498	SRX272144	SRP021539	SAMN02058452	28,897,848	93%	26%
SRR835499	SRX272145	SRP021539	SAMN02058453	29,065,945	93%	28%
SRR835500	SRX272146	SRP021539	SAMN02058454	30,055,186	93%	23%
SRR835501	SRX272147	SRP021539	SAMN02058455	28,341,003	93%	27%
SRR835502	SRX272148	SRP021539	SAMN02058456	31,620,077	93%	25%
SRR835503	SRX272149	SRP021539	SAMN02058457	32,392,358	93%	26%
SRR2470307	SRX1268863	SRP063937	SAMN04099974	177,835,580	93%	27%
SRR2470308	SRX1268864	SRP063937	SAMN04099975	182,421,744	93%	26%
SRR3200448	SRX1610301	SRP071078	SAMN04498517	166,944,018	81%	24%
SRR3200450	SRX1610303	SRP071078	SAMN04498518	201,883,266	83%	29%
SRR3200449	SRX1610302	SRP071078	SAMN04498519	123,208,322	79%	21%
SRR3200469	SRX1610322	SRP071078	SAMN04498520	166,157,056	78%	48%
SRR3200471	SRX1610324	SRP071078	SAMN04498521	215,449,116	84%	16%
SRR3200451	SRX1610304	SRP071078	SAMN04498522	106,996,830	86%	41%
SRR3200455	SRX1610308	SRP071078	SAMN04498523	148,955,788	82%	25%
SRR3200466	SRX1610319	SRP071078	SAMN04498524	162,679,594	82%	19%
SRR3218715	SRX1625945	SRP071078	SAMN04498525	104,390,084	71%	17%
SRR3200473	SRX1610326	SRP071078	SAMN04498526	94,438,504	82%	25%
SRR3200462	SRX1610315	SRP071078	SAMN04498527	160,548,060	80%	30%
SRR3218718	SRX1625949	SRP071078	SAMN04498528	131,799,730	85%	20%
SRR3200472	SRX1610325	SRP071078	SAMN04498529	96,748,380	81%	20%
SRR3200452	SRX1610305	SRP071078	SAMN04498530	110,467,612	84%	22%
SRR3200453	SRX1610306	SRP071078	SAMN04498531	132,973,642	85%	33%
SRR3200456	SRX1610309	SRP071078	SAMN04498532	128,178,488	80%	21%
SRR3200461	SRX1610314	SRP071078	SAMN04498533	119,882,660	81%	22%
SRR3218717	SRX1625948	SRP071078	SAMN04498534	113,655,162	75%	24%
SRR3200459	SRX1610312	SRP071078	SAMN04498535	72,193,490	76%	26%
SRR3200464	SRX1610317	SRP071078	SAMN04498536	122,891,208	74%	20%
SRR3200468	SRX1610321	SRP071078	SAMN04498537	123,289,946	80%	25%
SRR3200457	SRX1610310	SRP071078	SAMN04498538	108,620,542	87%	26%
SRR3200470	SRX1610323	SRP071078	SAMN04498539	133,555,718	82%	24%
SRR3200465	SRX1610318	SRP071078	SAMN04498540	117,242,720	80%	16%
SRR3200460	SRX1610313	SRP071078	SAMN04498541	120,590,350	84%	23%
SRR3218714	SRX1625944	SRP071078	SAMN04498542	121,451,690	81%	22%
SRR3200454	SRX1610307	SRP071078	SAMN04498543	123,699,138	85%	27%
SRR3200458	SRX1610311	SRP071078	SAMN04498544	103,269,448	78%	18%
SRR3218716	SRX1625946	SRP071078	SAMN04498545	85,691,628	81%	24%
SRR3218712	SRX1625943	SRP071078	SAMN04498546	155,642,984	81%	25%
SRR3200467	SRX1610320	SRP071078	SAMN04498547	117,414,704	80%	23%
SRR3200463	SRX1610316	SRP071078	SAMN04498548	149,283,588	85%	19%
SRR5243168	SRX2549942	SRP099203	SAMN06319450	76,484,434	93%	28%
SRR5243167	SRX2549941	SRP099203	SAMN06319451	70,830,070	93%	29%
SRR5243169	SRX2549943	SRP099203	SAMN06319452	96,437,318	93%	28%
SRR6639058	SRX3626710	SRP131692	SAMN08432501	16,316,683	94%	25%
SRR6639057	SRX3626709	SRP131692	SAMN08432502	15,464,572	95%	25%
SRR6639056	SRX3626708	SRP131692	SAMN08432503	16,287,160	95%	27%
SRR6639055	SRX3626707	SRP131692	SAMN08432504	14,027,987	91%	18%
SRR6639054	SRX3626706	SRP131692	SAMN08432505	13,442,786	92%	19%
SRR6639053	SRX3626705	SRP131692	SAMN08432506	15,281,316	92%	21%
SRR6639052	SRX3626704	SRP131692	SAMN08432507	26,628,178	95%	27%
SRR6639051	SRX3626703	SRP131692	SAMN08432508	30,416,721	96%	27%
SRR6639050	SRX3626702	SRP131692	SAMN08432509	28,952,259	96%	28%
SRR6639049	SRX3626701	SRP131692	SAMN08432510	28,022,864	96%	28%
SRR6639048	SRX3626700	SRP131692	SAMN08432511	31,886,603	96%	28%
SRR6639047	SRX3626699	SRP131692	SAMN08432512	24,390,039	96%	29%

Protein alignments

Source	Number of sequences retrieved from Entrez	Number (%) of sequences aligned by ProSplign	Number (%) of sequences passed to Gnomon	Average % identity	Average % coverage
Same-species GenBank	11	11 (100.00%)	11 (100.00%)	86.33%	96.62%
Same-species known RefSeq (NP_)	1	1 (100.00%)	1 (100.00%)	99.00%	100.00%
Carnivora GenBank	5,096	4,951 (97.15%)	4,951 (97.15%)	78.76%	86.14%
Carnivora known RefSeq (NP_)	2,376	2,347 (98.78%)	2,347 (98.78%)	77.56%	88.79%
Homo sapiens known RefSeq (NP_)	51,932	50,973 (98.15%)	50,973 (98.15%)	76.23%	83.98%
Felis catus high-quality model RefSeq (XP_)	14,790	14,703 (99.41%)	14,703 (99.41%)	80.63%	88.63%

Assembly-assembly alignments of current to previous assembly

When the assembly changes between two rounds of annotation, genes in the current and the previous annotation are mapped to each other using the genomic alignments of the current assembly to the previous assembly so that gene identifiers can be preserved. The success of the remapping depends largely on how well the two assembly versions align to each other.

Below are the percent coverage of one assembly by the other and the average percent identity of the alignments. The 'First pass' alignments are reciprocal best hits, while the 'Total' alignments also include 'Second pass' or non-reciprocal best alignments. For more information about the assembly-assembly alignment process, please visit the NCBI Genome Remapping Service page.

First Pass	Total
Aci_jub_2 (Current) Coverage: 96.88%	Aci_jub_2 (Current) Coverage: 97.25%
aciJub1 (Previous) Coverage: 98.67%	aciJub1 (Previous) Coverage: 98.86%
Percent Identity: 99.58%	Percent Identity: 99.57%

Comparison of the current and previous annotations

The annotation produced for this release (101) was compared to the annotation in the previous release (100) for each assembly annotated in both releases. Scores for current and previous gene and transcript features were calculated based on overlap in exon sequence and matches in exon boundaries. Pairs of current and previous features were categorized based on these scores, whether they are reciprocal best matches, and changes in attributes (gene biotype, completeness, etc.). If the assembly was updated between the two releases, alignments between the current and the previous assembly were used to match the current and previous gene and transcript features in mapped regions.

The table below summarizes the changes in the gene set for each assembly as a percent of the number of genes in the current annotation release, and provides links to the details of the comparison in tabular format and in a Genome Workbench project.

	Aci_jub_2 (Current) to aciJub1 (Previous)
Identical	3%
Minor changes	41%
Major changes	19%
New	35%
Deprecated	9%
Other	2%
Download the report	tabular, Genome Workbench

References

RefSeq: Pruitt KD, Brown GR, Hiatt SM, Thibaud-Nissen F, Astashyn A, Ermolaeva O, Farrell CM, Hart J, Landrum MJ, McGarvey KM, Murphy MR, O'Leary NA, Pujar S, Rajput B, Rangwala SH, Riddick LD, Shkeda A, Sun H, Tamez P, Tully RE, Wallin C, Webb D, Weber J, Wu W, Dicuccio M, Kitts P, Maglott DR, Murphy TD, Ostell JM. Nucleic Acids Research 2014, 42(Database issue):D756-63
RepeatMasker: Smit AFA, Hubley R, Green P. RepeatMasker Open-3.0. 1996–2004. http://www.repeatmasker.org
WindowMasker: Morgulis A, Gertz EM, Schäffer AA, Agarwala R. Bioinformatics 2006, 2:134-41
Splign: Kapustin Y, Souvorov A, Tatusova T, Lipman D. Biology Direct 2008, 3:20

RefSeq

Integrated reference sequences