NCBI Python bivittatus Annotation Release 102

The RefSeq genome records for Python bivittatus were annotated by the NCBI Eukaryotic Genome Annotation Pipeline, an automated pipeline that annotates genes, transcripts and proteins on draft and finished genome assemblies. This report presents statistics on the annotation products, the input data used in the pipeline and intermediate alignment results.

The annotation products are available in the sequence databases and on the FTP site.

This report provides:

Annotation Release information: The name of the release, important dates, the software version
Assemblies: A brief description of the annotated assembly(ies)
Gene and feature statistics: The counts and characteristics of the annotated features
Alignment of the annotated proteins to a set of high-quality proteins: The number of annotated proteins with hits to a set of high-quality proteins
Masking of genomic sequence: How much of the genome was masked
Transcript and protein alignments: The number and type of evidence retrieved from public databases and used for gene prediction
Comparison of the current and previous annotations: What proportion of the genes changed in this annotation

For more information on the annotation process, please visit the NCBI Eukaryotic Genome Annotation Pipeline page.

Annotation Release information

This annotation should be referred to as NCBI Python bivittatus Annotation Release 102

Annotation release ID: 102
Date of Entrez queries for transcripts and proteins: May 16 2018
Date of submission of annotation to the public databases: May 24 2018
Software version: 8.0

Assemblies

The following assemblies were included in this annotation run:

Assembly name	Assembly accession	Submitter	Assembly date	Reference/Alternate	Assembly content
Python_molurus_bivittatus-5.0.2	GCF_000186305.1	The Consortium for Comparative Genomics, UC Denver	09-15-2013	Reference	1 assembled chromosomes; unplaced scaffolds

Gene and feature statistics

Counts and length of annotated features are provided below for each assembly.

Feature counts

Feature	Python_molurus_bivittatus-5.0.2
Genes and pseudogenes	22,427
protein-coding	19,793
non-coding	2,089
transcribed pseudogenes	0
non-transcribed pseudogenes	455
genes with variants	6,319
immunoglobulin/T-cell receptor gene segments	90
other	0
mRNAs	32,711
fully-supported	28,609
with > 5% ab initio	1,546
partial	3,003
with filled gap(s)	2
known RefSeq (NM_)	9
model RefSeq (XM_)	32,702
non-coding RNAs	3,198
fully-supported	2,664
with > 5% ab initio	0
partial	4
with filled gap(s)	0
known RefSeq (NR_)	0
model RefSeq (XR_)	2,952
pseudo transcripts	0
fully-supported	0
with > 5% ab initio	0
partial	0
with filled gap(s)	0
known RefSeq (NR_)	0
model RefSeq (XR_)	0
CDSs	32,814
fully-supported	28,609
with > 5% ab initio	1,875
partial	3,023
with major correction(s)	340
known RefSeq (NP_)	9
model RefSeq (XP_)	32,715

Detailed reports

The counts below do not include pseudogenes.

Feature lengths

Feature	Count	Mean length (bp)	Median length (bp)	Min length (bp)	Max length (bp)
Genes	21,882	26,748	14,835	50	533,031
All transcripts	35,909	2,924	2,377	50	98,844
mRNA	32,711	3,071	2,509	96	98,844
misc_RNA	583	2,710	2,312	81	11,938
tRNA	244	73	73	57	84
lncRNA	2,082	1,400	892	75	12,259
snoRNA	180	112	102	50	319
snRNA	83	134	117	61	199
guide_RNA	15	183	137	86	370
rRNA	11	320	119	119	1,511
Single-exon transcripts	1,134	1,413	957	81	13,088
coding transcripts (NM_/XM_ )	1,133	1,414	957	96	13,088
non-coding transcripts (NR_/XR_ )	1	81	81	81	81
CDSs	32,724	1,769	1,305	96	97,629
Exons	210,908	292	134	1	17,100
in coding transcripts (NM_/XM_ )	204,115	288	134	1	17,100
in non-coding transcripts (NR_/XR_ )	10,337	332	133	2	10,743
Introns	188,445	3,233	1,337	30	265,164
in coding transcripts (NM_/XM_ )	183,614	3,169	1,326	30	265,164
in non-coding transcripts (NR_/XR_ )	8,251	4,392	1,547	51	236,809

Transcripts per gene, exons per transcript

	Mean	Median	Min	Max
Number of transcripts per gene	1.65	1	1	34
Number of exons per transcript	10.58	8	1	294

Alignment of the annotated proteins to a set of high-quality proteins

The final set of annotated proteins was searched with BLASTP against the UniProtKB/Swiss-Prot curated proteins, using the annotated proteins as the query and the high-quality proteins as the target. Out of 19793 coding genes, 19217 genes had a protein with an alignment covering 50% or more of the query and 11782 had an alignment covering 95% or more of the query.

Definition of query and target coverage. The query coverage is the percentage of the annotated protein length that is included in the alignment. The target coverage is the percentage of the target length that is included in the alignment.

Below is a cumulative graph displaying the number of genes with alignments above a given query or target coverage threshold. For comparison, corresponding statistics for other organisms annotated by the NCBI eukaryotic annotation pipeline were added to the graph.

Query: annotated proteins
Target: UniProtKB/Swiss-Prot curated proteins

Masking of genomic sequence

Transcript and protein alignments are performed on the repeat-masked genome. Below are the percentages of genomic sequence masked by WindowMasker and RepeatMasker for each assembly. RepeatMasker results are only used for organisms for which a comprehensive repeat library is available.

For this annotation run, transcripts and proteins were aligned to the genome masked with WindowMasker only.

Assembly name	Assembly accession	% Masked with RepeatMasker	% Masked with WindowMasker
Python_molurus_bivittatus-5.0.2	GCF_000186305.1	3.07%	24.87%

Transcript and protein alignments

The annotation pipeline relies heavily on alignments of experimental evidence for gene prediction. Below are the sets of transcripts and proteins that were retrieved from Entrez, aligned to the genome by Splign or ProSplign and passed to Gnomon, NCBI's gene prediction software.

Depending on the other evidence available, long 454 reads (with average length above 250 nt) may be aligned as traditional evidence and reported in the Transcript alignments section or aligned with RNA-Seq reads and reported in the RNA-Seq alignments section.

Transcript alignments

Source	Number of sequences retrieved from Entrez	Number (%) of sequences aligned by Splign	Number (%) of sequences passed to Gnomon	Average % identity	Average % coverage
Same-species known RefSeq (NM_/NR_)	9	9 (100.00%)	7 (77.78%)	99.57%	98.14%
Same-species Genbank	22	22 (100.00%)	20 (90.91%)	99.65%	93.40%

RNA-Seq alignments

The following RNA-Seq reads from the Sequence Read Archive were also used for gene prediction:

Hide alignments statistics, by sample (SAME, SAMN, SAMD, DRS)

Sample Id	Track name	Number of reads	Percent aligned reads	Percent of aligned reads with introns	Number of introns
All	Aggregate of all aligned samples	1,079,112,827	73%	24%	226,927
SAMN03263920	cross-section of small intestine, 0 hrs post fed (Python bivittatus, SAMN03263920)	4,826,631	67%	28%	30,507
SAMN03263922	cross-section of small intestine, 0 hrs post fed (Python bivittatus, SAMN03263922)	1,082,950	53%	28%	34,853
SAMN03263934	cross-section of small intestine, 24 hrs post fed (Python bivittatus, SAMN03263934)	4,205,468	64%	26%	44,331
SAMN03263939	cross-section of small intestine, 96 hrs post fed (Python bivittatus, SAMN03263939)	1,539,651	56%	24%	30,608
SAMN03263940	cross-section of small intestine, 96 hrs post fed (Python bivittatus, SAMN03263940)	3,898,008	47%	22%	48,146
SAMN03263941	cross-section of small intestine, 96 hrs post fed (Python bivittatus, SAMN03263941)	1,236,788	31%	22%	17,825
SAMN05792570	Heart (Python bivittatus, SAMN05792570)	18,026,042	70%	28%	131,824
SAMN05792571	Heart (Python bivittatus, SAMN05792571)	17,749,942	66%	26%	124,019
SAMN05792572	Heart (Python bivittatus, SAMN05792572)	30,172,988	67%	28%	145,724
SAMN05792573	Liver (Python bivittatus, SAMN05792573)	19,327,920	79%	32%	115,488
SAMN05792574	Liver (Python bivittatus, SAMN05792574)	22,015,132	75%	31%	111,022
SAMN05792575	Liver (Python bivittatus, SAMN05792575)	18,327,842	80%	31%	111,478
SAMN05792576	Intestine (Python bivittatus, SAMN05792576)	21,144,500	81%	33%	128,257
SAMN05792577	Intestine (Python bivittatus, SAMN05792577)	30,076,082	82%	29%	132,802
SAMN05792578	Intestine (Python bivittatus, SAMN05792578)	18,785,808	81%	28%	129,443
SAMN05792579	Stomach (Python bivittatus, SAMN05792579)	19,390,972	69%	31%	126,585
SAMN05792580	Stomach (Python bivittatus, SAMN05792580)	18,690,666	77%	32%	132,430
SAMN05792581	Stomach (Python bivittatus, SAMN05792581)	26,284,434	75%	28%	134,405
SAMN05792582	Pancreas (Python bivittatus, SAMN05792582)	20,277,016	89%	41%	96,409
SAMN05792583	Pancreas (Python bivittatus, SAMN05792583)	21,937,940	88%	34%	81,545
SAMN05792584	Pancreas (Python bivittatus, SAMN05792584)	21,524,266	89%	38%	106,313
SAMN05792856	Heart (Python bivittatus, SAMN05792856)	24,718,134	87%	26%	149,807
SAMN05792857	Liver (Python bivittatus, SAMN05792857)	20,041,308	87%	31%	128,449
SAMN05792858	Stomach (Python bivittatus, SAMN05792858)	23,367,682	85%	30%	150,128
SAMN05792859	Intestine (Python bivittatus, SAMN05792859)	23,742,066	88%	31%	141,988
SAMN05792860	Pancreas (Python bivittatus, SAMN05792860)	19,054,532	84%	28%	130,162
SAMN05792861	Heart (Python bivittatus, SAMN05792861)	21,372,618	87%	24%	149,032
SAMN05792862	Liver (Python bivittatus, SAMN05792862)	25,216,310	88%	27%	140,537
SAMN05792863	Stomach (Python bivittatus, SAMN05792863)	26,148,614	89%	25%	158,063
SAMN05792864	Intestine (Python bivittatus, SAMN05792864)	25,079,996	89%	27%	146,696
SAMN05792865	Pancreas (Python bivittatus, SAMN05792865)	23,484,678	85%	24%	139,887
SAMN06238948	Heart from isolate AI11 at 0 hours post fed (i.e., fasted). (Python bivittatus, SAMN06238948)	1,115,236	62%	16%	25,884
SAMN06238949	Heart from isolate AI6 at 0 hours post fed (i.e., fasted). (Python bivittatus, SAMN06238949)	2,077,844	61%	19%	52,089
SAMN06238951	Heart from isolate AJ6 at 0 hours post fed (i.e., fasted). (Python bivittatus, SAMN06238951)	1,057,598	69%	14%	28,472
SAMN06238952	Heart from isolate U25 at 0 hours post fed (i.e., fasted). (Python bivittatus, SAMN06238952)	9,874,726	42%	3%	29,479
SAMN06238953	Heart from isolate Z12 at 24 hours post fed. (Python bivittatus, SAMN06238953)	1,307,135	51%	9%	20,366
SAMN06238954	Heart from isolate Z14 at 24 hours post fed. (Python bivittatus, SAMN06238954)	4,427,749	41%	19%	21,287
SAMN06238955	Heart from isolate Z18 at 24 hours post fed. (Python bivittatus, SAMN06238955)	3,473,792	21%	15%	23,646
SAMN06238956	Heart from isolate Y5 at 96 hours post fed. (Python bivittatus, SAMN06238956)	2,163,949	55%	19%	42,994
SAMN06238957	Heart from isolate Y18 at 96 hours post fed. (Python bivittatus, SAMN06238957)	975,635	62%	14%	25,246
SAMN06238958	Heart from isolate Y23 at 96 hours post fed. (Python bivittatus, SAMN06238958)	1,093,780	67%	15%	25,867
SAMN06238959	Heart from isolate Y24 at 96 hours post fed. (Python bivittatus, SAMN06238959)	6,792,842	44%	4%	26,675
SAMN06238960	Kidney from isolate AI11 at 0 hours post fed (i.e., fasted). (Python bivittatus, SAMN06238960)	4,025,320	54%	27%	39,231
SAMN06238961	Kidney from isolate AI6 at 0 hours post fed (i.e., fasted). (Python bivittatus, SAMN06238961)	273,902	65%	30%	17,947
SAMN06238963	Kidney from isolate AJ6 at 0 hours post fed (i.e., fasted). (Python bivittatus, SAMN06238963)	3,677,133	58%	26%	34,252
SAMN06238964	Kidney from isolate U25 at 0 hours post fed (i.e., fasted). (Python bivittatus, SAMN06238964)	7,394,272	41%	3%	21,396
SAMN06238965	Kidney from isolate Z12 at 24 hours post fed. (Python bivittatus, SAMN06238965)	1,419,248	63%	31%	40,301
SAMN06238967	Kidney from isolate Z18 at 24 hours post fed. (Python bivittatus, SAMN06238967)	3,299,304	32%	25%	23,537
SAMN06238968	Kidney from isolate V43 at 24 hours post fed. (Python bivittatus, SAMN06238968)	10,058,002	46%	3%	27,512
SAMN06238969	Kidney from isolate Y5 at 96 hours post fed. (Python bivittatus, SAMN06238969)	3,012,339	56%	24%	22,847
SAMN06238970	Kidney from isolate Y18 at 96 hours post fed. (Python bivittatus, SAMN06238970)	70,503	41%	26%	998
SAMN06238977	Liver from isolate Z12 at 24 hours post fed. (Python bivittatus, SAMN06238977)	5,839,466	32%	20%	29,529
SAMN06238980	Liver from isolate Y5 at 96 hours post fed. (Python bivittatus, SAMN06238980)	2,057,648	53%	17%	25,659
SAMN06238983	Liver from isolate Y24 at 96 hours post fed. (Python bivittatus, SAMN06238983)	9,522,887	52%	3%	22,729
SAMN06238984	Cross-section of small intestine from isolate Z12 at 24 hours post fed. (Python bivittatus, SAMN06238984)	1,046,308	67%	22%	39,481
SAMN06240093	Liver from isolate AI6 at 0 hours post fed (i.e., fasted). (Python bivittatus, SAMN06240093)	2,430,567	47%	20%	31,300
SAMN06704747	Python bivittatus isolate Pymo208 (Python bivittatus, SAMN06704747)	42,644,522	62%	11%	111,301
SAMN06704778	Python bivittatus isolate Pymo209 (Python bivittatus, SAMN06704778)	42,349,982	67%	13%	123,270
SAMN06704780	Python bivittatus isolate Pymo210 (Python bivittatus, SAMN06704780)	38,129,806	67%	16%	127,430
SAMN06704781	Python bivittatus isolate Pymo213 (Python bivittatus, SAMN06704781)	41,074,202	69%	16%	131,930
SAMN06704782	Python bivittatus isolate Pymo215 (Python bivittatus, SAMN06704782)	41,026,632	73%	16%	131,520
SAMN06704784	Python bivittatus isolate Pymo216 (Python bivittatus, SAMN06704784)	40,773,334	70%	17%	133,401
SAMN06704787	Python bivittatus isolate Pymo218 (Python bivittatus, SAMN06704787)	47,676,104	70%	16%	129,762
SAMN06704788	Python bivittatus isolate Pymo219 (Python bivittatus, SAMN06704788)	39,746,974	69%	16%	133,312
SAMN08137146	MIGS Eukaryotic sample from Python bivittatus (Python bivittatus, SAMN08137146)	40,512,040	67%	17%	127,007
SAMN08137147	MIGS Eukaryotic sample from Python bivittatus (Python bivittatus, SAMN08137147)	43,945,062	70%	16%	134,085

Show alignments statistics, by run (ERR, SRR, DRR)

Run	Experiment	Project	Sample	Number of reads	Percent aligned reads	Percent of aligned reads with introns
SRR1746792	SRX834419	SRP051827	SAMN03263920	2,155,168	63%	28%
SRR1746793	SRX834420	SRP051827	SAMN03263920	2,671,463	70%	28%
SRR1746796	SRX834423	SRP051827	SAMN03263922	1,082,950	53%	28%
SRR1746813	SRX834440	SRP051827	SAMN03263934	1,766,164	62%	26%
SRR1746814	SRX834441	SRP051827	SAMN03263934	2,439,304	66%	26%
SRR1746823	SRX834450	SRP051827	SAMN03263939	1,539,651	56%	24%
SRR1746825	SRX834452	SRP051827	SAMN03263940	1,461,172	47%	22%
SRR1746826	SRX834453	SRP051827	SAMN03263940	2,436,836	47%	22%
SRR1746828	SRX834455	SRP051827	SAMN03263941	1,236,788	31%	22%
SRR5190735	SRX2506492	SRP051827	SAMN06238948	1,115,236	62%	16%
SRR5190733	SRX2506490	SRP051827	SAMN06238949	1,044,057	58%	24%
SRR5190734	SRX2506491	SRP051827	SAMN06238949	1,033,787	64%	14%
SRR5190731	SRX2506488	SRP051827	SAMN06238951	1,057,598	69%	14%
SRR5190730	SRX2506487	SRP051827	SAMN06238952	9,874,726	42%	3%
SRR5190729	SRX2506486	SRP051827	SAMN06238953	1,307,135	51%	9%
SRR5190727	SRX2506484	SRP051827	SAMN06238954	2,534,292	36%	23%
SRR5190728	SRX2506485	SRP051827	SAMN06238954	1,893,457	46%	15%
SRR5190726	SRX2506483	SRP051827	SAMN06238955	3,473,792	21%	15%
SRR5190724	SRX2506481	SRP051827	SAMN06238956	1,086,580	53%	23%
SRR5190725	SRX2506482	SRP051827	SAMN06238956	1,077,369	58%	14%
SRR5190723	SRX2506480	SRP051827	SAMN06238957	975,635	62%	14%
SRR5190722	SRX2506479	SRP051827	SAMN06238958	1,093,780	67%	15%
SRR5190721	SRX2506478	SRP051827	SAMN06238959	6,792,842	44%	4%
SRR5190719	SRX2506476	SRP051827	SAMN06238960	4,025,320	54%	27%
SRR5190717	SRX2506474	SRP051827	SAMN06238961	273,902	65%	30%
SRR5190713	SRX2506470	SRP051827	SAMN06238963	1,441,416	61%	26%
SRR5190714	SRX2506471	SRP051827	SAMN06238963	2,235,717	57%	26%
SRR5190712	SRX2506469	SRP051827	SAMN06238964	7,394,272	41%	3%
SRR5190710	SRX2506467	SRP051827	SAMN06238965	1,419,248	63%	31%
SRR5190704	SRX2506461	SRP051827	SAMN06238967	2,046,898	33%	24%
SRR5190705	SRX2506462	SRP051827	SAMN06238967	1,252,406	31%	25%
SRR5190703	SRX2506460	SRP051827	SAMN06238968	10,058,002	46%	3%
SRR5190700	SRX2506457	SRP051827	SAMN06238969	1,722,260	58%	24%
SRR5190701	SRX2506458	SRP051827	SAMN06238969	1,290,079	53%	23%
SRR5190697	SRX2506454	SRP051827	SAMN06238970	70,503	41%	26%
SRR5190686	SRX2506443	SRP051827	SAMN06238977	3,144,654	27%	25%
SRR5190687	SRX2506444	SRP051827	SAMN06238977	2,694,812	37%	15%
SRR5190682	SRX2506439	SRP051827	SAMN06238980	833,522	51%	22%
SRR5190683	SRX2506440	SRP051827	SAMN06238980	1,224,126	54%	13%
SRR5190679	SRX2506436	SRP051827	SAMN06238983	9,522,887	52%	3%
SRR5190677	SRX2506434	SRP051827	SAMN06238984	1,046,308	67%	22%
SRR5190691	SRX2506448	SRP051827	SAMN06240093	1,303,882	44%	25%
SRR5190692	SRX2506449	SRP051827	SAMN06240093	1,126,685	51%	15%
SRR4280472	SRX2182147	SRP090227	SAMN05792570	18,026,042	70%	28%
SRR4280492	SRX2182167	SRP090227	SAMN05792571	17,749,942	66%	26%
SRR4280474	SRX2182149	SRP090227	SAMN05792572	30,172,988	67%	28%
SRR4280473	SRX2182148	SRP090227	SAMN05792573	19,327,920	79%	32%
SRR4280493	SRX2182168	SRP090227	SAMN05792574	22,015,132	75%	31%
SRR4280475	SRX2182150	SRP090227	SAMN05792575	18,327,842	80%	31%
SRR4280490	SRX2182165	SRP090227	SAMN05792576	21,144,500	81%	33%
SRR4280495	SRX2182170	SRP090227	SAMN05792577	30,076,082	82%	29%
SRR4280477	SRX2182152	SRP090227	SAMN05792578	18,785,808	81%	28%
SRR4280484	SRX2182159	SRP090227	SAMN05792579	19,390,972	69%	31%
SRR4280494	SRX2182169	SRP090227	SAMN05792580	18,690,666	77%	32%
SRR4280476	SRX2182151	SRP090227	SAMN05792581	26,284,434	75%	28%
SRR4280491	SRX2182166	SRP090227	SAMN05792582	20,277,016	89%	41%
SRR4280496	SRX2182171	SRP090227	SAMN05792583	21,937,940	88%	34%
SRR4280478	SRX2182153	SRP090227	SAMN05792584	21,524,266	89%	38%
SRR4280479	SRX2182154	SRP090227	SAMN05792856	24,718,134	87%	26%
SRR4280480	SRX2182155	SRP090227	SAMN05792857	20,041,308	87%	31%
SRR4280481	SRX2182156	SRP090227	SAMN05792858	23,367,682	85%	30%
SRR4280482	SRX2182157	SRP090227	SAMN05792859	23,742,066	88%	31%
SRR4280483	SRX2182158	SRP090227	SAMN05792860	19,054,532	84%	28%
SRR4280485	SRX2182160	SRP090227	SAMN05792861	21,372,618	87%	24%
SRR4280486	SRX2182161	SRP090227	SAMN05792862	25,216,310	88%	27%
SRR4280487	SRX2182162	SRP090227	SAMN05792863	26,148,614	89%	25%
SRR4280488	SRX2182163	SRP090227	SAMN05792864	25,079,996	89%	27%
SRR4280489	SRX2182164	SRP090227	SAMN05792865	23,484,678	85%	24%
SRR5434345	SRX2724380	SRP103526	SAMN06704747	42,644,522	62%	11%
SRR5434344	SRX2724379	SRP103526	SAMN06704778	42,349,982	67%	13%
SRR5434343	SRX2724378	SRP103526	SAMN06704780	38,129,806	67%	16%
SRR5434342	SRX2724377	SRP103526	SAMN06704781	41,074,202	69%	16%
SRR5434341	SRX2724376	SRP103526	SAMN06704782	41,026,632	73%	16%
SRR5434340	SRX2724375	SRP103526	SAMN06704784	40,773,334	70%	17%
SRR5434339	SRX2724374	SRP103526	SAMN06704787	47,676,104	70%	16%
SRR5434338	SRX2724373	SRP103526	SAMN06704788	39,746,974	69%	16%
SRR6351163	SRX3447895	SRP103526	SAMN08137146	40,512,040	67%	17%
SRR6351164	SRX3447894	SRP103526	SAMN08137147	43,945,062	70%	16%

Protein alignments

Source	Number of sequences retrieved from Entrez	Number (%) of sequences aligned by ProSplign	Number (%) of sequences passed to Gnomon	Average % identity	Average % coverage
Pogona vitticeps high-quality model RefSeq (XP_)	13,733	13,308 (96.91%)	13,308 (96.91%)	70.59%	77.95%
Same-species GenBank	12	12 (100.00%)	12 (100.00%)	90.94%	88.11%
Same-species known RefSeq (NP_)	9	9 (100.00%)	9 (100.00%)	78.61%	80.72%
Anolis carolinensis high-quality model RefSeq (XP_)	13,146	12,544 (95.42%)	12,544 (95.42%)	68.28%	77.48%
Xenopus GenBank	31,776	29,821 (93.85%)	29,821 (93.85%)	68.76%	74.45%
Xenopus known RefSeq (NP_)	19,661	18,603 (94.62%)	18,603 (94.62%)	68.90%	74.72%
Sauropsida GenBank	21,389	18,994 (88.80%)	18,994 (88.80%)	67.95%	71.37%
Sauropsida known RefSeq (NP_)	8,113	7,719 (95.14%)	7,719 (95.14%)	71.68%	77.22%
Homo sapiens GenBank	129,209	106,101 (82.12%)	106,101 (82.12%)	66.33%	71.98%
Homo sapiens known RefSeq (NP_)	50,311	44,688 (88.82%)	44,688 (88.82%)	66.40%	71.37%

Comparison of the current and previous annotations

The annotation produced for this release (102) was compared to the annotation in the previous release (101) for each assembly annotated in both releases. Scores for current and previous gene and transcript features were calculated based on overlap in exon sequence and matches in exon boundaries. Pairs of current and previous features were categorized based on these scores, whether they are reciprocal best matches, and changes in attributes (gene biotype, completeness, etc.). If the assembly was updated between the two releases, alignments between the current and the previous assembly were used to match the current and previous gene and transcript features in mapped regions.

The table below summarizes the changes in the gene set for each assembly as a percent of the number of genes in the current annotation release, and provides links to the details of the comparison in tabular format and in a Genome Workbench project.

	Python_molurus_bivittatus-5.0.2 (Current) to Python_molurus_bivittatus-5.0.2 (Previous)
Identical	7%
Minor changes	66%
Major changes	12%
New	14%
Deprecated	4%
Other	1%
Download the report	tabular, Genome Workbench

References

RefSeq: Pruitt KD, Brown GR, Hiatt SM, Thibaud-Nissen F, Astashyn A, Ermolaeva O, Farrell CM, Hart J, Landrum MJ, McGarvey KM, Murphy MR, O'Leary NA, Pujar S, Rajput B, Rangwala SH, Riddick LD, Shkeda A, Sun H, Tamez P, Tully RE, Wallin C, Webb D, Weber J, Wu W, Dicuccio M, Kitts P, Maglott DR, Murphy TD, Ostell JM. Nucleic Acids Research 2014, 42(Database issue):D756-63
RepeatMasker: Smit AFA, Hubley R, Green P. RepeatMasker Open-3.0. 1996–2004. http://www.repeatmasker.org
WindowMasker: Morgulis A, Gertz EM, Schäffer AA, Agarwala R. Bioinformatics 2006, 2:134-41
Splign: Kapustin Y, Souvorov A, Tatusova T, Lipman D. Biology Direct 2008, 3:20

RefSeq

Integrated reference sequences