NCBI Chrysemys picta Annotation Release 103

The RefSeq genome records for Chrysemys picta were annotated by the NCBI Eukaryotic Genome Annotation Pipeline, an automated pipeline that annotates genes, transcripts and proteins on draft and finished genome assemblies. This report presents statistics on the annotation products, the input data used in the pipeline and intermediate alignment results.

The annotation products are available in the sequence databases and on the FTP site.

This report provides:

Annotation Release information: The name of the release, important dates, the software version
Assemblies: A brief description of the annotated assembly(ies)
Gene and feature statistics: The counts and characteristics of the annotated features
BUSCO results: Annotation completeness assessed with BUSCO
Alignment of the annotated proteins to a set of high-quality proteins: The number of annotated proteins with hits to a set of high-quality proteins
Masking of genomic sequence: How much of the genome was masked
Transcript and protein alignments: The number and type of evidence retrieved from public databases and used for gene prediction
Similarity of current and previous assembly: The similarity of the current and previous assembly
Comparison of the current and previous annotations: What proportion of the genes changed in this annotation

For more information on the annotation process, please visit the NCBI Eukaryotic Genome Annotation Pipeline page.

Annotation Release information

This annotation should be referred to as NCBI Chrysemys picta Annotation Release 103

Annotation release ID: 103
Date of Entrez queries for transcripts and proteins: Aug 4 2021
Date of submission of annotation to the public databases: Aug 10 2021
Software version: 9.0

Assemblies

The following assemblies were included in this annotation run:

Assembly name	Assembly accession	Submitter	Assembly date	Reference/Alternate	Assembly content
Chrysemys_picta_BioNano-3.0.4	GCF_000241765.4	Painted turtle genome sequencing consortium	08-21-2020	Reference	20 assembled chromosomes; unplaced scaffolds

Gene and feature statistics

Counts and length of annotated features are provided below for each assembly.

Feature counts

Feature	Chrysemys_picta_BioNano-3.0.4
Genes and pseudogenes	26,985
protein-coding	21,498
non-coding	4,505
Transcribed pseudogenes	0
Non-transcribed pseudogenes	738
genes with variants	10,948
Immunoglobulin/T-cell receptor gene segments	244
other	0
mRNAs	57,051
fully-supported	54,077
with > 5% ab initio	1,160
partial	1,026
with filled gap(s)	3
known RefSeq (NM_)	11
model RefSeq (XM_)	57,040
non-coding RNAs	8,181
fully-supported	7,258
with > 5% ab initio	0
partial	0
with filled gap(s)	0
known RefSeq (NR_)	0
model RefSeq (XR_)	7,734
pseudo transcripts	0
fully-supported	0
with > 5% ab initio	0
partial	0
with filled gap(s)	0
known RefSeq (NR_)	0
model RefSeq (XR_)	0
CDSs	57,308
fully-supported	54,077
with > 5% ab initio	1,424
partial	1,099
with major correction(s)	797
known RefSeq (NP_)	11
model RefSeq (XP_)	57,053

Detailed reports

The counts below do not include pseudogenes.

Feature lengths

Feature	Count	Mean length (bp)	Median length (bp)	Min length (bp)	Max length (bp)
Genes	26,003	45,698	16,486	61	2,476,307
All transcripts	65,232	3,731	3,053	61	114,806
mRNA	57,051	3,944	3,245	162	114,806
misc_RNA	2,498	3,475	2,932	202	27,034
tRNA	445	75	73	65	88
lncRNA	4,766	2,017	1,388	98	20,850
snoRNA	286	114	106	63	312
snRNA	154	143	141	61	199
guide_RNA	16	186	143	86	359
rRNA	16	280	153	118	1,614
Single-exon transcripts	1,692	1,430	951	243	15,003
coding transcripts (NM_/XM_ )	1,692	1,430	951	243	15,003
CDSs	57,064	2,129	1,524	96	113,562
Exons	264,491	349	142	1	25,119
in coding transcripts (NM_/XM_ )	248,133	333	141	1	25,119
in non-coding transcripts (NR_/XR_ )	33,837	389	142	2	16,209
Introns	237,170	6,624	1,747	30	1,164,492
in coding transcripts (NM_/XM_ )	225,611	6,528	1,723	30	1,164,492
in non-coding transcripts (NR_/XR_ )	28,411	7,054	2,025	30	441,283

Transcripts per gene, exons per transcript

	Mean	Median	Min	Max
Number of transcripts per gene	2.53	1	1	50
Number of exons per transcript	12.67	10	1	353

BUSCO analysis of gene annotation

BUSCO v4.1.4 (Simão et al 2015, PMID: 26059717) was run in "protein" mode on the annotated gene set picking one longest protein per gene, and run using the sauropsida_odb10 lineage dataset. Results are reported for the gene set from the primary assembly unit, and presented in BUSCO notation (C:complete [S:single-copy, D:duplicated], F:fragmented, M:missing, n:number of genes used).

Alignment of the annotated proteins to a set of high-quality proteins

The final set of annotated proteins was searched with BLASTP against the UniProtKB/Swiss-Prot curated proteins, using the annotated proteins as the query and the high-quality proteins as the target. Out of 21485 coding genes, 20736 genes had a protein with an alignment covering 50% or more of the query and 13104 had an alignment covering 95% or more of the query.

Definition of query and target coverage. The query coverage is the percentage of the annotated protein length that is included in the alignment. The target coverage is the percentage of the target length that is included in the alignment.

Below is a cumulative graph displaying the number of genes with alignments above a given query or target coverage threshold. For comparison, corresponding statistics for other organisms annotated by the NCBI eukaryotic annotation pipeline were added to the graph.

Query: annotated proteins
Target: UniProtKB/Swiss-Prot curated proteins

Masking of genomic sequence

Transcript and protein alignments are performed on the repeat-masked genome. Below are the percentages of genomic sequence masked by WindowMasker and RepeatMasker (if calculated), for each assembly. RepeatMasker results are only calculated for organisms with complete Dfam HMM model collections.

For this annotation run, transcripts and proteins were aligned to the genome masked with WindowMasker only.

Assembly name	Assembly accession	% Masked with WindowMasker
Chrysemys_picta_BioNano-3.0.4	GCF_000241765.4	27.08%

Transcript and protein alignments

The annotation pipeline relies heavily on alignments of experimental evidence for gene prediction. Below are the sets of transcripts and proteins that were retrieved from Entrez, aligned to the genome by Splign, minimap2, or ProSplign and passed to Gnomon, NCBI's gene prediction software.

Transcript alignments

Source	Number of sequences retrieved from Entrez	Number (%) of sequences aligned by Splign	Number (%) of sequences passed to Gnomon	Average % identity	Average % coverage
Same-species known RefSeq (NM_/NR_)	11	11 (100.00%)	11 (100.00%)	99.97%	100.00%
Same-species Genbank	65	65 (100.00%)	60 (92.31%)	99.42%	95.19%

RNA-Seq alignments

The following RNA-Seq reads from the Sequence Read Archive were also used for gene prediction:

Hide alignments statistics, by sample (SAME, SAMN, SAMD, DRS)

Sample Id	Publication	Track name	Number of reads	Percent aligned reads	Percent of aligned reads with introns	Number of introns
All	NA	Aggregate of all aligned samples	6,219,165,415	63%	27%	299,717
SAMN01885560	23537068,24438258,26108489	adrenal, kidney, ovaries (Chrysemys picta bellii, female, SAMN01885560)	417,089	59%	14%	7,528
SAMN01885561	23537068,24438258,26108489	adrenal, kidney, testes (Chrysemys picta bellii, male, SAMN01885561)	491,219	66%	13%	11,458
SAMN01885562	23537068,24438258,26108489	Sample from Chrysemys picta bellii (Chrysemys picta bellii, SAMN01885562)	99,513,558	75%	13%	177,115
SAMN01885563	23537068,24438258,26108489	Sample from Chrysemys picta bellii (Chrysemys picta bellii, SAMN01885563)	104,292,134	77%	14%	190,900
SAMN01885564	23537068,24438258,26108489	Sample from Chrysemys picta bellii (Chrysemys picta bellii, SAMN01885564)	93,088,004	64%	13%	134,730
SAMN01885565	23537068,24438258,26108489	Sample from Chrysemys picta bellii (Chrysemys picta bellii, SAMN01885565)	96,367,790	66%	13%	107,454
SAMN01885566	23537068,24438258,26108489	Sample from Chrysemys picta bellii (Chrysemys picta bellii, SAMN01885566)	95,007,546	70%	15%	150,894
SAMN01885567	23537068,24438258,26108489	Sample from Chrysemys picta bellii (Chrysemys picta bellii, SAMN01885567)	101,629,050	69%	14%	172,040
SAMN01885568	23537068,24438258,26108489	Sample from Chrysemys picta bellii (Chrysemys picta bellii, SAMN01885568)	97,648,304	78%	13%	190,651
SAMN01885569	23537068,24438258,26108489	Sample from Chrysemys picta bellii (Chrysemys picta bellii, SAMN01885569)	55,762,996	58%	19%	144,930
SAMN01885570	23537068,24438258,26108489	Sample from Chrysemys picta bellii (Chrysemys picta bellii, SAMN01885570)	56,543,004	74%	17%	180,954
SAMN01885571	23537068,24438258,26108489	Sample from Chrysemys picta bellii (Chrysemys picta bellii, SAMN01885571)	55,982,650	74%	18%	173,955
SAMN01885572	23537068,24438258,26108489	Sample from Chrysemys picta bellii (Chrysemys picta bellii, SAMN01885572)	57,936,248	71%	17%	171,823
SAMN01885573	23537068,24438258,26108489	Sample from Chrysemys picta bellii (Chrysemys picta bellii, SAMN01885573)	50,234,930	75%	16%	183,043
SAMN01885574	23537068,24438258,26108489	Sample from Chrysemys picta bellii (Chrysemys picta bellii, SAMN01885574)	57,055,016	78%	17%	182,434
SAMN01885575	23537068,24438258,26108489	Sample from Chrysemys picta bellii (Chrysemys picta bellii, SAMN01885575)	57,624,054	76%	16%	182,573
SAMN01885576	23537068,24438258,26108489	Sample from Chrysemys picta bellii (Chrysemys picta bellii, SAMN01885576)	96,208,092	74%	13%	170,053
SAMN01885577	23537068,24438258,26108489	Sample from Chrysemys picta bellii (Chrysemys picta bellii, SAMN01885577)	56,176,900	71%	18%	163,988
SAMN02230628	23537068,24438258,26108489	embryos, trunk (liver, carapace) (Chrysemys picta bellii, SAMN02230628)	494,188	80%	22%	16,025
SAMN02230629	23537068,24438258,26108489	pooled embryos and hatchlings, whole head (brain) (Chrysemys picta bellii, SAMN02230629)	412,899	76%	15%	12,715
SAMN05220588	30094964	Stage 20, Dorsal scapula (Chrysemys picta, Day 34 embryo, SAMN05220588)	58,831,092	85%	18%	156,372
SAMN05220589	30094964	Stage 21, Dorsal scapula (Chrysemys picta, Day 38 embryo, SAMN05220589)	50,571,362	85%	24%	174,325
SAMN05220590	30094964	Stage 22, Dorsal scapula (Chrysemys picta, Day 44 embryo, SAMN05220590)	38,174,039	84%	28%	153,903
SAMN05220591	30094964	Stage 21, Tail (Chrysemys picta, Day 38 embryo, SAMN05220591)	71,762,124	86%	20%	181,595
SAMN06298503	28296881	trunks (Chrysemys picta, male, SAMN06298503)	30,534,060	86%	29%	186,371
SAMN06298504	28296881	adrenal-kidney-gonad complex (Chrysemys picta, male, SAMN06298504)	25,341,134	87%	31%	174,085
SAMN06298505	28296881	adrenal-kidney-gonad complex (Chrysemys picta, male, SAMN06298505)	25,498,654	85%	30%	157,606
SAMN06298506	28296881	gonad (Chrysemys picta, male, SAMN06298506)	43,394,220	87%	29%	202,586
SAMN06298507	28296881	gonad (Chrysemys picta, male, SAMN06298507)	23,068,234	82%	25%	173,027
SAMN06298508	28296881	trunks (Chrysemys picta, female, SAMN06298508)	41,461,266	87%	31%	187,194
SAMN06298509	28296881	adrenal-kidney-gonad complex (Chrysemys picta, female, SAMN06298509)	38,185,322	88%	31%	191,662
SAMN06298510	28296881	adrenal-kidney-gonad complex (Chrysemys picta, female, SAMN06298510)	24,020,390	87%	32%	173,754
SAMN06298511	28296881	gonad (Chrysemys picta, female, SAMN06298511)	23,210,262	81%	33%	177,696
SAMN06298512	28296881	gonad (Chrysemys picta, female, SAMN06298512)	31,148,928	85%	31%	182,645
SAMN06446394	NA	telencephalon (Chrysemys picta bellii, female, SAMN06446394)	17,759,968	0%	22%	95
SAMN06446396	NA	telencephalon (Chrysemys picta bellii, female, SAMN06446396)	15,254,084	0%	33%	55
SAMN06446397	NA	telencephalon (Chrysemys picta bellii, female, SAMN06446397)	28,327,108	1%	30%	142
SAMN06446398	NA	telencephalon (Chrysemys picta bellii, male, SAMN06446398)	11,356,292	1%	22%	103
SAMN06446399	NA	telencephalon (Chrysemys picta bellii, male, SAMN06446399)	10,040,333	1%	23%	91
SAMN06446400	NA	telencephalon (Chrysemys picta bellii, female, SAMN06446400)	8,409,080	0%	23%	43
SAMN06446401	NA	telencephalon (Chrysemys picta bellii, male, SAMN06446401)	17,088,505	1%	25%	156
SAMN06446402	NA	telencephalon (Chrysemys picta bellii, female, SAMN06446402)	7,929,696	1%	10%	82
SAMN06446404	NA	telencephalon (Chrysemys picta bellii, male, SAMN06446404)	17,099,017	1%	24%	95
SAMN08718446	NA	Whole Blood (Chrysemys picta, Juvenile, not determined, SAMN08718446)	47,596,716	87%	28%	112,036
SAMN11086393	31862849	ventricle (Chrysemys picta, male, SAMN11086393)	94,132,716	51%	25%	188,402
SAMN11086394	31862849	ventricle (Chrysemys picta, male, SAMN11086394)	86,702,490	55%	27%	192,070
SAMN11086395	31862849	ventricle (Chrysemys picta, male, SAMN11086395)	99,815,768	52%	24%	189,285
SAMN11086396	31862849	ventricle (Chrysemys picta, male, SAMN11086396)	104,145,964	51%	27%	195,488
SAMN11086397	31862849	ventricle (Chrysemys picta, male, SAMN11086397)	102,949,626	56%	28%	193,635
SAMN11086398	31862849	ventricle (Chrysemys picta, male, SAMN11086398)	116,312,196	48%	26%	192,538
SAMN11086399	31862849	ventricle (Chrysemys picta, male, SAMN11086399)	111,656,660	60%	23%	200,047
SAMN11086400	31862849	ventricle (Chrysemys picta, male, SAMN11086400)	89,555,690	54%	25%	184,953
SAMN11086401	31862849	ventricle (Chrysemys picta, male, SAMN11086401)	88,958,258	56%	25%	188,729
SAMN11086402	31862849	ventricle (Chrysemys picta, male, SAMN11086402)	104,085,318	53%	24%	187,663
SAMN11086403	31862849	ventricle (Chrysemys picta, male, SAMN11086403)	96,402,608	52%	28%	189,364
SAMN11086404	31862849	ventricle (Chrysemys picta, male, SAMN11086404)	106,896,830	55%	29%	199,613
SAMN11086405	31862849	ventricle (Chrysemys picta, male, SAMN11086405)	98,275,932	53%	28%	194,192
SAMN11086406	31862849	ventricle (Chrysemys picta, male, SAMN11086406)	75,439,606	48%	25%	179,904
SAMN11086407	31862849	ventricle (Chrysemys picta, male, SAMN11086407)	87,672,832	55%	26%	188,841
SAMN11086408	31862849	ventricle (Chrysemys picta, male, SAMN11086408)	101,768,382	53%	29%	195,322
SAMN11086409	31862849	ventricle (Chrysemys picta, male, SAMN11086409)	89,059,914	51%	27%	192,162
SAMN11086410	31862849	ventricle (Chrysemys picta, male, SAMN11086410)	94,410,196	52%	27%	190,800
SAMN11086411	31862849	ventricle (Chrysemys picta, male, SAMN11086411)	106,535,108	52%	30%	194,566
SAMN11086412	31862849	ventricle (Chrysemys picta, male, SAMN11086412)	81,465,012	54%	29%	191,956
SAMN11086413	31862849	ventricle (Chrysemys picta, male, SAMN11086413)	95,343,560	54%	25%	195,419
SAMN11086414	31862849	ventricle (Chrysemys picta, male, SAMN11086414)	106,651,772	56%	25%	197,636
SAMN11086415	31862849	ventricle (Chrysemys picta, male, SAMN11086415)	102,716,990	53%	24%	192,461
SAMN11086416	31862849	ventricle (Chrysemys picta, male, SAMN11086416)	101,123,298	53%	27%	195,273
SAMN13500341	NA	Embryo stage 09, trunk (Chrysemys picta, SAMN13500341)	108,130,772	65%	38%	222,139
SAMN13500342	NA	Embryo stage 09, trunk (Chrysemys picta, SAMN13500342)	116,608,094	68%	37%	221,418
SAMN13500343	NA	Embryo stage 09, trunk (Chrysemys picta, SAMN13500343)	99,408,722	70%	39%	218,577
SAMN13500344	NA	Embryo stage 09, trunk (Chrysemys picta, SAMN13500344)	101,257,808	69%	35%	213,026
SAMN13500345	NA	Embryo stage 12, Adrenal Kidney Gonad (Chrysemys picta, SAMN13500345)	95,122,662	67%	32%	222,709
SAMN13500346	NA	Embryo stage 12, Adrenal Kidney Gonad (Chrysemys picta, SAMN13500346)	92,434,516	69%	29%	220,581
SAMN13500347	NA	Embryo stage 12, Adrenal Kidney Gonad (Chrysemys picta, SAMN13500347)	99,849,652	69%	34%	222,887
SAMN13500348	NA	Embryo stage 12, Adrenal Kidney Gonad (Chrysemys picta, SAMN13500348)	74,188,382	73%	39%	220,677
SAMN13500349	NA	Embryo stage 15, Adrenal Kidney Gonad (Chrysemys picta, SAMN13500349)	102,604,530	69%	37%	230,728
SAMN13500350	NA	Embryo stage 15, Adrenal Kidney Gonad (Chrysemys picta, SAMN13500350)	101,081,126	71%	38%	230,592
SAMN13500351	NA	Embryo stage 15, Adrenal Kidney Gonad (Chrysemys picta, SAMN13500351)	100,090,060	68%	36%	227,070
SAMN13500352	NA	Embryo stage 15, Adrenal Kidney Gonad (Chrysemys picta, SAMN13500352)	107,974,934	68%	38%	227,561
SAMN13500353	NA	Embryo stage 19, Gonads (Chrysemys picta, SAMN13500353)	87,678,506	65%	32%	214,155
SAMN13500354	NA	Embryo stage 19, Gonads (Chrysemys picta, SAMN13500354)	102,122,202	66%	36%	222,074
SAMN13500355	NA	Embryo stage 19, Gonads (Chrysemys picta, SAMN13500355)	87,755,376	64%	36%	201,051
SAMN13500356	NA	Embryo stage 19, Gonads (Chrysemys picta, SAMN13500356)	95,749,922	68%	35%	221,485
SAMN13500357	NA	Embryo stage 22, Gonads (Chrysemys picta, SAMN13500357)	90,004,290	64%	32%	213,711
SAMN13500358	NA	Embryo stage 22, Gonads (Chrysemys picta, SAMN13500358)	90,939,102	65%	32%	217,290
SAMN13500359	NA	Embryo stage 22, Gonads (Chrysemys picta, SAMN13500359)	86,990,578	65%	33%	213,942
SAMN13500360	NA	Embryo stage 22, Gonads (Chrysemys picta, SAMN13500360)	98,149,898	65%	34%	216,061

Show alignments statistics, by run (ERR, SRR, DRR)

Run	Experiment	Project	Sample	Number of reads	Percent aligned reads	Percent of aligned reads with introns
SRR647676	SRX216251	SRP012057	SAMN01885560	191,958	59%	15%
SRR647677	SRX216251	SRP012057	SAMN01885560	225,131	59%	14%
SRR647678	SRX216249	SRP012057	SAMN01885561	245,863	66%	13%
SRR647679	SRX216249	SRP012057	SAMN01885561	245,356	66%	13%
SRR647699	SRX216253	SRP012057	SAMN01885562	47,760,772	75%	16%
SRR647696	SRX216256	SRP012057	SAMN01885562	51,752,786	74%	10%
SRR647693	SRX216250	SRP012057	SAMN01885563	50,474,168	77%	17%
SRR647683	SRX216259	SRP012057	SAMN01885563	53,817,966	77%	11%
SRR647690	SRX216252	SRP012057	SAMN01885564	47,863,334	64%	10%
SRR647685	SRX216255	SRP012057	SAMN01885564	45,224,670	63%	16%
SRR647681	SRX216254	SRP012057	SAMN01885565	50,840,678	66%	10%
SRR647686	SRX216265	SRP012057	SAMN01885565	45,527,112	65%	16%
SRR647698	SRX216257	SRP012057	SAMN01885566	46,025,790	70%	18%
SRR647684	SRX216261	SRP012057	SAMN01885566	48,981,756	70%	11%
SRR647682	SRX216258	SRP012057	SAMN01885567	52,654,216	69%	11%
SRR647700	SRX216270	SRP012057	SAMN01885567	48,974,834	68%	17%
SRR647688	SRX216269	SRP012057	SAMN01885568	46,968,342	78%	16%
SRR647692	SRX216273	SRP012057	SAMN01885568	50,679,962	78%	10%
SRR647702	SRX216268	SRP012057	SAMN01885569	55,762,996	58%	19%
SRR647704	SRX216272	SRP012057	SAMN01885570	56,543,004	74%	17%
SRR647689	SRX216274	SRP012057	SAMN01885571	55,982,650	74%	18%
SRR647703	SRX216271	SRP012057	SAMN01885572	57,936,248	71%	17%
SRR647691	SRX216262	SRP012057	SAMN01885573	50,234,930	75%	16%
SRR647695	SRX216266	SRP012057	SAMN01885574	57,055,016	78%	17%
SRR647694	SRX216267	SRP012057	SAMN01885575	57,624,054	76%	16%
SRR647680	SRX216263	SRP012057	SAMN01885576	49,570,878	74%	10%
SRR647687	SRX216264	SRP012057	SAMN01885576	46,637,214	74%	16%
SRR647701	SRX216260	SRP012057	SAMN01885577	56,176,900	71%	18%
SRR931736	SRX320077	SRP012057	SAMN02230628	234,322	80%	22%
SRR931737	SRX320077	SRP012057	SAMN02230628	259,866	80%	23%
SRR931738	SRX320078	SRP012057	SAMN02230629	197,320	76%	14%
SRR931739	SRX320078	SRP012057	SAMN02230629	215,579	76%	15%
SRR3715367	SRX1874750	SRP077056	SAMN05220588	17,715,742	84%	19%
SRR3715368	SRX1874751	SRP077056	SAMN05220588	20,227,797	85%	16%
SRR3715379	SRX1874762	SRP077056	SAMN05220588	20,887,553	85%	19%
SRR3715390	SRX1874773	SRP077056	SAMN05220589	23,115,270	85%	23%
SRR3715391	SRX1874774	SRP077056	SAMN05220589	21,490,744	85%	26%
SRR3715392	SRX1874775	SRP077056	SAMN05220589	5,965,348	83%	25%
SRR3715393	SRX1874776	SRP077056	SAMN05220590	22,663,523	84%	28%
SRR3715394	SRX1874777	SRP077056	SAMN05220590	8,705,599	84%	28%
SRR3715395	SRX1874778	SRP077056	SAMN05220590	6,804,917	84%	29%
SRR3715369	SRX1874752	SRP077056	SAMN05220591	28,371,148	86%	19%
SRR3715370	SRX1874753	SRP077056	SAMN05220591	24,722,970	87%	20%
SRR3715396	SRX1874779	SRP077056	SAMN05220591	18,668,006	85%	21%
SRR5242272	SRX2549120	SRP099173	SAMN06298503	30,534,060	86%	29%
SRR5242271	SRX2549119	SRP099173	SAMN06298504	25,341,134	87%	31%
SRR5242270	SRX2549118	SRP099173	SAMN06298505	25,498,654	85%	30%
SRR5242269	SRX2549117	SRP099173	SAMN06298506	43,394,220	87%	29%
SRR5242268	SRX2549116	SRP099173	SAMN06298507	23,068,234	82%	25%
SRR5242267	SRX2549115	SRP099173	SAMN06298508	41,461,266	87%	31%
SRR5242266	SRX2549114	SRP099173	SAMN06298509	38,185,322	88%	31%
SRR5242265	SRX2549113	SRP099173	SAMN06298510	24,020,390	87%	32%
SRR5242264	SRX2549112	SRP099173	SAMN06298511	23,210,262	81%	33%
SRR5242263	SRX2549111	SRP099173	SAMN06298512	31,148,928	85%	31%
SRR5296397	SRX2598671	SRP100847	SAMN06446394	17,759,968	0%	22%
SRR5296395	SRX2598669	SRP100847	SAMN06446396	15,254,084	0%	33%
SRR5296394	SRX2598668	SRP100847	SAMN06446397	28,327,108	1%	30%
SRR5296393	SRX2598667	SRP100847	SAMN06446398	11,356,292	1%	22%
SRR5296392	SRX2598666	SRP100847	SAMN06446399	10,040,333	1%	23%
SRR5296391	SRX2598665	SRP100847	SAMN06446400	8,409,080	0%	23%
SRR5296390	SRX2598664	SRP100847	SAMN06446401	17,088,505	1%	25%
SRR5296389	SRX2598663	SRP100847	SAMN06446402	7,929,696	1%	10%
SRR5296399	SRX2598673	SRP100847	SAMN06446404	17,099,017	1%	24%
SRR6841721	SRX3797545	SRP135786	SAMN08718446	47,596,716	87%	28%
SRR8718970	SRX5512790	SRP188305	SAMN11086393	94,132,716	51%	25%
SRR8718969	SRX5512791	SRP188305	SAMN11086394	86,702,490	55%	27%
SRR8718972	SRX5512788	SRP188305	SAMN11086395	99,815,768	52%	24%
SRR8718971	SRX5512789	SRP188305	SAMN11086396	104,145,964	51%	27%
SRR8718966	SRX5512794	SRP188305	SAMN11086397	102,949,626	56%	28%
SRR8718965	SRX5512795	SRP188305	SAMN11086398	116,312,196	48%	26%
SRR8718968	SRX5512792	SRP188305	SAMN11086399	111,656,660	60%	23%
SRR8718967	SRX5512793	SRP188305	SAMN11086400	89,555,690	54%	25%
SRR8718974	SRX5512786	SRP188305	SAMN11086401	88,958,258	56%	25%
SRR8718973	SRX5512787	SRP188305	SAMN11086402	104,085,318	53%	24%
SRR8718982	SRX5512778	SRP188305	SAMN11086403	96,402,608	52%	28%
SRR8718981	SRX5512779	SRP188305	SAMN11086404	106,896,830	55%	29%
SRR8718980	SRX5512780	SRP188305	SAMN11086405	98,275,932	53%	28%
SRR8718979	SRX5512781	SRP188305	SAMN11086406	75,439,606	48%	25%
SRR8718978	SRX5512782	SRP188305	SAMN11086407	87,672,832	55%	26%
SRR8718977	SRX5512783	SRP188305	SAMN11086408	101,768,382	53%	29%
SRR8718976	SRX5512784	SRP188305	SAMN11086409	89,059,914	51%	27%
SRR8718975	SRX5512785	SRP188305	SAMN11086410	94,410,196	52%	27%
SRR8718984	SRX5512776	SRP188305	SAMN11086411	106,535,108	52%	30%
SRR8718983	SRX5512777	SRP188305	SAMN11086412	81,465,012	54%	29%
SRR8718963	SRX5512797	SRP188305	SAMN11086413	95,343,560	54%	25%
SRR8718964	SRX5512796	SRP188305	SAMN11086414	106,651,772	56%	25%
SRR8718961	SRX5512799	SRP188305	SAMN11086415	102,716,990	53%	24%
SRR8718962	SRX5512798	SRP188305	SAMN11086416	101,123,298	53%	27%
SRR10674614	SRX7351871	SRP237291	SAMN13500341	108,130,772	65%	38%
SRR10674613	SRX7351872	SRP237291	SAMN13500342	116,608,094	68%	37%
SRR10674602	SRX7351883	SRP237291	SAMN13500343	99,408,722	70%	39%
SRR10674601	SRX7351884	SRP237291	SAMN13500344	101,257,808	69%	35%
SRR10674600	SRX7351885	SRP237291	SAMN13500345	95,122,662	67%	32%
SRR10674599	SRX7351886	SRP237291	SAMN13500346	92,434,516	69%	29%
SRR10674598	SRX7351887	SRP237291	SAMN13500347	99,849,652	69%	34%
SRR10674597	SRX7351888	SRP237291	SAMN13500348	74,188,382	73%	39%
SRR10674596	SRX7351889	SRP237291	SAMN13500349	102,604,530	69%	37%
SRR10674595	SRX7351890	SRP237291	SAMN13500350	101,081,126	71%	38%
SRR10674612	SRX7351873	SRP237291	SAMN13500351	100,090,060	68%	36%
SRR10674611	SRX7351874	SRP237291	SAMN13500352	107,974,934	68%	38%
SRR10674610	SRX7351875	SRP237291	SAMN13500353	87,678,506	65%	32%
SRR10674609	SRX7351876	SRP237291	SAMN13500354	102,122,202	66%	36%
SRR10674608	SRX7351877	SRP237291	SAMN13500355	87,755,376	64%	36%
SRR10674607	SRX7351878	SRP237291	SAMN13500356	95,749,922	68%	35%
SRR10674606	SRX7351879	SRP237291	SAMN13500357	90,004,290	64%	32%
SRR10674605	SRX7351880	SRP237291	SAMN13500358	90,939,102	65%	32%
SRR10674604	SRX7351881	SRP237291	SAMN13500359	86,990,578	65%	33%
SRR10674603	SRX7351882	SRP237291	SAMN13500360	98,149,898	65%	34%

Protein alignments

Source	Number of sequences retrieved from Entrez	Number (%) of sequences aligned by ProSplign	Number (%) of sequences passed to Gnomon	Average % identity	Average % coverage
Pelodiscus sinensis high-quality model RefSeq (XP_)	10,355	10,044 (97.00%)	10,044 (97.00%)	74.99%	84.80%
Xenopus GenBank	31,757	9,381 (29.54%)	9,381 (29.54%)	68.17%	73.62%
Xenopus known RefSeq (NP_)	19,182	18,295 (95.38%)	18,295 (95.38%)	69.31%	77.79%
Sauropsida GenBank	29,933	17,735 (59.25%)	17,735 (59.25%)	68.20%	74.12%
Sauropsida known RefSeq (NP_)	8,432	7,963 (94.44%)	7,963 (94.44%)	72.47%	80.45%
Same-species GenBank	58	39 (67.24%)	39 (67.24%)	75.94%	82.67%
Same-species known RefSeq (NP_)	11	10 (90.91%)	10 (90.91%)	79.14%	86.14%
Homo sapiens GenBank	149,081	80,849 (54.23%)	80,849 (54.23%)	64.45%	78.39%
Homo sapiens known RefSeq (NP_)	62,674	43,196 (68.92%)	43,196 (68.92%)	69.51%	75.36%

Assembly-assembly alignments of current to previous assembly

When the assembly changes between two rounds of annotation, genes in the current and the previous annotation are mapped to each other using the genomic alignments of the current assembly to the previous assembly so that gene identifiers can be preserved. The success of the remapping depends largely on how well the two assembly versions align to each other.

Below are the percent coverage of one assembly by the other and the average percent identity of the alignments. The 'First pass' alignments are reciprocal best hits, while the 'Total' alignments also include 'Second pass' or non-reciprocal best alignments. For more information about the assembly-assembly alignment process, please visit the NCBI Genome Remapping Service page.

First Pass	Total
Chrysemys_picta_BioNano-3.0.4 (Current) Coverage: 99.87%	Chrysemys_picta_BioNano-3.0.4 (Current) Coverage: 99.87%
Chrysemys_picta_bellii-3.0.3 (Previous) Coverage: 99.80%	Chrysemys_picta_bellii-3.0.3 (Previous) Coverage: 99.80%
Percent Identity: 100.00%	Percent Identity: 99.99%

Comparison of the current and previous annotations

The annotation produced for this release (103) was compared to the annotation in the previous release (102) for each assembly annotated in both releases. Scores for current and previous gene and transcript features were calculated based on overlap in exon sequence and matches in exon boundaries. Pairs of current and previous features were categorized based on these scores, whether they are reciprocal best matches, and changes in attributes (gene biotype, completeness, etc.). If the assembly was updated between the two releases, alignments between the current and the previous assembly were used to match the current and previous gene and transcript features in mapped regions.

The table below summarizes the changes in the gene set for each assembly as a percent of the number of genes in the current annotation release, and provides links to the details of the comparison in tabular format and in a Genome Workbench project.

	Chrysemys_picta_BioNano-3.0.4 (Current) to Chrysemys_picta_bellii-3.0.3 (Previous)
Identical	10%
Minor changes	68%
Major changes	12%
New	10%
Deprecated	6%
Other	<1%
Download the report	tabular, Genome Workbench

References

RefSeq: Pruitt KD, Brown GR, Hiatt SM, Thibaud-Nissen F, Astashyn A, Ermolaeva O, Farrell CM, Hart J, Landrum MJ, McGarvey KM, Murphy MR, O'Leary NA, Pujar S, Rajput B, Rangwala SH, Riddick LD, Shkeda A, Sun H, Tamez P, Tully RE, Wallin C, Webb D, Weber J, Wu W, Dicuccio M, Kitts P, Maglott DR, Murphy TD, Ostell JM. Nucleic Acids Research 2014, 42(Database issue):D756-63
RepeatMasker: Smit AFA, Hubley R, Green P. RepeatMasker Open-3.0. 1996–2004. http://www.repeatmasker.org
WindowMasker: Morgulis A, Gertz EM, Schäffer AA, Agarwala R. Bioinformatics 2006, 2:134-41
Splign: Kapustin Y, Souvorov A, Tatusova T, Lipman D. Biology Direct 2008, 3:20
Minimap2: Li H. Bioinformatics 2018 Sep 15;34(18):3094-3100

RefSeq

Integrated reference sequences