NCBI Myodes glareolus Annotation Release 100

The RefSeq genome records for Myodes glareolus were annotated by the NCBI Eukaryotic Genome Annotation Pipeline, an automated pipeline that annotates genes, transcripts and proteins on draft and finished genome assemblies. This report presents statistics on the annotation products, the input data used in the pipeline and intermediate alignment results.

The annotation products are available in the sequence databases and on the FTP site.

This report provides:

Annotation Release information: The name of the release, important dates, the software version
Assemblies: A brief description of the annotated assembly(ies)
Gene and feature statistics: The counts and characteristics of the annotated features
BUSCO results: Annotation completeness assessed with BUSCO
Alignment of the annotated proteins to a set of high-quality proteins: The number of annotated proteins with hits to a set of high-quality proteins
Masking of genomic sequence: How much of the genome was masked
Transcript and protein alignments: The number and type of evidence retrieved from public databases and used for gene prediction

For more information on the annotation process, please visit the NCBI Eukaryotic Genome Annotation Pipeline page.

Annotation Release information

This annotation should be referred to as NCBI Myodes glareolus Annotation Release 100

Annotation release ID: 100
Date of Entrez queries for transcripts and proteins: May 25 2022
Date of submission of annotation to the public databases: May 28 2022
Software version: 9.0

Assemblies

The following assemblies were included in this annotation run:

Assembly name	Assembly accession	Submitter	Assembly date	Reference/Alternate	Assembly content
Bank_vole1_10x	GCF_902806735.1	LIV	02-22-2020	Reference	1 assembled chromosomes; unplaced scaffolds

Gene and feature statistics

Counts and length of annotated features are provided below for each assembly.

Feature counts

Feature	Bank_vole1_10x
Genes and pseudogenes	30,720
protein-coding	20,671
non-coding	6,628
Transcribed pseudogenes	104
Non-transcribed pseudogenes	3,084
genes with variants	10,153
Immunoglobulin/T-cell receptor gene segments	183
other	50
mRNAs	46,931
fully-supported	44,997
with > 5% ab initio	757
partial	333
with filled gap(s)	0
known RefSeq (NM_)	0
model RefSeq (XM_)	46,931
non-coding RNAs	9,767
fully-supported	7,989
with > 5% ab initio	0
partial	0
with filled gap(s)	0
known RefSeq (NR_)	0
model RefSeq (XR_)	9,375
pseudo transcripts	104
fully-supported	73
with > 5% ab initio	0
partial	0
with filled gap(s)	0
known RefSeq (NR_)	0
model RefSeq (XR_)	104
CDSs	47,127
fully-supported	44,997
with > 5% ab initio	932
partial	336
with major correction(s)	629
known RefSeq (NP_)	0
model RefSeq (XP_)	46,944

Detailed reports

The counts below do not include pseudogenes.

Feature lengths

Feature	Count	Mean length (bp)	Median length (bp)	Min length (bp)	Max length (bp)
Genes	27,349	34,431	10,629	49	2,115,447
All transcripts	56,698	3,011	2,388	32	103,307
mRNA	46,931	3,314	2,673	102	103,307
misc_RNA	1,050	2,925	2,333	155	12,023
tRNA	390	74	73	59	93
lncRNA	6,950	1,716	1,171	32	18,872
snoRNA	521	114	126	49	317
snRNA	789	120	107	61	197
rRNA	17	253	119	119	1,571
Single-exon transcripts	2,867	1,194	946	102	12,587
coding transcripts (NM_/XM_ )	2,867	1,194	946	102	12,587
CDSs	46,944	1,954	1,431	87	102,075
Exons	244,644	307	138	1	23,179
in coding transcripts (NM_/XM_ )	225,038	291	135	1	23,179
in non-coding transcripts (NR_/XR_ )	27,014	410	162	2	17,242
Introns	216,243	5,095	1,440	30	1,036,355
in coding transcripts (NM_/XM_ )	202,674	5,124	1,431	30	1,036,355
in non-coding transcripts (NR_/XR_ )	20,742	4,535	1,496	30	438,301

Transcripts per gene, exons per transcript

	Mean	Median	Min	Max
Number of transcripts per gene	2.09	1	1	50
Number of exons per transcript	11.1	8	1	304

BUSCO analysis of gene annotation

BUSCO v4.1.4 was run in "protein" mode on the annotated gene set picking one longest protein per gene, and run using the glires_odb10 lineage dataset. Results are reported for the gene set from the primary assembly unit, and presented in BUSCO notation.

Alignment of the annotated proteins to a set of high-quality proteins

The final set of annotated proteins was searched with BLASTP against the UniProtKB/Swiss-Prot curated proteins, using the annotated proteins as the query and the high-quality proteins as the target. Out of 20671 coding genes, 20413 genes had a protein with an alignment covering 50% or more of the query and 17687 had an alignment covering 95% or more of the query.

Definition of query and target coverage. The query coverage is the percentage of the annotated protein length that is included in the alignment. The target coverage is the percentage of the target length that is included in the alignment.

Below is a cumulative graph displaying the number of genes with alignments above a given query or target coverage threshold. For comparison, corresponding statistics for other organisms annotated by the NCBI eukaryotic annotation pipeline were added to the graph.

Query: annotated proteins
Target: UniProtKB/Swiss-Prot curated proteins

Masking of genomic sequence

Transcript and protein alignments are performed on the repeat-masked genome. Below are the percentages of genomic sequence masked by WindowMasker and RepeatMasker (if calculated), for each assembly. RepeatMasker results are only calculated for organisms with complete Dfam HMM model collections.

For this annotation run, transcripts and proteins were aligned to the genome masked with WindowMasker only.

Assembly name	Assembly accession	% Masked with WindowMasker
Bank_vole1_10x	GCF_902806735.1	30.97%

Transcript and protein alignments

The annotation pipeline relies heavily on alignments of experimental evidence for gene prediction. Below are the sets of transcripts and proteins that were retrieved from Entrez, aligned to the genome by Splign, minimap2, or ProSplign and passed to Gnomon, NCBI's gene prediction software.

Transcript alignments

Source	Number of sequences retrieved from Entrez	Number (%) of sequences aligned by Splign	Number (%) of sequences passed to Gnomon	Average % identity	Average % coverage
Same-species Genbank	323	195 (60.37%)	66 (20.43%)	98.72%	95.33%
Mus musculus known RefSeq (NM_/NR_)	48,908	42,246 (86.38%)	12,594 (25.75%)	89.68%	87.81%
Mus musculus Genbank	330,972	162,434 (49.08%)	64,556 (19.50%)	90.34%	92.31%

RNA-Seq alignments

The following RNA-Seq reads from the Sequence Read Archive were also used for gene prediction:

Hide alignments statistics, by sample (SAME, SAMN, SAMD, DRS)

Sample Id	Publication	Track name	Number of reads	Percent aligned reads	Percent of aligned reads with introns	Number of introns
All	NA	Aggregate of all aligned samples	5,575,023,954	84%	32%	252,218
SAMN00012283	20565972	Heart transcriptome of the bank vole (Myodes glareolus): towards understanding the evolutionary variation in metabolic rate: unselected controls (Myodes glareolus, SAMN00012283)	571,581	55%	41%	43,549
SAMN00012348	20565972	Heart transcriptome of the bank vole (Myodes glareolus): towards understanding the evolutionary variation in metabolic rate: unselected controls (Myodes glareolus, SAMN00012348)	538,239	57%	41%	42,713
SAMN02371284	NA	liver (Myodes glareolus, 80 days, female, SAMN02371284)	21,294,910	77%	33%	112,497
SAMN02371285	NA	liver (Myodes glareolus, 80 days, male, SAMN02371285)	12,683,802	78%	33%	95,255
SAMN02371286	NA	liver (Myodes glareolus, 80 days, male, SAMN02371286)	17,498,476	78%	34%	109,586
SAMN02371287	NA	liver (Myodes glareolus, 80 days, male, SAMN02371287)	18,312,030	77%	34%	111,704
SAMN02371501	NA	liver (Myodes glareolus, 80 days, female, SAMN02371501)	18,697,650	76%	33%	108,195
SAMN02371502	NA	liver (Myodes glareolus, 80 days, female, SAMN02371502)	16,169,414	78%	33%	106,770
SAMN02371503	NA	liver (Myodes glareolus, 80 days, female, SAMN02371503)	12,184,174	77%	33%	96,680
SAMN02371504	NA	liver (Myodes glareolus, 80 days, male, SAMN02371504)	17,318,986	77%	32%	101,335
SAMN02371505	NA	liver (Myodes glareolus, 80 days, female, SAMN02371505)	11,792,596	73%	32%	95,691
SAMN02371506	NA	liver (Myodes glareolus, 80 days, male, SAMN02371506)	13,954,190	78%	32%	93,124
SAMN02371507	NA	liver (Myodes glareolus, SAMN02371507)	228,251,864	76%	33%	163,208
SAMN04099865	NA	liver (Myodes glareolus, 80 days, pooled male and female, SAMN04099865)	30,058,492	83%	30%	124,654
SAMN04099866	NA	liver (Myodes glareolus, 80 days, pooled male and female, SAMN04099866)	35,566,672	83%	30%	125,619
SAMN04099867	NA	liver (Myodes glareolus, 80 days, pooled male and female, SAMN04099867)	35,112,603	84%	30%	119,289
SAMN04099900	NA	liver (Myodes glareolus, 80 days, pooled male and female, SAMN04099900)	26,716,071	83%	30%	113,838
SAMN04099901	NA	hippocampus (Myodes glareolus, 80 days, pooled male and female, SAMN04099901)	68,114,798	83%	17%	162,468
SAMN04099902	NA	hippocampus (Myodes glareolus, 80 days, pooled male and female, SAMN04099902)	13,284,399	84%	18%	133,371
SAMN04099903	NA	hippocampus (Myodes glareolus, 80 days, pooled male and female, SAMN04099903)	35,000,000	84%	19%	152,803
SAMN04099904	NA	hippocampus (Myodes glareolus, 80 days, pooled male and female, SAMN04099904)	29,425,344	84%	17%	147,259
SAMN04099905	NA	hippocampus (Myodes glareolus, 80 days, pooled male and female, SAMN04099905)	32,612,534	83%	17%	149,130
SAMN04099906	NA	hippocampus (Myodes glareolus, 80 days, pooled male and female, SAMN04099906)	36,827,514	83%	17%	151,075
SAMN04099907	NA	hippocampus (Myodes glareolus, 80 days, pooled male and female, SAMN04099907)	31,020,071	83%	17%	146,397
SAMN04099908	NA	hippocampus (Myodes glareolus, 80 days, pooled male and female, SAMN04099908)	38,241,446	84%	17%	150,940
SAMN04324610	25141177,27427999	adult, liver (Myodes glareolus, SAMN04324610)	14,480,120	81%	36%	108,038
SAMN05860961	NA	spleen (Myodes glareolus, male, SAMN05860961)	91,983,824	83%	33%	166,986
SAMN05860962	NA	liver (Myodes glareolus, male, SAMN05860962)	80,884,120	81%	41%	143,907
SAMN05860963	NA	kidney (Myodes glareolus, male, SAMN05860963)	82,679,224	79%	32%	155,913
SAMN05860964	NA	intestine (Myodes glareolus, male, SAMN05860964)	71,342,568	76%	33%	146,855
SAMN05860965	NA	testis (Myodes glareolus, male, SAMN05860965)	70,687,536	81%	36%	166,486
SAMN05860966	NA	heart (Myodes glareolus, male, SAMN05860966)	77,182,280	73%	31%	141,398
SAMN09835396	NA	liver (Myodes glareolus, female, SAMN09835396)	38,661,430	83%	34%	133,989
SAMN09835397	NA	liver (Myodes glareolus, female, SAMN09835397)	38,292,486	84%	33%	133,183
SAMN09835398	NA	liver (Myodes glareolus, female, SAMN09835398)	32,815,032	83%	33%	129,856
SAMN09835399	NA	liver (Myodes glareolus, female, SAMN09835399)	32,889,262	85%	33%	128,727
SAMN09835400	NA	liver (Myodes glareolus, female, SAMN09835400)	32,010,406	83%	33%	126,781
SAMN09835401	NA	liver (Myodes glareolus, female, SAMN09835401)	41,750,150	84%	34%	132,227
SAMN09835403	NA	liver (Myodes glareolus, female, SAMN09835403)	39,111,320	85%	34%	128,715
SAMN09835404	NA	liver (Myodes glareolus, female, SAMN09835404)	37,570,258	84%	32%	131,582
SAMN09835405	NA	liver (Myodes glareolus, female, SAMN09835405)	30,982,940	85%	34%	128,477
SAMN09835406	NA	liver (Myodes glareolus, female, SAMN09835406)	35,065,240	83%	33%	129,434
SAMN09835407	NA	liver (Myodes glareolus, female, SAMN09835407)	40,915,352	82%	33%	131,903
SAMN09835408	NA	liver (Myodes glareolus, female, SAMN09835408)	36,617,372	82%	33%	131,141
SAMN09835409	NA	liver (Myodes glareolus, female, SAMN09835409)	39,287,878	82%	33%	129,871
SAMN09835410	NA	liver (Myodes glareolus, female, SAMN09835410)	44,052,972	82%	33%	136,828
SAMN09835411	NA	liver (Myodes glareolus, female, SAMN09835411)	39,129,356	84%	32%	144,579
SAMN09835412	NA	liver (Myodes glareolus, female, SAMN09835412)	44,817,832	84%	34%	131,364
SAMN09835413	NA	liver (Myodes glareolus, female, SAMN09835413)	33,712,734	85%	34%	138,850
SAMN09835414	NA	liver (Myodes glareolus, female, SAMN09835414)	40,199,564	82%	34%	137,421
SAMN09835415	NA	liver (Myodes glareolus, female, SAMN09835415)	49,460,964	84%	34%	135,169
SAMN09835416	NA	liver (Myodes glareolus, female, SAMN09835416)	31,418,170	83%	32%	127,818
SAMN09835417	NA	liver (Myodes glareolus, female, SAMN09835417)	35,288,456	84%	30%	150,967
SAMN09835418	NA	liver (Myodes glareolus, female, SAMN09835418)	31,185,900	85%	32%	136,611
SAMN09835419	NA	liver (Myodes glareolus, female, SAMN09835419)	34,374,758	84%	33%	128,335
SAMN09835420	NA	liver (Myodes glareolus, female, SAMN09835420)	30,297,998	83%	32%	129,033
SAMN09835421	NA	liver (Myodes glareolus, female, SAMN09835421)	36,194,126	84%	33%	130,907
SAMN09835422	NA	liver (Myodes glareolus, female, SAMN09835422)	31,807,310	85%	34%	125,949
SAMN09835423	NA	liver (Myodes glareolus, female, SAMN09835423)	36,484,244	82%	33%	133,476
SAMN09835424	NA	liver (Myodes glareolus, female, SAMN09835424)	40,238,284	83%	33%	135,253
SAMN09835425	NA	liver (Myodes glareolus, female, SAMN09835425)	42,726,558	85%	33%	133,207
SAMN09835426	NA	liver (Myodes glareolus, female, SAMN09835426)	43,142,868	85%	33%	135,394
SAMN09835427	NA	liver (Myodes glareolus, female, SAMN09835427)	31,967,664	83%	32%	131,647
SAMN09835428	NA	liver (Myodes glareolus, female, SAMN09835428)	35,538,258	85%	32%	129,590
SAMN09835429	NA	liver (Myodes glareolus, female, SAMN09835429)	32,224,036	84%	33%	132,017
SAMN09835430	NA	liver (Myodes glareolus, female, SAMN09835430)	38,268,006	85%	34%	129,804
SAMN09835431	NA	liver (Myodes glareolus, female, SAMN09835431)	36,841,082	85%	33%	131,357
SAMN09835432	NA	liver (Myodes glareolus, female, SAMN09835432)	34,178,804	83%	32%	129,217
SAMN09835433	NA	liver (Myodes glareolus, female, SAMN09835433)	33,082,156	83%	33%	130,708
SAMN09835434	NA	liver (Myodes glareolus, female, SAMN09835434)	42,779,056	83%	32%	134,408
SAMN09835435	NA	liver (Myodes glareolus, female, SAMN09835435)	41,419,354	86%	34%	133,075
SAMN09835436	NA	spleen (Myodes glareolus, female, SAMN09835436)	39,220,066	83%	26%	148,991
SAMN09835437	NA	spleen (Myodes glareolus, female, SAMN09835437)	35,353,328	86%	29%	151,215
SAMN09835438	NA	spleen (Myodes glareolus, female, SAMN09835438)	33,124,612	85%	28%	148,269
SAMN09835439	NA	spleen (Myodes glareolus, female, SAMN09835439)	29,771,326	84%	27%	145,970
SAMN09835440	NA	spleen (Myodes glareolus, female, SAMN09835440)	31,259,898	84%	27%	149,534
SAMN09835441	NA	spleen (Myodes glareolus, female, SAMN09835441)	34,476,780	85%	28%	146,843
SAMN09835442	NA	spleen (Myodes glareolus, female, SAMN09835442)	30,105,580	83%	26%	148,042
SAMN09835443	NA	spleen (Myodes glareolus, female, SAMN09835443)	34,155,290	83%	27%	150,711
SAMN09835444	NA	spleen (Myodes glareolus, female, SAMN09835444)	34,390,170	85%	28%	140,607
SAMN09835445	NA	spleen (Myodes glareolus, female, SAMN09835445)	39,401,420	85%	28%	149,580
SAMN09835446	NA	spleen (Myodes glareolus, female, SAMN09835446)	38,874,148	85%	28%	151,489
SAMN09835447	NA	spleen (Myodes glareolus, female, SAMN09835447)	39,893,674	83%	26%	152,908
SAMN09835448	NA	spleen (Myodes glareolus, female, SAMN09835448)	34,312,796	84%	27%	151,825
SAMN09835449	NA	spleen (Myodes glareolus, female, SAMN09835449)	35,460,232	86%	28%	141,745
SAMN09835450	NA	spleen (Myodes glareolus, female, SAMN09835450)	34,053,416	83%	27%	151,419
SAMN09835451	NA	spleen (Myodes glareolus, female, SAMN09835451)	33,006,586	85%	29%	138,400
SAMN09835452	NA	spleen (Myodes glareolus, female, SAMN09835452)	31,580,576	84%	28%	145,482
SAMN09835453	NA	spleen (Myodes glareolus, female, SAMN09835453)	35,434,664	86%	29%	134,204
SAMN09835454	NA	spleen (Myodes glareolus, female, SAMN09835454)	29,699,030	85%	28%	140,143
SAMN09835455	NA	spleen (Myodes glareolus, female, SAMN09835455)	38,969,958	83%	27%	151,040
SAMN09835456	NA	spleen (Myodes glareolus, female, SAMN09835456)	39,608,324	86%	28%	145,559
SAMN09835457	NA	spleen (Myodes glareolus, female, SAMN09835457)	40,528,516	84%	29%	154,702
SAMN09835458	NA	spleen (Myodes glareolus, female, SAMN09835458)	40,421,212	85%	25%	155,318
SAMN09835459	NA	spleen (Myodes glareolus, female, SAMN09835459)	34,013,240	87%	28%	140,945
SAMN09835460	NA	spleen (Myodes glareolus, female, SAMN09835460)	42,277,654	86%	27%	152,697
SAMN09835461	NA	spleen (Myodes glareolus, female, SAMN09835461)	41,008,252	85%	28%	150,574
SAMN09835462	NA	spleen (Myodes glareolus, female, SAMN09835462)	37,302,666	83%	27%	152,554
SAMN09835463	NA	spleen (Myodes glareolus, female, SAMN09835463)	40,729,544	85%	27%	148,106
SAMN09835464	NA	spleen (Myodes glareolus, female, SAMN09835464)	38,204,296	86%	28%	146,052
SAMN09835465	NA	spleen (Myodes glareolus, female, SAMN09835465)	28,759,388	85%	27%	142,969
SAMN09835466	NA	spleen (Myodes glareolus, female, SAMN09835466)	44,856,032	86%	28%	149,880
SAMN09835467	NA	spleen (Myodes glareolus, female, SAMN09835467)	46,347,188	87%	28%	151,827
SAMN09835468	NA	spleen (Myodes glareolus, female, SAMN09835468)	36,757,272	84%	26%	152,777
SAMN09835469	NA	spleen (Myodes glareolus, female, SAMN09835469)	31,168,052	85%	28%	147,386
SAMN09835470	NA	spleen (Myodes glareolus, female, SAMN09835470)	40,458,524	84%	27%	151,927
SAMN09835471	NA	spleen (Myodes glareolus, female, SAMN09835471)	38,784,686	85%	28%	145,786
SAMN09835472	NA	spleen (Myodes glareolus, female, SAMN09835472)	38,564,946	84%	27%	154,576
SAMN09835473	NA	spleen (Myodes glareolus, female, SAMN09835473)	37,845,804	85%	29%	145,862
SAMN09835474	NA	spleen (Myodes glareolus, female, SAMN09835474)	38,899,834	86%	27%	148,028
SAMN09835475	NA	spleen (Myodes glareolus, female, SAMN09835475)	33,390,558	85%	27%	147,094
SAMN12285027	NA	spleen (Myodes glareolus, male, SAMN12285027)	94,694,332	88%	36%	173,576
SAMN12285030	NA	spleen (Myodes glareolus, male, SAMN12285030)	68,784,816	83%	40%	169,508
SAMN12285031	NA	spleen (Myodes glareolus, male, SAMN12285031)	72,643,226	86%	35%	170,550
SAMN12285034	NA	spleen (Myodes glareolus, male, SAMN12285034)	71,562,664	88%	38%	169,621
SAMN12285037	NA	spleen (Myodes glareolus, male, SAMN12285037)	67,113,526	86%	39%	166,152
SAMN12285038	NA	spleen (Myodes glareolus, male, SAMN12285038)	72,114,558	88%	41%	161,385
SAMN12285041	NA	spleen (Myodes glareolus, male, SAMN12285041)	73,660,692	86%	38%	163,884
SAMN12285042	NA	spleen (Myodes glareolus, male, SAMN12285042)	74,371,418	86%	38%	168,220
SAMN12285045	NA	spleen (Myodes glareolus, male, SAMN12285045)	65,660,834	87%	39%	169,881
SAMN12285046	NA	spleen (Myodes glareolus, male, SAMN12285046)	77,420,718	89%	40%	164,900
SAMN12285048	NA	spleen (Myodes glareolus, male, SAMN12285048)	79,936,512	86%	37%	170,295
SAMN12285050	NA	spleen (Myodes glareolus, male, SAMN12285050)	92,453,130	87%	38%	174,843
SAMN12285053	NA	spleen (Myodes glareolus, male, SAMN12285053)	81,826,548	89%	41%	165,189
SAMN12285054	NA	spleen (Myodes glareolus, male, SAMN12285054)	79,052,290	87%	38%	169,309
SAMN12285057	NA	spleen (Myodes glareolus, male, SAMN12285057)	77,975,876	87%	39%	167,999
SAMN12285059	NA	spleen (Myodes glareolus, male, SAMN12285059)	83,820,160	88%	40%	161,479
SAMN12285060	NA	spleen (Myodes glareolus, male, SAMN12285060)	73,850,708	88%	40%	154,796
SAMN12285061	NA	spleen (Myodes glareolus, male, SAMN12285061)	68,325,244	87%	40%	161,020

Show alignments statistics, by run (ERR, SRR, DRR)

Run	Experiment	Project	Sample	Number of reads	Percent aligned reads	Percent of aligned reads with introns
SRR042424	SRX020000	SRP002377	SAMN00012283	571,581	55%	41%
SRR042425	SRX020002	SRP002377	SAMN00012348	538,239	57%	41%
SRR1010537	SRX363393	SRP030778	SAMN02371284	21,294,910	77%	33%
SRR1010834	SRX363397	SRP030778	SAMN02371285	12,683,802	78%	33%
SRR1010835	SRX363396	SRP030778	SAMN02371286	17,498,476	78%	34%
SRR1010837	SRX363394	SRP030778	SAMN02371287	18,312,030	77%	34%
SRR1010631	SRX363401	SRP030778	SAMN02371501	18,697,650	76%	33%
SRR1010832	SRX363399	SRP030778	SAMN02371502	16,169,414	78%	33%
SRR1010630	SRX363402	SRP030778	SAMN02371503	12,184,174	77%	33%
SRR1010629	SRX363403	SRP030778	SAMN02371504	17,318,986	77%	32%
SRR1010833	SRX363398	SRP030778	SAMN02371505	11,792,596	73%	32%
SRR1010836	SRX363395	SRP030778	SAMN02371506	13,954,190	78%	32%
SRR1010821	SRX363400	SRP030778	SAMN02371507	228,251,864	76%	33%
SRR2980764	SRX1470201	SRP042651	SAMN04324610	14,480,120	81%	36%
SRR3659016	SRX1838076	SRP064032	SAMN04099865	30,058,492	83%	30%
SRR3659017	SRX1838077	SRP064032	SAMN04099866	35,566,672	83%	30%
SRR3659018	SRX1838078	SRP064032	SAMN04099867	35,112,603	84%	30%
SRR3659019	SRX1838079	SRP064032	SAMN04099900	26,716,071	83%	30%
SRR3658988	SRX1838047	SRP064032	SAMN04099901	68,114,798	83%	17%
SRR3658989	SRX1838048	SRP064032	SAMN04099902	13,284,399	84%	18%
SRR3658990	SRX1838050	SRP064032	SAMN04099903	35,000,000	84%	19%
SRR3659003	SRX1838051	SRP064032	SAMN04099904	29,425,344	84%	17%
SRR3658984	SRX1838042	SRP064032	SAMN04099905	32,612,534	83%	17%
SRR3658985	SRX1838044	SRP064032	SAMN04099906	36,827,514	83%	17%
SRR3658986	SRX1838045	SRP064032	SAMN04099907	31,020,071	83%	17%
SRR3658987	SRX1838046	SRP064032	SAMN04099908	38,241,446	84%	17%
SRR4342180	SRX2209235	SRP081142	SAMN05860961	91,983,824	83%	33%
SRR4342181	SRX2209236	SRP081142	SAMN05860962	80,884,120	81%	41%
SRR4342182	SRX2209237	SRP081142	SAMN05860963	82,679,224	79%	32%
SRR4342183	SRX2209238	SRP081142	SAMN05860964	71,342,568	76%	33%
SRR4342184	SRX2209239	SRP081142	SAMN05860965	70,687,536	81%	36%
SRR4342185	SRX2209240	SRP081142	SAMN05860966	77,182,280	73%	31%
SRR7695594	SRX4553909	SRP157977	SAMN09835396	38,661,430	83%	34%
SRR7695593	SRX4553910	SRP157977	SAMN09835397	38,292,486	84%	33%
SRR7695596	SRX4553907	SRP157977	SAMN09835398	32,815,032	83%	33%
SRR7695595	SRX4553908	SRP157977	SAMN09835399	32,889,262	85%	33%
SRR7695590	SRX4553913	SRP157977	SAMN09835400	32,010,406	83%	33%
SRR7695550	SRX4553953	SRP157977	SAMN09835401	41,750,150	84%	34%
SRR7695592	SRX4553911	SRP157977	SAMN09835403	39,111,320	85%	34%
SRR7695591	SRX4553912	SRP157977	SAMN09835404	37,570,258	84%	32%
SRR7695560	SRX4553943	SRP157977	SAMN09835405	30,982,940	85%	34%
SRR7695587	SRX4553916	SRP157977	SAMN09835406	35,065,240	83%	33%
SRR7695585	SRX4553918	SRP157977	SAMN09835407	40,915,352	82%	33%
SRR7695584	SRX4553919	SRP157977	SAMN09835408	36,617,372	82%	33%
SRR7695583	SRX4553920	SRP157977	SAMN09835409	39,287,878	82%	33%
SRR7695582	SRX4553921	SRP157977	SAMN09835410	44,052,972	82%	33%
SRR7695581	SRX4553922	SRP157977	SAMN09835411	39,129,356	84%	32%
SRR7695580	SRX4553923	SRP157977	SAMN09835412	44,817,832	84%	34%
SRR7695579	SRX4553924	SRP157977	SAMN09835413	33,712,734	85%	34%
SRR7695578	SRX4553925	SRP157977	SAMN09835414	40,199,564	82%	34%
SRR7695577	SRX4553926	SRP157977	SAMN09835415	49,460,964	84%	34%
SRR7695576	SRX4553927	SRP157977	SAMN09835416	31,418,170	83%	32%
SRR7695620	SRX4553883	SRP157977	SAMN09835417	35,288,456	84%	30%
SRR7695621	SRX4553882	SRP157977	SAMN09835418	31,185,900	85%	32%
SRR7695618	SRX4553885	SRP157977	SAMN09835419	34,374,758	84%	33%
SRR7695619	SRX4553884	SRP157977	SAMN09835420	30,297,998	83%	32%
SRR7695616	SRX4553887	SRP157977	SAMN09835421	36,194,126	84%	33%
SRR7695617	SRX4553886	SRP157977	SAMN09835422	31,807,310	85%	34%
SRR7695614	SRX4553889	SRP157977	SAMN09835423	36,484,244	82%	33%
SRR7695615	SRX4553888	SRP157977	SAMN09835424	40,238,284	83%	33%
SRR7695623	SRX4553880	SRP157977	SAMN09835425	42,726,558	85%	33%
SRR7695624	SRX4553879	SRP157977	SAMN09835426	43,142,868	85%	33%
SRR7695575	SRX4553928	SRP157977	SAMN09835427	31,967,664	83%	32%
SRR7695574	SRX4553929	SRP157977	SAMN09835428	35,538,258	85%	32%
SRR7695605	SRX4553898	SRP157977	SAMN09835429	32,224,036	84%	33%
SRR7695589	SRX4553914	SRP157977	SAMN09835430	38,268,006	85%	34%
SRR7695571	SRX4553932	SRP157977	SAMN09835431	36,841,082	85%	33%
SRR7695570	SRX4553933	SRP157977	SAMN09835432	34,178,804	83%	32%
SRR7695573	SRX4553930	SRP157977	SAMN09835433	33,082,156	83%	33%
SRR7695572	SRX4553931	SRP157977	SAMN09835434	42,779,056	83%	32%
SRR7695556	SRX4553947	SRP157977	SAMN09835435	41,419,354	86%	34%
SRR7695610	SRX4553893	SRP157977	SAMN09835436	39,220,066	83%	26%
SRR7695551	SRX4553952	SRP157977	SAMN09835437	35,353,328	86%	29%
SRR7695611	SRX4553892	SRP157977	SAMN09835438	33,124,612	85%	28%
SRR7695612	SRX4553891	SRP157977	SAMN09835439	29,771,326	84%	27%
SRR7695613	SRX4553890	SRP157977	SAMN09835440	31,259,898	84%	27%
SRR7695606	SRX4553897	SRP157977	SAMN09835441	34,476,780	85%	28%
SRR7695607	SRX4553896	SRP157977	SAMN09835442	30,105,580	83%	26%
SRR7695608	SRX4553895	SRP157977	SAMN09835443	34,155,290	83%	27%
SRR7695609	SRX4553894	SRP157977	SAMN09835444	34,390,170	85%	28%
SRR7695602	SRX4553901	SRP157977	SAMN09835445	39,401,420	85%	28%
SRR7695603	SRX4553900	SRP157977	SAMN09835446	38,874,148	85%	28%
SRR7695554	SRX4553949	SRP157977	SAMN09835447	39,893,674	83%	26%
SRR7695599	SRX4553904	SRP157977	SAMN09835448	34,312,796	84%	27%
SRR7695564	SRX4553939	SRP157977	SAMN09835449	35,460,232	86%	28%
SRR7695561	SRX4553942	SRP157977	SAMN09835450	34,053,416	83%	27%
SRR7695558	SRX4553945	SRP157977	SAMN09835451	33,006,586	85%	29%
SRR7695588	SRX4553915	SRP157977	SAMN09835452	31,580,576	84%	28%
SRR7695586	SRX4553917	SRP157977	SAMN09835453	35,434,664	86%	29%
SRR7695552	SRX4553951	SRP157977	SAMN09835454	29,699,030	85%	28%
SRR7695604	SRX4553899	SRP157977	SAMN09835455	38,969,958	83%	27%
SRR7695598	SRX4553905	SRP157977	SAMN09835456	39,608,324	86%	28%
SRR7695546	SRX4553957	SRP157977	SAMN09835457	40,528,516	84%	29%
SRR7695549	SRX4553954	SRP157977	SAMN09835458	40,421,212	85%	25%
SRR7695548	SRX4553955	SRP157977	SAMN09835459	34,013,240	87%	28%
SRR7695622	SRX4553881	SRP157977	SAMN09835460	42,277,654	86%	27%
SRR7695555	SRX4553948	SRP157977	SAMN09835461	41,008,252	85%	28%
SRR7695557	SRX4553946	SRP157977	SAMN09835462	37,302,666	83%	27%
SRR7695597	SRX4553906	SRP157977	SAMN09835463	40,729,544	85%	27%
SRR7695553	SRX4553950	SRP157977	SAMN09835464	38,204,296	86%	28%
SRR7695600	SRX4553903	SRP157977	SAMN09835465	28,759,388	85%	27%
SRR7695601	SRX4553902	SRP157977	SAMN09835466	44,856,032	86%	28%
SRR7695563	SRX4553940	SRP157977	SAMN09835467	46,347,188	87%	28%
SRR7695562	SRX4553941	SRP157977	SAMN09835468	36,757,272	84%	26%
SRR7695565	SRX4553938	SRP157977	SAMN09835469	31,168,052	85%	28%
SRR7695547	SRX4553956	SRP157977	SAMN09835470	40,458,524	84%	27%
SRR7695567	SRX4553936	SRP157977	SAMN09835471	38,784,686	85%	28%
SRR7695566	SRX4553937	SRP157977	SAMN09835472	38,564,946	84%	27%
SRR7695569	SRX4553934	SRP157977	SAMN09835473	37,845,804	85%	29%
SRR7695568	SRX4553935	SRP157977	SAMN09835474	38,899,834	86%	27%
SRR7695559	SRX4553944	SRP157977	SAMN09835475	33,390,558	85%	27%
SRR9990613	SRX6729984	SRP218649	SAMN12285027	94,694,332	88%	36%
SRR9990612	SRX6729985	SRP218649	SAMN12285030	68,784,816	83%	40%
SRR9990609	SRX6729988	SRP218649	SAMN12285031	72,643,226	86%	35%
SRR9990608	SRX6729989	SRP218649	SAMN12285034	71,562,664	88%	38%
SRR9990596	SRX6730001	SRP218649	SAMN12285037	67,113,526	86%	39%
SRR9990597	SRX6730000	SRP218649	SAMN12285038	72,114,558	88%	41%
SRR9990592	SRX6730005	SRP218649	SAMN12285041	73,660,692	86%	38%
SRR9990593	SRX6730004	SRP218649	SAMN12285042	74,371,418	86%	38%
SRR9990603	SRX6729994	SRP218649	SAMN12285045	65,660,834	87%	39%
SRR9990604	SRX6729993	SRP218649	SAMN12285046	77,420,718	89%	40%
SRR9990623	SRX6729974	SRP218649	SAMN12285048	79,936,512	86%	37%
SRR9990621	SRX6729976	SRP218649	SAMN12285050	92,453,130	87%	38%
SRR9990618	SRX6729979	SRP218649	SAMN12285053	81,826,548	89%	41%
SRR9990617	SRX6729980	SRP218649	SAMN12285054	79,052,290	87%	38%
SRR9990598	SRX6729999	SRP218649	SAMN12285057	77,975,876	87%	39%
SRR9990600	SRX6729997	SRP218649	SAMN12285059	83,820,160	88%	40%
SRR9990601	SRX6729996	SRP218649	SAMN12285060	73,850,708	88%	40%
SRR9990602	SRX6729995	SRP218649	SAMN12285061	68,325,244	87%	40%

Protein alignments

Source	Number of sequences retrieved from Entrez	Number (%) of sequences aligned by ProSplign	Number (%) of sequences passed to Gnomon	Average % identity	Average % coverage
Mus musculus known RefSeq (NP_)	40,622	39,046 (96.12%)	39,046 (96.12%)	77.51%	87.42%
Rattus norvegicus known RefSeq (NP_)	19,648	19,074 (97.08%)	19,074 (97.08%)	76.28%	89.09%
Same-species GenBank	30	28 (93.33%)	28 (93.33%)	79.79%	94.44%
Homo sapiens known RefSeq (NP_)	64,649	60,891 (94.19%)	60,891 (94.19%)	77.94%	84.92%

References

RefSeq: Pruitt KD, Brown GR, Hiatt SM, Thibaud-Nissen F, Astashyn A, Ermolaeva O, Farrell CM, Hart J, Landrum MJ, McGarvey KM, Murphy MR, O'Leary NA, Pujar S, Rajput B, Rangwala SH, Riddick LD, Shkeda A, Sun H, Tamez P, Tully RE, Wallin C, Webb D, Weber J, Wu W, Dicuccio M, Kitts P, Maglott DR, Murphy TD, Ostell JM. Nucleic Acids Research 2014, 42(Database issue):D756-63
BUSCO: Manni M, Berkeley MR, Seppey M, Simão FA, Zdobnov EM. Molecular biology and evolution 2021.38(10):4647-4654
RepeatMasker: Smit AFA, Hubley R, Green P. RepeatMasker Open-3.0. 1996–2004. http://www.repeatmasker.org
WindowMasker: Morgulis A, Gertz EM, Schäffer AA, Agarwala R. Bioinformatics 2006, 2:134-41
Splign: Kapustin Y, Souvorov A, Tatusova T, Lipman D. Biology Direct 2008, 3:20
Minimap2: Li H. Bioinformatics 2018 Sep 15;34(18):3094-3100

RefSeq

Integrated reference sequences