NCBI Biomphalaria glabrata Annotation Release GCF_947242115.1-RS_2023_05

The genome sequence records for Biomphalaria glabrata RefSeq assembly GCF_947242115.1 (xgBioGlab47.1) were annotated by the NCBI Eukaryotic Genome Annotation Pipeline, an automated pipeline that annotates genes, transcripts and proteins on draft and finished genome assemblies.

The annotation products are available in the sequence databases and on the FTP site.

This report provides:

Annotation Release information: The name of the release, important dates, the software version
Assemblies: A brief description of the annotated assembly(ies)
Gene and feature statistics: The counts and characteristics of the annotated features
BUSCO results: Annotation completeness assessed with BUSCO
Alignment of the annotated proteins to a set of high-quality proteins: The number of annotated proteins with hits to a set of high-quality proteins
Masking of genomic sequence: How much of the genome was masked
Transcript and protein alignments: The number and type of evidence retrieved from public databases and used for gene prediction
Similarity of current and previous assembly: The similarity of the current and previous assembly
Comparison of the current and previous annotations: What proportion of the genes changed in this annotation

For more information on the annotation process, please visit the NCBI Eukaryotic Genome Annotation Pipeline page.

Annotation Release information

This annotation should be referred to as "GCF_947242115.1-RS_2023_05".

Date of Entrez queries for transcripts and proteins: May 1 2023
Date of submission of annotation to the public databases: May 10 2023
Software version: 10.1

Assemblies

The following assemblies were included in this annotation run:

Assembly name	Assembly accession	Submitter	Assembly date	Reference/Alternate	Assembly content
xgBioGlab47.1	GCF_947242115.1	WELLCOME SANGER INSTITUTE	10-29-2022	Reference	18 assembled chromosomes; unplaced scaffolds

Gene and feature statistics

Counts and length of annotated features are provided below for each assembly.

Feature counts

Feature	xgBioGlab47.1
Genes and pseudogenes	26,406
protein-coding	22,006
non-coding	4,181
Transcribed pseudogenes	2
Non-transcribed pseudogenes	217
genes with variants	11,587
Immunoglobulin/T-cell receptor gene segments	0
other	0
mRNAs	53,149
fully-supported	50,642
with > 5% ab initio	2,052
partial	63
with filled gap(s)	1
known RefSeq (NM_)	47
model RefSeq (XM_)	53,102
non-coding RNAs	7,845
fully-supported	6,897
with > 5% ab initio	0
partial	0
with filled gap(s)	0
known RefSeq (NR_)	0
model RefSeq (XR_)	7,280
pseudo transcripts	2
fully-supported	2
with > 5% ab initio	0
partial	0
with filled gap(s)	0
known RefSeq (NR_)	0
model RefSeq (XR_)	2
CDSs	53,149
fully-supported	50,642
with > 5% ab initio	2,116
partial	62
with major correction(s)	59
known RefSeq (NP_)	47
model RefSeq (XP_)	53,102

Detailed reports

The counts below do not include pseudogenes.

Feature lengths

Feature	Count	Mean length (bp)	Median length (bp)	Min length (bp)	Max length (bp)
Genes	26,187	23,188	12,381	62	641,611
All transcripts	60,994	3,981	3,129	62	90,247
mRNA	53,149	4,222	3,358	99	90,247
misc_RNA	1,746	4,613	3,111	240	43,043
tRNA	565	74	73	69	87
lncRNA	5,152	1,981	1,455	99	26,734
snoRNA	64	111	74	62	262
snRNA	147	146	139	101	193
rRNA	171	526	119	119	3,790
Single-exon transcripts	1,376	1,452	1,050	285	18,669
coding transcripts (NM_/XM_ )	1,376	1,452	1,050	285	18,669
CDSs	53,149	2,147	1,467	99	87,618
Exons	262,611	366	142	3	42,794
in coding transcripts (NM_/XM_ )	246,359	350	141	3	42,794
in non-coding transcripts (NR_/XR_ )	23,957	488	156	10	42,740
Introns	235,793	3,092	1,021	30	430,080
in coding transcripts (NM_/XM_ )	224,087	3,097	1,020	30	430,080
in non-coding transcripts (NR_/XR_ )	19,085	2,910	1,043	30	249,067

Transcripts per gene, exons per transcript

	Mean	Median	Min	Max
Number of transcripts per gene	2.36	1	1	50
Number of exons per transcript	11.75	8	1	302

BUSCO analysis of gene annotation

BUSCO v4.1.4 was run in "protein" mode on the annotated gene set picking one longest protein per gene, and run using the mollusca_odb10 lineage dataset. Results are reported for the gene set from the primary assembly unit, and presented in BUSCO notation.

Alignment of the annotated proteins to a set of high-quality proteins

The final set of annotated proteins was searched with BLASTP against the UniProtKB/Swiss-Prot curated proteins, using the annotated proteins as the query and the high-quality proteins as the target. Out of 22006 coding genes, 13637 genes had a protein with an alignment covering 50% or more of the query and 2869 had an alignment covering 95% or more of the query.

Definition of query and target coverage. The query coverage is the percentage of the annotated protein length that is included in the alignment. The target coverage is the percentage of the target length that is included in the alignment.

Below is a cumulative graph displaying the number of genes with alignments above a given query or target coverage threshold. For comparison, corresponding statistics for other organisms annotated by the NCBI eukaryotic annotation pipeline were added to the graph.

Query: annotated proteins
Target: UniProtKB/Swiss-Prot curated proteins

Masking of genomic sequence

Transcript and protein alignments are performed on the repeat-masked genome. Below are the percentages of genomic sequence masked by WindowMasker and RepeatMasker (if calculated), for each assembly. RepeatMasker results are only calculated for organisms with complete Dfam HMM model collections.

For this annotation run, transcripts and proteins were aligned to the genome masked with WindowMasker only.

Assembly name	Assembly accession	% Masked with WindowMasker
xgBioGlab47.1	GCF_947242115.1	45.01%

Transcript and protein alignments

The annotation pipeline relies heavily on alignments of experimental evidence for gene prediction. Below are the sets of transcripts and proteins that were retrieved from Entrez Nucleotide, Entrez Protein, and SRA, and aligned to the genome.

Transcript alignments

The alignments of the following transcripts with Splign were used for gene prediction:

Source	Number of sequences retrieved from Entrez	Number (%) of sequences aligned by Splign	Number (%) of sequences passed to Gnomon	Average % identity	Average % coverage
Same-species known RefSeq (NM_/NR_)	52	52 (100.00%)	46 (88.46%)	99.42%	99.82%
Same-species Genbank	510	496 (97.25%)	426 (83.53%)	98.79%	98.99%
Same-species TSA	84	80 (95.24%)	47 (55.95%)	98.27%	99.30%
Same-species EST	54,364	42,915 (78.94%)	34,489 (63.44%)	98.11%	98.93%

RNA-Seq alignments

The alignments of the following RNA-Seq reads with STAR were also used for gene prediction:

Hide alignments statistics, by sample (SAME, SAMN, SAMD, DRS)

Sample Id	Publication	Track name	Number of reads	Percent aligned reads	Percent of aligned reads with introns	Number of introns
All	NA	Aggregate of all aligned samples	4,939,677,663	67%	29%	291,318
SAMEA10417723	NA	WHOLE ORGANISM (Biomphalaria glabrata, SAMEA10417723)	44,373,416	73%	32%	196,898
SAMN02261944	NA	Whole organism (Biomphalaria glabrata, 3-6 months, hermaphrodite, SAMN02261944)	903,781,752	86%	17%	263,319
SAMN02905163	28508897,15562597	Sample from Biomphalaria glabrata (Biomphalaria glabrata, SAMN02905163)	40,515,146	58%	19%	169,698
SAMN02905165	28508897,15562597	Sample from Biomphalaria glabrata (Biomphalaria glabrata, SAMN02905165)	55,201,758	55%	18%	160,344
SAMN02905166	28508897,15562597	Sample from Biomphalaria glabrata (Biomphalaria glabrata, SAMN02905166)	58,428,384	61%	28%	137,658
SAMN02905167	28508897,15562597	Sample from Biomphalaria glabrata (Biomphalaria glabrata, SAMN02905167)	26,624,570	58%	18%	139,348
SAMN02905168	28508897,15562597	Sample from Biomphalaria glabrata (Biomphalaria glabrata, SAMN02905168)	29,748,868	55%	18%	146,991
SAMN02905169	28508897,15562597	Sample from Biomphalaria glabrata (Biomphalaria glabrata, SAMN02905169)	36,784,004	50%	18%	147,149
SAMN02905171	28508897,15562597	Sample from Biomphalaria glabrata (Biomphalaria glabrata, SAMN02905171)	45,486,220	58%	18%	159,307
SAMN02905172	28508897,15562597	Sample from Biomphalaria glabrata (Biomphalaria glabrata, SAMN02905172)	59,063,042	55%	16%	137,643
SAMN02905173	28508897,15562597	Sample from Biomphalaria glabrata (Biomphalaria glabrata, SAMN02905173)	73,130,492	51%	18%	163,630
SAMN02905174	28508897,15562597	Sample from Biomphalaria glabrata (Biomphalaria glabrata, SAMN02905174)	86,491,800	49%	14%	157,507
SAMN03112928	NA	whole body, R haplotype, no exposure (Biomphalaria glabrata, juvenile, hermaphrodite, SAMN03112928)	394,228,523	75%	32%	247,240
SAMN03112929	NA	whole body, S1 haplotype, no exposure (Biomphalaria glabrata, juvenile, hermaphrodite, SAMN03112929)	187,890,152	75%	31%	227,765
SAMN03112930	NA	whole body, S2 haplotype, no exposure (Biomphalaria glabrata, juvenile, hermaphrodite, SAMN03112930)	192,312,920	75%	33%	236,369
SAMN03264585	NA	whole body, fam 1d3, R haplotype, 2 hr after exposure (Biomphalaria glabrata, juvenile, hermaphrodite, SAMN03264585)	15,105,626	67%	25%	128,885
SAMN03264586	NA	whole body, fam 1d3, R haplotype, 6 hr after exposure (Biomphalaria glabrata, juvenile, hermaphrodite, SAMN03264586)	17,649,218	55%	25%	125,418
SAMN03264587	NA	whole body, fam 9a1, R haplotype, 2hr after exposure (Biomphalaria glabrata, juvenile, hermaphrodite, SAMN03264587)	15,917,381	63%	29%	115,666
SAMN03264588	NA	whole body, fam 9a1, R haplotype, 6 hr after exposure (Biomphalaria glabrata, juvenile, hermaphrodite, SAMN03264588)	14,660,058	49%	27%	121,600
SAMN03264589	NA	whole body, fam A01, R haplotype, 2 hr after exposure (Biomphalaria glabrata, juvenile, hermaphrodite, SAMN03264589)	16,383,793	62%	28%	125,715
SAMN03264590	NA	whole body, fam A01, R haplotype, 6 hr after exposure (Biomphalaria glabrata, juvenile, hermaphrodite, SAMN03264590)	15,295,285	59%	26%	133,915
SAMN03264591	NA	whole body, fam c59, S1 haplotype, 2 hr after exposure (Biomphalaria glabrata, juvenile, hermaphrodite, SAMN03264591)	15,064,426	52%	26%	138,178
SAMN03264592	NA	whole body, fam c59, S1 haplotype, 6 hr after exposure (Biomphalaria glabrata, juvenile, hermaphrodite, SAMN03264592)	14,706,170	55%	24%	141,762
SAMN03264593	NA	whole body, fam c15, S2 haplotype, 2 hr after exposure (Biomphalaria glabrata, juvenile, hermaphrodite, SAMN03264593)	14,045,984	64%	26%	137,283
SAMN03264594	NA	whole body, fam c15, S2 haplotype, 6 hr after exposure (Biomphalaria glabrata, juvenile, hermaphrodite, SAMN03264594)	14,719,725	54%	25%	135,395
SAMN03264595	NA	whole body, fam 18d, S2 haplotype, 2 hr after exposure (Biomphalaria glabrata, juvenile, hermaphrodite, SAMN03264595)	14,900,609	59%	27%	134,756
SAMN03264596	NA	whole body, fam 18d, S2 haplotype, 6 hr after exposure (Biomphalaria glabrata, juvenile, hermaphrodite, SAMN03264596)	18,310,412	52%	26%	142,911
SAMN03568798	NA	tentacle (Biomphalaria glabrata, SAMN03568798)	51,206,204	77%	19%	149,394
SAMN04359736	NA	mixed (Biomphalaria glabrata, SAMN04359736)	105,296,148	77%	30%	200,411
SAMN13390092	NA	whole snail (Biomphalaria glabrata, SAMN13390092)	29,758,242	68%	38%	185,179
SAMN13390093	NA	whole snail (Biomphalaria glabrata, SAMN13390093)	28,884,538	70%	38%	187,621
SAMN13390094	NA	whole snail (Biomphalaria glabrata, SAMN13390094)	28,940,484	69%	38%	191,517
SAMN13390095	NA	whole snail (Biomphalaria glabrata, SAMN13390095)	28,933,058	70%	38%	188,298
SAMN13390096	NA	whole snail (Biomphalaria glabrata, SAMN13390096)	27,601,658	69%	38%	187,702
SAMN13390097	NA	whole snail (Biomphalaria glabrata, SAMN13390097)	27,071,700	67%	37%	186,321
SAMN13390098	NA	whole snail (Biomphalaria glabrata, SAMN13390098)	28,430,420	68%	37%	191,142
SAMN13390099	NA	whole snail (Biomphalaria glabrata, SAMN13390099)	26,871,852	66%	37%	183,028
SAMN13390100	NA	whole snail (Biomphalaria glabrata, SAMN13390100)	28,029,312	72%	37%	203,746
SAMN13390101	NA	whole snail (Biomphalaria glabrata, SAMN13390101)	27,177,978	72%	39%	188,531
SAMN13390102	NA	whole snail (Biomphalaria glabrata, SAMN13390102)	28,621,248	72%	36%	201,764
SAMN13390103	NA	whole snail (Biomphalaria glabrata, SAMN13390103)	27,167,794	71%	36%	204,664
SAMN13390104	NA	whole snail (Biomphalaria glabrata, SAMN13390104)	28,924,104	66%	35%	188,860
SAMN13390105	NA	whole snail (Biomphalaria glabrata, SAMN13390105)	28,796,624	65%	36%	184,521
SAMN13390106	NA	whole snail (Biomphalaria glabrata, SAMN13390106)	28,513,764	64%	35%	175,231
SAMN13390107	NA	whole snail (Biomphalaria glabrata, SAMN13390107)	28,702,514	67%	36%	185,100
SAMN13390108	NA	whole snail (Biomphalaria glabrata, SAMN13390108)	27,429,008	67%	35%	193,591
SAMN13390109	NA	whole snail (Biomphalaria glabrata, SAMN13390109)	28,372,002	67%	37%	194,143
SAMN13390110	NA	whole snail (Biomphalaria glabrata, SAMN13390110)	28,551,888	66%	36%	182,471
SAMN13390111	NA	whole snail (Biomphalaria glabrata, SAMN13390111)	27,517,736	67%	37%	191,415
SAMN13390112	NA	whole snail (Biomphalaria glabrata, SAMN13390112)	28,927,364	55%	29%	168,444
SAMN13390113	NA	whole snail (Biomphalaria glabrata, SAMN13390113)	27,251,036	68%	34%	185,199
SAMN13390114	NA	whole snail (Biomphalaria glabrata, SAMN13390114)	28,468,822	66%	37%	190,422
SAMN13390115	NA	whole snail (Biomphalaria glabrata, SAMN13390115)	27,609,080	65%	39%	190,506
SAMN13390116	NA	whole snail (Biomphalaria glabrata, SAMN13390116)	27,991,994	59%	36%	183,521
SAMN13390117	NA	whole snail (Biomphalaria glabrata, SAMN13390117)	28,396,938	61%	34%	176,037
SAMN13390118	NA	whole snail (Biomphalaria glabrata, SAMN13390118)	28,010,358	59%	40%	180,590
SAMN13390119	NA	whole snail (Biomphalaria glabrata, SAMN13390119)	28,674,016	49%	39%	180,251
SAMN13390120	NA	whole snail (Biomphalaria glabrata, SAMN13390120)	28,045,372	69%	40%	188,952
SAMN13390121	NA	whole snail (Biomphalaria glabrata, SAMN13390121)	28,024,892	67%	36%	192,496
SAMN13390122	NA	whole snail (Biomphalaria glabrata, SAMN13390122)	29,351,260	69%	39%	190,296
SAMN13390123	NA	whole snail (Biomphalaria glabrata, SAMN13390123)	28,384,840	68%	40%	188,164
SAMN13390124	NA	whole snail (Biomphalaria glabrata, SAMN13390124)	27,641,806	70%	39%	201,200
SAMN13390125	NA	whole snail (Biomphalaria glabrata, SAMN13390125)	28,269,096	70%	39%	190,364
SAMN13390126	NA	whole snail (Biomphalaria glabrata, SAMN13390126)	28,444,090	69%	40%	193,390
SAMN13390127	NA	whole snail (Biomphalaria glabrata, SAMN13390127)	27,790,778	69%	40%	194,286
SAMN13390128	NA	whole snail (Biomphalaria glabrata, SAMN13390128)	29,144,852	66%	33%	181,894
SAMN13390129	NA	whole snail (Biomphalaria glabrata, SAMN13390129)	28,871,320	65%	36%	181,239
SAMN13390130	NA	whole snail (Biomphalaria glabrata, SAMN13390130)	30,036,762	67%	37%	188,270
SAMN13390131	NA	whole snail (Biomphalaria glabrata, SAMN13390131)	28,091,300	64%	38%	178,030
SAMN13390132	NA	whole snail (Biomphalaria glabrata, SAMN13390132)	28,749,504	67%	38%	188,908
SAMN13390133	NA	whole snail (Biomphalaria glabrata, SAMN13390133)	29,258,626	62%	33%	186,441
SAMN13390134	NA	whole snail (Biomphalaria glabrata, SAMN13390134)	29,081,290	65%	39%	175,723
SAMN13390135	NA	whole snail (Biomphalaria glabrata, SAMN13390135)	29,334,272	69%	39%	199,528
SAMN13390136	NA	whole snail (Biomphalaria glabrata, SAMN13390136)	29,463,540	69%	40%	183,988
SAMN13390137	NA	whole snail (Biomphalaria glabrata, SAMN13390137)	29,056,760	68%	36%	194,950
SAMN13390138	NA	whole snail (Biomphalaria glabrata, SAMN13390138)	28,060,858	68%	38%	183,269
SAMN13390139	NA	whole snail (Biomphalaria glabrata, SAMN13390139)	27,408,712	69%	38%	200,463
SAMN13390140	NA	whole snail (Biomphalaria glabrata, SAMN13390140)	28,052,716	70%	37%	198,169
SAMN13390141	NA	whole snail (Biomphalaria glabrata, SAMN13390141)	27,838,002	69%	37%	197,046
SAMN13390142	NA	whole snail (Biomphalaria glabrata, SAMN13390142)	26,562,372	71%	39%	198,055
SAMN13390143	NA	whole snail (Biomphalaria glabrata, SAMN13390143)	28,436,454	66%	36%	185,364
SAMN13390144	NA	whole snail (Biomphalaria glabrata, SAMN13390144)	26,737,250	67%	38%	195,562
SAMN13390145	NA	whole snail (Biomphalaria glabrata, SAMN13390145)	27,133,626	69%	37%	192,270
SAMN13390146	NA	whole snail (Biomphalaria glabrata, SAMN13390146)	28,582,114	66%	38%	178,897
SAMN13390147	NA	whole snail (Biomphalaria glabrata, SAMN13390147)	28,794,540	64%	36%	190,521
SAMN13880206	NA	hepatopancreas (Biomphalaria glabrata, SAMN13880206)	264,996,554	29%	30%	219,064
SAMN15221695	NA	Whole body (Biomphalaria glabrata, SAMN15221695)	66,521,625	66%	37%	186,467
SAMN15221696	NA	Whole body (Biomphalaria glabrata, SAMN15221696)	32,289,202	61%	36%	185,703
SAMN15221697	NA	Whole body (Biomphalaria glabrata, SAMN15221697)	57,931,862	63%	39%	196,344
SAMN17205588	NA	whole snail (Biomphalaria glabrata, F2 juvenile, SAMN17205588)	31,676,632	70%	30%	201,849
SAMN17205589	NA	whole snail (Biomphalaria glabrata, F2 juvenile, SAMN17205589)	52,121,718	74%	31%	213,474
SAMN19243680	NA	Neural tissue (Biomphalaria glabrata, .5-1 yr, hermaphrodite, SAMN19243680)	35,306,892	67%	36%	170,031
SAMN19243681	NA	Neural tissue (Biomphalaria glabrata, .5-1 yr, hermaphrodite, SAMN19243681)	36,781,006	62%	41%	171,231
SAMN19243682	NA	Neural tissue (Biomphalaria glabrata, .5-1 yr, hermaphrodite, SAMN19243682)	35,099,824	63%	38%	155,412
SAMN27734799	NA	Whole body (Biomphalaria glabrata, SAMN27734799)	61,592,014	60%	29%	179,742

Show alignments statistics, by run (ERR, SRR, DRR)

Run	Experiment	Project	Sample	Number of reads	Percent aligned reads	Percent of aligned reads with introns
ERR9682492	ERX9231483	ERP137314	SAMEA10417723	44,373,416	73%	32%
SRR1509466	SRX648260	SRP000005	SAMN02905163	40,515,146	58%	19%
SRR1509467	SRX648262	SRP000005	SAMN02905165	55,201,758	55%	18%
SRR1509468	SRX648263	SRP000005	SAMN02905166	58,428,384	61%	28%
SRR1509459	SRX648264	SRP000005	SAMN02905167	26,624,570	58%	18%
SRR1509460	SRX648265	SRP000005	SAMN02905168	29,748,868	55%	18%
SRR1509461	SRX648266	SRP000005	SAMN02905169	36,784,004	50%	18%
SRR1509464	SRX648268	SRP000005	SAMN02905171	45,486,220	58%	18%
SRR1509469	SRX648269	SRP000005	SAMN02905172	59,063,042	55%	16%
SRR1509473	SRX648270	SRP000005	SAMN02905173	73,130,492	51%	18%
SRR1509470	SRX648271	SRP000005	SAMN02905174	86,491,800	49%	14%
SRR15304978	SRX11609500	SRP028164	SAMN02261944	4,000,000	142%	19%
SRR15304977	SRX11609501	SRP028164	SAMN02261944	4,000,000	142%	19%
SRR15304976	SRX11609502	SRP028164	SAMN02261944	4,000,000	136%	18%
SRR15304975	SRX11609503	SRP028164	SAMN02261944	4,000,000	141%	19%
SRR15304974	SRX11609504	SRP028164	SAMN02261944	4,000,000	142%	19%
SRR15304973	SRX11609505	SRP028164	SAMN02261944	4,000,000	136%	18%
SRR15304972	SRX11609506	SRP028164	SAMN02261944	4,000,000	141%	19%
SRR15304971	SRX11609507	SRP028164	SAMN02261944	4,000,000	143%	19%
SRR15304970	SRX11609508	SRP028164	SAMN02261944	4,000,000	138%	18%
SRR15304969	SRX11609509	SRP028164	SAMN02261944	4,000,000	142%	18%
SRR15304961	SRX11609517	SRP028164	SAMN02261944	529,068	143%	19%
SRR15304960	SRX11609518	SRP028164	SAMN02261944	4,000,000	145%	19%
SRR15304959	SRX11609519	SRP028164	SAMN02261944	4,000,000	147%	19%
SRR15304958	SRX11609520	SRP028164	SAMN02261944	4,000,000	139%	19%
SRR15304954	SRX11609524	SRP028164	SAMN02261944	1,427,771	140%	18%
SRR15304947	SRX11609531	SRP028164	SAMN02261944	745,711	142%	13%
SRR15304946	SRX11609532	SRP028164	SAMN02261944	4,000,000	144%	18%
SRR15304945	SRX11609533	SRP028164	SAMN02261944	4,000,000	147%	13%
SRR15304944	SRX11609534	SRP028164	SAMN02261944	4,000,000	146%	13%
SRR15304943	SRX11609535	SRP028164	SAMN02261944	4,000,000	147%	13%
SRR15304942	SRX11609536	SRP028164	SAMN02261944	4,000,000	145%	13%
SRR15304941	SRX11609537	SRP028164	SAMN02261944	4,000,000	147%	13%
SRR15304940	SRX11609538	SRP028164	SAMN02261944	4,000,000	146%	13%
SRR15304939	SRX11609539	SRP028164	SAMN02261944	4,000,000	148%	13%
SRR15304938	SRX11609540	SRP028164	SAMN02261944	4,000,000	144%	13%
SRR15304937	SRX11609541	SRP028164	SAMN02261944	4,000,000	148%	13%
SRR15304936	SRX11609542	SRP028164	SAMN02261944	4,000,000	144%	13%
SRR15304935	SRX11609543	SRP028164	SAMN02261944	4,000,000	143%	18%
SRR15304934	SRX11609544	SRP028164	SAMN02261944	4,000,000	149%	13%
SRR15304933	SRX11609545	SRP028164	SAMN02261944	2,823,293	117%	14%
SRR15304932	SRX11609546	SRP028164	SAMN02261944	4,000,000	121%	14%
SRR15304931	SRX11609547	SRP028164	SAMN02261944	4,000,000	122%	14%
SRR15304930	SRX11609548	SRP028164	SAMN02261944	4,000,000	117%	14%
SRR15304929	SRX11609549	SRP028164	SAMN02261944	4,000,000	120%	14%
SRR15304928	SRX11609550	SRP028164	SAMN02261944	4,000,000	120%	14%
SRR15304927	SRX11609551	SRP028164	SAMN02261944	4,000,000	143%	18%
SRR15304926	SRX11609552	SRP028164	SAMN02261944	4,000,000	145%	18%
SRR15304925	SRX11609553	SRP028164	SAMN02261944	4,000,000	107%	17%
SRR15304924	SRX11609554	SRP028164	SAMN02261944	4,000,000	109%	17%
SRR15304923	SRX11609555	SRP028164	SAMN02261944	4,000,000	110%	17%
SRR15304922	SRX11609556	SRP028164	SAMN02261944	4,000,000	144%	18%
SRR15304921	SRX11609557	SRP028164	SAMN02261944	4,000,000	107%	17%
SRR15304920	SRX11609558	SRP028164	SAMN02261944	4,000,000	110%	17%
SRR15304919	SRX11609559	SRP028164	SAMN02261944	4,000,000	109%	17%
SRR15304918	SRX11609560	SRP028164	SAMN02261944	4,000,000	108%	17%
SRR15304917	SRX11609561	SRP028164	SAMN02261944	4,000,000	110%	17%
SRR15304916	SRX11609562	SRP028164	SAMN02261944	4,000,000	107%	17%
SRR15304915	SRX11609563	SRP028164	SAMN02261944	4,000,000	110%	17%
SRR15304914	SRX11609564	SRP028164	SAMN02261944	4,000,000	152%	15%
SRR15304913	SRX11609565	SRP028164	SAMN02261944	4,000,000	153%	15%
SRR15304912	SRX11609566	SRP028164	SAMN02261944	4,000,000	153%	15%
SRR15304911	SRX11609567	SRP028164	SAMN02261944	4,000,000	150%	15%
SRR15304910	SRX11609568	SRP028164	SAMN02261944	4,000,000	152%	15%
SRR15304909	SRX11609569	SRP028164	SAMN02261944	4,000,000	152%	15%
SRR15304908	SRX11609570	SRP028164	SAMN02261944	4,000,000	154%	15%
SRR15304907	SRX11609571	SRP028164	SAMN02261944	4,000,000	152%	15%
SRR15304906	SRX11609572	SRP028164	SAMN02261944	4,000,000	154%	15%
SRR15304900	SRX11609578	SRP028164	SAMN02261944	4,000,000	119%	14%
SRR15304899	SRX11609579	SRP028164	SAMN02261944	4,000,000	122%	14%
SRR15304898	SRX11609580	SRP028164	SAMN02261944	4,000,000	118%	14%
SRR15304897	SRX11609581	SRP028164	SAMN02261944	4,000,000	119%	14%
SRR15304896	SRX11609582	SRP028164	SAMN02261944	4,000,000	121%	14%
SRR15304895	SRX11609583	SRP028164	SAMN02261944	4,000,000	118%	14%
SRR15304894	SRX11609584	SRP028164	SAMN02261944	4,000,000	121%	14%
SRR15304893	SRX11609585	SRP028164	SAMN02261944	4,000,000	121%	14%
SRR15304892	SRX11609586	SRP028164	SAMN02261944	2,933,457	114%	17%
SRR15304891	SRX11609587	SRP028164	SAMN02261944	4,000,000	116%	17%
SRR15304890	SRX11609588	SRP028164	SAMN02261944	4,000,000	140%	18%
SRR15304889	SRX11609589	SRP028164	SAMN02261944	4,000,000	114%	17%
SRR15304888	SRX11609590	SRP028164	SAMN02261944	4,000,000	116%	17%
SRR15304887	SRX11609591	SRP028164	SAMN02261944	4,000,000	114%	17%
SRR15304886	SRX11609592	SRP028164	SAMN02261944	4,000,000	113%	17%
SRR15304885	SRX11609593	SRP028164	SAMN02261944	4,000,000	114%	17%
SRR15304884	SRX11609594	SRP028164	SAMN02261944	4,000,000	113%	17%
SRR15304883	SRX11609595	SRP028164	SAMN02261944	4,000,000	117%	17%
SRR15304882	SRX11609596	SRP028164	SAMN02261944	4,000,000	116%	17%
SRR15304881	SRX11609597	SRP028164	SAMN02261944	4,000,000	117%	17%
SRR15304880	SRX11609598	SRP028164	SAMN02261944	2,681,104	107%	17%
SRR15304879	SRX11609599	SRP028164	SAMN02261944	4,000,000	109%	17%
SRR15304878	SRX11609600	SRP028164	SAMN02261944	4,000,000	110%	17%
SRR15304877	SRX11609601	SRP028164	SAMN02261944	4,000,000	107%	17%
SRR15304876	SRX11609602	SRP028164	SAMN02261944	4,000,000	109%	17%
SRR15304874	SRX11609604	SRP028164	SAMN02261944	4,000,000	140%	19%
SRR15304870	SRX11609608	SRP028164	SAMN02261944	4,000,000	142%	19%
SRR15304859	SRX11609619	SRP028164	SAMN02261944	4,000,000	139%	19%
SRR15304851	SRX11609627	SRP028164	SAMN02261944	4,000,000	145%	19%
SRR15304850	SRX11609628	SRP028164	SAMN02261944	4,000,000	146%	19%
SRR15304849	SRX11609629	SRP028164	SAMN02261944	4,000,000	146%	19%
SRR15304848	SRX11609630	SRP028164	SAMN02261944	4,000,000	143%	19%
SRR15304847	SRX11609631	SRP028164	SAMN02261944	4,000,000	146%	19%
SRR15304846	SRX11609632	SRP028164	SAMN02261944	4,000,000	147%	19%
SRR15304845	SRX11609633	SRP028164	SAMN02261944	4,000,000	144%	19%
SRR15304844	SRX11609634	SRP028164	SAMN02261944	4,000,000	146%	19%
SRR15304843	SRX11609635	SRP028164	SAMN02261944	4,000,000	147%	19%
SRR15304842	SRX11609636	SRP028164	SAMN02261944	4,000,000	143%	19%
SRR15304841	SRX11609637	SRP028164	SAMN02261944	4,000,000	143%	19%
SRR15304840	SRX11609638	SRP028164	SAMN02261944	4,000,000	146%	19%
SRR15304839	SRX11609639	SRP028164	SAMN02261944	4,000,000	146%	19%
SRR15304838	SRX11609640	SRP028164	SAMN02261944	4,000,000	142%	19%
SRR15304837	SRX11609641	SRP028164	SAMN02261944	4,000,000	146%	19%
SRR15304836	SRX11609642	SRP028164	SAMN02261944	4,000,000	147%	19%
SRR15304830	SRX11609648	SRP028164	SAMN02261944	4,000,000	143%	19%
SRR15304819	SRX11609659	SRP028164	SAMN02261944	4,000,000	118%	14%
SRR15304818	SRX11609660	SRP028164	SAMN02261944	4,000,000	121%	14%
SRR15304817	SRX11609661	SRP028164	SAMN02261944	4,000,000	119%	14%
SRR15304816	SRX11609662	SRP028164	SAMN02261944	4,000,000	139%	18%
SRR15304815	SRX11609663	SRP028164	SAMN02261944	4,000,000	112%	17%
SRR15304814	SRX11609664	SRP028164	SAMN02261944	4,000,000	116%	17%
SRR15304813	SRX11609665	SRP028164	SAMN02261944	4,000,000	113%	17%
SRR15304812	SRX11609666	SRP028164	SAMN02261944	4,000,000	139%	18%
SRR15304811	SRX11609667	SRP028164	SAMN02261944	4,000,000	110%	17%
SRR15304810	SRX11609668	SRP028164	SAMN02261944	1,800,700	152%	15%
SRR15304809	SRX11609669	SRP028164	SAMN02261944	4,000,000	153%	15%
SRR15304808	SRX11609670	SRP028164	SAMN02261944	4,000,000	142%	18%
SRR15304807	SRX11609671	SRP028164	SAMN02261944	4,000,000	151%	15%
SRR15304806	SRX11609672	SRP028164	SAMN02261944	4,000,000	153%	15%
SRR15304805	SRX11609673	SRP028164	SAMN02261944	4,000,000	151%	15%
SRR15304804	SRX11609674	SRP028164	SAMN02261944	4,000,000	144%	18%
SRR15304800	SRX11609678	SRP028164	SAMN02261944	4,000,000	141%	18%
SRR15304793	SRX11609685	SRP028164	SAMN02261944	478,770	133%	18%
SRR15304792	SRX11609686	SRP028164	SAMN02261944	4,000,000	140%	19%
SRR15304791	SRX11609687	SRP028164	SAMN02261944	4,000,000	142%	19%
SRR15304790	SRX11609688	SRP028164	SAMN02261944	4,000,000	143%	19%
SRR15304789	SRX11609689	SRP028164	SAMN02261944	4,000,000	144%	18%
SRR15304788	SRX11609690	SRP028164	SAMN02261944	4,000,000	137%	18%
SRR942795	SRX327185	SRP028164	SAMN02261944	172,317,158	79%	18%
SRR1617465	SRX736535	SRP049070	SAMN03112928	20,246,449	74%	31%
SRR1617490	SRX736539	SRP049070	SAMN03112928	15,723,178	77%	33%
SRR1617492	SRX736540	SRP049070	SAMN03112928	17,056,226	76%	32%
SRR1617494	SRX736542	SRP049070	SAMN03112928	15,160,382	75%	31%
SRR1617496	SRX736543	SRP049070	SAMN03112928	16,434,052	76%	33%
SRR1617498	SRX736544	SRP049070	SAMN03112928	16,032,818	78%	34%
SRR1617499	SRX736545	SRP049070	SAMN03112928	16,479,998	77%	30%
SRR1617501	SRX736546	SRP049070	SAMN03112928	17,443,088	76%	33%
SRR1617503	SRX736547	SRP049070	SAMN03112928	16,766,444	72%	31%
SRR1617504	SRX736548	SRP049070	SAMN03112928	17,784,130	73%	33%
SRR1617506	SRX736549	SRP049070	SAMN03112928	16,223,760	74%	32%
SRR1617508	SRX736550	SRP049070	SAMN03112928	18,637,606	72%	33%
SRR1617536	SRX736552	SRP049070	SAMN03112928	35,197,960	76%	31%
SRR1617537	SRX736554	SRP049070	SAMN03112928	30,372,764	74%	34%
SRR1617538	SRX736556	SRP049070	SAMN03112928	28,811,490	74%	32%
SRR1617539	SRX736567	SRP049070	SAMN03112928	30,439,124	70%	29%
SRR1617540	SRX736570	SRP049070	SAMN03112928	34,590,406	75%	34%
SRR1617541	SRX736571	SRP049070	SAMN03112928	30,828,648	77%	32%
SRR1617586	SRX736587	SRP049070	SAMN03112929	15,124,050	74%	32%
SRR1617587	SRX736588	SRP049070	SAMN03112929	18,696,662	74%	26%
SRR1617589	SRX736589	SRP049070	SAMN03112929	16,480,683	76%	32%
SRR1617590	SRX736590	SRP049070	SAMN03112929	16,781,163	76%	32%
SRR1617592	SRX736591	SRP049070	SAMN03112929	16,012,269	78%	31%
SRR1617593	SRX736592	SRP049070	SAMN03112929	17,053,245	77%	31%
SRR1617595	SRX736593	SRP049070	SAMN03112929	26,250,130	76%	33%
SRR1617596	SRX736594	SRP049070	SAMN03112929	29,155,420	74%	30%
SRR1617599	SRX736595	SRP049070	SAMN03112929	32,336,530	75%	32%
SRR1617616	SRX736572	SRP049070	SAMN03112930	16,152,939	77%	33%
SRR1617618	SRX736573	SRP049070	SAMN03112930	16,747,609	76%	32%
SRR1617619	SRX736574	SRP049070	SAMN03112930	15,678,473	75%	30%
SRR1617624	SRX736575	SRP049070	SAMN03112930	16,997,034	74%	32%
SRR1617625	SRX736576	SRP049070	SAMN03112930	16,304,586	77%	30%
SRR1617626	SRX736577	SRP049070	SAMN03112930	14,701,175	74%	30%
SRR1617627	SRX736578	SRP049070	SAMN03112930	30,760,146	74%	32%
SRR1617629	SRX736580	SRP049070	SAMN03112930	32,334,472	72%	35%
SRR1617630	SRX736581	SRP049070	SAMN03112930	32,636,486	75%	36%
SRR1708818	SRX807920	SRP049070	SAMN03264585	15,105,626	67%	25%
SRR1708819	SRX807922	SRP049070	SAMN03264586	17,649,218	55%	25%
SRR1708821	SRX807924	SRP049070	SAMN03264587	15,917,381	63%	29%
SRR1708822	SRX807925	SRP049070	SAMN03264588	14,660,058	49%	27%
SRR1708825	SRX807927	SRP049070	SAMN03264589	16,383,793	62%	28%
SRR1708827	SRX807928	SRP049070	SAMN03264590	15,295,285	59%	26%
SRR1708828	SRX807929	SRP049070	SAMN03264591	15,064,426	52%	26%
SRR1708830	SRX807930	SRP049070	SAMN03264592	14,706,170	55%	24%
SRR1708832	SRX807931	SRP049070	SAMN03264593	14,045,984	64%	26%
SRR1708833	SRX807932	SRP049070	SAMN03264594	14,719,725	54%	25%
SRR1708835	SRX807933	SRP049070	SAMN03264595	14,900,609	59%	27%
SRR1708836	SRX807934	SRP049070	SAMN03264596	18,310,412	52%	26%
SRR2004377	SRX1015104	SRP057675	SAMN03568798	51,206,204	77%	19%
SRR3039143	SRX1494899	SRP067658	SAMN04359736	105,296,148	77%	30%
SRR10558621	SRX7240300	SRP233900	SAMN13390092	29,758,242	68%	38%
SRR10558620	SRX7240301	SRP233900	SAMN13390093	28,884,538	70%	38%
SRR10558609	SRX7240312	SRP233900	SAMN13390094	28,940,484	69%	38%
SRR10558598	SRX7240323	SRP233900	SAMN13390095	28,933,058	70%	38%
SRR10558587	SRX7240334	SRP233900	SAMN13390096	27,601,658	69%	38%
SRR10558576	SRX7240345	SRP233900	SAMN13390097	27,071,700	67%	37%
SRR10558569	SRX7240352	SRP233900	SAMN13390098	28,430,420	68%	37%
SRR10558568	SRX7240353	SRP233900	SAMN13390099	26,871,852	66%	37%
SRR10558567	SRX7240354	SRP233900	SAMN13390100	28,029,312	72%	37%
SRR10558566	SRX7240355	SRP233900	SAMN13390101	27,177,978	72%	39%
SRR10558619	SRX7240302	SRP233900	SAMN13390102	28,621,248	72%	36%
SRR10558618	SRX7240303	SRP233900	SAMN13390103	27,167,794	71%	36%
SRR10558617	SRX7240304	SRP233900	SAMN13390104	28,924,104	66%	35%
SRR10558616	SRX7240305	SRP233900	SAMN13390105	28,796,624	65%	36%
SRR10558615	SRX7240306	SRP233900	SAMN13390106	28,513,764	64%	35%
SRR10558614	SRX7240307	SRP233900	SAMN13390107	28,702,514	67%	36%
SRR10558613	SRX7240308	SRP233900	SAMN13390108	27,429,008	67%	35%
SRR10558612	SRX7240309	SRP233900	SAMN13390109	28,372,002	67%	37%
SRR10558611	SRX7240310	SRP233900	SAMN13390110	28,551,888	66%	36%
SRR10558610	SRX7240311	SRP233900	SAMN13390111	27,517,736	67%	37%
SRR10558608	SRX7240313	SRP233900	SAMN13390112	28,927,364	55%	29%
SRR10558607	SRX7240314	SRP233900	SAMN13390113	27,251,036	68%	34%
SRR10558606	SRX7240315	SRP233900	SAMN13390114	28,468,822	66%	37%
SRR10558605	SRX7240316	SRP233900	SAMN13390115	27,609,080	65%	39%
SRR10558604	SRX7240317	SRP233900	SAMN13390116	27,991,994	59%	36%
SRR10558603	SRX7240318	SRP233900	SAMN13390117	28,396,938	61%	34%
SRR10558602	SRX7240319	SRP233900	SAMN13390118	28,010,358	59%	40%
SRR10558601	SRX7240320	SRP233900	SAMN13390119	28,674,016	49%	39%
SRR10558600	SRX7240321	SRP233900	SAMN13390120	28,045,372	69%	40%
SRR10558599	SRX7240322	SRP233900	SAMN13390121	28,024,892	67%	36%
SRR10558597	SRX7240324	SRP233900	SAMN13390122	29,351,260	69%	39%
SRR10558596	SRX7240325	SRP233900	SAMN13390123	28,384,840	68%	40%
SRR10558595	SRX7240326	SRP233900	SAMN13390124	27,641,806	70%	39%
SRR10558594	SRX7240327	SRP233900	SAMN13390125	28,269,096	70%	39%
SRR10558593	SRX7240328	SRP233900	SAMN13390126	28,444,090	69%	40%
SRR10558592	SRX7240329	SRP233900	SAMN13390127	27,790,778	69%	40%
SRR10558591	SRX7240330	SRP233900	SAMN13390128	29,144,852	66%	33%
SRR10558590	SRX7240331	SRP233900	SAMN13390129	28,871,320	65%	36%
SRR10558589	SRX7240332	SRP233900	SAMN13390130	30,036,762	67%	37%
SRR10558588	SRX7240333	SRP233900	SAMN13390131	28,091,300	64%	38%
SRR10558586	SRX7240335	SRP233900	SAMN13390132	28,749,504	67%	38%
SRR10558585	SRX7240336	SRP233900	SAMN13390133	29,258,626	62%	33%
SRR10558584	SRX7240337	SRP233900	SAMN13390134	29,081,290	65%	39%
SRR10558583	SRX7240338	SRP233900	SAMN13390135	29,334,272	69%	39%
SRR10558582	SRX7240339	SRP233900	SAMN13390136	29,463,540	69%	40%
SRR10558581	SRX7240340	SRP233900	SAMN13390137	29,056,760	68%	36%
SRR10558580	SRX7240341	SRP233900	SAMN13390138	28,060,858	68%	38%
SRR10558579	SRX7240342	SRP233900	SAMN13390139	27,408,712	69%	38%
SRR10558578	SRX7240343	SRP233900	SAMN13390140	28,052,716	70%	37%
SRR10558577	SRX7240344	SRP233900	SAMN13390141	27,838,002	69%	37%
SRR10558575	SRX7240346	SRP233900	SAMN13390142	26,562,372	71%	39%
SRR10558574	SRX7240347	SRP233900	SAMN13390143	28,436,454	66%	36%
SRR10558573	SRX7240348	SRP233900	SAMN13390144	26,737,250	67%	38%
SRR10558572	SRX7240349	SRP233900	SAMN13390145	27,133,626	69%	37%
SRR10558571	SRX7240350	SRP233900	SAMN13390146	28,582,114	66%	38%
SRR10558570	SRX7240351	SRP233900	SAMN13390147	28,794,540	64%	36%
SRR11248244	SRX7858653	SRP251773	SAMN13880206	264,996,554	29%	30%
SRR12001323	SRX8534559	SRP267019	SAMN15221695	66,521,625	66%	37%
SRR12001322	SRX8534560	SRP267019	SAMN15221696	32,289,202	61%	36%
SRR12001321	SRX8534561	SRP267019	SAMN15221697	57,931,862	63%	39%
SRR13356422	SRX9780744	SRP300347	SAMN17205588	31,676,632	70%	30%
SRR13356421	SRX9780745	SRP300347	SAMN17205589	52,121,718	74%	31%
SRR14581508	SRX10929104	SRP320397	SAMN19243680	35,306,892	67%	36%
SRR14581507	SRX10929105	SRP320397	SAMN19243681	36,781,006	62%	41%
SRR14581506	SRX10929106	SRP320397	SAMN19243682	35,099,824	63%	38%
SRR18863902	SRX14961650	SRP371616	SAMN27734799	61,592,014	60%	29%

Protein alignments

The alignments of the following proteins with ProSplign were used for gene prediction:

Source	Number of sequences retrieved from Entrez	Number (%) of sequences aligned by ProSplign	Number (%) of sequences passed to Gnomon	Average % identity	Average % coverage
Octopus sinensis high-quality model RefSeq (XP_)	13,472	9,228 (68.50%)	9,228 (68.50%)	67.59%	62.04%
Crassostrea gigas high-quality model RefSeq (XP_)	28,029	14,637 (52.22%)	14,637 (52.22%)	60.19%	46.61%
Mollusca GenBank	19,508	8,719 (44.69%)	8,719 (44.69%)	71.61%	74.75%
Mollusca known RefSeq (NP_)	432	383 (88.66%)	383 (88.66%)	74.96%	79.16%
Aplysia californica high-quality model RefSeq (XP_)	9,849	8,281 (84.08%)	8,281 (84.08%)	67.44%	69.17%
Same-species GenBank	434	427 (98.39%)	427 (98.39%)	71.90%	85.90%
Same-species known RefSeq (NP_)	52	52 (100.00%)	52 (100.00%)	78.56%	86.56%
Pecten maximus high-quality model RefSeq (XP_)	18,685	11,825 (63.29%)	11,825 (63.29%)	62.89%	54.69%
Homo sapiens known RefSeq (NP_)	66,931	40,067 (59.86%)	40,067 (59.86%)	61.66%	50.18%

Assembly-assembly alignments of current to previous assembly

When the assembly changes between two rounds of annotation, genes in the current and the previous annotation are mapped to each other using the genomic alignments of the current assembly to the previous assembly so that gene identifiers can be preserved. The success of the remapping depends largely on how well the two assembly versions align to each other.

Below are the percent coverage of one assembly by the other and the average percent identity of the alignments. The 'First pass' alignments are reciprocal best hits, while the 'Total' alignments also include 'Second pass' or non-reciprocal best alignments. For more information about the assembly-assembly alignment process, please visit the NCBI Genome Remapping Service page.

First Pass	Total
xgBioGlab47.1 (Current) Coverage: 70.38%	xgBioGlab47.1 (Current) Coverage: 71.55%
ASM45736v1 (Previous) Coverage: 66.58%	ASM45736v1 (Previous) Coverage: 70.21%
Percent Identity: 85.32%	Percent Identity: 85.71%

Comparison of the current and previous annotations

The annotations produced for this release were compared to the annotations in the previous release for each assembly annotated in both releases. Scores for current and previous gene and transcript features were calculated based on overlap in exon sequence and matches in exon boundaries. Pairs of current and previous features were categorized based on these scores, whether they are reciprocal best matches, and changes in attributes (gene biotype, completeness, etc.). If the assembly was updated between the two releases, alignments between the current and the previous assembly were used to match the current and previous gene and transcript features in mapped regions.

The table below summarizes the changes in the gene set for each assembly as a percent of the number of genes in the current annotation release, and provides links to the details of the comparison in tabular format and in a Genome Workbench project.

	xgBioGlab47.1 (Current) to ASM45736v1 (Previous)
Identical	1%
Minor changes	37%
Major changes	33%
New	26%
Deprecated	43%
Other	3%
Download the report	tabular, Genome Workbench

References

RefSeq: Pruitt KD, Brown GR, Hiatt SM, Thibaud-Nissen F, Astashyn A, Ermolaeva O, Farrell CM, Hart J, Landrum MJ, McGarvey KM, Murphy MR, O'Leary NA, Pujar S, Rajput B, Rangwala SH, Riddick LD, Shkeda A, Sun H, Tamez P, Tully RE, Wallin C, Webb D, Weber J, Wu W, Dicuccio M, Kitts P, Maglott DR, Murphy TD, Ostell JM. Nucleic Acids Research 2014, 42(Database issue):D756-63
BUSCO: Manni M, Berkeley MR, Seppey M, Simão FA, Zdobnov EM. Molecular biology and evolution 2021.38(10):4647-4654
RepeatMasker: Smit AFA, Hubley R, Green P. RepeatMasker Open-3.0. 1996–2004. http://www.repeatmasker.org
WindowMasker: Morgulis A, Gertz EM, Schäffer AA, Agarwala R. Bioinformatics 2006, 2:134-41
Splign: Kapustin Y, Souvorov A, Tatusova T, Lipman D. Biology Direct 2008, 3:20
STAR: Dobin A, Davis CA, Schlesinger F, Drenkow J, Zaleski C, Jha S, Batut P, Chaisson M, Gingeras TR. Bioinformatics 2013 Jan 1;29(1):15-21.
Minimap2: Li H. Bioinformatics 2018 Sep 15;34(18):3094-3100

RefSeq

Integrated reference sequences