NCBI Logo
GEO Logo
   NCBI > GEO > Accession DisplayHelp Not logged in | LoginHelp
GEO help: Mouse over screen elements for information.
          Go
Series GSE2967 Query DataSets for GSE2967
Status Public on Jul 20, 2005
Title LongSAGE analysis significantly improves genome annotation
Organism Mus musculus
Experiment type Expression profiling by SAGE
Summary Owing to its increased tag length, LongSAGE tags are expected to be more reliable in direct assignment to genome sequences. Therefore, we evaluated the use of LongSAGE data in genome annotation by using our LongSAGE dataset of 202 015 tags (consisting of 41 718 unique tags), experimentally generated from mouse embryonic tail libraries. RESULTS: A fraction of LongSAGE tags could not be unambiguously assigned to its gene, due to the presence of widely conserved sequences downstream of particular CATG anchor sites. The presence of alternative forms of transcripts was confirmed in 45% of all detected genes. Surprisingly, a large fraction of LongSAGE tags with hits to the genome (66%) could not be assigned to any gene annotated in EnsEMBL. Among such cases, 2098 LongSAGE tags fell into a region containing a putative gene predicted by GenScan, providing experimental evidence for the presence of real genes, while 9112 genes were found out to be left out or wrongly annotated by the EnsEMBL pipeline. CONCLUSIONS: LongSAGE transcriptome data can significantly improve the genome annotation by identifying novel genes and alternative transcripts, even in the case of thus far best-characterized organisms like the mouse.
Keywords: other
 
Overall design Owing to its increased tag length, LongSAGE tags are expected to be more reliable in direct assignment to genome sequences. Therefore, we evaluated the use of LongSAGE data in genome annotation by using our LongSAGE dataset of 202 015 tags (consisting of 41 718 unique tags), experimentally generated from mouse embryonic tail libraries.
 
Citation(s) 15591356, 15585522
Submission date Jul 20, 2005
Last update date Mar 16, 2012
Contact name Matthias Wahl
E-mail(s) mbw@stowers-institute.org
Phone ++49 89 3187 4117
Fax ++49 89 3187 3099
Organization name GSF - National Research Center for Environment and Health
Department Institute of Developmental Genetics
Street address Ingolstaedter Landstrasse 1
City Neuherberg
ZIP/Postal code D-85764
Country Germany
 
Platforms (1)
GPL1349 SAGE:17:NlaIII:Mus musculus
Samples (1)
GSM26978 LongSAGE of mouse tails
Relations
BioProject PRJNA91953

Download family Format
SOFT formatted family file(s) SOFTHelp
MINiML formatted family file(s) MINiMLHelp
Series Matrix File(s) TXTHelp

Supplementary data files not provided

| NLM | NIH | GEO Help | Disclaimer | Accessibility |
NCBI Home NCBI Search NCBI SiteMap