NCBI Logo
GEO Logo
   NCBI > GEO > Accession DisplayHelp Not logged in | LoginHelp
GEO help: Mouse over screen elements for information.
          Go
Series GSE19289 Query DataSets for GSE19289
Status Public on May 18, 2010
Title Most “dark matter” transcripts are associated with known genes
Organisms Homo sapiens; Mus musculus
Experiment type Expression profiling by genome tiling array
Summary A series of reports over the last few years have indicated that a much larger portion of the mammalian genome is transcribed than can be accounted for by currently annotated genes, but the quantity and nature of these additional transcripts remains unclear. Here, we have used data from single- and paired-end RNA-Seq and tiling arrays to assess the quantity and composition of transcripts in PolyA+ RNA from human and mouse tissues. Relative to tiling arrays, RNA-Seq identifies many fewer transcribed regions (‘‘seqfrags’’) outside known exons and ncRNAs. Most nonexonic seqfrags are in introns, raising the possibility that they are fragments of pre-mRNAs. The chromosomal locations of the majority of intergenic seqfrags in RNA-Seq data are near known genes, consistent with alternative cleavage and polyadenylation site usage, promoter- and terminator-associated transcripts, or new alternative exons; indeed, reads that bridge splice sites identified 4,544 new exons, affecting 3,554 genes. Most of the remaining seqfrags correspond to either single reads that display characteristics of random sampling from a low-level background or several thousand small transcripts (median length = 111 bp) present at higher levels, which also tend to display sequence conservation and originate from regions with open chromatin. We conclude that, while there are bona fide new intergenic transcripts, their number and abundance is generally low in comparison to known exons, and the genome is not as pervasively transcribed as previously reported.
 
Overall design Genome-wide expression profiling of human and mouse total and polyA+ RNA from four different tissue sources (brain, heart, liver, testes) was performed using Affymetrix tiling microarrays (Human & Mouse tiling 2.0R array set; 7 arrays per genome).

BAR files: HBP - Human Brain PolyA+ RNA; HBT - Human Brain Total RNA; HLP - Human Liver PolyA+ RNA; HLT - Human Liver Total RNA; HHP - Human Heart PolyA+ RNA; HHT - Human Heart Total RNA; HTP - Human Testes PolyA+ RNA; HTT - Human Testes Total RNA; MBP - Mouse Brain PolyA+ RNA; MBT - Mouse Brain Total RNA; MLP - Mouse Liver PolyA+ RNA; MLT - Mouse Liver Total RNA; MHP - Mouse Heart PolyA+ RNA; MHT - Mouse Heart Total RNA; MTP - Mouse Testes PolyA+ RNA; MTT - Mouse Testes Total RNA
Web link http://hugheslab.ccbr.utoronto.ca/supplementary-data/hm_transcriptome/
 
Contributor(s) vanBakel H, Nislow C, Blencowe BJ, Hughes TR
Citation(s) 20502517
Submission date Dec 02, 2009
Last update date Mar 21, 2017
Contact name Harm van Bakel
E-mail(s) harm.vanbakel@mssm.edu
Organization name Mount Sinai School of Medicine
Department Genetics and Genomic Sciences
Lab Bakel Lab
Street address One Gustave L. Levy Place, Box 1498
City New York
State/province New York
ZIP/Postal code 10029
Country USA
 
Platforms (14)
GPL4910 [Hs35b_P01R] Affymetrix Human Tiling 2.0R Set, Array 1
GPL4911 [Hs35b_P02R] Affymetrix Human Tiling 2.0R Set, Array 2
GPL4912 [Hs35b_P03R] Affymetrix Human Tiling 2.0R Set, Array 3
Samples (109)
GSM478708 Human Brain PolyA+ RNA, 1
GSM478709 Human Brain PolyA+ RNA, 2
GSM478710 Human Brain PolyA+ RNA, 3
Relations
BioProject PRJNA120373

Download family Format
SOFT formatted family file(s) SOFTHelp
MINiML formatted family file(s) MINiMLHelp
Series Matrix File(s) TXTHelp

Supplementary file Size Download File type/resource
GSE19289_RAW.tar 9.2 Gb (http)(custom) TAR (of BAR, CEL)
Processed data provided as supplementary file

| NLM | NIH | GEO Help | Disclaimer | Accessibility |
NCBI Home NCBI Search NCBI SiteMap