GSM1357913: TK_wt_exp_minus_TEX; Thermococcus kodakarensis KOD1; RNA-... - SRA

Links from BioSample

SRX501746: GSM1357913: TK_wt_exp_minus_TEX; Thermococcus kodakarensis KOD1; RNA-Seq
1 ILLUMINA (Illumina HiSeq 2000) run: 8.1M spots, 760.2M bases, 462.7Mb downloads

Submitted by: Gene Expression Omnibus (GEO)

Study: Primary transcriptome map of the hyperthermophilic archaeon Thermococcus kodakarensis

PRJNA242777 • SRP040604 • All experiments • All runs

show Abstracthide Abstract

Background: Prokaryotes have relatively small genomes, densely-packed and apparently dominated by protein-encoding sequences. However, data now generated by high throughput RNA sequencing (RNA-seq) reveal surprisingly more-complex transcriptomes with many previously unrecognized and unanticipated non-coding small and antisense transcripts. To date, such studies have investigated primarily Bacteria. Here, we report the transcripts present in Thermococcus kodakarensis, a model hyperthermophilic Archaeon, synthesized under different growth and metabolic conditions. Results: cDNA libraries, generated from RNA preparations isolated from cells growing in media with sulfur or pyruvate, with sulfur to stationary phase, and growing with pyruvate but with sulfur added 20 min before RNA isolation, have been deep-sequenced. The results identify >2,700 sites of transcription initiation, establish a genome-wide map of transcripts, and consensus sequences for transcription initiation and post-transcription regulatory elements in T. kodakarensis. Primary transcription start sites (TSS) are identified upstream of 1,254 annotated genes, including ~78 % of those predicted by promoter locations, and an additional 644 primary TSS and their promoters have been identified within genes. Most of the mRNAs have a 5''-untranslated region (5''-UTR) between 10 and 50 nt long (median length = 16 nt), ~20 % have 5''-UTRs from 50 to 300 nt long, ~14 % are leaderless with 5''-UTRs =8 nt, and ~50% contain a consensus ribosome binding sequence. The results also identify TSS for 1,018 antisense transcripts, most with sequences complementary to either the 5''- or 3''-region of a sense mRNA. The data confirm the presence of transcripts from all three CRISPR loci, the RNase P and 7S RNAs, all tRNAs and rRNAs and 69 snoRNAs predicted to be encoded in the T. kodakarensis genome. Two transcripts, putatively identified as riboswitches, were present in RNA preparations isolated from growing but not from stationary phase cells. The procedure used is designed to identify TSS but, assuming that the number of cDNA reads correlates with transcript abundance, the data obtained also provide a semi-quantitative overview of global operon expression. They document substantial differences in gene expression under different physiological conditions and are consistent with previous observations of substrate-dependent specific gene expression. Many previously unrecognized and unanticipated small RNAs have been identified, some with relative low GC contents (=50%) and sequences that do not fold readily into base-paired secondary structures, contrary to the classical expectations for non-coding RNAs in a hyperthermophile. Conclusion: We have identified >2,700 TSS that include almost all of the primary sites of transcription initiation upstream of annotated genes, and also many secondary sites, sites within genes and sites resulting in antisense transcripts. The T. kodakarensis genome is small (~2.1 Mbp) and tightly packed with protein-encoding genes, but the results reveal the presence of many non-coding RNAs and predict extensive RNA-based regulation in T. kodakarensis. Overall design: cDNA libraries were generated and sequenced from RNA isolated from T. kodakarensis cells growing exponentially (Sexp) and to stationary phase (Sstat) in ASW-YT medium with sulfur, growing exponentially in ASW-YT with pyruvate (Pexp), and from cells growing exponentially in pyruvate but 20 min after sulfur addition (PS). The cDNAs were generated after first incubating the RNA preparations with terminator exonuclease (TEX). TEX does not degrade primary transcripts with a 5''-triphosphate (Sharma et al., 2010) but does digest RNAs generated by transcript processing that have a 5''-monophosphate. As a control and to fully document all transcripts, a cDNA library (C) was also generated and sequenced from an aliquot of an RNA preparation isolated from the cells growing exponentially with sulfur that was not exposed to TEX digestion.

Sample: TK_wt_exp_minus_TEX

SAMN02708927 • SRS582797 • All experiments • All runs

Organism: Thermococcus kodakarensis KOD1

Library:

Instrument: Illumina HiSeq 2000

Strategy: RNA-Seq

Source: TRANSCRIPTOMIC

Selection: cDNA

Layout: SINGLE

Construction protocol: RNA was extracted using TRIzol (Invitrogen) according to the manufacturer Total RNA was freed of residual genomic DNA by DNase I treatment. For depletion of processed transcripts, equal amounts of T. kodakarensis RNA were incubated with TerminatorTM 5'-phosphate-dependent exonuclease (TEX) (Epicentre #TER51020) as previously described (Sharma et al., 2010). Libraries for Solexa sequencing (HiSeq) of cDNA were constructed by vertis Biotechnology AG, Germany (http://www.vertis-biotech.com/), as described previously for eukaryotic microRNA (Berezikov et al., 2006) but omitting the RNA size-fractionation step prior to cDNA synthesis

Experiment attributes:

GEO Accession: GSM1357913

Links:

External link: GEO Sample GSM1357913

NCBI link: NCBI Entrez (gds)

Runs: 1 run, 8.1M spots, 760.2M bases, 462.7Mb

Run	# of Spots	# of Bases	Size	Published
SRR1205928	8,087,386	760.2M	462.7Mb	2014-08-19

ID:: 699642

SRA

Sequence Read Archive

Result Filters

Send to:

Links from BioSample

Supplemental Content

Related information

Recent activity