GEO Accession viewer

NCBI > GEO > Accession Display

Not logged in | Login

GEO help: Mouse over screen elements for information.

Sample GSM4725190

Query DataSets for GSM4725190

Status

Public on Oct 27, 2020

Title

Long read Control-AA rep2

Sample type

SRA

Source name

MATa tor1-1 fpr1Δ::loxP-LEU2-LoxP RPL13A-2xFKBP12::LoxP-TRP1-LoxP

Organism

Saccharomyces cerevisiae

Characteristics

genotype: MATa tor1-1 fpr1{delta}::loxP-LEU2-LoxP RPL13A-2xFKBP12::LoxP-TRP1-LoxP
treatment: rapamycin (1 ug/ml) for 10 min
molecule subtype: nascent RNA

Treatment protocol

Rapamycin was added to exponentially growing cells to a final concentration of 1 ug/ml for either 10 or 30 minutes.

Growth protocol

Yeast cells were grown in YPAD medium at 30°C into log phase (OD600 = 0.5-0.8). For Nab2-AA and Control-AA strains, rapamycin was added at a final concentration of 1 ug/ml for either 10 or 30 minutes. Cells were pelleted at 4°C and washed with cold 1x PBS before snap freezing.

Extracted molecule

total RNA

Extraction protocol

Nascent RNA was extracted as previously described (Carrillo et al 2010). Briefly, chromatin isolated from frozen cell pellets is incubated with phenol-chloroform to extract nucleotides. DNA is digested with DNase and polyA sequences are depleted using oligo(dT) beads. For long read sequencing, ribosomal RNA was depleted using a cap-dependent exonuclease.
SMIT library construction was performed as described previously (Carrillo et al. 2016). Briefly, a 3′ end adapter (/5rApp/NNNNNCTGTAGGCACCATCAAT/3ddC/) was ligated to nascent RNA and samples were reverse transcribed. Gene-specific forward primers were used to amplify the cDNA and a second PCR added Illumina library adapters. Long read sequencing utilized the same adapter ligation strategy used in SMIT. Strand-switching reverse transcription created full length double-stranded cDNA molecules with common 5′ and 3′ ends. An initial PCR of 12-15 cycles amplified product for input into the ONT PCR Barcoding kit (SQK-PBK004) which includes dA-tailing and barcode adapter ligation. A final PCR of 8-10 cycles enriched for successful ligation and AMPure XP bead purification removes products under 250 bp. The final library was loaded onto an Oxford Nanopore MinION flow cell and sequenced for 48 hours.

Library strategy

OTHER

Library source

transcriptomic

Library selection

other

Instrument model

MinION

Description

ControlAA
processed data file: longread_fractionspliced.txt

Data processing

library strategy: SMIT and long read sequencing of nascent RNA
For SMIT, data processing was completed as previously published (Carrillo et al. 2016) with the exception that reads were mapped with Hisat2 (v2.1.0). First, fastq files are filtered for quality with the Fastx Toolkit (v 0.0.14) fastq_quality_filter function (-Q 33 -q 20 -p 90). 3′ end adapter and Illumina adapter sequences were trimmed using Cutadapt (v1.9.1). For R1: -g CATTGATGGTGCCTACAG -a AGATCGGAAGAGCACACGTCTGAACTCCAGTCAC -n 2 -O 18 -m 23 -e 0.11 --match-read-wildcards --discard-untrimmed. For R2: -a CTGTAGGCACCATCAATG -a AGATCGGAAGAGCGTCGTGTAGGGAAAGAGTGTAGATCTCGGTGGTCGCCGTATCATT -n 2 -m 28 -M 76 -e 0.11 --match-read-wildcards. Prinseq-lite (v0.20.4) removed PCR duplicates (-out_format 3 -qual_noscale -ns_max_p 1 -derep 1). The resulting reads had a remaining 5 N bases from the SMIT primer which aid in deduplication. These bases were trimmed with the FastX Toolkit fastx_trimmer tool (for R1: -Q 33 -f 6; for R2: -Q 33 -t 5). Reads were then mapped to the SacCer3 genome annotation using Hisat2 (v2.1.0) (--phred33 --no-mixed --min-intronlen 30 --max-intronlen 1010 -p 5). PolyA sequences were removed from the SAM file with a custom script. Custom R scripts compiled the number of spliced and unspliced reads for every position in the second exon of intron-containing genes. Fraction spliced values were normalized to correct for an insert length bias introduced by the PCR (https://github.com/carrillo/SMITproject/blob/master/smit_r/insert_length_model.R). Fraction spliced values were modeled with ggplot2 (v 3.2.1) geom_smooth function using the loess method (span = 0.15). The distance matrix between the fraction spliced of Control and Sample (considering only the first 300 nt after the 3′ SS) was generated for each gene using the R stats dist function (v3.6.1) using the Euclidean method. The matrix was collapsed into a single value by finding the median which is assigned as the ΔSMIT value for that gene.
For long read sequencing, the raw fast5 files were basecalled with Guppy (v3.3.0) with default quality filter parameters and basecalled reads were demultiplexed with Qcat (v1.1.0). Nanopore adapters were trimmed with Cutadapt (v1.9.1) and 3′ end adapter sequence was removed with the following parameters (cutadapt -a CTGTAGGCACCATCAATG -e 0.17 -O 6 -m 5 --match-read-wildcards --discard-untrimmed). Reads which had 3′ end adapter on 5′ end (cutadapt -g AAGCAGTGGTATCAACGCAGAGTAC -e 0.17 -O 6 -m 5 --match-read-wildcards) were reverse complemented using a custom script. Trimmed reads were then mapped to the sacCer3 genome with minimap2 (v2.17) (-ax splice -k 10 -G 2000 –secondary=no). Reads with polyA tails were removed using a custom script. Reads with 30 or more soft-clipped bases which contain a stretch of 10 A’s were removed. Reads with less than 30 soft-clipped bases which contain 6 A’s were removed. Bam and bed files were generated using Samtools (v1.9) and BEDTools (v 2.29.2) respectively. Custom R scripts were written identify intrusive and readthrough reads using transcript start and end positions defined by Nagalakshmi et al 2008. Fraction spliced of reads with 3′ ends downstream of the 3′ SS was determined using the DescTools (v0.99.30) Overlap function.
Genome_build: sacCer3
Supplementary_files_format_and_content: tab-delimited text files with fraction spliced for each gene
Supplementary_files_format_and_content: tab-delimited text files with the deltaSMIT value for each gene
Supplementary_files_format_and_content: .bed file for mapped long read data filtered for reads overlapping intron-containing genes only; BED12 file with extra column to identify gene.
Supplementary_files_format_and_content: longread_fractionspliced.txt

Submission date

Aug 12, 2020

Last update date

Oct 28, 2020

Contact name

Tara Diane Alpert

E-mail(s)

tara.alpert@yale.edu

Organization name

Yale University

Department

Molecular Biophysics and Biochemistry

Lab

Karla Neugebauer

Street address

333 Cedar St.

City

New Haven

State/province

ZIP/Postal code

06511

Country

USA

Platform ID

GPL25739

Series (1)

GSE156133

Widespread transcriptional readthrough caused by Nab2 depletion leads to chimeric transcripts with retained introns

Relations

BioSample

SAMN15804191

SRA

SRX8934928

Supplementary file	Size	Download	File type/resource
GSM4725190_control2_ic.bed.gz	455.3 Kb	(ftp)(http)	BED
SRA Run Selector
Raw data are available in SRA
Processed data provided as supplementary file
Processed data are available on Series record