NCBI Logo
GEO Logo
   NCBI > GEO > Accession DisplayHelp Not logged in | LoginHelp
GEO help: Mouse over screen elements for information.
          Go
Sample GSM4725190 Query DataSets for GSM4725190
Status Public on Oct 27, 2020
Title Long read Control-AA rep2
Sample type SRA
 
Source name MATa tor1-1 fpr1Δ::loxP-LEU2-LoxP RPL13A-2xFKBP12::LoxP-TRP1-LoxP
Organism Saccharomyces cerevisiae
Characteristics genotype: MATa tor1-1 fpr1{delta}::loxP-LEU2-LoxP RPL13A-2xFKBP12::LoxP-TRP1-LoxP
treatment: rapamycin (1 ug/ml) for 10 min
molecule subtype: nascent RNA
Treatment protocol Rapamycin was added to exponentially growing cells to a final concentration of 1 ug/ml for either 10 or 30 minutes.
Growth protocol Yeast cells were grown in YPAD medium at 30°C into log phase (OD600 = 0.5-0.8). For Nab2-AA and Control-AA strains, rapamycin was added at a final concentration of 1 ug/ml for either 10 or 30 minutes. Cells were pelleted at 4°C and washed with cold 1x PBS before snap freezing.
Extracted molecule total RNA
Extraction protocol Nascent RNA was extracted as previously described (Carrillo et al 2010). Briefly, chromatin isolated from frozen cell pellets is incubated with phenol-chloroform to extract nucleotides. DNA is digested with DNase and polyA sequences are depleted using oligo(dT) beads. For long read sequencing, ribosomal RNA was depleted using a cap-dependent exonuclease.
SMIT library construction was performed as described previously (Carrillo et al. 2016). Briefly, a 3′ end adapter (/5rApp/NNNNNCTGTAGGCACCATCAAT/3ddC/) was ligated to nascent RNA and samples were reverse transcribed. Gene-specific forward primers were used to amplify the cDNA and a second PCR added Illumina library adapters. Long read sequencing utilized the same adapter ligation strategy used in SMIT. Strand-switching reverse transcription created full length double-stranded cDNA molecules with common 5′ and 3′ ends. An initial PCR of 12-15 cycles amplified product for input into the ONT PCR Barcoding kit (SQK-PBK004) which includes dA-tailing and barcode adapter ligation. A final PCR of 8-10 cycles enriched for successful ligation and AMPure XP bead purification removes products under 250 bp. The final library was loaded onto an Oxford Nanopore MinION flow cell and sequenced for 48 hours.
 
Library strategy OTHER
Library source transcriptomic
Library selection other
Instrument model MinION
 
Description ControlAA
processed data file: longread_fractionspliced.txt
Data processing library strategy: SMIT and long read sequencing of nascent RNA
For SMIT, data processing was completed as previously published (Carrillo et al. 2016) with the exception that reads were mapped with Hisat2 (v2.1.0). First, fastq files are filtered for quality with the Fastx Toolkit (v 0.0.14) fastq_quality_filter function (-Q 33 -q 20 -p 90). 3′ end adapter and Illumina adapter sequences were trimmed using Cutadapt (v1.9.1). For R1: -g CATTGATGGTGCCTACAG -a AGATCGGAAGAGCACACGTCTGAACTCCAGTCAC -n 2 -O 18 -m 23 -e 0.11 --match-read-wildcards --discard-untrimmed. For R2: -a CTGTAGGCACCATCAATG -a AGATCGGAAGAGCGTCGTGTAGGGAAAGAGTGTAGATCTCGGTGGTCGCCGTATCATT -n 2 -m 28 -M 76 -e 0.11 --match-read-wildcards. Prinseq-lite (v0.20.4) removed PCR duplicates (-out_format 3 -qual_noscale -ns_max_p 1 -derep 1). The resulting reads had a remaining 5 N bases from the SMIT primer which aid in deduplication. These bases were trimmed with the FastX Toolkit fastx_trimmer tool (for R1: -Q 33 -f 6; for R2: -Q 33 -t 5). Reads were then mapped to the SacCer3 genome annotation using Hisat2 (v2.1.0) (--phred33 --no-mixed --min-intronlen 30 --max-intronlen 1010 -p 5). PolyA sequences were removed from the SAM file with a custom script. Custom R scripts compiled the number of spliced and unspliced reads for every position in the second exon of intron-containing genes. Fraction spliced values were normalized to correct for an insert length bias introduced by the PCR (https://github.com/carrillo/SMITproject/blob/master/smit_r/insert_length_model.R). Fraction spliced values were modeled with ggplot2 (v 3.2.1) geom_smooth function using the loess method (span = 0.15). The distance matrix between the fraction spliced of Control and Sample (considering only the first 300 nt after the 3′ SS) was generated for each gene using the R stats dist function (v3.6.1) using the Euclidean method. The matrix was collapsed into a single value by finding the median which is assigned as the ΔSMIT value for that gene.
For long read sequencing, the raw fast5 files were basecalled with Guppy (v3.3.0) with default quality filter parameters and basecalled reads were demultiplexed with Qcat (v1.1.0). Nanopore adapters were trimmed with Cutadapt (v1.9.1) and 3′ end adapter sequence was removed with the following parameters (cutadapt -a CTGTAGGCACCATCAATG -e 0.17 -O 6 -m 5 --match-read-wildcards --discard-untrimmed). Reads which had 3′ end adapter on 5′ end (cutadapt -g AAGCAGTGGTATCAACGCAGAGTAC -e 0.17 -O 6 -m 5 --match-read-wildcards) were reverse complemented using a custom script. Trimmed reads were then mapped to the sacCer3 genome with minimap2 (v2.17) (-ax splice -k 10 -G 2000 –secondary=no). Reads with polyA tails were removed using a custom script. Reads with 30 or more soft-clipped bases which contain a stretch of 10 A’s were removed. Reads with less than 30 soft-clipped bases which contain 6 A’s were removed. Bam and bed files were generated using Samtools (v1.9) and BEDTools (v 2.29.2) respectively. Custom R scripts were written identify intrusive and readthrough reads using transcript start and end positions defined by Nagalakshmi et al 2008. Fraction spliced of reads with 3′ ends downstream of the 3′ SS was determined using the DescTools (v0.99.30) Overlap function.
Genome_build: sacCer3
Supplementary_files_format_and_content: tab-delimited text files with fraction spliced for each gene
Supplementary_files_format_and_content: tab-delimited text files with the deltaSMIT value for each gene
Supplementary_files_format_and_content: .bed file for mapped long read data filtered for reads overlapping intron-containing genes only; BED12 file with extra column to identify gene.
Supplementary_files_format_and_content: longread_fractionspliced.txt
 
Submission date Aug 12, 2020
Last update date Oct 28, 2020
Contact name Tara Diane Alpert
E-mail(s) tara.alpert@yale.edu
Organization name Yale University
Department Molecular Biophysics and Biochemistry
Lab Karla Neugebauer
Street address 333 Cedar St.
City New Haven
State/province CT
ZIP/Postal code 06511
Country USA
 
Platform ID GPL25739
Series (1)
GSE156133 Widespread transcriptional readthrough caused by Nab2 depletion leads to chimeric transcripts with retained introns
Relations
BioSample SAMN15804191
SRA SRX8934928

Supplementary file Size Download File type/resource
GSM4725190_control2_ic.bed.gz 455.3 Kb (ftp)(http) BED
SRA Run SelectorHelp
Raw data are available in SRA
Processed data provided as supplementary file
Processed data are available on Series record

| NLM | NIH | GEO Help | Disclaimer | Accessibility |
NCBI Home NCBI Search NCBI SiteMap