NCBI Logo
GEO Logo
   NCBI > GEO > Accession DisplayHelp Not logged in | LoginHelp
GEO help: Mouse over screen elements for information.
          Go
Sample GSM8302351 Query DataSets for GSM8302351
Status Public on Jul 11, 2024
Title miCLIP_mESC_NSUN5_Eluent_rep1
Sample type SRA
 
Source name cell line
Organism Mus musculus
Characteristics tissue: cell line
cell line: mESC
cell type: Mouse embryonic stem cells
genotype: NSUN5 C308A overexpression
treatment: Cells were lysed and followed by partial RNA digestion with RNase I. Then, FLAG immunoprecipitation was performed using anti-FLAG magnetic beads. The protein-RNA complex was eluted and a second biotin immunoprecipitation was conducted with streptavidin beads. The RNA was purified by RNA Isolater and subjected to library construction.
Growth protocol The mESC cell line was cultured in a complete ESC culture medium, comprising Dulbecco's modified Eagle's medium (DMEM) supplemented with 15% heat-inactivated fetal bovine serum (FBS), 0.1 mM 2-mercaptoethanol, 2 mM L-glutamine, 0.1 mM non-essential amino acids, 1000 U/mL recombinant leukemia inhibitory factor (LIF), and 100 U/mL penicillin/streptomycin. For differentiation assays, mESC cells were cultured without LIF for six days. HeLa and HEK 293T cell lines were cultured in DMEM supplemented with 10% FBS and 100 U/mL penicillin/streptomycin.
Extracted molecule total RNA
Extraction protocol Total RNA extraction from the cell lines was accomplished using the RNA isolater total RNA extraction reagent (Vazyme, R401-01). Prior to size selection, RNA underwent DNase I treatment at 37 ℃ for 15 min and subsequently was purified using RNA isolater extraction and ethanol precipitation (Vazyme, EN401-01). The small RNA fraction, characterized by a size of less than 200nt, was enriched from the total RNA utilizing the RNA Clean & Concentrator-25 Kit (Zymo, R1018). For mRNA purification, two successive rounds of poly(A)+ selection were performed employing oligo(dT)25 dynabeads (Life Technologies, 61005). Additionally, chromatin-associated RNA was isolated following subcellular fractionation assay, DNase I treatment, and rRNA depletion (Vazyme, N406-01). Modification-free control RNAs were prepared from HeLa or mESC mRNA, adhering to a previously published protocol (PMID: 34594034, 36593412).
The RNA samples were initially dephosphorylated using T4 polynucleotide kinase (Vazyme, N102-01) at 37℃ for 1 hour, followed by ethanol precipitation. Subsequently, they were ligated to the 3' RNA linker using T4 RNA ligase 2, truncated at 25°C for 2 hours, followed by an overnight incubation at 16°C (Beyotime, R0635L). To remove excess adaptors, the samples underwent treatment with 5´ Deadenylase (NEB, M0331S), RecJf (NEB, M0264S), and Proteinase K (NEB, P8102). RNA samples intended for group comparisons were pooled and then purified through Tris-phenol extraction followed by ethanol precipitation. The barcoded and pooled samples were labeled by 5-(2-azidoethyl)-1,3-indandione (AI) and subsequently subjected to a click reaction. The m5C-modified RNA was subjected to RNA pull-down to be enriched, followed by reverse transcription using the HiScript III RT SuperMix (Vazyme, R323-01). The excess reverse transcription primer was digested with exonuclease I (NEB, M0293S). The cDNA was purified by silane beads and then performed to 5′ adapter ligation on beads with T4 RNA ligase 1, high concentration (NEB, M0437M). cDNA was purified using silane beads (Invitrogen, 37002D), and subsequently, the purified cDNA was amplified using the KAPA HiFi HotStart ReadyMix (Roche, KK2602). Finally, PCR products were purified using 1.8X VAHTS DNA Clean Beads (Vazyme, N411-02) and 7% TBE gel.
 
Library strategy OTHER
Library source transcriptomic
Library selection other
Instrument model Illumina NovaSeq 6000
 
Data processing For batch paired-end reads of m5C-seq libraries, a sample-barcoded-sequence (Pos 1-6) and random-4-base (pos7-10) in Read1 and a 10nt unique molecular identifier (UMI) was included in the adaptor that ligates to the 3’end of cDNA so it locates in Read2 (Pos1-10). Firstly, m5C-seq batch libraries were demultiplexed into individual samples, using fastq-multx(with -m 1 parameters)that distributes reads into a sample FASTQ file(Read1.fq file and Read2.fq file) according to the sample-barcoded-sequence in Read1 (Pos 1-6). For every individual sample, only Read2 data with UMI was used for subsequent analyses. Illumina sequencing reads were first treated with trim_galore(version 0.6.4) for adapter removal and quality trimming. Work commands are as follows: trim_galore --fastqc --phred33 --length 30. We then used Seqkit (version 0.10.0) to deduplicate based on 10 bp UMI at the 5′ end of reads R2, key process parameters are as follows: seqkit rmdup -s. Finally, Fastp (version 0.23.2) was used to remove the 10 bp UMI in the deduplication read. 10 bases at the 3′ end of inserted sequences were also removed, using the key parameters: fastp -f 10 -t 10 -l 20.
Processed reads were mapped to reference genome (human GRCh19 reference / mouse mm10 reference) using HISAT2-3N (version 2.2.1-3n-0.0.3). The key parameters are as follows: hisat3n --base-change C,T --directional-mapping --repeat -p 20 -k 1. Considering a creditable m5C signal, the alignments are further filtering using python house-scripts, the alignments hosts >=3 C-to-T mismatches or >= 3 other type mismatches or the length of alignments < 20nt are filtered. Next, we parse BAM file using pysamstats (version 1.1.2), with commands as follow: pysamstats -D 100000 -S "nofilter" -t variation.
To identify the methyltransferase-dependent m5C sites, we calculated the C-to-T ratios for the m5C-TACseq-identified m5C sites in WT and all knockout/knockdown samples. Then, the fold change and difference of C-to-T ratio between KO/KD samples and WT/Control samples were calculated. For the identification of enzyme-dependent sites by comparing the KO and WT samples, the fold change in C-to-T ratio (KO sample/WT sample) was required to be less than 60% in two replicates, and the difference in C-to-T ratio (WT sample - KO sample) was required to be > 1.5%. Considering the presence of residual enzymes upon knockdown, to identify enzyme-dependent sites by comparing the KD and Control samples, the fold change in C-to-T ratio (KD sample/Control sample) was required to be less than 60% in at least one replicate.
The analysis of CLIP-seq data was performed using CLIP Tool Kit (CTK, v1.1.3). The processed reads were mapped to the mouse genome (Ensembl annotation of mm10) using hisat2 v2.2.1 with default settings.
Assembly: hg19(UCSC)/mm10(UCSC)/tRNA-referenece(tRNAdb)
Supplementary files format and content: tab-delimited text file includes every C base coverage and C-to-T signal
Supplementary files format and content: excel file,in xls file format
Library strategy: m5C-seq
 
Submission date Jun 03, 2024
Last update date Jul 11, 2024
Contact name Xiaoyu Li
E-mail(s) xiaoyu_li@zju.edu.cn
Organization name Zhejiang University
Department SCHOOL OF BASIC MEDICAL SCIENCES
Lab Li Lab
Street address 866 Yuhangtang Road
City Hangzhou
State/province Zhejiang
ZIP/Postal code 310058
Country China
 
Platform ID GPL24247
Series (1)
GSE242724 Base-resolution m5C profiling across the mammalian transcriptome by bisulfite-free enzyme-assisted chemical labeling approach
Relations
BioSample SAMN41656085
SRA SRX24781297

Supplementary file Size Download File type/resource
GSM8302351_NSUN5-Eluent-rep1.F.bw 394.0 Kb (ftp)(http) BW
GSM8302351_NSUN5-Eluent-rep1.R.bw 357.1 Kb (ftp)(http) BW
SRA Run SelectorHelp
Raw data are available in SRA

| NLM | NIH | GEO Help | Disclaimer | Accessibility |
NCBI Home NCBI Search NCBI SiteMap