|
|
GEO help: Mouse over screen elements for information. |
|
Status |
Public on Nov 16, 2020 |
Title |
frag_mRNA_input_control |
Sample type |
SRA |
|
|
Source name |
S2R+ cells
|
Organism |
Drosophila melanogaster |
Characteristics |
condition: fragmented mRNA input control miclip antibody: none
|
Treatment protocol |
no specific treatment
|
Growth protocol |
S2R+ cells were grown at room temperature in Schneider's medium (Gibco) supplemented with 10% FBS (Sigma) and 1% penicilin-streptomycin (sigma)
|
Extracted molecule |
polyA RNA |
Extraction protocol |
RNA were harvested using Trizol reagent and mRNA were isolated by two rounds of binding to oligo (dt)25 magnetic beads (NEB). mRNA was fragmented with RNA fragmentation solution (Ambion). Samples IP1, IP2, IP3, IP4 and ctr were immunoprecipitated using anti-m6A antibody. Sample frag-mRNA was purified with RNAClean XP beads (Beckman Coulter). Library preparation for IP1, IP2, IP3, IP4 and ctr samples was performed as previously described in (Sutandy et al., 2016). Libraries were sequenced on an Illumina NextSeq500. Library preparation for frag_mRNA was performed using the NEBNext Ultra Directional RNA Library Prep Kit.
|
|
|
Library strategy |
RIP-Seq |
Library source |
transcriptomic |
Library selection |
other |
Instrument model |
Illumina NextSeq 500 |
|
|
Description |
fragmented mRNA input control for comparative analysis with IP samples
|
Data processing |
Individual samples were processed using the CLIP Tool Kit (CTK) v1.0.9. (Shah et al. 2017). We largely followed recommended user guide lines specific to CTK iCLIP data analysis as described here (https://zhanglab.c2b2.columbia.edu/index.php/ICLIP_data_analysis_using_CTK). Briefly, 3’adapter sequences [AGATCGGAAGAGCGGTTCAG] were trimmed using cutadapt v1.8. [--overlap = 5 -m 29]. PCR duplicates were removed using a custom perl script, followed by the extraction of the 9 nucleotide miCLIP barcode and its addition to the read name. All cDNA libraries were filtered for common Drosophila virus sequences using bowtie v1.1.2 [-p 4 -q (-X 1000) --fr --best]. Next, to avoid sequencing read alignment software biases, we decided to map sequencing reads to the Drosophila melanogaster dm6 genome assembly (ensemble v81) using novoalign, bwa and STAR v2.4.2a. For STAR alignments, we used a custom python script to transform sam alignment files into the expected format for downstream CITS identification. For STAR alignments, we did not consider spliced reads, soft-clipped reads, mismatches and indels near read start and read end, and reads with more than one indel or mismatch. Then, unique tags were identified using parseAlignment.pl [-v --map-qual 1 --min-len 18 --indel-to-end 2] to extract unique tags, followed by read collapsing using tag2collapse.pl [-v -big --random-barcode -EM 30 --seq-error-model alignment -weight --weight-in-name --keep-max-score --keep-tag-name]. Crosslinking induced mutation sites (CIMS) indicative for the antibody-m6A interaction were identified running CIMS.pl [-big -n 10], and CIMS with FDR < 0.001 were retained. Crosslinking induced truncations sites were identified using CITS.pl [-big -p 0.001 --gap 25]. Sites spanning more than 1 nucleotide were removed. We further filtered identified CIMS and CITS to be reproducible in at least 2 out of 4 replicate m6A-immunoprecipitation samples and not present in identified CITS from the input control sample for each aligner separately. Moreover, CIMS were filtered to have a minimum of 6 unique tags [k > 5], at least three unique substitutions [m > 2], and be prevalent in less than 20% of the coverage [m/k < 0.2] to avoid calling homozygous and heterozygous single nucleotide variants. CIMS sites were found to be almost exclusive C-to-T conversions (n = 6225, 88% ± 5.8%) independent of the alignment software used (3 aligners n = 2677, 2 n = 2411, 1 n = 1137). For a stringent CITS set, we filtered CITS sites (n = 22917) that mostly truncated at A residues ( n = 11897, 52%), and were followed by C residues (CITS-AC n = 6799, 57%). Two thousand three hundred and two (37%) of C-to-T CIMS overlapped within a 1nt window to CITS-AC sites, suggesting that in many cases the same nucleotide was identified. Together, we considered a set of 13024 C-to-T conversions CIMS and AC truncation CITS across 2464 genes for our final S2 cell miCLIP data set. CITS site were annotated as described before (Wessels et al. 2019). For representation purposes we simplified the annotation categories. All CITS not annotated to 5’UTR, CDS, 3’UTR or intron were summarized in the category ‘other’. Enrichments were calculated relative to median feature proportions (5'UTR = 0.08 (131nt), CDS = 0.78 (1309.5nt), 3'UTR = 0.14 (234nt)) determined previously for S2 cells (Wessels et al. 2019). Enrichments for sites annotated as intronic or other were set to 1. Alignment parameters: Novoalign: [-t 85 -l 20 -s 1 -o Native -r None] BWA: [-t 4 -n 0.06 -q 20] STAR: [--alignEndsType EndToEnd --runThreadN 4 --outFilterMultimapNmax 10 --outSAMattributes All --outFilterIntronMotifs RemoveNoncanonical --outReadsUnmapped Fastx --alignSJoverhangMin 12 --outFilterMatchNmin 15 --outFilterMismatchNmax 1 --outFilterMismatchNoverLmax 0.05 --outFilterMultimapScoreRange 3 --alignIntronMax 20000 --seedMultimapNmax 200000 --seedPerReadNmax 30000 --outSAMtype BAM SortedByCoordinate] Genome_build: dm6
|
|
|
Submission date |
Feb 14, 2020 |
Last update date |
Nov 16, 2020 |
Contact name |
Jean-Yves Roignant |
Organization name |
Institute of molecular Biology
|
Lab |
Roignant
|
Street address |
Ackermannweg 4
|
City |
Mainz |
ZIP/Postal code |
55128 |
Country |
Germany |
|
|
Platform ID |
GPL19132 |
Series (1) |
GSE145342 |
The m6A reader Ythdf restricts axonal growth in Drosophila through target selection modulation of the Fragile X mental retardation protein |
|
Relations |
BioSample |
SAMN14113727 |
SRA |
SRX7730212 |
Supplementary data files not provided |
SRA Run Selector |
Raw data are available in SRA |
Processed data are available on Series record |
|
|
|
|
|