GEO Accession viewer

NCBI > GEO > Accession Display

Not logged in | Login

GEO help: Mouse over screen elements for information.

Sample GSM3243330

Query DataSets for GSM3243330

Status

Public on Jul 04, 2018

Title

Cp_immature_DNase-seq

Sample type

SRA

Source name

immature fruit flesh

Organism

Carica papaya

Characteristics

cultivar: Tainong1
tissue: fruit flesh
developmental stage: immature

Extracted molecule

genomic DNA

Extraction protocol

DNase-Seq was performed as described in 1. Grøntved, L. et al., Epigenetics Chromatin 5, 10 (2012). The purified nuclei were suspended in 500 ul of nuclease digestion buffer (30 mM Tris-HCl pH 8.0, 14 mM MgCl2, 0.5% NP-40, 0.2% BSA) containing different concentration of DNase (Cyanase, RiboSolutions). After a short incubation at 37°C for 3 min, the digestion was immediately terminated by adding EDTA and SDS. RNase A was then added to digest the RNA. The DNA was purified by Phenol/Chloroform extraction and ethanol precipitation. The purified total DNA was run on a 2% agarose gel and the small fragments released by the DNase digestion were purified and converted to Illumina TruSeq libraries as described in He, H.H. et al., Nat. Methods, 2014.

Library strategy

DNase-Hypersensitivity

Library source

genomic

Library selection

DNAse

Instrument model

Illumina HiSeq 4000

Description

Cp_immature
sequencing instrument(s): Illumina HiSeq 4000,HiSeq X Ten

Data processing

Most of the sequencing and base-calling services were provide by CloudHealth Genomics, Ltd. (Shanghai).
For RNA-Seq dataset, raw data was aligned to a home maintained rDNA database using Bowtie2 (version 2.1.0) with default parameters to filter out any rRNA sequence. Retained reads were then mapped to genome reference using TopHat2 (v2.1.1) with parameters: --read-realign-edit-dist 0 --max-intron-length 10000 --max-segment-intron 10000 --prefilter-multihits. FPKM value of each gene was calculated using StringTie (v1.3.1c) with parameters: -e -j 3 -f 0.2 -A. Raw read counts of genes were calculated using HTSeq (version 0.9.1) with parameters: --nonunique all.
For DNase-Seq and ChIP-Seq dataset, raw reads were mapped to the genome reference using Bowtie2 (version 2.1.0) with option: -3 100. For DNase-Seq and transcription factor ChIP-Seq datasets, peaks were called using MACS (ver 2.1.1.20160309), with the parameters: --nomodel -q 0.01 and -q 0.01, respectively. For H3K27me3 ChIP-Seq dataset, peaks were called using SICER (version 1.1) with the settings: window size 150, gap size 300, FDR 0.01. H3K27me3 candidate peaks were filtered further by their length, fold of enrichment and FDR: 1. peaks shorter than 300 bp was discarded; 2. for peaks shorter than 1kb, fold of enrichment and FDR requirement cutoff were setted as 5 and 1e-50, respectively for apple and tomato datasets, 4 and 1e-50 repectively for banana, grape and pear datasets, 3 and 1e-5 for datasets of other species; 3. for peaks longer than 1kb, fold of enrichment cutoff were setted as 4 for apple and tomato datasets, 3 for banana, grape and pear datasets, 2 for datasets of other species; Alignments with mapping score less than 10 and PCR duplicates were discarded using samtools before peak calling. bedgraph file generated by MACS with --SPMR enabled was converted to bw file using bedGraphToBigWig.
For Bisulfite-Seq dataset, raw data was mapped to genome reference using BSMAP (version 2.90) with the parameters: -A AGATCGGAAGAGC -w 100 -r 0 -q 10. Methylation ratio of each cytosine was calculated using methratio.py with the parameters: -z -r. The result file was compressed using bgzip and index using tabix.
Genome_build: Arabidopsis thaliana: genome release 9 (assembly), TAIR10 (annotation), https://genome.jgi.doe.gov/pages/dynamicOrganismDownload.jsf?organism=Athaliana
Genome_build: Apple: GDDH13 Version 1.1 (assembly), GDDH13 Version 1.1 (annotation), https://iris.angers.inra.fr/gddh13/
Genome_build: Banana: DH-Pahang v2 (assembly), DH-Pahang v2 (annotation), http://banana-genome.cirad.fr/content/download-dh-pahang
Genome_build: Cucumber: Chinese long v2 (assembly), Chinese long v2 (annotation), ftp://www.icugi.org/pub/genome/cucumber/Chinese_long/v2/
Genome_build: Grape: Genoscope (12X) (assembly), CRIBI V2.1 (annotation), http://genomes.cribi.unipd.it/DATA/V2/V2.1/
Genome_build: Melon: v3.5 (assembly), v3.5 (annotation), https://melonomics.net/files/Genome/Melon_genome_v3.5_Garcia-Mas_et_al_2012/
Genome_build: Papaya: ASGPBv0.4 (assembly), ASGPBv0.4 (annotation), http://phytozome.jgi.doe.gov/pz/portal.html#!info?alias=Org_Cpapaya
Genome_build: Peach: v1.0 (assembly), v1.0 (annotation), http://phytozome.jgi.doe.gov/pz/portal.html#!info?alias=Org_Ppersica
Genome_build: Pear: Pbr_v1.0 (assembly), NCBI Annotation Release 101 (annotation), https://www.ncbi.nlm.nih.gov/genome/12793
Genome_build: Strawberry: v1.1 (assembly), v1.1 (annotation), http://phytozome.jgi.doe.gov/pz/portal.html#!info?alias=Org_Fvesca
Genome_build: Watermelon: v1 (assembly), v1 (annotation), ftp://www.icugi.org/pub/genome/watermelon/97103/v1/
Genome_build: Tomato: v2.5 (assembly), ITAG2.4 (annotation), ftp://ftp.solgenomics.net/tomato_genome/assembly/build_2.50/
Genome_build: Rice: Os-Nipponbare-Reference-IRGSP-1.0 (assembly), MSU release 7.0 (annotation), https://genome.jgi.doe.gov/pages/dynamicOrganismDownload.jsf?organism=Osativa
Supplementary_files_format_and_content: For RNA-Seq dataset, tab delimited abundance file gives FPKM of each gene, tab delimited count file gives read counts of each gene
Supplementary_files_format_and_content: For Dnase-Seq dataset, bw file shows the read density across the entile genome, tab delimited file in narrowPeak format shows peak calling information generated by MACS2
Supplementary_files_format_and_content: For Bisulfite-Seq dataset, bgzip compressed tab delimited file gives all and mC counts of each genomic cytosine across the entile genome. Index file are also provided, so one can fast retrieve the data randomly using tabix.
Supplementary_files_format_and_content: For ChIP-Seq dataset, bw file shows the read density across the entile genome. For transcription factor ChIP-Seq dataset, tab delimited file in narrowPeak format shows peak calling information generated by MACS2. For H3K27me3 ChIP-Seq dataset, tab delimited file shows peak calling information generated by SICER.

Submission date

Jul 03, 2018

Last update date

Jul 04, 2018

Contact name

Silin Zhong

E-mail(s)

zhonglab.cuhk@gmail.com

Organization name

CUHK

Department

School of Life Sciences

Street address

Shatin, N.T.

City

Hong Hong

ZIP/Postal code

Country

China

Platform ID

GPL25274

Series (1)

GSE116581

fruitENCODE: An encyclopedia of DNA elements for fruit ripening

Relations

BioSample

BioSample

SRA

SRA

Supplementary file	Size	Download	File type/resource
GSM3243330_Cp_immature_DNase-seq.bw	55.8 Mb	(ftp)(http)	BW
GSM3243330_Cp_immature_DNase-seq.narrowPeak.gz	545.2 Kb	(ftp)(http)	NARROWPEAK
SRA Run Selector
Raw data are available in SRA
Processed data provided as supplementary file