NCBI Logo
GEO Logo
   NCBI > GEO > Accession DisplayHelp Not logged in | LoginHelp
GEO help: Mouse over screen elements for information.
          Go
Sample GSM3243330 Query DataSets for GSM3243330
Status Public on Jul 04, 2018
Title Cp_immature_DNase-seq
Sample type SRA
 
Source name immature fruit flesh
Organism Carica papaya
Characteristics cultivar: Tainong1
tissue: fruit flesh
developmental stage: immature
Extracted molecule genomic DNA
Extraction protocol DNase-Seq was performed as described in 1. Grøntved, L. et al., Epigenetics Chromatin 5, 10 (2012). The purified nuclei were suspended in 500 ul of nuclease digestion buffer (30 mM Tris-HCl pH 8.0, 14 mM MgCl2, 0.5% NP-40, 0.2% BSA) containing different concentration of DNase (Cyanase, RiboSolutions). After a short incubation at 37°C for 3 min, the digestion was immediately terminated by adding EDTA and SDS. RNase A was then added to digest the RNA. The DNA was purified by Phenol/Chloroform extraction and ethanol precipitation. The purified total DNA was run on a 2% agarose gel and the small fragments released by the DNase digestion were purified and converted to Illumina TruSeq libraries as described in He, H.H. et al., Nat. Methods, 2014.
 
Library strategy DNase-Hypersensitivity
Library source genomic
Library selection DNAse
Instrument model Illumina HiSeq 4000
 
Description Cp_immature
sequencing instrument(s): Illumina HiSeq 4000,HiSeq X Ten
Data processing Most of the sequencing and base-calling services were provide by CloudHealth Genomics, Ltd. (Shanghai).
For RNA-Seq dataset, raw data was aligned to a home maintained rDNA database using Bowtie2 (version 2.1.0) with default parameters to filter out any rRNA sequence. Retained reads were then mapped to genome reference using TopHat2 (v2.1.1) with parameters: --read-realign-edit-dist 0 --max-intron-length 10000 --max-segment-intron 10000 --prefilter-multihits. FPKM value of each gene was calculated using StringTie (v1.3.1c) with parameters: -e -j 3 -f 0.2 -A. Raw read counts of genes were calculated using HTSeq (version 0.9.1) with parameters: --nonunique all.
For DNase-Seq and ChIP-Seq dataset, raw reads were mapped to the genome reference using Bowtie2 (version 2.1.0) with option: -3 100. For DNase-Seq and transcription factor ChIP-Seq datasets, peaks were called using MACS (ver 2.1.1.20160309), with the parameters: --nomodel -q 0.01 and -q 0.01, respectively. For H3K27me3 ChIP-Seq dataset, peaks were called using SICER (version 1.1) with the settings: window size 150, gap size 300, FDR 0.01. H3K27me3 candidate peaks were filtered further by their length, fold of enrichment and FDR: 1. peaks shorter than 300 bp was discarded; 2. for peaks shorter than 1kb, fold of enrichment and FDR requirement cutoff were setted as 5 and 1e-50, respectively for apple and tomato datasets, 4 and 1e-50 repectively for banana, grape and pear datasets, 3 and 1e-5 for datasets of other species; 3. for peaks longer than 1kb, fold of enrichment cutoff were setted as 4 for apple and tomato datasets, 3 for banana, grape and pear datasets, 2 for datasets of other species; Alignments with mapping score less than 10 and PCR duplicates were discarded using samtools before peak calling. bedgraph file generated by MACS with --SPMR enabled was converted to bw file using bedGraphToBigWig.
For Bisulfite-Seq dataset, raw data was mapped to genome reference using BSMAP (version 2.90) with the parameters: -A AGATCGGAAGAGC -w 100 -r 0 -q 10. Methylation ratio of each cytosine was calculated using methratio.py with the parameters: -z -r. The result file was compressed using bgzip and index using tabix.
Genome_build: Arabidopsis thaliana: genome release 9 (assembly), TAIR10 (annotation), https://genome.jgi.doe.gov/pages/dynamicOrganismDownload.jsf?organism=Athaliana
Genome_build: Apple: GDDH13 Version 1.1 (assembly), GDDH13 Version 1.1 (annotation), https://iris.angers.inra.fr/gddh13/
Genome_build: Banana: DH-Pahang v2 (assembly), DH-Pahang v2 (annotation), http://banana-genome.cirad.fr/content/download-dh-pahang
Genome_build: Cucumber: Chinese long v2 (assembly), Chinese long v2 (annotation), ftp://www.icugi.org/pub/genome/cucumber/Chinese_long/v2/
Genome_build: Grape: Genoscope (12X) (assembly), CRIBI V2.1 (annotation), http://genomes.cribi.unipd.it/DATA/V2/V2.1/
Genome_build: Melon: v3.5 (assembly), v3.5 (annotation), https://melonomics.net/files/Genome/Melon_genome_v3.5_Garcia-Mas_et_al_2012/
Genome_build: Papaya: ASGPBv0.4 (assembly), ASGPBv0.4 (annotation), http://phytozome.jgi.doe.gov/pz/portal.html#!info?alias=Org_Cpapaya
Genome_build: Peach: v1.0 (assembly), v1.0 (annotation), http://phytozome.jgi.doe.gov/pz/portal.html#!info?alias=Org_Ppersica
Genome_build: Pear: Pbr_v1.0 (assembly), NCBI Annotation Release 101 (annotation), https://www.ncbi.nlm.nih.gov/genome/12793
Genome_build: Strawberry: v1.1 (assembly), v1.1 (annotation), http://phytozome.jgi.doe.gov/pz/portal.html#!info?alias=Org_Fvesca
Genome_build: Watermelon: v1 (assembly), v1 (annotation), ftp://www.icugi.org/pub/genome/watermelon/97103/v1/
Genome_build: Tomato: v2.5 (assembly), ITAG2.4 (annotation), ftp://ftp.solgenomics.net/tomato_genome/assembly/build_2.50/
Genome_build: Rice: Os-Nipponbare-Reference-IRGSP-1.0 (assembly), MSU release 7.0 (annotation), https://genome.jgi.doe.gov/pages/dynamicOrganismDownload.jsf?organism=Osativa
Supplementary_files_format_and_content: For RNA-Seq dataset, tab delimited abundance file gives FPKM of each gene, tab delimited count file gives read counts of each gene
Supplementary_files_format_and_content: For Dnase-Seq dataset, bw file shows the read density across the entile genome, tab delimited file in narrowPeak format shows peak calling information generated by MACS2
Supplementary_files_format_and_content: For Bisulfite-Seq dataset, bgzip compressed tab delimited file gives all and mC counts of each genomic cytosine across the entile genome. Index file are also provided, so one can fast retrieve the data randomly using tabix.
Supplementary_files_format_and_content: For ChIP-Seq dataset, bw file shows the read density across the entile genome. For transcription factor ChIP-Seq dataset, tab delimited file in narrowPeak format shows peak calling information generated by MACS2. For H3K27me3 ChIP-Seq dataset, tab delimited file shows peak calling information generated by SICER.
 
Submission date Jul 03, 2018
Last update date Jul 04, 2018
Contact name Silin Zhong
E-mail(s) zhonglab.cuhk@gmail.com
Organization name CUHK
Department School of Life Sciences
Street address Shatin, N.T.
City Hong Hong
ZIP/Postal code NA
Country China
 
Platform ID GPL25274
Series (1)
GSE116581 fruitENCODE: An encyclopedia of DNA elements for fruit ripening
Relations
BioSample SAMN06673735
BioSample SAMN06673734
SRA SRX2697833
SRA SRX2697834

Supplementary file Size Download File type/resource
GSM3243330_Cp_immature_DNase-seq.bw 55.8 Mb (ftp)(http) BW
GSM3243330_Cp_immature_DNase-seq.narrowPeak.gz 545.2 Kb (ftp)(http) NARROWPEAK
SRA Run SelectorHelp
Raw data are available in SRA
Processed data provided as supplementary file

| NLM | NIH | GEO Help | Disclaimer | Accessibility |
NCBI Home NCBI Search NCBI SiteMap