NCBI Logo
GEO Logo
   NCBI > GEO > Accession DisplayHelp Not logged in | LoginHelp
GEO help: Mouse over screen elements for information.
          Go
Sample GSM1228844 Query DataSets for GSM1228844
Status Public on Sep 12, 2013
Title Dmel_Input_2
Sample type SRA
 
Source name Blastoderm embryos
Organism Drosophila melanogaster
Characteristics developmental stage: Blastoderm
strains: Oregon R
chip antibody: none
Growth protocol Embryos collected from large population cages for 1 hour, aged to appropriate stage, fixed in formaldehyde.
Extracted molecule genomic DNA
Extraction protocol Chromatin was isolated by CsCl gradient ultracentrifugation, fragmented and immunoprecipitated with affnity-purified antibody
Libraries were prepared according to Illumina's instructions accompanying the DNA Sample Kit (Part# 0801-0303). Briefly DNA fragments were converted to phosphorylated blunt ends using T4 DNA polymerase, Klenow DNA polymerase, and T4 polymerase kinase, a 3' A base overhang was added using Klenow DNA polymerase exo- (3' to 5' exo minus), and Illumina adapters were ligated to the fragments. We carried out the PCR step for enrichment of adapter-modified DNA prior to the library size selection, and limited the amplification to 15 cycles. After the amplification step, we size-selected DNA fragments of 150-250 bp.
 
Library strategy ChIP-Seq
Library source genomic
Library selection ChIP
Instrument model Illumina Genome Analyzer II
 
Data processing Basecalls performed using CASAVA version 1.4
We used the Apr. 2006 assembly (Flybase Release 5) of the D. melanogaster genome, the February 2006 assembly (Flybase release 2) of the D.pseudoobscura genome and the February 2006 assembly (Flybase release 1) of the D.virilis genome. We trimmed all sequenced tags so that their average quality was above 30 and mapped the tags to the genomes using Bowtie v0.12.7 {Langmead:2009fv} with command-line options ‘-v 1 -m 1’ for small reads (length below 35bp) and ‘-v 1 –m 3 for long reads (length above 70bp), thereby keeping only tags that mapped uniquely to the genome with at most one or three mismatch. ChIP data was parsed independently for each experiment using two separate peak callers. First, we used MACS (version 1.4) {Zhang:2008gm}, with the following parameter “-g dm --off-auto --nomodel --pvalue=1e-2” and “--shiftsize=110 --mfold=10,10000 --slocal=2000 --llocal=20000"" or ” --shiftsize=60 --mfold=4,10000"" depending on the length distribution for DNA fragment sizes prior to sequencing. We also called peaks using Grizzly Peak fitting program (Capaldi et al., 2008, Harrison et al., 2011) with estimated DNA fragment length of 150 or 250.
We then intersected the two sets of peaks, and filtered out all peaks not supported by both methods. To account for low complexity peaks and possible PCR artifacts, we further removed peaks with negative correlation (<-0.1) among the Forward and Reverse reads, peaks where 60% of the reads mapped to less than 1% of the positions, and peaks whose height was less than three times the height of Input reads in the same locus.
ChIP data was parsed independently for each experiment using two separate peak callers. First, we used MACS (version 1.4) {Zhang:2008gm}, with the following parameter “-g dm --off-auto --nomodel --pvalue=1e-2” and “--shiftsize=110 --mfold=10,10000 --slocal=2000 --llocal=20000" or ” --shiftsize=60 --mfold=4,10000" depending on the length distribution for DNA fragment sizes prior to sequencing. We also called peaks using Grizzly Peak fitting program {Capaldi:2008ff, Harrison:2011iy} with estimated DNA fragment length of 150 or 250.
We then intersected the two sets of peaks, and filtered out all peaks not supported by both methods. To account for low complexity peaks and possible PCR artifacts, we further removed peaks with negative correlation (<-0.1) among the Forward and Reverse reads, peaks where 60% of the reads mapped to less than 1% of the positions, and peaks whose height was less than three times the height of Input reads in the same locus.
We took as an initial dataset the union of all bound regions in the different replicates
Genome_build: Apr. 2006 assembly (Flybase Release 5) of the D. melanogaster genome, the February 2006 assembly (Flybase release 2) of the D.pseudoobscura genome and the February 2006 assembly (Flybase release 1) of the D.virilis genome
Supplementary_files_format_and_content: bwt files were generated using bowtie. They indicate the coordinates of mapped reads.
 
Submission date Sep 11, 2013
Last update date May 15, 2019
Contact name Mathilde PARIS
E-mail(s) thildeparis@gmail.com
Organization name CNRS
Department Institut de Genomique Fonctionnelle de Lyon
Street address 32-34 avenue Tony Garnier
City Lyon
ZIP/Postal code 69006
Country France
 
Platform ID GPL9061
Series (2)
GSE50771 Extensive divergence of transcription factor binding in Drosophila embryos with highly conserved gene expression (part 2)
GSE50773 Extensive divergence of transcription factor binding in Drosophila embryos with highly conserved gene expression
Relations
BioSample SAMN02353807
SRA SRX348458

Supplementary data files not provided
SRA Run SelectorHelp
Raw data are available in SRA
Processed data are available on Series record

| NLM | NIH | GEO Help | Disclaimer | Accessibility |
NCBI Home NCBI Search NCBI SiteMap