NCBI Logo
GEO Logo
   NCBI > GEO > Accession DisplayHelp Not logged in | LoginHelp
GEO help: Mouse over screen elements for information.
          Go
Sample GSM3444473 Query DataSets for GSM3444473
Status Public on Nov 05, 2018
Title AK124_H3K9me3_ChIP-seq
Sample type SRA
 
Source name adult glioblastoma tumour
Organism Homo sapiens
Characteristics subtype: IDH
gender: female
age: 32
survival_status: alive
os (overall survival, month): 64
progression: 0
pfs (progression-free survival, month): 64
chr7_gain: 0
chr10_loss: 0
chr19_gain: 0
chr20_gain: 0
idh1 mutation status: R132H
idh2 mutation status: wt
egfr_amplification: 0
pten_deletion: 0
mdm2_amplification: 0
mdm4_amplification: 0
pdgfra_amplification: 0
cdkn2a_b_deletion: 0
cdk4_amplification: 0
met_amplification: 0
molecule subtype: genomic DNA and associated chromatin
chip antibody: H3K9me3
chip antibody vendor: Active Motif
chip antibody cat.#: AM#39161
chip antibody lot #: 3
Extracted molecule genomic DNA
Extraction protocol H3K27Ac (AM#39133, Active Motif), H3K4me1 (AM#38297, Active Motif), H3K4me3 (AM#39159, Active Motif), H3K9me3 (AM#39161, Active Motif), H3K27me3 (#07-449, Millipore) and H3K36me3 (AM#61101, Active Motif) ChIP library preparation of GB samples was performed at Active Motif according to proprietary methods.
Libraries were multiplexed so that all libraries for each individual IP were sequenced on 1-4 lanes using the Illumina HiSeq 2000 platform.
 
Library strategy ChIP-Seq
Library source genomic
Library selection ChIP
Instrument model Illumina HiSeq 2000
 
Description processed data file:
AK124_ChromHMM_18states.bed
AK124_ChromHMM_simplified_7states.bed
MES_ChromHMM_18states.bed
MES_ChromHMM_simplified_7states.bed
Data processing ChIP-seq datasets were processed using a custom pipeline implemented in Snakemake. Briefly, reads were trimmed using the Trimgalore tool (https://github.com/FelixKrueger/TrimGalore) and aligned using Bowtie with standard parameters. Duplicates and multi-mapping reads were removed using the samtools package and the XS flag in the bam files. Input control (WGS) and corresponding IP datasets were scaled using the SES method and converted into a bigwig track using the bamCompare tool of the deepTools2 suite. Peaks were called using the callpeak mode in MACS2 (https://github.com/taoliu/MACS) for broad and narrow peaks. In addition, SICER was used to call peaks using the gap 600 and window 200 parameters. Various QC parameters (FRiP, PCR bottleneck coefficient, cross-strand correlation) were determined according to the ENCODE guidelines. In addition, visual QC was performed using the signal profile at TSS of annotated genes and the fingerprint method from the DeepTools2 suite.
Chromatin segmentation was defined using the ChromHMM tool. ChIP-seq data (H3K27ac, H3K27me3, H3K36me3, H3K4me1, H3K4me3, H3K9me3) and WGS data (input control) were binarized using ChromHMM’s “BinarizeBam” command. The Epigenome Roadmap 18-state model was used to segment the genome of each sample. The consensus state for a subtype is defined as the state with the highest frequency in a given segment, and a minimum frequency of 50%.
The simplified 7-state model was defined by mapping the 18 states as follows: 1-4 to TSS (Tss), 5-6 to Transcription (Tx), 7-11 and 15 to Enhancer (Enh), 12-13 to Heterochromatin/Repeats (Het/Rpts), 14 to Bivalent TSS (TSSBiv), 16-17 to Polycomb Repressed (ReprPC), 18 to Quiescent (Qui).
Subtype superenhancer analysis: the union of H3K27ac peaks for each subtype’s samples were used as input regions for the ROSE2 superenhancer analysis pipeline, stitching together regions within 12.5kbp. Sample H3K27ac signal was calculated using ‘bigWigAverageOverBed’ (v2), and enhancers were ranked by the subtype average enrichment. SEs were defined using the default parameters for ROSE2.
GBM superenhancer landscape: the pan-GBM SE landscape was defined as the reduced union of the 4 subtype SE lists. Enrichment of H3K27ac for each sample was calculated with ‘bigWigAverageOverBed’. z-scores of the log10-transformed H3K27ac signal was used to hierarchically cluster SEs using 1-Pearson correlation as the distance and the ‘Ward.D2’ method. Target genes were assigned as follows: for each SE, we computed the Spearman correlation of the H3K27ac signal in the SE regions across all samples with the expression of all genes within a region of +/- 500kb, and selected the gene showing the highest correlation as being the most likely target gene. We additionally filtered using TADs by requiring the SE and the predicted target gene to be within a common TAD.
Subtype enhancer analysis: subtype enhancers were defined as regions where a H3K4me1 peak was called in at least 2 samples in that subtype falling more than 2kbp from a H3K4me3 peak, or a promoter otherwise. Regions closer than 10bp were merged, and those longer than 500bp were retained. Active enhancers were defined as regions where a H3K27ac peak was also called, otherwise the region was defined as a poised enhancer. Intensities of H3K27ac ChIPseq signal was calculated using ‘bigWigAverageOverBed’.
Identification of the GBM active enhancer landscape: The GBM active enhancer landscape was defined by taking the union of the 4 subtype active enhancer sets. The activity of enhancers was visualised in a heatmap using ‘ComplexHeatmap’ of the z-scores of the log2-transformed H3K27ac signal. Active enhancers were clustered on the average log2 H3K27ac signal in each subtype, using 1-Pearson correlation as the distance and the ‘Ward.D2’ hierarchical clustering method.
Subtype-activated enhancer analysis: subtype-activated active enhancers were identified using an ANOVA test on the average subtype log2-transformed H3K27ac signal. Enhancers with a BH-adjusted ANOVA p-value < 0.1 and a log2FC of H3K27ac signal > 1 were defined as activated in a subtype.
Assignment of target genes to enhancers: The ‘InTAD’ R package (unpub., v0.1.1) was used to identify target genes of active enhancers. Briefly, the correlation of log2-transformed H3K27ac signal and protein-coding gene (Gencode v19 annotation) expression (log2 RPKM+1) was calculated within ESC-defined topologically associated domains for all possible enhancer-gene interactions. Genes were allowed to overlap multiple TADs (option “selMaxTadOvlp=TRUE” in the combineInTAD function). The target for each enhancer was defined as follows: the closest gene (distance to TSS) with correlation > 0.4; otherwise, the gene with the highest correlation.
genome build: hg19
processed data files format and content: ChromHMM results for each sample are contained in two bedfiles: the full 18-state and the reduced 7-state model. For each IPed factor, 3 files are included: a bigWig coverage file and two peak call bedfiles. Additionally, subtype consensus files are included for the ChromHMM models (both 18 and 7 states). Superenhancer analysis: the individual subtype SE lists (SUBTYPE_subtype_H3K27ac_superenhancers.bed) as well as the full landscape (GB_H3K27ac_superenhancers.bed), with subtype-specific SEs. Active enhancer analysis: individual subtype active enhancer lists (SUBTYPE_enhancers_annotated_with_histone_signal) are indluced, as well as the full landscape (GB_active_enhancers.bed) as well as subtype-specific lists. The full enhancer matrix, consisting of this full landscape with annotated targeted genes, is also included (GB_active_enhancers_matrix_with_target_gene_predictions.csv).
 
Submission date Oct 24, 2018
Last update date Nov 15, 2018
Contact name Bernhard Radlwimmer
E-mail(s) b.radlwimmer@dkfz-heidelberg.de
Organization name Deutsches Krebsforschungszentrum / German National Cancer Research Centre
Department Department of Molecular Genetics
Street address Im Neuenheimer Feld 280
City Heidelberg
ZIP/Postal code 69120
Country Germany
 
Platform ID GPL11154
Series (2)
GSE121719 Glioblastoma epigenome profiling identifies SOX10 as a master regulator of molecular tumour subtype - tumour ChIPseq data
GSE121723 Glioblastoma epigenome profiling identifies SOX10 as a master regulator of molecular tumour subtype
Relations
BioSample SAMN10285453

Supplementary file Size Download File type/resource
GSM3444473_AK124_H3K9me3_W200-G600-FDR0.01-island.bed.gz 199.7 Kb (ftp)(http) BED
GSM3444473_AK124_H3K9me3_coverage_SES_subtract.bigWig 115.8 Mb (ftp)(http) BIGWIG
GSM3444473_AK124_H3K9me3_peaks.narrowPeak.bed.gz 105.5 Kb (ftp)(http) BED
Raw data not provided for this record
Processed data provided as supplementary file
Processed data are available on Series record

| NLM | NIH | GEO Help | Disclaimer | Accessibility |
NCBI Home NCBI Search NCBI SiteMap