NCBI Logo
GEO Logo
   NCBI > GEO > Accession DisplayHelp Not logged in | LoginHelp
GEO help: Mouse over screen elements for information.
          Go
Series GSE137435 Query DataSets for GSE137435
Status Public on Jul 08, 2020
Title DeepC: Predicting chromatin interactions using megabase scaled deep neural networks and transfer learning (NG Capture-C)
Organism Homo sapiens
Experiment type Other
Summary Understanding 3D genome structure requires high throughput, genome-wide approaches. However, assays for all vs. all chromatin interaction mapping are expensive and time consuming, which severely restricts their usage for large-scale mutagenesis screens or for mapping the impact of sequence variants. Computational models sophisticated enough to grasp the determinants of chromatin folding provide a unique window into the functional determinants of 3D genome structure as well as the effects of genome variation. A chromatin interaction predictor should work at the base pair level but also incorporate large-scale genomic context to simultaneously capture the large scale and intricate structures of chromatin architecture. Similarly, to be a flexible and generalisable approach it should also be applicable to data it has not been explicitly trained on. To develop a model with these properties, we designed a deep neuronal network (deepC) that utilizes transfer learning to accurately predict chromatin interactions from DNA sequence at megabase scale. The model generalizes well to unseen chromosomes and works across cell types, Hi-C data resolutions and a range of sequencing depths. DeepC integrates DNA sequence context on an unprecedented scale, bridging the different levels of resolution from base pairs to TADs. We demonstrate how this model allows us to investigate sequence determinants of chromatin folding at genome-wide scale and to predict the importance of regulatory elements and the impact of sequence variations.
 
Overall design To validate in silico predictions of chromatin interactions at high resolution and scale, we performed NG Capture-C (Davies 2016) from 220 viewpoints in two cell lines (K562 (WIMM transgenics facility) and GM12878 - LCLs (Coriell)), from which predicted chromatin interactions have been generated. These viewpoints comprise 81 CTCF sites and 139 intra domain viewpoints designed to avoid active element overlap. Library preparation and NG Capture-C was performed in biological triplicates with four unique adapters being used for each replicate to increase sequencing depth and minimize PCR duplicates. These were pooled for maximum resolution. Capture was performed with biotinylated oligonucleotides targeting sequences adjacent to DpnII sites at the viewpoints of interest.
 
Contributor(s) Schwessinger R, Downes DJ, Gosden M, Hughes JR
Citation(s) 33046896
Submission date Sep 13, 2019
Last update date Oct 13, 2020
Contact name Ron Schwessinger
E-mail(s) ron.schwessinger@msdtc.ox.ac.uk
Phone 00441865222153
Organization name University of Oxford
Department MRC Weatherall Institute if Molecular Medicine
Lab Hughes Lab
Street address MRC WIMM, John Radcliffe Hospital, Headington
City Oxford
ZIP/Postal code OX3 9DS
Country United Kingdom
 
Platforms (1)
GPL18573 Illumina NextSeq 500 (Homo sapiens)
Samples (4)
GSM4079317 K562_CTCF
GSM4079318 K562_intra_domain
GSM4079319 LCL_CTCF
This SubSeries is part of SuperSeries:
GSE137437 DeepC: predicting 3D genome folding using megabase-scale transfer learning
Relations
BioProject PRJNA565432
SRA SRP221613

Download family Format
SOFT formatted family file(s) SOFTHelp
MINiML formatted family file(s) MINiMLHelp
Series Matrix File(s) TXTHelp

Supplementary file Size Download File type/resource
GSE137435_RAW.tar 341.8 Mb (http)(custom) TAR (of TXT)
SRA Run SelectorHelp
Raw data are available in SRA
Processed data provided as supplementary file

| NLM | NIH | GEO Help | Disclaimer | Accessibility |
NCBI Home NCBI Search NCBI SiteMap