|
Status |
Public on Jun 29, 2011 |
Title |
Open chromatin defined by DNaseI and FAIRE identifies regulatory elements that shape cell-type identity [FAIRE_seq] |
Project |
ENCODE
|
Organism |
Homo sapiens |
Experiment type |
Genome binding/occupancy profiling by high throughput sequencing
|
Summary |
The human body contains thousands of unique cell types, each with specialized functions. Cell identity is governed in large part by gene transcription programs, which are determined by regulatory elements encoded in DNA. To identify regulatory elements active in seven cell lines representative of diverse human cell types, we used DNase-seq and FAIRE-seq to map âopen chromatinâ. Over 870,000 DNaseI or FAIRE sites, which correspond largely to nucleosome depleted regions (NDRs), were identified across the seven cell lines, covering nearly 9% of the genome. The combination of DNaseI and FAIRE is more effective than either assay alone in identifying likely regulatory elements, as judged by coincidence with transcription factor binding locations determined in the same cells. Open chromatin common to all seven cell types tended to be at or near transcription start sites and encompassed more CTCF binding sites, while open chromatin sites found in only one cell type were typically located away from transcription start sites, and contained DNA motifs recognized by regulators of cell-type identity. As one example of its ability to identify functional DNA, we show that open chromatin regions bound by CTCF are potent insulators. We identified clusters of open regulatory elements (COREs) that were physically near each other and whose appearance was coordinated among one or more cell types. Gene expression and RNA Pol II binding data support the hypothesis that COREs control gene activity required for the maintenance of cell-type identity. This publicly available atlas of regulatory elements may prove valuable in identifying non-coding DNA sequence variants that are causally linked to human disease.
For data usage terms and conditions, please refer to http://www.genome.gov/27528022 and http://www.genome.gov/Pages/Research/ENCODE/ENCODEDataReleasePolicyFinal2008.pdf
|
|
|
Overall design |
DNase-seq, FAIRE-seq, and ChIP-seq were performed on seven human cell lines: GM12878 (lymphoblastoid), K562 (leukemia), HepG2 (hepatocellular carcinoma), HelaS3 (cervical carcinoma), HUVEC (human umbilical vein endothelial cells), NHEK (keratinocytes), and H1-ES (embryonic stem cells). For each cell line, two or three replicates were independently grown and split into three, one for each of the three experimental methods. Control ChIP experiments were performed on five of the cell lines with NHEK and H1-ES being excluded due to lack of material.
|
Web link |
http://www.ncbi.nlm.nih.gov/geo/info/ENCODE.html
|
|
|
Contributor(s) |
Furey TS, Lieb JD, Crawford GE |
Citation(s) |
21750106 |
BioProject |
PRJNA63443 |
|
Submission date |
Jun 27, 2011 |
Last update date |
May 15, 2019 |
Contact name |
Terry Furey |
E-mail(s) |
terry.furey@unc.edu
|
Organization name |
UNC-Chapel Hill
|
Department |
Genetics
|
Lab |
Furey
|
Street address |
5022 Genetic Medicine Building, CB#7264
|
City |
Chapel Hill |
State/province |
NC |
ZIP/Postal code |
27599 |
Country |
USA |
|
|
Platforms (1) |
GPL9052 |
Illumina Genome Analyzer (Homo sapiens) |
|
Samples (16)
|
|
This SubSeries is part of SuperSeries: |
GSE30227 |
Open chromatin defined by DNaseI and FAIRE identifies regulatory elements that shape cell-type identity |
|
Relations |
SRA |
SRP007348 |
Supplementary file |
Size |
Download |
File type/resource |
GSE30225_GM12878_UNCFAIRESeq.narrowPeak.gz |
2.0 Mb |
(ftp)(http) |
NARROWPEAK |
GSE30225_H1_ES_UNCFAIRESeq.narrowPeak.gz |
1.7 Mb |
(ftp)(http) |
NARROWPEAK |
GSE30225_HUVEC_UNCFAIRESeq.narrowPeak.gz |
3.2 Mb |
(ftp)(http) |
NARROWPEAK |
GSE30225_HelaS3_UNCFAIRESeq.narrowPeak.gz |
1.8 Mb |
(ftp)(http) |
NARROWPEAK |
GSE30225_HepG2_UNCFAIRESeq.narrowPeak.gz |
1.7 Mb |
(ftp)(http) |
NARROWPEAK |
GSE30225_K562_UNCFAIRESeq.narrowPeak.gz |
2.6 Mb |
(ftp)(http) |
NARROWPEAK |
GSE30225_NHEK_UNCFAIRESeq.narrowPeak.gz |
2.9 Mb |
(ftp)(http) |
NARROWPEAK |
GSE30225_RAW.tar |
10.3 Gb |
(http)(custom) |
TAR (of BED) |
SRA Run Selector |
Raw data are available in SRA |
Processed data provided as supplementary file |
Processed data are available on Series record |