|
|
GEO help: Mouse over screen elements for information. |
|
Status |
Public on Oct 11, 2019 |
Title |
Orangutan replicate 2 |
Sample type |
SRA |
|
|
Source name |
Skin fibroblast cells
|
Organism |
Pongo pygmaeus |
Characteristics |
coriell id: PR01107
|
Growth protocol |
Growth media consisted of Gibco's MEM (10370-021), L-Glutamine (25030-081), Pen/Strep (15140-122), and 10% FBS (Hyclone SH30070).
|
Extracted molecule |
genomic DNA |
Extraction protocol |
Cells were grown in culture then pelleted before immediately proceeding to DNase-seq library preparations. All DNase-Seq libraries were prepared according to (Song L, et al. Genome Res. 2011; PMID 21750106). Briefly, cells were lysed and incubated at 37¡C for various times (5 min to 1 hour) and optimal DNase digestion was confirmed by pulsed field gel electrophoresis (Song L and Crawford GE. Cold Spring Harbor protocols 2010(2):pdb prot5384). DNase-seq libraries were prepared according to (Song L, et al. Genome Res. 2011; PMID 21750106).
|
|
|
Library strategy |
DNase-Hypersensitivity |
Library source |
genomic |
Library selection |
DNAse |
Instrument model |
Illumina HiSeq 2000 |
|
|
Data processing |
Due to the use of MmeI to generate DNase-seq libraries, sequencing reads were trimmed to 20 bases using a custom perl script. Reads for each species were mapped to its native genome using Bowtie version 0.12.9 (parameters: -trim5 0 --trim3 0 -m 1 -l 20) as part of a custom two-step pipeline. In the first step (“tier 1”), reads were required to match to a unique location with no mismatches (parameter: -n 0). In the second step (“tier 2”), unmapped reads from step one were re-mapped with a relaxed mismatch parameter of one mismatch (parameter: -n 1). Reads that mapped to multiple locations or had more than one mismatch were discarded. Samtools version 0.1.19-44428cd was used to convert the sam files from each step to bam files, merge them into one file, and remove duplicate reads (defined as having the same chromosomal position). Bedtools v2.17.0 was used to convert the bam files to bed files. Reads from the non-human samples were converted from their native genome coordinates to the human hg19 genome coordinates using a three-step process that removed reads that did not have a one-to-one relationship between the genomes. In each step, read coordinates were converted from one genome to the other using the UCSC liftOver software with a minMatch parameter of 0.8, which requires that 80% of the read maps to the new genome. Note that this parameter filters only on genomic coverage, not sequence identity. In the first step, read coordinates were converted from their native genome to hg19. Read coordinates that successfully lifted to hg19 were then lifted back to the native genome. Read coordinates that did not lift back to the same coordinates on the native genome were removed. Reads that did lift back to the same coordinates were lifted back to hg19. An additional filtering step was added to ensure the reads did not map to a duplicated region. In that step, overlapping reads were merged into a region, and that region was lifted from the native genome to hg19. Regions that failed to lift uniquely to hg19 were flagged and reads that overlapped them were removed. Because some of the samples were from males and some were from females, we removed reads that mapped or lifted over to the human X or Y chromosomes to eliminate any sex-specific bias. Genome_build: hg19 for human, panTro4 for chimpanzee, gorGor3 for gorilla, ponAbe2 for orangutan, rheMac3 for macaque Supplementary_files_format_and_content: bed files contain hg19 coordinates for the reads
|
|
|
Submission date |
Mar 29, 2019 |
Last update date |
Oct 11, 2019 |
Contact name |
Gregory Crawford |
E-mail(s) |
greg.crawford@duke.edu
|
Organization name |
Duke University
|
Street address |
101 Science Drive
|
City |
Durham |
State/province |
NC |
ZIP/Postal code |
27708 |
Country |
USA |
|
|
Platform ID |
GPL26367 |
Series (1) |
GSE129034 |
Comparison of chromatin accessibility between human and non-human primates |
|
Relations |
BioSample |
SAMN11285968 |
SRA |
SRX5600568 |
Supplementary file |
Size |
Download |
File type/resource |
GSM3692003_orangutan2.20_million_read_set.bed.gz |
357.5 Mb |
(ftp)(http) |
BED |
GSM3692003_orangutan2.full_read_set.bed.gz |
357.5 Mb |
(ftp)(http) |
BED |
SRA Run Selector |
Raw data are available in SRA |
Processed data provided as supplementary file |
|
|
|
|
|