|
|
GEO help: Mouse over screen elements for information. |
|
Status |
Public on Apr 21, 2008 |
Title |
Col-0_methylC-seq |
Sample type |
SRA |
|
|
Source name |
Immature floral tissue
|
Organism |
Arabidopsis thaliana |
Characteristics |
Columbia-0, unopened flower buds
|
Treatment protocol |
Immature (unopened) flower buds were collected and frozen in liquid nitrogen.
|
Growth protocol |
All plants were grown in potting soil (Metro Mix 250; Grace-Sierra, Boca Raton, FL) at 23?C under a 16-hour light/8-hour dark cycle.
|
Extracted molecule |
genomic DNA |
Extraction protocol |
MethylC-seq library construction protocol: Genomic DNA was extracted using the DNeasy Plant Mini Kit (Qiagen, Valencia, CA), and 5 µg of was fragmented by sonication to 50-500 bp with a Bioruptor (Diagenode Sparta, NJ), followed by end repair and ligation of methylated adapters provided by Illumina (Illumina, San Diego, CA) as per manufacturer?s instructions for gDNA library construction. 100-200 ng of adapter-ligated gDNA of 120-170 bp was isolated by agarose gel electrophoresis, and subjected to two successive treatments of sodium bisulfite conversion using the EpiTect Bisulfite kit (Qiagen, Valencia, CA), using the subsequent FFPE purification step, as outlined in the manufacturer?s instructions. The reaction was then purified once more using the PCR purification kit (Qiagen, Valencia, CA). Five ng of bisulfite-converted, adapter-ligated DNA molecules were enriched by 18 cycles of PCR with the following reaction composition: 2.5 U of uracil-insensitive PfuTurboCx Hotstart DNA polymerase (Stratagene), 5 µl 10X PfuTurbo reaction buffer, 25 µM dNTPs, 1 µl Primer 1.1, 1 µl Primer 2.1 (50 µl final). The thermocyling was as follows: 95?C 2 min, 98?C 30 sec, then 18 cycles of 98?C 10 sec, 65?C 30 sec and 72?C 30 sec, completed with one 72?C 5 min step. The enriched library was purified with the PCR purification kit (Qiagen, Valencia, CA)and quantity and quality examined by spectrophotometry, gel electrophoresis, and limited sequencing of cloned library molecules.
|
|
|
Library strategy |
OTHER |
Library source |
genomic |
Library selection |
other |
Instrument model |
Illumina Genome Analyzer |
|
|
Description |
Sequence information was extracted from the image files with the Illumina Firecrest and Bustard applications and mapped to the Arabidopsis (Col-0) reference genome sequence (TAIR 7) with the Illumina ELAND algorithm. ELAND aligns 32 bases or shorter reads, allowing up to two mismatches to the reference sequence. For reads longer than 32 bases, only the first 32 bases will be used for alignment, while the remaining sequence will be appended regardless of similarity to the reference sequence. A Perl script was used to truncate the appended sequence at the point where the next four bases contain two or more errors relative to the reference sequence.
|
Data processing |
When mapping reads generated from bisulfite converted genomic DNA, converted cytosines will score as a mismatch and will adversely affect the ELAND alignment ability. Therefore reads were mapped against computationally bisulfite converted and non-converted genome sequences. As bisulfite conversion of cytosine to thymidine results in non-complementarity of the two strands of a DNA duplex, reads were mapped against two converted genome sequences, one with cytosine changed to thymidine to represent a converted Watson strand, and a second with guanine changed to adenosine to represent the converted Crick strand. For reads that aligned to multiple positions in the reference genome at 32 bases we utilized version (1.080214) of the cross_match algorithm (P. Green personal communication) to map these non-unique reads to a reference sequence that was repeat-masked for 50 bp perfect repeat sequence. To reduce clonal bias, short read sequences that mapped to the same start position were collapsed into a single consensus read. Where a base call within the consensus was contentious, the base to be retained was randomly selected.
|
|
|
Submission date |
Mar 21, 2008 |
Last update date |
May 15, 2019 |
Contact name |
Joseph R Ecker |
E-mail(s) |
ecker@salk.edu
|
Phone |
8584534100
|
Organization name |
HHMI-Salk-Institute
|
Department |
Genomic Analysis Laboratory
|
Lab |
Ecker lab
|
Street address |
10010 North Torrey Pines Road
|
City |
La Jolla |
State/province |
CA |
ZIP/Postal code |
92037 |
Country |
USA |
|
|
Platform ID |
GPL9062 |
Series (2) |
GSE10877 |
Highly integrated single base resolution maps of the epigenome in Arabidopsis |
GSE10966 |
Highly integrated epigenome maps in Arabidopsis - whole genome shotgun bisulfite sequencing |
|
Relations |
SRA |
SRX002495 |
BioSample |
SAMN02195398 |
Supplementary file |
Size |
Download |
File type/resource |
GSM276809.txt.gz |
453.3 Mb |
(ftp)(http) |
TXT |
SRA Run Selector |
Processed data provided as supplementary file |
Raw data not provided for this record |
Raw data are available in SRA |
|
|
|
|
|