|
|
GEO help: Mouse over screen elements for information. |
|
Status |
Public on Jan 20, 2016 |
Title |
Escherichia_coli_K12_MG1655_Tet |
Sample type |
SRA |
|
|
Source name |
Bacterial gDNA
|
Organism |
Escherichia coli str. K-12 substr. MG1655 |
Characteristics |
strain: K-12, MG1655 dna treatment: DNA fragments were treated with Tet-1 according to guidelines for using the WiseGene™ 5-mC Tet1 oxidation kit for SMRT sequencing on the Pacific Biosciences RS (WiseGene LLC., Chicago, IL USA; Item #K004; Pacific Biosciences Inc. Menlo Park, CA USA). pacbio sequencing chemistry: P4 / C2 genome build (img genome id): 646311926
|
Extracted molecule |
genomic DNA |
Extraction protocol |
gDNA was randomly sheared, end repaired, and ligated to blunt end hairpin adapters to form standard SMRTbell templates SMRT-sequencing
|
|
|
Library strategy |
OTHER |
Library source |
genomic |
Library selection |
other |
Instrument model |
PacBio RS II |
|
|
Data processing |
Sequencing reads were processed and mapped to the respective reference sequences using the BLASR mapper (http://www.pacbiodevnet.com/SMRT-Analysis/Algorithms/BLASR) and the Pacific Biosciences' SMRT Analysis pipeline (http://www.pacbiodevnet.com/SMRT-Analysis/Software/SMRT-Pipe) using the standard mapping protocol. To identify modified positions, we used Pacific Biosciences' SMRTPortal analysis platform, v. 2.0.1, which uses an in silico kinetic reference, and a t-test based kinetic score detection of modified base positions (details are available at http://www.pacb.com/pdf/TN_Detecting_DNA_Base_Modifications.pdf) Modified sites were grouped into motifs using MotifFinder (v1)2. Genome_build: As indicated by IMG genome ID in the individual samples section (http://img.jgi.doe.gov/home.html) Supplementary_files_format_and_content: modifications.gff files are comma separated text files that contain data for genome positions with Qmod scores greater than 20; Data includes Qmod scores, IPD ratios, coverage, and sequence contexts for a +/- 20 nt window surrounding the modified position (see https://github.com/PacificBiosciences/Bioinformatics-Training/wiki/Methylome-Analysis-Technical-Note for more details) Supplementary_files_format_and_content: modifications.csv files are comma separated text files that contain data for all positions (both strands) in the reference sequence; data includes Qmod score, IPD ratio, and coverage. (see https://github.com/PacificBiosciences/Bioinformatics-Training/wiki/Methylome-Analysis-Technical-Note for more details) Supplementary_files_format_and_content: motif_summary.csv files contain the genome-wide summary of the methyltransferase recognition motifs discovered in this sample. (see https://github.com/PacificBiosciences/Bioinformatics-Training/wiki/Methylome-Analysis-Technical-Note for more details). Supplementary_files_format_and_content: motifs.gff files are similar to the modifications.gff files. These files contain information about all sites detected as modified, all locations of a discovered motif (including those which are apparently unmodified), and also the overlap between the modifications and motifs. (see https://github.com/PacificBiosciences/Bioinformatics-Training/wiki/Methylome-Analysis-Technical-Note for more details).
|
|
|
Submission date |
Jun 15, 2015 |
Last update date |
May 15, 2019 |
Contact name |
Matthew James Blow |
E-mail(s) |
mjblow@lbl.gov
|
Phone |
510-486-6590
|
Fax |
510-486-7004
|
Organization name |
Lawrence Berkeley National Laboratory
|
Department |
Genomics Division
|
Lab |
Rubin / Pennacchio
|
Street address |
1 Cyclotron Road
|
City |
Berkeley |
State/province |
CA |
ZIP/Postal code |
94720 |
Country |
USA |
|
|
Platform ID |
GPL20400 |
Series (1) |
GSE69872 |
The Epigenomic Landscape of Prokaryotes |
|
Relations |
BioSample |
SAMN03775432 |
SRA |
SRX1058891 |
Supplementary file |
Size |
Download |
File type/resource |
GSM1711631_Escherichia_coli_K12_MG1655_Tet.modifications.csv.gz |
128.0 Mb |
(ftp)(http) |
CSV |
GSM1711631_Escherichia_coli_K12_MG1655_Tet.modifications.gff.gz |
4.0 Mb |
(ftp)(http) |
GFF |
GSM1711631_Escherichia_coli_K12_MG1655_Tet.motif_summary.csv.gz |
463 b |
(ftp)(http) |
CSV |
GSM1711631_Escherichia_coli_K12_MG1655_Tet.motifs.gff.gz |
4.4 Mb |
(ftp)(http) |
GFF |
SRA Run Selector |
Raw data are available in SRA |
Processed data provided as supplementary file |
|
|
|
|
|