GEO Accession viewer

NCBI > GEO > Accession Display

Not logged in | Login

GEO help: Mouse over screen elements for information.

Sample GSM4294714

Query DataSets for GSM4294714

Status

Public on Feb 24, 2021

Title

228_Wang_HiC_BWX3908

Sample type

SRA

Source name

Bacillus subtilis PY79

Organism

Bacillus subtilis PY79

Characteristics

protocol: CH, 37degree C
genotype: parSdelta9 (loxP-spec-loxP), -94degree parS (loxP no a.b.), -109degree 12x XDS (loxP-kan-loxP), deltaxerD (loxP-erm-loxP)
restriction enzyme: HindIII
strain: PY79

Treatment protocol

Cells were growing in CH medium at 37˚C, fixed by 3% formaldehyde for 30min at room temperature

Growth protocol

Bacillus subtilis strains were derived from the prototrophic strain PY79 (Youngman et al 1983, PMID: 6300908). Cells were grown in defined rich medium (CH) (Harwood and Cutting 1990, Molecular Biological Methods for Bacillus) at specified temperature. Staphylococcus aureus strains were derived from HG003 (Herbert et al 2010, PMID: 20212089). Cells were grown in tryptic soy broth (TSB) at specified temperature.

Extracted molecule

genomic DNA

Extraction protocol

Hi-C experiments were performed using HindIII according to previous publications (Le et al, 2013, PMID: 24158908; Wang et al 2015, PMID: 26253537). ChIP-seq experiments were performed using anti-SMC, or anti-His antibodies according to previous publication (Graham et al 2014, PMID: 24829297).
Standard library construction protocol was used for Illumina HiSeq2500, Nextseq550, or MiSeq sequencing platforms. Illumina Truseq indexes were used.

Library strategy

OTHER

Library source

genomic

Library selection

other

Instrument model

NextSeq 550

Description

parS∆9 (loxP-spec-loxP), -94˚ parS (loxP no a.b.), -109˚ 12x XDS (loxP-kan-loxP), ∆xerD (loxP-erm-loxP)

Data processing

Library strategy: HiC-Seq
Reads from each end of a DNA fragment were represented in each of the two FASTQ files generated by Illumina paired-end sequencing.
For Bacillus subtilis Hi-C, we mapped reads in each FASTQ file back to the genome of Bacillus subtilis PY79 (NCBI Reference Sequence: NC_022898.1) independently by Bowtie version 2.1.0 to preserve the order of reads. All the processing steps afterwards were performed using an in-house Perl scripts. For Staphylococcus aureus Hi-C, we mapped reads in each FASTQ file back to the genome of Staphylococcus aureus NCTC8325 genome (NCBI Reference Sequence: NC_007795.1) independently by Bowtie version 2.1.0 to preserve the order of reads. All the processing steps afterwards were performed using an in-house Perl scripts. For ChIP-seq, we mapped reads in each FASTQ file back to the genome of Bacillus subtilis PY79 (NCBI Reference Sequence: NC_022898.1) or Staphylococcus aureus NCTC8325 genome (NCBI Reference Sequence: NC_007795.1) independently using CLC genomics workbench 8.0, and exported .csv file.
For Hi-C, the two resulting SAM files were then merged together. Unaligned reads were discarded.
For Hi-C, the Bacillus subtilis (PY79) and Staphylococcus aureus (NCTC8325) genomes was divided into HindIII restriction fragments. Each aligned read was sorted into its corresponding restriction fragment. We inferred that a DNA fragment resulted from non-ligation or self-ligation if reads from both ends were from the same restriction fragment; these reads were discarded. Only DNA fragments for which the reads came from different restriction fragments were retained and used for construction of a Hi-C contact map.
For Hi-C, the genomes of Bacillus subtilis (PY79) and Staphylococcus aureus (NCTC8325) was then divided into bins 10 kb and the remaining reads were allocated to their corresponding bin. We then counted the number of fragments having reads with in different bins. A raw Hi-C contact map is the matrix of read counts in which each entry, mij, indicates the number of fragments with ends mapping to bins i and j. The raw Hi-C contact map/matrix is biased due to the uneven distribution of restriction enzyme sites and, to a lesser extent, differences in GC content and the mappability of individual reads. We normalized raw contact maps using an iterative normalization procedure (Imakaev et al., 2012 PMID: 22941365). Essentially, we converted the number of interactions, or read counts, into Hi-C scores by applying the following equation and iteratively repeating it for the resulting contact map after each cycle: mij =mij * (total reads) / (total reads in bin i * total reads in bin j). The iterative procedure was repeated until the maximum relative error of the total number of Hi-C scores in a bin was less than 10-5. Resulting matrices were normalized so that Hi-C scores for each row and column sum to 1. Subsequent analysis and visualization was done using R scripts.
Genome_build: NC_022898.1; NC_007795.1
Supplementary_files_format_and_content: Files ending with .matrix.txt: tab-delimited text files of the iteratively-corrected Hi-C contact maps/matrices. Files ending with .csv: comma-separated values files of ChIP-seq results.
Supplementary_files_format_and_content: matrix.txt files: 10kb bins of B. subtilis genome; normalized Hi-C interaction scores
Supplementary_files_format_and_content: csv files: genome position; reads; number of reads at each genome position

Submission date

Feb 04, 2020

Last update date

Feb 25, 2021

Contact name

Xindan Wang

E-mail(s)

xindan@iu.edu

Organization name

Indiana University at Bloomington

Department

Biology

Lab

Biology Building 225

Street address

1001 E 3rd St

City

Bloomington

State/province

ZIP/Postal code

47405

Country

USA

Platform ID

GPL28115

Series (1)

GSE144742

XerD is required to unload bacterial SMC complexes at the replication terminus

Relations

BioSample

SAMN13981336

SRA

SRX7671096

Supplementary file	Size	Download	File type/resource
GSM4294714_228_Wang_HiC_BWX3908.matrix.txt.gz	145.0 Kb	(ftp)(http)	TXT
SRA Run Selector
Raw data are available in SRA
Processed data provided as supplementary file