GEO Accession viewer

NCBI > GEO > Accession Display

Not logged in | Login

GEO help: Mouse over screen elements for information.

Sample GSM5182724

Query DataSets for GSM5182724

Status

Public on Apr 23, 2021

Title

Purple sea urchin muscle (Sample2581)

Sample type

SRA

Source name

muscle

Organism

Strongylocentrotus purpuratus

Characteristics

tissue: muscle

Extracted molecule

genomic DNA

Extraction protocol

Extraction protocol is described as part of the library construction protocol.
The Hi-C libraries were prepared following the in situ Hi-C strategy (Rao, Huntley et al., 2014). Briefly, cells or tissue were crosslinked and then lysed with nuclei permeabilized but still intact. DNA was then restricted and the overhangs filled in incorporating a biotinylated base. Free ends were then ligated together in situ. Crosslinks were reversed, the DNA was sheared to 300-500bp and then biotinylated ligation junctions were recovered with streptavidin beads. Afterward DNA was handled according to standard Illumina library construction protocol. DNA was end-repaired using a combination of T4 DNA polymerase, E. coli DNA Pol I large fragment (Klenow polymerase) and T4 polynucleotide kinase. The blunt, phosphorylated ends were treated with Klenow fragment (3' to 5' exo-) and dATP to yield a protruding 3- 'A' base for ligation of Illumina's adapters which have a single 'T' base overhang at the 3 end. After adapter ligation DNA was PCR amplified with Illumina primers and library fragments of 400-600 bp (insert plus adaptor and PCR primer sequences) were purified using SPRI beads. The purified DNA was captured on an Illumina flow cell for cluster generation. Libraries were sequenced on various Illumina instruments following the manufacturer's protocols. Fpr the tammar wallaby, a short insert-size PCR-free DNA-Seq library was prepared following the ClaSeek protocol.

Library strategy

Hi-C

Library source

genomic

Library selection

other

Instrument model

Illumina NovaSeq 6000

Description

Sample2581

Data processing

Hi-C-guided genome assembly (3D-DNA, JBAT)
Construction of the chromosome-length Hi-C contact map (3D-DNA, Juicer)
Centromere position identification (repeat sequence mapping or structural analysis, 3D-DNA/JBAT)
Aggregate chromosome analysis (3D-DNA)
Variant calling (DRAGEN pipeline)
Chromosome-length variant phasing (3D-DNA)
Genome_build: In addition to those generate as part of the study, the following existing genome builds were used: GRCg6a (domestic chicken); Xla.v91 (African clawed frog); AaegL5.0 (Yellow fever mosquito), CpipJ3 (Southern domestic mosquito); Release_6_plus_ISO1_MT (fruit fly, with chromosome arms merged for ACA analysis resulting in Release_6_plus_ISO1_MT_merged); WBcel235 (C. elegans); sacCer3 (baker's yeast); arahy.Tifrunner.gnm2.J5K5 (ground nut); 161010_Chinese_Spring_v1.0_pseudomolecules (bread wheat).
Supplementary_files_format_and_content: fasta.gz: chromosome-length nucleotide sequences generated usign Hi-C-guided genome assembly
Supplementary_files_format_and_content: hic: contact maps generated by aligning Hi-C data to corresponding chromosome-length reference sequences. In addition to standard contact maps generated for all genomic positions across chromosomes the uploaded files include ACA (aggregate chromosome analysis) maps and phasing contact maps representing interactions between positions on the genome that contain heterogyzous SNPs. For more details see Hoencamp et al., Supplement.
Supplementary_files_format_and_content: wig: tracks showing the alignment frequency of Hi-C reads from read pairs that contain centromere repeat-specifc sequences.
Supplementary_files_format_and_content: bedpe: Files describing the tentative position of centromeres in chromosome-length genome assemblies, identified either by mapping repeat sequences associated with the centromere or via analysis of the contact map for evidence of centromere-to-telomere folding axis. For details see Hoencamp et al., Supplement.
Supplementary_files_format_and_content: vcf.gz: Variants (specifically SNPs) identivied by the DRAGEN variant calling pipeline from aligning Hi-C data to chromosome-length references generated via Hi-C-guided assembly and phased using the 3D-DNA Hi-C-guided phasing module.

Submission date

Mar 18, 2021

Last update date

Apr 23, 2021

Contact name

Olga Dudchenko

E-mail(s)

Olga.Dudchenko@bcm.edu

Organization name

Baylor College of Medicine

Street address

1 Baylor Plaza

City

Houston

State/province

ZIP/Postal code

77030

Country

USA

Platform ID

GPL29871

Series (1)

GSE169088

3D genomics across the tree of life reveals condensin II as a determinant of architecture type

Relations

BioSample

SAMN17221987

SRA

SRX9788348

Supplementary file	Size	Download	File type/resource
GSM5182724_Spur_5.0.contigs.rawchrom_purged_HiC.aca.hic	580.5 Kb	(ftp)(http)	HIC
GSM5182724_Spur_5.0.contigs_purged_HiC.fasta.gz	225.4 Mb	(ftp)(http)	FASTA
GSM5182724_Spur_5.0.contigs_purged_HiC.hic	218.3 Mb	(ftp)(http)	HIC
GSM5182724_Strongylocentrotus_purpuratus.hard-filtered.snp.phased.vcf.gz	911.1 Mb	(ftp)(http)	VCF
SRA Run Selector
Raw data are available in SRA
Processed data provided as supplementary file