NCBI Logo
GEO Logo
   NCBI > GEO > Accession DisplayHelp Not logged in | LoginHelp
GEO help: Mouse over screen elements for information.
          Go
Sample GSM5182724 Query DataSets for GSM5182724
Status Public on Apr 23, 2021
Title Purple sea urchin muscle (Sample2581)
Sample type SRA
 
Source name muscle
Organism Strongylocentrotus purpuratus
Characteristics tissue: muscle
Extracted molecule genomic DNA
Extraction protocol Extraction protocol is described as part of the library construction protocol.
The Hi-C libraries were prepared following the in situ Hi-C strategy (Rao, Huntley et al., 2014). Briefly, cells or tissue were crosslinked and then lysed with nuclei permeabilized but still intact. DNA was then restricted and the overhangs filled in incorporating a biotinylated base. Free ends were then ligated together in situ. Crosslinks were reversed, the DNA was sheared to 300-500bp and then biotinylated ligation junctions were recovered with streptavidin beads. Afterward DNA was handled according to standard Illumina library construction protocol. DNA was end-repaired using a combination of T4 DNA polymerase, E. coli DNA Pol I large fragment (Klenow polymerase) and T4 polynucleotide kinase. The blunt, phosphorylated ends were treated with Klenow fragment (3' to 5' exo-) and dATP to yield a protruding 3- 'A' base for ligation of Illumina's adapters which have a single 'T' base overhang at the 3 end. After adapter ligation DNA was PCR amplified with Illumina primers and library fragments of 400-600 bp (insert plus adaptor and PCR primer sequences) were purified using SPRI beads. The purified DNA was captured on an Illumina flow cell for cluster generation. Libraries were sequenced on various Illumina instruments following the manufacturer's protocols. Fpr the tammar wallaby, a short insert-size PCR-free DNA-Seq library was prepared following the ClaSeek protocol.
 
Library strategy Hi-C
Library source genomic
Library selection other
Instrument model Illumina NovaSeq 6000
 
Description Sample2581
Data processing Hi-C-guided genome assembly (3D-DNA, JBAT)
Construction of the chromosome-length Hi-C contact map (3D-DNA, Juicer)
Centromere position identification (repeat sequence mapping or structural analysis, 3D-DNA/JBAT)
Aggregate chromosome analysis (3D-DNA)
Variant calling (DRAGEN pipeline)
Chromosome-length variant phasing (3D-DNA)
Genome_build: In addition to those generate as part of the study, the following existing genome builds were used: GRCg6a (domestic chicken); Xla.v91 (African clawed frog); AaegL5.0 (Yellow fever mosquito), CpipJ3 (Southern domestic mosquito); Release_6_plus_ISO1_MT (fruit fly, with chromosome arms merged for ACA analysis resulting in Release_6_plus_ISO1_MT_merged); WBcel235 (C. elegans); sacCer3 (baker's yeast); arahy.Tifrunner.gnm2.J5K5 (ground nut); 161010_Chinese_Spring_v1.0_pseudomolecules (bread wheat).
Supplementary_files_format_and_content: fasta.gz: chromosome-length nucleotide sequences generated usign Hi-C-guided genome assembly
Supplementary_files_format_and_content: hic: contact maps generated by aligning Hi-C data to corresponding chromosome-length reference sequences. In addition to standard contact maps generated for all genomic positions across chromosomes the uploaded files include ACA (aggregate chromosome analysis) maps and phasing contact maps representing interactions between positions on the genome that contain heterogyzous SNPs. For more details see Hoencamp et al., Supplement.
Supplementary_files_format_and_content: wig: tracks showing the alignment frequency of Hi-C reads from read pairs that contain centromere repeat-specifc sequences.
Supplementary_files_format_and_content: bedpe: Files describing the tentative position of centromeres in chromosome-length genome assemblies, identified either by mapping repeat sequences associated with the centromere or via analysis of the contact map for evidence of centromere-to-telomere folding axis. For details see Hoencamp et al., Supplement.
Supplementary_files_format_and_content: vcf.gz: Variants (specifically SNPs) identivied by the DRAGEN variant calling pipeline from aligning Hi-C data to chromosome-length references generated via Hi-C-guided assembly and phased using the 3D-DNA Hi-C-guided phasing module.
 
Submission date Mar 18, 2021
Last update date Apr 23, 2021
Contact name Olga Dudchenko
E-mail(s) Olga.Dudchenko@bcm.edu
Organization name Baylor College of Medicine
Street address 1 Baylor Plaza
City Houston
State/province TX
ZIP/Postal code 77030
Country USA
 
Platform ID GPL29871
Series (1)
GSE169088 3D genomics across the tree of life reveals condensin II as a determinant of architecture type
Relations
BioSample SAMN17221987
SRA SRX9788348

Supplementary file Size Download File type/resource
GSM5182724_Spur_5.0.contigs.rawchrom_purged_HiC.aca.hic 580.5 Kb (ftp)(http) HIC
GSM5182724_Spur_5.0.contigs_purged_HiC.fasta.gz 225.4 Mb (ftp)(http) FASTA
GSM5182724_Spur_5.0.contigs_purged_HiC.hic 218.3 Mb (ftp)(http) HIC
GSM5182724_Strongylocentrotus_purpuratus.hard-filtered.snp.phased.vcf.gz 911.1 Mb (ftp)(http) VCF
SRA Run SelectorHelp
Raw data are available in SRA
Processed data provided as supplementary file

| NLM | NIH | GEO Help | Disclaimer | Accessibility |
NCBI Home NCBI Search NCBI SiteMap