|
|
GEO help: Mouse over screen elements for information. |
|
Status |
Public on Apr 23, 2021 |
Title |
Purple sea urchin muscle (Sample2581) |
Sample type |
SRA |
|
|
Source name |
muscle
|
Organism |
Strongylocentrotus purpuratus |
Characteristics |
tissue: muscle
|
Extracted molecule |
genomic DNA |
Extraction protocol |
Extraction protocol is described as part of the library construction protocol. The Hi-C libraries were prepared following the in situ Hi-C strategy (Rao, Huntley et al., 2014). Briefly, cells or tissue were crosslinked and then lysed with nuclei permeabilized but still intact. DNA was then restricted and the overhangs filled in incorporating a biotinylated base. Free ends were then ligated together in situ. Crosslinks were reversed, the DNA was sheared to 300-500bp and then biotinylated ligation junctions were recovered with streptavidin beads. Afterward DNA was handled according to standard Illumina library construction protocol. DNA was end-repaired using a combination of T4 DNA polymerase, E. coli DNA Pol I large fragment (Klenow polymerase) and T4 polynucleotide kinase. The blunt, phosphorylated ends were treated with Klenow fragment (3' to 5' exo-) and dATP to yield a protruding 3- 'A' base for ligation of Illumina's adapters which have a single 'T' base overhang at the 3 end. After adapter ligation DNA was PCR amplified with Illumina primers and library fragments of 400-600 bp (insert plus adaptor and PCR primer sequences) were purified using SPRI beads. The purified DNA was captured on an Illumina flow cell for cluster generation. Libraries were sequenced on various Illumina instruments following the manufacturer's protocols. Fpr the tammar wallaby, a short insert-size PCR-free DNA-Seq library was prepared following the ClaSeek protocol.
|
|
|
Library strategy |
Hi-C |
Library source |
genomic |
Library selection |
other |
Instrument model |
Illumina NovaSeq 6000 |
|
|
Description |
Sample2581
|
Data processing |
Hi-C-guided genome assembly (3D-DNA, JBAT) Construction of the chromosome-length Hi-C contact map (3D-DNA, Juicer) Centromere position identification (repeat sequence mapping or structural analysis, 3D-DNA/JBAT) Aggregate chromosome analysis (3D-DNA) Variant calling (DRAGEN pipeline) Chromosome-length variant phasing (3D-DNA) Genome_build: In addition to those generate as part of the study, the following existing genome builds were used: GRCg6a (domestic chicken); Xla.v91 (African clawed frog); AaegL5.0 (Yellow fever mosquito), CpipJ3 (Southern domestic mosquito); Release_6_plus_ISO1_MT (fruit fly, with chromosome arms merged for ACA analysis resulting in Release_6_plus_ISO1_MT_merged); WBcel235 (C. elegans); sacCer3 (baker's yeast); arahy.Tifrunner.gnm2.J5K5 (ground nut); 161010_Chinese_Spring_v1.0_pseudomolecules (bread wheat). Supplementary_files_format_and_content: fasta.gz: chromosome-length nucleotide sequences generated usign Hi-C-guided genome assembly Supplementary_files_format_and_content: hic: contact maps generated by aligning Hi-C data to corresponding chromosome-length reference sequences. In addition to standard contact maps generated for all genomic positions across chromosomes the uploaded files include ACA (aggregate chromosome analysis) maps and phasing contact maps representing interactions between positions on the genome that contain heterogyzous SNPs. For more details see Hoencamp et al., Supplement. Supplementary_files_format_and_content: wig: tracks showing the alignment frequency of Hi-C reads from read pairs that contain centromere repeat-specifc sequences. Supplementary_files_format_and_content: bedpe: Files describing the tentative position of centromeres in chromosome-length genome assemblies, identified either by mapping repeat sequences associated with the centromere or via analysis of the contact map for evidence of centromere-to-telomere folding axis. For details see Hoencamp et al., Supplement. Supplementary_files_format_and_content: vcf.gz: Variants (specifically SNPs) identivied by the DRAGEN variant calling pipeline from aligning Hi-C data to chromosome-length references generated via Hi-C-guided assembly and phased using the 3D-DNA Hi-C-guided phasing module.
|
|
|
Submission date |
Mar 18, 2021 |
Last update date |
Apr 23, 2021 |
Contact name |
Olga Dudchenko |
E-mail(s) |
Olga.Dudchenko@bcm.edu
|
Organization name |
Baylor College of Medicine
|
Street address |
1 Baylor Plaza
|
City |
Houston |
State/province |
TX |
ZIP/Postal code |
77030 |
Country |
USA |
|
|
Platform ID |
GPL29871 |
Series (1) |
GSE169088 |
3D genomics across the tree of life reveals condensin II as a determinant of architecture type |
|
Relations |
BioSample |
SAMN17221987 |
SRA |
SRX9788348 |
Supplementary file |
Size |
Download |
File type/resource |
GSM5182724_Spur_5.0.contigs.rawchrom_purged_HiC.aca.hic |
580.5 Kb |
(ftp)(http) |
HIC |
GSM5182724_Spur_5.0.contigs_purged_HiC.fasta.gz |
225.4 Mb |
(ftp)(http) |
FASTA |
GSM5182724_Spur_5.0.contigs_purged_HiC.hic |
218.3 Mb |
(ftp)(http) |
HIC |
GSM5182724_Strongylocentrotus_purpuratus.hard-filtered.snp.phased.vcf.gz |
911.1 Mb |
(ftp)(http) |
VCF |
SRA Run Selector |
Raw data are available in SRA |
Processed data provided as supplementary file |
|
|
|
|
|