NCBI Logo
GEO Logo
   NCBI > GEO > Accession DisplayHelp Not logged in | LoginHelp
GEO help: Mouse over screen elements for information.
          Go
Sample GSM5182723 Query DataSets for GSM5182723
Status Public on Apr 23, 2021
Title European lancelet whole animal(s) (Sample1824)
Sample type SRA
 
Source name whole animal
Organism Branchiostoma lanceolatum
Characteristics tissue: whole animal
Extracted molecule genomic DNA
Extraction protocol Extraction protocol is described as part of the library construction protocol.
The Hi-C libraries were prepared following the in situ Hi-C strategy (Rao, Huntley et al., 2014). Briefly, cells or tissue were crosslinked and then lysed with nuclei permeabilized but still intact. DNA was then restricted and the overhangs filled in incorporating a biotinylated base. Free ends were then ligated together in situ. Crosslinks were reversed, the DNA was sheared to 300-500bp and then biotinylated ligation junctions were recovered with streptavidin beads. Afterward DNA was handled according to standard Illumina library construction protocol. DNA was end-repaired using a combination of T4 DNA polymerase, E. coli DNA Pol I large fragment (Klenow polymerase) and T4 polynucleotide kinase. The blunt, phosphorylated ends were treated with Klenow fragment (3' to 5' exo-) and dATP to yield a protruding 3- 'A' base for ligation of Illumina's adapters which have a single 'T' base overhang at the 3 end. After adapter ligation DNA was PCR amplified with Illumina primers and library fragments of 400-600 bp (insert plus adaptor and PCR primer sequences) were purified using SPRI beads. The purified DNA was captured on an Illumina flow cell for cluster generation. Libraries were sequenced on various Illumina instruments following the manufacturer's protocols. Fpr the tammar wallaby, a short insert-size PCR-free DNA-Seq library was prepared following the ClaSeek protocol.
 
Library strategy Hi-C
Library source genomic
Library selection other
Instrument model Illumina NextSeq 500
 
Description Sample1824
Data processing Hi-C-guided genome assembly (3D-DNA, JBAT)
Construction of the chromosome-length Hi-C contact map (3D-DNA, Juicer)
Centromere position identification (repeat sequence mapping or structural analysis, 3D-DNA/JBAT)
Aggregate chromosome analysis (3D-DNA)
Variant calling (DRAGEN pipeline)
Chromosome-length variant phasing (3D-DNA)
Genome_build: In addition to those generate as part of the study, the following existing genome builds were used: GRCg6a (domestic chicken); Xla.v91 (African clawed frog); AaegL5.0 (Yellow fever mosquito), CpipJ3 (Southern domestic mosquito); Release_6_plus_ISO1_MT (fruit fly, with chromosome arms merged for ACA analysis resulting in Release_6_plus_ISO1_MT_merged); WBcel235 (C. elegans); sacCer3 (baker's yeast); arahy.Tifrunner.gnm2.J5K5 (ground nut); 161010_Chinese_Spring_v1.0_pseudomolecules (bread wheat).
Supplementary_files_format_and_content: fasta.gz: chromosome-length nucleotide sequences generated usign Hi-C-guided genome assembly
Supplementary_files_format_and_content: hic: contact maps generated by aligning Hi-C data to corresponding chromosome-length reference sequences. In addition to standard contact maps generated for all genomic positions across chromosomes the uploaded files include ACA (aggregate chromosome analysis) maps and phasing contact maps representing interactions between positions on the genome that contain heterogyzous SNPs. For more details see Hoencamp et al., Supplement.
Supplementary_files_format_and_content: wig: tracks showing the alignment frequency of Hi-C reads from read pairs that contain centromere repeat-specifc sequences.
Supplementary_files_format_and_content: bedpe: Files describing the tentative position of centromeres in chromosome-length genome assemblies, identified either by mapping repeat sequences associated with the centromere or via analysis of the contact map for evidence of centromere-to-telomere folding axis. For details see Hoencamp et al., Supplement.
Supplementary_files_format_and_content: vcf.gz: Variants (specifically SNPs) identivied by the DRAGEN variant calling pipeline from aligning Hi-C data to chromosome-length references generated via Hi-C-guided assembly and phased using the 3D-DNA Hi-C-guided phasing module.
 
Submission date Mar 18, 2021
Last update date Apr 23, 2021
Contact name Olga Dudchenko
E-mail(s) Olga.Dudchenko@bcm.edu
Organization name Baylor College of Medicine
Street address 1 Baylor Plaza
City Houston
State/province TX
ZIP/Postal code 77030
Country USA
 
Platform ID GPL29870
Series (1)
GSE169088 3D genomics across the tree of life reveals condensin II as a determinant of architecture type
Relations
BioSample SAMN17217640
SRA SRX9785439
SRA SRX9785440
SRA SRX9785443

Supplementary file Size Download File type/resource
GSM5182723_Bl71nemr_HiC.aca.hic 560.9 Kb (ftp)(http) HIC
GSM5182723_Bl71nemr_HiC.fasta.gz 145.5 Mb (ftp)(http) FASTA
GSM5182723_Bl71nemr_HiC.hic 112.1 Mb (ftp)(http) HIC
GSM5182723_Branchiostoma_lanceolatum.hard-filtered.snp.phased.vcf.gz 728.4 Mb (ftp)(http) VCF
SRA Run SelectorHelp
Raw data are available in SRA
Processed data provided as supplementary file

| NLM | NIH | GEO Help | Disclaimer | Accessibility |
NCBI Home NCBI Search NCBI SiteMap