|
|
GEO help: Mouse over screen elements for information. |
|
Status |
Public on Mar 22, 2023 |
Title |
CPT0010100001_snATAC |
Sample type |
SRA |
|
|
Source name |
Kidney
|
Organism |
Homo sapiens |
Characteristics |
cell type: tissue tissue: Kidney
|
Extracted molecule |
genomic DNA |
Extraction protocol |
15-25 mg of pulverized tissue was placed in a 5mL Eppendorf tube on ice. Using a wide-bore pipette tip (Rainin), a lysis buffer prepared from the Nuclei Isolation protocol (10X Genomics) and SuperRNase inhibitor (Invitrogen) was added to the tube. The tissue solution was gently pipetted until the lysis liquid turned a slightly cloudy color. (The number of pipetting iterations depended on the specific tissue.) The tissue homogenate was then filtered through a 40-micron strainer (pluriSelect) and washed with a BSA wash buffer (2% BSA + 1x PBS + RNase inhibitor). The filtrate was collected, centrifuged at 500 x g for 6 minutes at 4°C, and resuspended with a BSA wash buffer. 100 uL of cell lysis solution was set aside for unstained reference, while the rest was stained with DRAQ5 or 7AAD for RNA or ATAC sequencing, respectively. Namely, snRNA-seq nuclei were stained with 1 uL of DRAQ5 per 300 uL of the sample, and snATAC-seq nuclei were stained with 1 uL of 7AAD per 500 uL of the sample. Sorting gates were based on size, granularity, and dye staining signal. Nuclei and barcoded beads were isolated in oil droplets via the 10x Genomics Chromium instrument. Single nuclei suspensions were counted and adjusted to a range of 500 to 1800 nuclei/µL using a hemocytometer. Reverse transcription was subsequently performed to incorporate cell and transcript-specific barcodes. All snRNA-seq samples were run using the Chromium Next GEM Single Cell 3’ Library and Gel Bead Kit v3.1 (10x Genomics). For snATAC-seq, Chromium Next GEM Single Cell ATAC Library and Gel Bead Kit v1.1 prep (10x Genomics) were used for all samples. Barcoded libraries were then pooled and sequenced on the Illumina NovaSeq 6000 system with specific flow cell types (snRNA-seq: S4; snATAC-seq: S1).
|
|
|
Library strategy |
ATAC-seq |
Library source |
genomic |
Library selection |
other |
Instrument model |
Illumina NovaSeq 6000 |
|
|
Description |
10x Genomics
|
Data processing |
After single-nuclei prep and sequencing, Cell Ranger (v3.1.0) from 10X Genomics (with Count functionality) was used for aligning reads to the human genome reference (GRCh38) with the addition of pre-mRNA reference (v3.0.0). The reference file was downloaded from the 10X Genomics website (https://support.10xgenomics.com/single-cell-gene-expression/software/downloads/latest). The pre-mRNA reference was added using the following code: awk 'BEGIN{FS="\t"; OFS="\t"} $3 == "transcript"{ $3="exon"; print}' refdata-cellranger-GRCh38-3.0.0/genes/genes.gtf > GRCh38-3.0.0.premrna.gtf; cellranger mkref --genome=GRCh38-3.0.0.premrna --fasta=refdata-cellranger-GRCh38-3.0.0/fasta/genome.fa --genes=GRCh38-3.0.0.premrna.gtf --nthreads 50. The parameters used with Count functionality include --chemistry=threeprime --expect-cells=6000 --jobmode=local --localcores=20 --localmem=350. The resulting gene-by-cell UMI count matrix was used by the R package Seurat (v.3.1.0)122 for all subsequent processing. Quality filters were applied to the data to remove barcodes that fell into any of the following categories: too few genes expressed (possible debris), too many associated UMIs (possibly more than one cell), and too high mitochondrial gene expression (possible dead cell). The cut-offs for these filters were based on recommendations by Seurat package documentation and manually adjusted to keep the number of cells after filtering under 6500 (detailed filtering parameters see Supplementary Data 2). Finally, doublets were filtered out using Scrublet (v.0.2.1). Scrublet was run on each sample separately with the following parameter settings: expected_doublet_rate=0.06, min_counts=2, min_cells=3, min_gene_variability_pctl=85, n_prin_comps=30. The doublet score threshold was adjusted manually, which can separate the two peaks of a bimodal simulated doublet score histogram. The Cell Ranger ATAC tool (v.1.2.0, 10X Genomics) was used to process the raw snATAC-seq data (FASTQ). We utilized the cellranger-atac count pipeline to filter and map snATAC-reads and to identify transposase cut sites. The GRCh38 human reference was used for the reads mapping. Next, MACS2126 (v2.2.7.1) was used to perform peak calling. All peaks were resized to 501 bp centered at the peak summit defined by MACS2. After this, we combined all peaks and removed the ones overlapping with the peaks with greater signal, to get the set of non-overlapping peaks, as described in Schep et al127. The resulting set of sample peaks was used to calculate the peak-count matrix using FeatureMatrix function from the R package Signac (v.1.2.0; https://github.com/timoast/signac), which was also used for downstream analysis. QC-filtering of the snATAC-seq data was performed using functions from the Signac package. Filters that were applied for the cell calling include: 1,000 < number of fragments in peaks < 20,000, percentage of reads in peaks > 15, ENCODE blacklist regions percentage < 0.05 (https://www.encodeproject.org/annotations/ENCSR636HFF/), nucleosome banding pattern score < 10, and enrichment-score for Tn5-integration events at transcriptional start sites > 2. Assembly: hg38 Supplementary files format and content: Tab-separated values files and matrix files.
|
|
|
Submission date |
Mar 21, 2023 |
Last update date |
Mar 31, 2023 |
Contact name |
Yige Wu |
E-mail(s) |
yigewu.app@gmail.com
|
Phone |
3142950479
|
Organization name |
Washington University in St. Louis
|
Department |
Department of Medicine
|
Street address |
4520 McPherson Ave
|
City |
St. Louis |
State/province |
MO |
ZIP/Postal code |
63108 |
Country |
USA |
|
|
Platform ID |
GPL24676 |
Series (1) |
GSE227898 |
Epigenetic and transcriptomic characterization reveals progression markers and essential pathways in clear cell renal cell carcinoma |
|
Supplementary file |
Size |
Download |
File type/resource |
GSM7110475_CPT0010100001_fragments.tsv.gz |
1.4 Gb |
(ftp)(http) |
TSV |
GSM7110475_snATAC.CPT0010100001.tar.gz |
140.4 Mb |
(ftp)(http) |
TAR |
Raw data not provided for this record |
Processed data provided as supplementary file |
|
|
|
|
|