GEO Accession viewer

NCBI > GEO > Accession Display

Not logged in | Login

GEO help: Mouse over screen elements for information.

Sample GSM8385345

Query DataSets for GSM8385345

Status

Public on Jul 09, 2024

Title

Chicago Mix F10, PBMC

Sample type

SRA

Source name

PBMC

Organism

Homo sapiens

Characteristics

tissue: PBMC
cohort: Chicago
samples multiplexed_(chicago_only): HPC12, HPC13
age (singleton or multiplexed): 66, 77
sex (singleton or multiplexed): pooled: F, F
disease (singleton or multiplexed): FHP, FHP
smoking status: NA
fev1: NA
dlco: NA

Extracted molecule

polyA RNA

Extraction protocol

At each sample collection site, PBMCs were separated from the whole blood using density gradient centrifugation, and PBMC and BAL samples were cryopreserved in liquid nitrogen as separate samples for single-cell experiments. The PBMCs were cryopreserved in 1.5 ml cryotubes using 10% DMSO in heat-inactivated FBS and were stored long-term in liquid nitrogen. Initial PBMC sample collection, processing, and storage did not differ significantly among the three patient sample collection sites. In Mexico City, bronchoalveolar lavage samples were collected and stored long-term in liquid nitrogen using 10% DMSO in heat-inactivated FBS.
All patient samples were shipped for processing in New Haven, CT, to minimize technical and batch effects arising from sample processing. Samples were randomized based on age, sex, diagnosis, and smoking history into different batches for sample preparation and single-cell library construction. Sixteen samples were processed at a time to reduce batch effects. The sample preparation protocol used was modified from 10X Genomics with the following modifications. Cryopreserved samples were thawed in a 37 °C water bath, and 1.5 ml of 10% FBS in Dulbecco’s Modified Eagle Medium (DMEM) was added to each tube. The cells were centrifuged, the supernatant was removed, and the cells were resuspended in D-PBS with 0.04% bovine serum albumin (BSA). The sample was passed through a 40 µm cell strainer into a 2-mL microcentrifuge tube. The cell concentration was determined using Trypan Blue solution, and Invitrogen Countess Automated Cell Counter. Samples with isolated single cells were diluted, and these cells were loaded into the chip with an anticipated recovery of 10,000 cells according to the manufacturer’s protocol for Chromium Next GEM Single Cell 5’ Reagent Kits v2 (Dual Index) (from samples from New Haven and Mexico City only). To further reduce batch effects, Chicago samples were pooled, such that each chip lane was loaded with two or three patient samples, with an anticipated recovery cell of 20,000 cells to 30,000 cells, respectively. Any leftover samples were fixed in 80% methanol and stored at -20C to preserve nucleotide integrity for downstream DNA extraction and whole-exome sequencing to SNP-demultiplex mixed patient samples from Chicago. All subsequent library construction steps were followed precisely, and all cDNA library QC was performed using an Agilent 2100 Bioanalyzer.
The single-cell cDNA libraries were sequenced using an Illumina HiSeq4000 machine at the Yale Stem Cell Genomics Center (samples from New Haven and Mexico City) and from an Illumina NovaSeq X at the Yale Center for Genomic Analysis (samples from Chicago). Samples were sequenced at a depth of 150 million reads per sample to allow for adequate transcriptome capture.

Library strategy

RNA-Seq

Library source

transcriptomic single cell

Library selection

cDNA

Instrument model

Illumina NovaSeq X

Description

F10_HHT
metadata in order of appearance of multiplexed samples

Data processing

The CellRanger v7.0.0 pipeline (mkfastq and count) was used to demultiplex the raw sequencing reads from the single-cell libraries and align them to the GRCh38 reference genome (CellRanger 2020-A).
To demultiplex the mixed samples from Chicago, the fastq files corresponding to the whole-exome sequencing data of these samples were aligned to the CellRanger reference fasta file (refdata-gex-GRCh38-2020-A/genome.fa) using bwa-mem. The resulting .sam files were converted to .bam files using picard. Picard was also used to sort, mark duplicates in, and build indices for the .bam files. The Genome Analysis Toolkit (GATK)’s Best Practices for Germline Short Variant Discovery were followed. These .vcf files were filtered for common single nucleotide polymorphisms (SNPs) with a 5% minor allele frequency cutoff. Demuxlet, implemented in the Demuxafy framework, was used to demultiplex these samples based on their common SNPs [25]. Only samples that had a posterior likelihood of >0.95 were kept.
The feature-barcode matrices for each patient sample were loaded into R and converted into a Seurat (v4.0) object. Cells with fewer than 200 features and those expressing greater than 20% of mitochondrial genes were filtered out to remove dead and dying cells. Samples with fewer than 500 cells were removed to eliminate low-quality samples. The filtered gene expression matrix was normalized using Seurat's NormalizeData function with default settings. Genes with high cell-to-cell variation were identified using the FindVariableFeatures function with the “vst” method. The data was integrated on an assay level (5’ v1 versus v2) using reciprocal PCA to reduce 10X assay-specific effects, and no significant batch effects were noted after this integration. The data were then scaled and centered using the ScaleData function, and the top 3,000 variable features were used for Principal Component Analysis. The top 30 principal components (PCs) were selected based on the elbow plot generated by the ElbowPlot function. Graph-based clustering was performed using the FindNeighbors and FindClusters functions with the selected 30 PCs.
Assembly: Assembly: GRCh38
Supplementary files format and content: Supplementary files format and content: For each sample, processed data include Cell Ranger Count filtered outputs: cell barcodes (barcodes.tsv.gz), genes (features.tsv.gz) and count matrix (matrix.mtx.gz).
Supplementary files format and content: demuxlet-outputs.zip (Chicago Mix A10, B10, C10, etc. include barcodes for multiple samples that were multiplexed together in a single 10X lane. demuxlet-outputs.zip contains the demuxlet.best files for each mix, which are the outputs the demultiplexing software demuxlet, that link each barcode to a specific sample (e.g. A10-demuxlet.best links each barcode to a sample). )

Submission date

Jul 08, 2024

Last update date

Jul 12, 2024

Contact name

Xiting Yan

E-mail(s)

xiting.yan@yale.edu

Phone

2037855567

Organization name

Yale School of Medicine

Department

Internal Medicine

Street address

300 Cedar Street, Ste S441D

City

New Haven

State/province

Connecticut

ZIP/Postal code

06519

Country

USA

Platform ID

GPL34281

Series (1)

GSE271789

Single-Cell Analysis Reveals Novel Immune Perturbations in Fibrotic Hypersensitivity Pneumonitis

Relations

BioSample

SAMN42440296

SRA

SRX25261750

Supplementary file	Size	Download	File type/resource
GSM8385345_F10_barcodes.tsv.gz	223.7 Kb	(ftp)(http)	TSV
GSM8385345_F10_features.tsv.gz	325.6 Kb	(ftp)(http)	TSV
GSM8385345_F10_matrix.mtx.gz	197.7 Mb	(ftp)(http)	MTX
SRA Run Selector
Raw data are available in SRA