NCBI Logo
GEO Logo
   NCBI > GEO > Accession DisplayHelp Not logged in | LoginHelp
GEO help: Mouse over screen elements for information.
          Go
Sample GSM8385345 Query DataSets for GSM8385345
Status Public on Jul 09, 2024
Title Chicago Mix F10, PBMC
Sample type SRA
 
Source name PBMC
Organism Homo sapiens
Characteristics tissue: PBMC
cohort: Chicago
samples multiplexed_(chicago_only): HPC12, HPC13
age (singleton or multiplexed): 66, 77
sex (singleton or multiplexed): pooled: F, F
disease (singleton or multiplexed): FHP, FHP
smoking status: NA
fev1: NA
dlco: NA
Extracted molecule polyA RNA
Extraction protocol At each sample collection site, PBMCs were separated from the whole blood using density gradient centrifugation, and PBMC and BAL samples were cryopreserved in liquid nitrogen as separate samples for single-cell experiments. The PBMCs were cryopreserved in 1.5 ml cryotubes using 10% DMSO in heat-inactivated FBS and were stored long-term in liquid nitrogen. Initial PBMC sample collection, processing, and storage did not differ significantly among the three patient sample collection sites. In Mexico City, bronchoalveolar lavage samples were collected and stored long-term in liquid nitrogen using 10% DMSO in heat-inactivated FBS.
All patient samples were shipped for processing in New Haven, CT, to minimize technical and batch effects arising from sample processing. Samples were randomized based on age, sex, diagnosis, and smoking history into different batches for sample preparation and single-cell library construction. Sixteen samples were processed at a time to reduce batch effects. The sample preparation protocol used was modified from 10X Genomics with the following modifications. Cryopreserved samples were thawed in a 37 °C water bath, and 1.5 ml of 10% FBS in Dulbecco’s Modified Eagle Medium (DMEM) was added to each tube. The cells were centrifuged, the supernatant was removed, and the cells were resuspended in D-PBS with 0.04% bovine serum albumin (BSA). The sample was passed through a 40 µm cell strainer into a 2-mL microcentrifuge tube. The cell concentration was determined using Trypan Blue solution, and Invitrogen Countess Automated Cell Counter. Samples with isolated single cells were diluted, and these cells were loaded into the chip with an anticipated recovery of 10,000 cells according to the manufacturer’s protocol for Chromium Next GEM Single Cell 5’ Reagent Kits v2 (Dual Index) (from samples from New Haven and Mexico City only). To further reduce batch effects, Chicago samples were pooled, such that each chip lane was loaded with two or three patient samples, with an anticipated recovery cell of 20,000 cells to 30,000 cells, respectively. Any leftover samples were fixed in 80% methanol and stored at -20C to preserve nucleotide integrity for downstream DNA extraction and whole-exome sequencing to SNP-demultiplex mixed patient samples from Chicago. All subsequent library construction steps were followed precisely, and all cDNA library QC was performed using an Agilent 2100 Bioanalyzer.
The single-cell cDNA libraries were sequenced using an Illumina HiSeq4000 machine at the Yale Stem Cell Genomics Center (samples from New Haven and Mexico City) and from an Illumina NovaSeq X at the Yale Center for Genomic Analysis (samples from Chicago). Samples were sequenced at a depth of 150 million reads per sample to allow for adequate transcriptome capture.
 
Library strategy RNA-Seq
Library source transcriptomic single cell
Library selection cDNA
Instrument model Illumina NovaSeq X
 
Description F10_HHT
metadata in order of appearance of multiplexed samples
Data processing The CellRanger v7.0.0 pipeline (mkfastq and count) was used to demultiplex the raw sequencing reads from the single-cell libraries and align them to the GRCh38 reference genome (CellRanger 2020-A).
To demultiplex the mixed samples from Chicago, the fastq files corresponding to the whole-exome sequencing data of these samples were aligned to the CellRanger reference fasta file (refdata-gex-GRCh38-2020-A/genome.fa) using bwa-mem. The resulting .sam files were converted to .bam files using picard. Picard was also used to sort, mark duplicates in, and build indices for the .bam files. The Genome Analysis Toolkit (GATK)’s Best Practices for Germline Short Variant Discovery were followed. These .vcf files were filtered for common single nucleotide polymorphisms (SNPs) with a 5% minor allele frequency cutoff. Demuxlet, implemented in the Demuxafy framework, was used to demultiplex these samples based on their common SNPs [25]. Only samples that had a posterior likelihood of >0.95 were kept.
The feature-barcode matrices for each patient sample were loaded into R and converted into a Seurat (v4.0) object. Cells with fewer than 200 features and those expressing greater than 20% of mitochondrial genes were filtered out to remove dead and dying cells. Samples with fewer than 500 cells were removed to eliminate low-quality samples. The filtered gene expression matrix was normalized using Seurat's NormalizeData function with default settings. Genes with high cell-to-cell variation were identified using the FindVariableFeatures function with the “vst” method. The data was integrated on an assay level (5’ v1 versus v2) using reciprocal PCA to reduce 10X assay-specific effects, and no significant batch effects were noted after this integration. The data were then scaled and centered using the ScaleData function, and the top 3,000 variable features were used for Principal Component Analysis. The top 30 principal components (PCs) were selected based on the elbow plot generated by the ElbowPlot function. Graph-based clustering was performed using the FindNeighbors and FindClusters functions with the selected 30 PCs.
Assembly: Assembly: GRCh38
Supplementary files format and content: Supplementary files format and content: For each sample, processed data include Cell Ranger Count filtered outputs: cell barcodes (barcodes.tsv.gz), genes (features.tsv.gz) and count matrix (matrix.mtx.gz).
Supplementary files format and content: demuxlet-outputs.zip (Chicago Mix A10, B10, C10, etc. include barcodes for multiple samples that were multiplexed together in a single 10X lane. demuxlet-outputs.zip contains the demuxlet.best files for each mix, which are the outputs the demultiplexing software demuxlet, that link each barcode to a specific sample (e.g. A10-demuxlet.best links each barcode to a sample). )
 
Submission date Jul 08, 2024
Last update date Jul 12, 2024
Contact name Xiting Yan
E-mail(s) xiting.yan@yale.edu
Phone 2037855567
Organization name Yale School of Medicine
Department Internal Medicine
Street address 300 Cedar Street, Ste S441D
City New Haven
State/province Connecticut
ZIP/Postal code 06519
Country USA
 
Platform ID GPL34281
Series (1)
GSE271789 Single-Cell Analysis Reveals Novel Immune Perturbations in Fibrotic Hypersensitivity Pneumonitis
Relations
BioSample SAMN42440296
SRA SRX25261750

Supplementary file Size Download File type/resource
GSM8385345_F10_barcodes.tsv.gz 223.7 Kb (ftp)(http) TSV
GSM8385345_F10_features.tsv.gz 325.6 Kb (ftp)(http) TSV
GSM8385345_F10_matrix.mtx.gz 197.7 Mb (ftp)(http) MTX
SRA Run SelectorHelp
Raw data are available in SRA

| NLM | NIH | GEO Help | Disclaimer | Accessibility |
NCBI Home NCBI Search NCBI SiteMap