|
|
GEO help: Mouse over screen elements for information. |
|
Status |
Public on Jul 09, 2024 |
Title |
Chicago Mix F10, PBMC |
Sample type |
SRA |
|
|
Source name |
PBMC
|
Organism |
Homo sapiens |
Characteristics |
tissue: PBMC cohort: Chicago samples multiplexed_(chicago_only): HPC12, HPC13 age (singleton or multiplexed): 66, 77 sex (singleton or multiplexed): pooled: F, F disease (singleton or multiplexed): FHP, FHP smoking status: NA fev1: NA dlco: NA
|
Extracted molecule |
polyA RNA |
Extraction protocol |
At each sample collection site, PBMCs were separated from the whole blood using density gradient centrifugation, and PBMC and BAL samples were cryopreserved in liquid nitrogen as separate samples for single-cell experiments. The PBMCs were cryopreserved in 1.5 ml cryotubes using 10% DMSO in heat-inactivated FBS and were stored long-term in liquid nitrogen. Initial PBMC sample collection, processing, and storage did not differ significantly among the three patient sample collection sites. In Mexico City, bronchoalveolar lavage samples were collected and stored long-term in liquid nitrogen using 10% DMSO in heat-inactivated FBS. All patient samples were shipped for processing in New Haven, CT, to minimize technical and batch effects arising from sample processing. Samples were randomized based on age, sex, diagnosis, and smoking history into different batches for sample preparation and single-cell library construction. Sixteen samples were processed at a time to reduce batch effects. The sample preparation protocol used was modified from 10X Genomics with the following modifications. Cryopreserved samples were thawed in a 37 °C water bath, and 1.5 ml of 10% FBS in Dulbecco’s Modified Eagle Medium (DMEM) was added to each tube. The cells were centrifuged, the supernatant was removed, and the cells were resuspended in D-PBS with 0.04% bovine serum albumin (BSA). The sample was passed through a 40 µm cell strainer into a 2-mL microcentrifuge tube. The cell concentration was determined using Trypan Blue solution, and Invitrogen Countess Automated Cell Counter. Samples with isolated single cells were diluted, and these cells were loaded into the chip with an anticipated recovery of 10,000 cells according to the manufacturer’s protocol for Chromium Next GEM Single Cell 5’ Reagent Kits v2 (Dual Index) (from samples from New Haven and Mexico City only). To further reduce batch effects, Chicago samples were pooled, such that each chip lane was loaded with two or three patient samples, with an anticipated recovery cell of 20,000 cells to 30,000 cells, respectively. Any leftover samples were fixed in 80% methanol and stored at -20C to preserve nucleotide integrity for downstream DNA extraction and whole-exome sequencing to SNP-demultiplex mixed patient samples from Chicago. All subsequent library construction steps were followed precisely, and all cDNA library QC was performed using an Agilent 2100 Bioanalyzer. The single-cell cDNA libraries were sequenced using an Illumina HiSeq4000 machine at the Yale Stem Cell Genomics Center (samples from New Haven and Mexico City) and from an Illumina NovaSeq X at the Yale Center for Genomic Analysis (samples from Chicago). Samples were sequenced at a depth of 150 million reads per sample to allow for adequate transcriptome capture.
|
|
|
Library strategy |
RNA-Seq |
Library source |
transcriptomic single cell |
Library selection |
cDNA |
Instrument model |
Illumina NovaSeq X |
|
|
Description |
F10_HHT metadata in order of appearance of multiplexed samples
|
Data processing |
The CellRanger v7.0.0 pipeline (mkfastq and count) was used to demultiplex the raw sequencing reads from the single-cell libraries and align them to the GRCh38 reference genome (CellRanger 2020-A). To demultiplex the mixed samples from Chicago, the fastq files corresponding to the whole-exome sequencing data of these samples were aligned to the CellRanger reference fasta file (refdata-gex-GRCh38-2020-A/genome.fa) using bwa-mem. The resulting .sam files were converted to .bam files using picard. Picard was also used to sort, mark duplicates in, and build indices for the .bam files. The Genome Analysis Toolkit (GATK)’s Best Practices for Germline Short Variant Discovery were followed. These .vcf files were filtered for common single nucleotide polymorphisms (SNPs) with a 5% minor allele frequency cutoff. Demuxlet, implemented in the Demuxafy framework, was used to demultiplex these samples based on their common SNPs [25]. Only samples that had a posterior likelihood of >0.95 were kept. The feature-barcode matrices for each patient sample were loaded into R and converted into a Seurat (v4.0) object. Cells with fewer than 200 features and those expressing greater than 20% of mitochondrial genes were filtered out to remove dead and dying cells. Samples with fewer than 500 cells were removed to eliminate low-quality samples. The filtered gene expression matrix was normalized using Seurat's NormalizeData function with default settings. Genes with high cell-to-cell variation were identified using the FindVariableFeatures function with the “vst” method. The data was integrated on an assay level (5’ v1 versus v2) using reciprocal PCA to reduce 10X assay-specific effects, and no significant batch effects were noted after this integration. The data were then scaled and centered using the ScaleData function, and the top 3,000 variable features were used for Principal Component Analysis. The top 30 principal components (PCs) were selected based on the elbow plot generated by the ElbowPlot function. Graph-based clustering was performed using the FindNeighbors and FindClusters functions with the selected 30 PCs. Assembly: Assembly: GRCh38 Supplementary files format and content: Supplementary files format and content: For each sample, processed data include Cell Ranger Count filtered outputs: cell barcodes (barcodes.tsv.gz), genes (features.tsv.gz) and count matrix (matrix.mtx.gz). Supplementary files format and content: demuxlet-outputs.zip (Chicago Mix A10, B10, C10, etc. include barcodes for multiple samples that were multiplexed together in a single 10X lane. demuxlet-outputs.zip contains the demuxlet.best files for each mix, which are the outputs the demultiplexing software demuxlet, that link each barcode to a specific sample (e.g. A10-demuxlet.best links each barcode to a sample). )
|
|
|
Submission date |
Jul 08, 2024 |
Last update date |
Jul 12, 2024 |
Contact name |
Xiting Yan |
E-mail(s) |
xiting.yan@yale.edu
|
Phone |
2037855567
|
Organization name |
Yale School of Medicine
|
Department |
Internal Medicine
|
Street address |
300 Cedar Street, Ste S441D
|
City |
New Haven |
State/province |
Connecticut |
ZIP/Postal code |
06519 |
Country |
USA |
|
|
Platform ID |
GPL34281 |
Series (1) |
GSE271789 |
Single-Cell Analysis Reveals Novel Immune Perturbations in Fibrotic Hypersensitivity Pneumonitis |
|
Relations |
BioSample |
SAMN42440296 |
SRA |
SRX25261750 |
Supplementary file |
Size |
Download |
File type/resource |
GSM8385345_F10_barcodes.tsv.gz |
223.7 Kb |
(ftp)(http) |
TSV |
GSM8385345_F10_features.tsv.gz |
325.6 Kb |
(ftp)(http) |
TSV |
GSM8385345_F10_matrix.mtx.gz |
197.7 Mb |
(ftp)(http) |
MTX |
SRA Run Selector |
Raw data are available in SRA |
|
|
|
|
|