GEO Logo
   NCBI > GEO > Accession DisplayHelp Not logged in | LoginHelp
GEO help: Mouse over screen elements for information.
Series GSE212396 Query DataSets for GSE212396
Status Public on Nov 01, 2022
Title High-content CRISPR screens link coronary artery disease genes to endothelial cell programs [Pilot scRNA-seq]
Organism Homo sapiens
Experiment type Expression profiling by high throughput sequencing
Summary Genome-wide association studies (GWAS) have discovered thousands of risk loci for common, complex diseases, each of which could point to genes and gene programs that influence disease. For some diseases, it has been observed that GWAS signals converge on a smaller number of biological programs, and that this convergence can help to identify causal genes. However, identifying such convergence remains challenging: each GWAS locus can have 2-20 candidate genes, the cellular programs a gene participates in are difficult to define in an unbiased fashion, and it remains unclear which genes and programs would be likely to influence disease risk. Here, we explored a new approach to address this challenge, by creating an unbiased catalog of gene programs and their regulators in endothelial cells to link variants to functions for coronary artery disease (CAD). To do so, we applied CRISPRi-Perturb-seq to knock down all expressed genes within 500 Kb of all CAD GWAS loci (2,285 genes in total) and measure their effects on the transcriptome using single-cell RNA-seq. We used consensus non-negative matrix factorization to define 60 gene expression programs—including core cellular programs, such as ribosome biogenesis, and endothelial cell-specific programs, such as flow response and angiogenesis—and link these programs to upstream regulators including transcription factors, chromatin regulators, metabolic enzymes, and signaling cascades. By combining this gene-to-program catalog with variant-to-gene maps, we find that candidate CAD genes converge onto 6 interrelated gene programs, together involving known and novel genes in 39 of 229 CAD GWAS loci. Analysis of these programs revealed that the cerebral cavernous malformations (CCM) complex—whose potential connection to CAD has not been previously explored—acts upstream to regulate other CAD genes involved in cytoskeletal organization, extracellular matrix remodeling, and cell migration. The strongest regulator of these programs is TLNRD1, a highly conserved but poorly studied gene that we show acts in the CCM pathway and regulates actin organization and endothelial cell barrier function. Together, our study nominates new genes that likely influence risk for CAD, identifies convergence of CAD risk loci into certain gene programs in endothelial cells, and demonstrates a generalizable strategy to catalog gene programs to connect disease variants to functions.
Overall design This series contains pilot Perturb-seq studies, which were used to optimize conditions for and cross validate the larger Perturb-seq study reported in GSE210681. Human immortalized endothelial cells engineered to contain doxycycline inducible CRISPR interference machinery (CRISPRi TeloHAEC or CRISPRi Eahy926), were transduced with one of two guide RNA libraries targeting gene promoters (referred to as the 50gene & 200gene libraries), in a CROP-seq vector. After 5 days of doxycycline treatment, cells were run on a single lane of a 10X Chromium Controller and scRNA-seq libraries generated. For Eahy926, the maximum number of cells recommended in the 10x protocol was used (16k), or the lane was "superloaded" with ~10x more cells (150k). For TeloHAEC, only the superloading condition was used. polyA+ cDNA surrounding the guide sequences was amplified to make "dialout" libraries, used to link guide RNA sequences to scRNAseq cell barcodes.
Contributor(s) Schnitzler GR, Kang H, Lee-Kim V, Ma XR, Zeng T, Vellarikkal SK, Zhou R, Guo K, Sias-Garcia O, Bloemendal A, Munson G, Guckelberger P, Nguyen TH, Bergman DT, Atri D, Cheng N, Cleary B, Lander ES, Finucane HK, Gupta RM, Engreitz JM
Citation(s) 38326615
Submission date Aug 31, 2022
Last update date Mar 04, 2024
Contact name Jesse Engreitz
Organization name Broad Institute
Street address 415 Main Street
City Cambridge
State/province MA
ZIP/Postal code 02142
Country USA
Platforms (2)
GPL18573 Illumina NextSeq 500 (Homo sapiens)
GPL24676 Illumina NovaSeq 6000 (Homo sapiens)
Samples (15)
GSM6529416 50gLib_Eahy926_150K_scRNAseq.1
GSM6529417 50gLib_Eahy926_150K_scRNAseq.2
GSM6529418 50gLib_Eahy926_150K_scRNAseq.3
This SubSeries is part of SuperSeries:
GSE210523 High-content CRISPR screens link coronary artery disease genes to endothelial cell programs
BioProject PRJNA875320

Download family Format
SOFT formatted family file(s) SOFTHelp
MINiML formatted family file(s) MINiMLHelp
Series Matrix File(s) TXTHelp

Supplementary file Size Download File type/resource
GSE212396_200gLib_TeloHAEC_DE_log2fcs.txt.gz 33.9 Mb (ftp)(http) TXT
GSE212396_200gLib_TeloHAEC_DE_p.values.txt.gz 33.3 Mb (ftp)(http) TXT
GSE212396_200gLib_TeloHAEC_scRNAseq_1_barcodes.tsv.gz 99.9 Kb (ftp)(http) TSV
GSE212396_200gLib_TeloHAEC_scRNAseq_1_calls.tsv.gz 367.4 Kb (ftp)(http) TSV
GSE212396_200gLib_TeloHAEC_scRNAseq_1_features.tsv.gz 264.3 Kb (ftp)(http) TSV
GSE212396_200gLib_TeloHAEC_scRNAseq_1_matrix.mtx.gz 204.2 Mb (ftp)(http) MTX
GSE212396_200gLib_TeloHAEC_scRNAseq_2_barcodes.tsv.gz 129.9 Kb (ftp)(http) TSV
GSE212396_200gLib_TeloHAEC_scRNAseq_2_calls.tsv.gz 476.5 Kb (ftp)(http) TSV
GSE212396_200gLib_TeloHAEC_scRNAseq_2_features.tsv.gz 264.3 Kb (ftp)(http) TSV
GSE212396_200gLib_TeloHAEC_scRNAseq_2_matrix.mtx.gz 211.0 Mb (ftp)(http) MTX
GSE212396_200gLib_targets_n_guides.txt.gz 35.5 Kb (ftp)(http) TXT
GSE212396_50gLib_Eahy926_150K_scRNAseq_barcodes.tsv.gz 154.3 Kb (ftp)(http) TSV
GSE212396_50gLib_Eahy926_150K_scRNAseq_calls.tsv.gz 343.8 Kb (ftp)(http) TSV
GSE212396_50gLib_Eahy926_150K_scRNAseq_features.tsv.gz 265.5 Kb (ftp)(http) TSV
GSE212396_50gLib_Eahy926_150K_scRNAseq_matrix.mtx.gz 319.4 Mb (ftp)(http) MTX
GSE212396_50gLib_Eahy926_16K_scRNAseq_barcodes.tsv.gz 22.9 Kb (ftp)(http) TSV
GSE212396_50gLib_Eahy926_16K_scRNAseq_calls.tsv.gz 52.1 Kb (ftp)(http) TSV
GSE212396_50gLib_Eahy926_16K_scRNAseq_features.tsv.gz 264.3 Kb (ftp)(http) TSV
GSE212396_50gLib_Eahy926_16K_scRNAseq_matrix.mtx.gz 56.9 Mb (ftp)(http) MTX
GSE212396_50gLib_TeloHAEC_150K_scRNAseq_barcodes.tsv.gz 213.4 Kb (ftp)(http) TSV
GSE212396_50gLib_TeloHAEC_150K_scRNAseq_calls.tsv.gz 415.8 Kb (ftp)(http) TSV
GSE212396_50gLib_TeloHAEC_150K_scRNAseq_features.tsv.gz 264.3 Kb (ftp)(http) TSV
GSE212396_50gLib_TeloHAEC_150K_scRNAseq_matrix.mtx.gz 252.3 Mb (ftp)(http) MTX
GSE212396_50gLib_targets_n_guides.txt.gz 1.6 Kb (ftp)(http) TXT
SRA Run SelectorHelp
Raw data are available in SRA
Processed data are available on Series record

| NLM | NIH | GEO Help | Disclaimer | Accessibility |
NCBI Home NCBI Search NCBI SiteMap