Bayesian Genome-wide TWAS Method to Leverage both cis- and trans-eQTL Information through Summary Statistics

Am J Hum Genet. 2020 Oct 1;107(4):714-726. doi: 10.1016/j.ajhg.2020.08.022. Epub 2020 Sep 21.

Abstract

Transcriptome-wide association studies (TWASs) have been widely used to integrate gene expression and genetic data for studying complex traits. Due to the computational burden, existing TWAS methods do not assess distant trans-expression quantitative trait loci (eQTL) that are known to explain important expression variation for most genes. We propose a Bayesian genome-wide TWAS (BGW-TWAS) method that leverages both cis- and trans-eQTL information for a TWAS. Our BGW-TWAS method is based on Bayesian variable selection regression, which not only accounts for cis- and trans-eQTL of the target gene but also enables efficient computation by using summary statistics from standard eQTL analyses. Our simulation studies illustrated that BGW-TWASs achieved higher power compared to existing TWAS methods that do not assess trans-eQTL information. We further applied BWG-TWAS to individual-level GWAS data (N = ∼3.3K), which identified significant associations between the genetically regulated gene expression (GReX) of ZC3H12B and Alzheimer dementia (AD) (p value = 5.42 × 10-13), neurofibrillary tangle density (p value = 1.89 × 10-6), and global measure of AD pathology (p value = 9.59 × 10-7). These associations for ZC3H12B were completely driven by trans-eQTL. Additionally, the GReX of KCTD12 was found to be significantly associated with β-amyloid (p value = 3.44 × 10-8) which was driven by both cis- and trans-eQTL. Four of the top driven trans-eQTL of ZC3H12B are located within APOC1, a known major risk gene of AD and blood lipids. Additionally, by applying BGW-TWAS with summary-level GWAS data of AD (N = ∼54K), we identified 13 significant genes including known GWAS risk genes HLA-DRB1 and APOC1, as well as ZC3H12B.

Keywords: Alzheimer dementia; Bayesian variable selection model; TWAS; cis-eQTL; summary statistics; trans-eQTL; transcriptome-wide association study.

Publication types

  • Research Support, N.I.H., Extramural
  • Research Support, Non-U.S. Gov't

MeSH terms

  • Aged
  • Aged, 80 and over
  • Alzheimer Disease / diagnosis
  • Alzheimer Disease / genetics*
  • Alzheimer Disease / metabolism
  • Alzheimer Disease / pathology
  • Amyloid beta-Peptides / genetics
  • Amyloid beta-Peptides / metabolism
  • Apolipoprotein C-I / genetics*
  • Apolipoprotein C-I / metabolism
  • Bayes Theorem
  • Case-Control Studies
  • Computer Simulation
  • Female
  • Gene Expression
  • Genetic Markers
  • Genome, Human*
  • Genome-Wide Association Study
  • HLA-DRB1 Chains / genetics
  • HLA-DRB1 Chains / metabolism
  • Humans
  • Male
  • Models, Statistical*
  • Neurofibrillary Tangles / metabolism
  • Neurofibrillary Tangles / pathology
  • Proteins / genetics*
  • Proteins / metabolism
  • Quantitative Trait Loci*
  • Ribonucleases / genetics*
  • Ribonucleases / metabolism
  • Transcriptome

Substances

  • APOC1 protein, human
  • Amyloid beta-Peptides
  • Apolipoprotein C-I
  • Genetic Markers
  • HLA-DRB1 Chains
  • KCTD12 protein, human
  • Proteins
  • Ribonucleases
  • ZC3H12B protein, human