NCBI Logo
GEO Logo
   NCBI > GEO > Accession DisplayHelp Not logged in | LoginHelp
GEO help: Mouse over screen elements for information.
          Go
Series GSE265942 Query DataSets for GSE265942
Status Public on Apr 27, 2024
Title Site saturation mutagenesis of 500 human protein domains
Organism Saccharomyces cerevisiae
Experiment type Other
Summary Missense variants that change the amino acid sequences of proteins cause one third of human genetic diseases. Tens of millions of missense variants exist in the current human population, with the vast majority having unknown functional consequences. Here we present the first large-scale experimental analysis of human missense variants. Using DNA synthesis and cellular selection experiments we quantify the impact of >500,000 variants on the abundance of >500 human protein domains. This dataset, Domainome 1.0, reveals that >60% of disease-causing variants destabilize proteins. The contribution of stability to protein fitness varies across proteins and diseases, and is particularly important in recessive disorders. Combining experimental stability measurements with large language models we annotate functionally important sites across domains. Fitting energy models to the data demonstrates the conservation of mutation effects in homologous domains and allows stability to be accurately predicted for entire domain families. Domainome 1.0 demonstrates the feasibility of assaying human protein variant effects at scale and provides a large consistent reference dataset for clinical variant interpretation and the training and benchmarking of computational methods.
 
Overall design We designed site saturation mutagenesis variant libraries of >500 protein domains, and selected them using an in cell abundance assay to measure the effects of mutations on protein abundance.
 
Contributor(s) Antoni B, Ben L
Citation missing Has this study been published? Please login to update or notify GEO.
Submission date Apr 26, 2024
Last update date Apr 27, 2024
Contact name Antoni Beltran
E-mail(s) toni.beltran@crg.eu
Organization name CRG Barcelona
Street address Dr Aiguader, 88
City Barcelona
ZIP/Postal code 08003
Country Spain
 
Platforms (3)
GPL19756 Illumina NextSeq 500 (Saccharomyces cerevisiae)
GPL27812 Illumina NovaSeq 6000 (Saccharomyces cerevisiae)
GPL31112 NextSeq 2000 (Saccharomyces cerevisiae)
Samples (78)
GSM8232325 A1_input_1
GSM8232326 A1_input_2
GSM8232327 A1_input_3
Relations
BioProject PRJNA1105013

Download family Format
SOFT formatted family file(s) SOFTHelp
MINiML formatted family file(s) MINiMLHelp
Series Matrix File(s) TXTHelp

Supplementary file Size Download File type/resource
GSE265942_aPCA_variant_growthrates.txt.gz 77.4 Mb (ftp)(http) TXT
SRA Run SelectorHelp
Raw data are available in SRA

| NLM | NIH | GEO Help | Disclaimer | Accessibility |
NCBI Home NCBI Search NCBI SiteMap