|
Status |
Public on Apr 27, 2024 |
Title |
Site saturation mutagenesis of 500 human protein domains |
Organism |
Saccharomyces cerevisiae |
Experiment type |
Other
|
Summary |
Missense variants that change the amino acid sequences of proteins cause one third of human genetic diseases. Tens of millions of missense variants exist in the current human population, with the vast majority having unknown functional consequences. Here we present the first large-scale experimental analysis of human missense variants. Using DNA synthesis and cellular selection experiments we quantify the impact of >500,000 variants on the abundance of >500 human protein domains. This dataset, Domainome 1.0, reveals that >60% of disease-causing variants destabilize proteins. The contribution of stability to protein fitness varies across proteins and diseases, and is particularly important in recessive disorders. Combining experimental stability measurements with large language models we annotate functionally important sites across domains. Fitting energy models to the data demonstrates the conservation of mutation effects in homologous domains and allows stability to be accurately predicted for entire domain families. Domainome 1.0 demonstrates the feasibility of assaying human protein variant effects at scale and provides a large consistent reference dataset for clinical variant interpretation and the training and benchmarking of computational methods.
|
|
|
Overall design |
We designed site saturation mutagenesis variant libraries of >500 protein domains, and selected them using an in cell abundance assay to measure the effects of mutations on protein abundance.
|
|
|
Contributor(s) |
Antoni B, Ben L |
Citation missing |
Has this study been published? Please login to update or notify GEO. |
Submission date |
Apr 26, 2024 |
Last update date |
Apr 27, 2024 |
Contact name |
Antoni Beltran |
E-mail(s) |
toni.beltran@crg.eu
|
Organization name |
CRG Barcelona
|
Street address |
Dr Aiguader, 88
|
City |
Barcelona |
ZIP/Postal code |
08003 |
Country |
Spain |
|
|
Platforms (3) |
GPL19756 |
Illumina NextSeq 500 (Saccharomyces cerevisiae) |
GPL27812 |
Illumina NovaSeq 6000 (Saccharomyces cerevisiae) |
GPL31112 |
NextSeq 2000 (Saccharomyces cerevisiae) |
|
Samples (78)
|
|
Relations |
BioProject |
PRJNA1105013 |