Severity: Warning
Message: file_get_contents(https://...@pubfacts.com&api_key=b8daa3ad693db53b1410957c26c9a51b4908&a=1): Failed to open stream: HTTP request failed! HTTP/1.1 429 Too Many Requests
Filename: helpers/my_audit_helper.php
Line Number: 176
Backtrace:
File: /var/www/html/application/helpers/my_audit_helper.php
Line: 176
Function: file_get_contents
File: /var/www/html/application/helpers/my_audit_helper.php
Line: 250
Function: simplexml_load_file_from_url
File: /var/www/html/application/helpers/my_audit_helper.php
Line: 1034
Function: getPubMedXML
File: /var/www/html/application/helpers/my_audit_helper.php
Line: 3152
Function: GetPubMedArticleOutput_2016
File: /var/www/html/application/controllers/Detail.php
Line: 575
Function: pubMedSearch_Global
File: /var/www/html/application/controllers/Detail.php
Line: 489
Function: pubMedGetRelatedKeyword
File: /var/www/html/index.php
Line: 316
Function: require_once
Epistasis complicates our understanding of protein sequence-function relationships and impedes our ability to build accurate predictive models for novel genotypes. Although pairwise epistasis has been extensively studied in proteins, the significance of higher-order epistasis for protein sequence-function relationships remains contentious, largely due to challenges in fitting higher-order epistatatic interactions for full-length proteins. Here, we introduce a novel transformer-based approach. The key feature of our method is that we can adjust the order of interactions fit by the model by changing the number of attention layers while also accounting for any global nonlinearity induced by the experimental conditions. This allows us to test if inclusion of higher-order interactions leads to enhanced model performance. Applying our method to 10 large protein sequence-function datasets, we found that the importance of higher-order epistasis differs substantially between proteins, accounting for up to 60% of the total variance attributed to epistasis. We also found that including higher-order epistasis is particularly important for generalizing locally sampled fitness data to distant regions of sequence space and for modeling an additional multi-peak fitness landscape derived from combining mutagenesis data from 4 orthologous green fluorescencent proteins. Our findings suggest that higher-order epistasis often does play an important role in protein sequence-function relationships, and thus should be properly incorporated during protein engineering and evolutionary data analysis.
Download full-text PDF |
Source |
---|---|
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC11463489 | PMC |
http://dx.doi.org/10.1101/2024.09.22.614318 | DOI Listing |
Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!