Nuclear receptors (NRs) are a superfamily of transcription factors central to regulating many biological processes, including cell growth, death, metabolism, and immune responses. NR-mediated gene expression can be modulated by coactivators and corepressors through direct physical interaction or protein complexes with functional domains in NRs. One class of these domains includes short linear motifs (SLiMs), which facilitate protein-protein interactions, phosphorylation, and ligand binding primarily in the intrinsically disordered regions (IDRs) of proteins. Across all proteins, the number of known SLiMs is limited due to the difficulty in studying IDRs experimentally. Computational tools provide a systematic and data-driven approach for predicting functional motifs that can be used to prioritize experimental efforts. Accordingly, several tools have been developed based on sequence conservation or biophysical features; however, discrepancies in predictions make it difficult to determine the true candidate SLiMs. In this work, we present the ensemble predictor for short linear motifs (EPSLiM), a novel strategy to prioritize the residues that are most likely to be SLiMs in IDRs. EPSLiM applies a generalized linear model to integrate predictions from individual methodologies. We show that EPSLiM outperforms individual predictors, and we apply our method to NRs. The androgen receptor is an example with an N-terminal domain of 559 disordered amino acids that contains several validated SLiMs important for transcriptional activation. We use the androgen receptor to illustrate the predictive performance of EPSLiM and make the results of all human and mouse NRs publically available through the web service http://epslim.bwh.harvard.edu.
Download full-text PDF |
Source |
---|---|
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC4004780 | PMC |
http://dx.doi.org/10.1210/me.2014-1006 | DOI Listing |
Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!