Aurora-A is an essential cell-cycle kinase with critical roles in mitotic entry and spindle dynamics. These functions require binding partners such as CEP192 and TPX2, which modulate both kinase activity and localisation of Aurora-A. Here we investigate the structure and role of the centrosomal Aurora-A:CEP192 complex in the wider molecular network.
View Article and Find Full Text PDFIn eukaryotes, protein kinase signaling is regulated by a diverse array of post-translational modifications (PTMs), including phosphorylation of Ser/Thr residues and oxidation of cysteine (Cys) residues. While regulation by activation segment phosphorylation of Ser/Thr residues is well understood, relatively little is known about how oxidation of cysteine residues modulate catalysis. In this study, we investigate redox regulation of the AMPK-related Brain-selective kinases (BRSK) 1 and 2, and detail how broad catalytic activity is directly regulated through reversible oxidation and reduction of evolutionarily conserved Cys residues within the catalytic domain.
View Article and Find Full Text PDFThe understudied members of the druggable proteomes offer promising prospects for drug discovery efforts. While large-scale initiatives have generated valuable functional information on understudied members of the druggable gene families, translating this information into actionable knowledge for drug discovery requires specialized informatics tools and resources. Here, we review the unique informatics challenges and advances in annotating understudied members of the druggable proteome.
View Article and Find Full Text PDFMotivation: Phosphorylation, a post-translational modification regulated by protein kinase enzymes, plays an essential role in almost all cellular processes. Understanding how each of the nearly 500 human protein kinases selectively phosphorylates their substrates is a foundational challenge in bioinformatics and cell signaling. Although deep learning models have been a popular means to predict kinase-substrate relationships, existing models often lack interpretability and are trained on datasets skewed toward a subset of well-studied kinases.
View Article and Find Full Text PDFThe Protein Kinase Ontology (ProKinO) is an integrated knowledge graph that conceptualizes the complex relationships among protein kinase sequence, structure, function, and disease in a human and machine-readable format. In this study, we have significantly expanded ProKinO by incorporating additional data on expression patterns and drug interactions. Furthermore, we have developed a completely new browser from the ground up to render the knowledge graph visible and interactive on the web.
View Article and Find Full Text PDFCatalytic signaling outputs of protein kinases are dynamically regulated by an array of structural mechanisms, including allosteric interactions mediated by intrinsically disordered segments flanking the conserved catalytic domain. The doublecortin-like kinases (DCLKs) are a family of microtubule-associated proteins characterized by a flexible C-terminal autoregulatory 'tail' segment that varies in length across the various human DCLK isoforms. However, the mechanism whereby these isoform-specific variations contribute to unique modes of autoregulation is not well understood.
View Article and Find Full Text PDFAutophosphorylation controls the transition between discrete functional and conformational states in protein kinases, yet the structural and molecular determinants underlying this fundamental process remain unclear. Here we show that c-terminal Tyr 530 is a de facto c-Src autophosphorylation site with slow time-resolution kinetics and a strong intermolecular component. On the contrary, activation-loop Tyr 419 undergoes faster kinetics and a cis-to-trans phosphorylation switch that controls c-terminal Tyr 530 autophosphorylation, enzyme specificity, and strikingly, c-Src non-catalytic function as a substrate.
View Article and Find Full Text PDFThe current understanding of farnesyltransferase (FTase) specificity was pioneered through investigations of reporters like Ras and Ras-related proteins that possess a C-terminal CaaX motif that consists of 4 amino acid residues: cysteine-aliphatic1-aliphatic2-variable (X). These studies led to the finding that proteins with the CaaX motif are subject to a 3-step post-translational modification pathway involving farnesylation, proteolysis, and carboxylmethylation. Emerging evidence indicates, however, that FTase can farnesylate sequences outside the CaaX motif and that these sequences do not undergo the canonical 3-step pathway.
View Article and Find Full Text PDFCatalytic signaling outputs of protein kinases are dynamically regulated by an array of structural mechanisms, including allosteric interactions mediated by intrinsically disordered segments flanking the conserved catalytic domain. The Doublecortin Like Kinases (DCLKs) are a family of microtubule-associated proteins characterized by a flexible C-terminal autoregulatory 'tail' segment that varies in length across the various human DCLK isoforms. However, the mechanism whereby these isoform-specific variations contribute to unique modes of autoregulation is not well understood.
View Article and Find Full Text PDFMotivation: The human genome encodes over 500 distinct protein kinases which regulate nearly all cellular processes by the specific phosphorylation of protein substrates. While advances in mass spectrometry and proteomics studies have identified thousands of phosphorylation sites across species, information on the specific kinases that phosphorylate these sites is currently lacking for the vast majority of phosphosites. Recently, there has been a major focus on the development of computational models for predicting kinase-substrate associations.
View Article and Find Full Text PDFProtein language models, trained on millions of biologically observed sequences, generate feature-rich numerical representations of protein sequences. These representations, called sequence embeddings, can infer structure-functional properties, despite protein language models being trained on primary sequence alone. While sequence embeddings have been applied toward tasks such as structure and function prediction, applications toward alignment-free sequence classification have been hindered by the lack of studies to derive, quantify and evaluate relationships between protein sequence embeddings.
View Article and Find Full Text PDFProtein language modeling is a fast-emerging deep learning method in bioinformatics with diverse applications such as structure prediction and protein design. However, application toward estimating sequence conservation for functional site prediction has not been systematically explored. Here, we present a method for the alignment-free estimation of sequence conservation using sequence embeddings generated from protein language models.
View Article and Find Full Text PDFHydrophobic cores are fundamental structural properties of proteins typically associated with protein folding and stability; however, how the hydrophobic core shapes protein evolution and function is poorly understood. Here, we investigated the role of conserved hydrophobic cores in fold-A glycosyltransferases (GT-As), a large superfamily of enzymes that catalyze formation of glycosidic linkages between diverse donor and acceptor substrates through distinct catalytic mechanisms (inverting versus retaining). Using hidden Markov models and protein structural alignments, we identify similarities in the phosphate-binding cassette (PBC) of GT-As and unrelated nucleotide-binding proteins, such as UDP-sugar pyrophosphorylases.
View Article and Find Full Text PDFProtein prenylation by farnesyltransferase (FTase) is often described as the targeting of a cysteine-containing motif (CaaX) that is enriched for aliphatic amino acids at the a1 and a2 positions, while quite flexible at the X position. Prenylation prediction methods often rely on these features despite emerging evidence that FTase has broader target specificity than previously considered. Using a machine learning approach and training sets based on canonical (prenylated, proteolyzed, and carboxymethylated) and recently identified shunted motifs (prenylation only), this study aims to improve prenylation predictions with the goal of determining the full scope of prenylation potential among the 8000 possible Cxxx sequence combinations.
View Article and Find Full Text PDFGlycosyltransferases (GTs) play fundamental roles in nearly all cellular processes through the biosynthesis of complex carbohydrates and glycosylation of diverse protein and small molecule substrates. The extensive structural and functional diversification of GTs presents a major challenge in mapping the relationships connecting sequence, structure, fold and function using traditional bioinformatics approaches. Here, we present a convolutional neural network with attention (CNN-attention) based deep learning model that leverages simple secondary structure representations generated from primary sequences to provide GT fold prediction with high accuracy.
View Article and Find Full Text PDFBackground: Protein kinases are among the largest druggable family of signaling proteins, involved in various human diseases, including cancers and neurodegenerative disorders. Despite their clinical relevance, nearly 30% of the 545 human protein kinases remain highly understudied. Comparative genomics is a powerful approach for predicting and investigating the functions of understudied kinases.
View Article and Find Full Text PDFThe emergence of multicellularity is strongly correlated with the expansion of tyrosine kinases, a conserved family of signaling enzymes that regulates pathways essential for cell-to-cell communication. Although tyrosine kinases have been classified from several model organisms, a molecular-level understanding of tyrosine kinase evolution across all holozoans is currently lacking. Using a hierarchical sequence constraint-based classification of diverse holozoan tyrosine kinases, we construct a new phylogenetic tree that identifies two ancient clades of cytoplasmic and receptor tyrosine kinases separated by the presence of an extended insert segment in the kinase domain connecting the D and E-helices.
View Article and Find Full Text PDFGlycosyltransferases (GTs) play a central role in sustaining all forms of life through the biosynthesis of complex carbohydrates. Despite significant strides made in recent years to establish computational resources, databases and tools to understand the nature and role of carbohydrates and related glycoenzymes, a data analytics framework that connects the sequence-structure-function relationships to the evolution of GTs is currently lacking. This hinders the characterization of understudied GTs and the synthetic design of GTs for medical and biotechnology applications.
View Article and Find Full Text PDFPhosphorylation of the MLKL pseudokinase by the RIPK3 kinase leads to MLKL oligomerization, translocation to, and permeabilization of, the plasma membrane to induce necroptotic cell death. The precise choreography of MLKL activation remains incompletely understood. Here, we report Monobodies, synthetic binding proteins, that bind the pseudokinase domain of MLKL within human cells and their crystal structures in complex with the human MLKL pseudokinase domain.
View Article and Find Full Text PDFThe complex mTORC2 is accepted to be the kinase that controls the phosphorylation of the hydrophobic motif, a key regulatory switch for AGC kinases, although whether mTOR directly phosphorylates this motif remains controversial. Here, we identified an mTOR-mediated phosphorylation site that we termed the TOR interaction motif (TIM; F-x-F-pT), which controls the phosphorylation of the hydrophobic motif of PKC and Akt and the activity of these kinases. The TIM is invariant in mTORC2-dependent AGC kinases, is evolutionarily conserved, and coevolved with mTORC2 components.
View Article and Find Full Text PDFBackground: Protein kinases are a large family of druggable proteins that are genomically and proteomically altered in many human cancers. Kinase-targeted drugs are emerging as promising avenues for personalized medicine because of the differential response shown by altered kinases to drug treatment in patients and cell-based assays. However, an incomplete understanding of the relationships connecting genome, proteome and drug sensitivity profiles present a major bottleneck in targeting kinases for personalized medicine.
View Article and Find Full Text PDFNot much is known about how organelles organize into patterns. In ciliates, the cortical pattern is propagated during "tandem duplication," a cell division that remodels the parental cell into two daughter cells. A key step is the formation of the division boundary along the cell's equator.
View Article and Find Full Text PDFThe MLKL pseudokinase is the terminal effector in the necroptosis cell death pathway. Phosphorylation by its upstream regulator, RIPK3, triggers MLKL's conversion from a dormant cytoplasmic protein into oligomers that translocate to, and permeabilize, the plasma membrane to kill cells. The precise mechanisms underlying these processes are incompletely understood, and were proposed to differ between mouse and human cells.
View Article and Find Full Text PDFGlycosyltransferases (GTs) are prevalent across the tree of life and regulate nearly all aspects of cellular functions. The evolutionary basis for their complex and diverse modes of catalytic functions remain enigmatic. Here, based on deep mining of over half million GT-A fold sequences, we define a minimal core component shared among functionally diverse enzymes.
View Article and Find Full Text PDFThe faithful propagation of cellular signals in most organisms relies on the coordinated functions of a large family of protein kinases that share a conserved catalytic domain. The catalytic domain is a dynamic scaffold that undergoes large conformational changes upon activation. Most of these conformational changes, such as movement of the regulatory αC-helix from an "out" to "in" conformation, hinge on a conserved, but understudied, loop termed the αC-β4 loop, which mediates conserved interactions to tether flexible structural elements to the kinase core.
View Article and Find Full Text PDF