ASAP: a machine learning framework for local protein properties.

Database (Oxford)

Department of Biological Chemistry, The Alexander Silberman Institute of Life Sciences, The Hebrew University, Jerusalem 91904, Israel

Published: November 2017

Determining residue-level protein properties, such as sites of post-translational modifications (PTMs), is vital to understanding protein function. Experimental methods are costly and time-consuming, while traditional rule-based computational methods fail to annotate sites lacking substantial similarity. Machine Learning (ML) methods are becoming fundamental in annotating unknown proteins and their heterogeneous properties. We present ASAP (Amino-acid Sequence Annotation Prediction), a universal ML framework for predicting residue-level properties. ASAP extracts numerous features from raw sequences, and supports easy integration of external features such as secondary structure, solvent accessibility, intrinsically disorder or PSSM profiles. Features are then used to train ML classifiers. ASAP can create new classifiers within minutes for a variety of tasks, including PTM prediction (e.g. cleavage sites by convertase, phosphoserine modification). We present a detailed case study for ASAP: CleavePred, an ASAP-based model to predict protein precursor cleavage sites, with state-of-the-art results. Protein cleavage is a PTM shared by a wide variety of proteins sharing minimal sequence similarity. Current rule-based methods suffer from high false positive rates, making them suboptimal. The high performance of CleavePred makes it suitable for analyzing new proteomes at a genomic scale. The tool is attractive to protein design, mass spectrometry search engines and the discovery of new bioactive peptides from precursors. ASAP functions as a baseline approach for residue-level protein sequence prediction. CleavePred is freely accessible as a web-based application. Both ASAP and CleavePred are open-source with a flexible Python API.Database URL: ASAP's and CleavePred source code, webtool and tutorials are available at: https://github.com/ddofer/asap; http://protonet.cs.huji.ac.il/cleavepred.

Download full-text PDF

Source
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC5045867PMC
http://dx.doi.org/10.1093/database/baw133DOI Listing

Publication Analysis

Top Keywords

machine learning
8
protein properties
8
residue-level protein
8
properties asap
8
cleavage sites
8
asap cleavepred
8
asap
7
protein
7
cleavepred
5
asap machine
4

Similar Publications

Computational Methods for Predicting Chemical Reactivity of Covalent Compounds.

J Chem Inf Model

January 2025

Department of Medicinal Chemistry, School of Pharmacy, Fudan University, 826 Zhangheng Road, Shanghai 201203, People's Republic of China.

In recent decades, covalent inhibitors have emerged as a promising strategy for therapeutic development, leveraging their unique mechanism of forming covalent bonds with target proteins. This approach offers advantages such as prolonged drug efficacy, precise targeting, and the potential to overcome resistance. However, the inherent reactivity of covalent compounds presents significant challenges, leading to off-target effects and toxicities.

View Article and Find Full Text PDF

Bimodal In Situ Analyzer for Circular RNA in Extracellular Vesicles Combined with Machine Learning for Accurate Gastric Cancer Detection.

Adv Sci (Weinh)

January 2025

Department of Laboratory Medicine, Guangdong Provincial Key Laboratory of Precision Medical Diagnostics, Guangdong Engineering and Technology Research Center for Rapid Diagnostic Biosensors, Guangdong Provincial Key Laboratory of Single Cell Technology and Application, School of Laboratory Medicine and Biotechnology, Nanfang Hospital, Southern Medical University, Guangzhou, Guangdong, 510515, P. R. China.

Circular RNAs in extracellular vesicles (EV-circRNAs) are gaining recognition as potential biomarkers for the diagnosis of gastric cancer (GC). Most current research is focused on identifying new biomarkers and their functional significance in disease regulation. However, the practical application of EV-circRNAs in the early diagnosis of GC is yet to be thoroughly explored due to the low accuracy of EV-circRNAs analysis.

View Article and Find Full Text PDF

Bacterial Leaf Blight (BLB) usually attacks rice in the flowering stage and can cause yield losses of up to 50% in severely infected fields. The resulting yield losses severely impact farmers, necessitating compensation from the regulatory authorities. This study introduces a new pipeline specifically designed for detecting BLB in rice fields using unmanned aerial vehicle (UAV) imagery.

View Article and Find Full Text PDF

Purpose: In this study, we investigated the performance of deep learning (DL) models to differentiate between normal and glaucomatous visual fields (VFs) and classify glaucoma from early to the advanced stage to observe if the DL model can stage glaucoma as Mills criteria using only the pattern deviation (PD) plots. The DL model results were compared with a machine learning (ML) classifier trained on conventional VF parameters.

Methods: A total of 265 PD plots and 265 numerical datasets of Humphrey 24-2 VF images were collected from 119 normal and 146 glaucomatous eyes to train the DL models to classify the images into four groups: normal, early glaucoma, moderate glaucoma, and advanced glaucoma.

View Article and Find Full Text PDF

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!