The development of intracellular ice in the bodies of cold-blooded living organisms may cause them to die. These species yield antifreeze proteins (AFPs) to live in subzero temperature environments. Additionally, AFPs are implemented in biotechnological, industrial, agricultural, and medical fields. Machine learning-based predictors were presented for AFP identification. However, more accurate predictors are still highly desirable for boosting the AFP prediction. This work presents a novel approach, named AFP-SPTS, for the correct prediction of AFPs. We explored the discriminative features with four schemes, namely, dipeptide deviation from the expected mean (DDE), reduced amino acid alphabet (RAAA), grouped dipeptide composition (GDPC), and a novel representative method, called pseudo-position-specific scoring matrix tri-slicing (PseTS-PSSM). Considering the advantages of ensemble learning strategy, we fused each feature vector into different combinations and trained the models with five machine learning algorithms, i.e., multilayer perceptron (MLP), extremely randomized tree (ERT), decision tree (DT), random forest (RF), and AdaBoost. Among all models, PseTS-PSSM + RAAA with an extremely randomized tree attained the best outcomes. The proposed predictor (AFP-SPTS) boosted the accuracies of AFPs in the literature by 1.82 and 4.1%.

Download full-text PDF

Source
http://dx.doi.org/10.1021/acs.jcim.2c01417DOI Listing

Publication Analysis

Top Keywords

extremely randomized
12
randomized tree
12
antifreeze proteins
8
afp-spts accurate
4
accurate prediction
4
prediction antifreeze
4
proteins sequential
4
sequential pseudo-tri-slicing
4
pseudo-tri-slicing evolutionary
4
evolutionary features
4

Similar Publications

Background: Depression is being increasingly acknowledged as an important risk factor contributing to coronary heart disease (CHD). Currently, there is no predictive model specifically designed to evaluate the risk of coronary heart disease among individuals with depression. We aim to develop a machine learning (ML) model that will analyze risk factors and forecast the probability of coronary heart disease in individuals suffering from depression.

View Article and Find Full Text PDF

Background: Unbound bilirubin (UB) was measured on day 5 ± 1 in 1101 ELBW newborns in the Aggressive vs Conservative Phototherapy randomized controlled trial. We accessed this dataset to quantify the UB-mediated risk of severe neurodevelopmental impairment (sNDI) in extremely low birthweight (ELBW) newborns.

Methods: UB levels were standardized within laboratories as z-score percentiles.

View Article and Find Full Text PDF

Evaluating Machine Learning and Deep Learning models for predicting Wind Turbine power output from environmental factors.

PLoS One

January 2025

Renewable Energy Science and Engineering Department, Faculty of Postgraduate Studies for Advanced Sciences (PSAS), Beni-Suef University, Beni-Suef, Egypt.

This study presents a comprehensive comparative analysis of Machine Learning (ML) and Deep Learning (DL) models for predicting Wind Turbine (WT) power output based on environmental variables such as temperature, humidity, wind speed, and wind direction. Along with Artificial Neural Network (ANN), Long Short-Term Memory (LSTM), Recurrent Neural Network (RNN), and Convolutional Neural Network (CNN), the following ML models were looked at: Linear Regression (LR), Support Vector Regressor (SVR), Random Forest (RF), Extra Trees (ET), Adaptive Boosting (AdaBoost), Categorical Boosting (CatBoost), Extreme Gradient Boosting (XGBoost), and Light Gradient Boosting Machine (LightGBM). Using a dataset of 40,000 observations, the models were assessed based on R-squared, Mean Absolute Error (MAE), and Root Mean Square Error (RMSE).

View Article and Find Full Text PDF

Interpreting Lung Cancer Health Disparity between African American Males and European American Males.

Proceedings (IEEE Int Conf Bioinformatics Biomed)

December 2024

Knight Foundation School of Computing and Information Sciences, Florida International University, Miami, USA.

Lung cancer remains a predominant cause of cancer-related deaths, with notable disparities in incidence and outcomes across racial and gender groups. This study addresses these disparities by developing a computational framework leveraging explainable artificial intelligence (XAI) to identify both patient- and cohort-specific biomarker genes in lung cancer. Specifically, we focus on two lung cancer subtypes, Lung Adenocarcinoma (LUAD) and Lung Squamous Cell Carcinoma (LUSC), examining distinct racial and sex-specific cohorts: African American males (AAMs) and European American males (EAMs).

View Article and Find Full Text PDF

Objective: The objective of this study is to compare the effectiveness of low-load blood flow restriction training (LL-BFRT) to heavy-load resistance training (HL-RT) in male collegiate athletes with chronic non-specific low back pain (CNLBP).

Methods: Twenty-six participants were randomly assigned to LL-BFRT ( = 13) or HL-RT ( = 13). All participants supervised exercises (deep-squat, lateral pull-down, bench-press and machine seated crunch) cycled 4 times per week for 4 weeks (16 sessions).

View Article and Find Full Text PDF

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!