This paper presents an algorithm which is able to extract discriminant rules from oligopeptides for protease proteolytic cleavage activity prediction. The algorithm is developed using genetic programming. Three important components in the algorithm are a min-max scoring function, the reverse Polish notation (RPN) and the use of minimum description length. The min-max scoring function is developed using amino acid similarity matrices for measuring the similarity between an oligopeptide and a rule, which is a complex algebraic equation of amino acids rather than a simple pattern sequence. The Fisher ratio is then calculated on the scoring values using the class label associated with the oligopeptides. The discriminant ability of each rule can therefore be evaluated. The use of RPN makes the evolutionary operations simpler and therefore reduces the computational cost. To prevent overfitting, the concept of minimum description length is used to penalize over-complicated rules. A fitness function is therefore composed of the Fisher ratio and the use of minimum description length for an efficient evolutionary process. In the application to four protease datasets (Trypsin, Factor Xa, Hepatitis C Virus and HIV protease cleavage site prediction), our algorithm is superior to C5, a conventional method for deriving decision trees.

Download full-text PDF

Source
http://dx.doi.org/10.1016/s0303-2647(03)00141-2DOI Listing

Publication Analysis

Top Keywords

min-max scoring
12
scoring function
12
minimum description
12
description length
12
protease proteolytic
8
proteolytic cleavage
8
cleavage activity
8
genetic programming
8
prediction algorithm
8
fisher ratio
8

Similar Publications

Aim: The aim was to investigate feeding type at discharge; exclusively breastfeeding (EBF), mixed breastfeeding (MBF), and formula milk feeding (FMF), factors associated with feeding type, and changes in weight-for-age z-score (ΔWAZ) in infants admitted to Danish neonatal units.

Methods: Using data from the Danish National Quality Database for Births and the Danish Newborn Quality Database, we included 8639 mother-infant dyads admitted ≥5 days between February 2019 and December 2021. We used logistic regression to investigate associations between maternal and infant factors and feeding type, and descriptive statistics to describe ΔWAZ and feeding type at discharge.

View Article and Find Full Text PDF

Background: The aim of the study was to determine the level of dietary self-efficacy, physical activity and obesogenic environment in several districts of Istanbul and to examine the relationship between them.

Methods: In this cross-sectional study, a questionnaire was administered to primary health care workers working in three different districts of Istanbul. Sociodemographic questions, Dieting Self-Efficiency Scale (DSES), The Assessment of the Obesogenic Environment Scale (AOES), and International Physical Activity Questionnaire (IPAQ) were used in the survey.

View Article and Find Full Text PDF

Background: This study aimed to investigate the utility of minimally invasive sinus laser therapy (SiLaT) versus flap surgery (Karydakis flap procedure) in terms of intraoperative parameters and postoperative outcome in patients with pilonidal sinus disease (PSD).

Methods: A total of 106 patients with PSD (mean ± SD age: 26.4 ± 7.

View Article and Find Full Text PDF

Human milk oligosaccharides (HMOs) have been positively associated with child neurodevelopment in some cohort studies. However, there is a lack of consistency in the association between HMOs and benefits to infants' brains. Moreover, the quantification methods for HMOs have not yet been standardized.

View Article and Find Full Text PDF

Effect of alemtuzumab on fatigue, quality of life, and patient/caregiver-reported outcomes in relapsing-remitting multiple sclerosis-A real-world evidence study.

Mult Scler Relat Disord

December 2024

Department of Clinical Medicine, University of Bergen, Bergen, Norway; Neuro-SysMed, Department of Neurology, Haukeland University Hospital, Bergen, Norway. Electronic address:

Background: Alemtuzumab is approved in the European Union for treating highly active relapsing-remitting multiple sclerosis (RRMS). Patient-reported outcomes measure the treatment impact on quality of life (QoL), including fatigue, a common symptom in multiple sclerosis (MS). Chronic diseases like MS also affect the patient's caregiver.

View Article and Find Full Text PDF

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!