Constrained neuro fuzzy inference methodology for explainable personalised modelling with applications on gene expression data.

Sci Rep

Knowledge Engineering and Discovery Research Innovation (KEDRI), School of Engineering Computer and Mathematical Sciences, Auckland University of Technology, Auckland, New Zealand.

Published: January 2023

Interpretable machine learning models for gene expression datasets are important for understanding the decision-making process of a classifier and gaining insights on the underlying molecular processes of genetic conditions. Interpretable models can potentially support early diagnosis before full disease manifestation. This is particularly important yet, challenging for mental health. We hypothesise this is due to extreme heterogeneity issues which may be overcome and explained by personalised modelling techniques. Thus far, most machine learning methods applied to gene expression datasets, including deep neural networks, lack personalised interpretability. This paper proposes a new methodology named personalised constrained neuro fuzzy inference (PCNFI) for learning personalised rules from high dimensional datasets which are structurally and semantically interpretable. Case studies on two mental health related datasets (schizophrenia and bipolar disorders) have shown that the relatively short and simple personalised fuzzy rules provided enhanced interpretability as well as better classification performance compared to other commonly used machine learning methods. Performance test on a cancer dataset also showed that PCNFI matches previous benchmarks. Insights from our approach also indicated the importance of two genes (ATRX and TSPAN2) as possible biomarkers for early differentiation of ultra-high risk, bipolar and healthy individuals. These genes are linked to cognitive ability and impulsive behaviour. Our findings suggest a significant starting point for further research into the biological role of cognitive and impulsivity-related differences. With potential applications across bio-medical research, the proposed PCNFI method is promising for diagnosis, prognosis, and the design of personalised treatment plans for better outcomes in the future.

Download full-text PDF

Source
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC9829920PMC
http://dx.doi.org/10.1038/s41598-022-27132-8DOI Listing

Publication Analysis

Top Keywords

gene expression
12
machine learning
12
constrained neuro
8
neuro fuzzy
8
fuzzy inference
8
personalised modelling
8
expression datasets
8
mental health
8
learning methods
8
personalised
7

Similar Publications

The most damaging disease affecting citrus globally is Huanglongbing (HLB), primarily attributed to the infection by ' asiaticus' (Las). Based on comparative transcriptome data, two cellulose synthase (CESA) genes responsive to Las infection induction were screened, and one gene cloned with higher differential expression level was selected and named . we verified the interaction between CsCESA1 and citrus exopolysaccharide 2 (CsEPS2) proteins.

View Article and Find Full Text PDF

Defining the cellular factors that drive growth rate and proteome composition is essential for understanding and manipulating cellular systems. In bacteria, ribosome concentration is known to be a constraining factor of cell growth rate, while gene concentration is usually assumed not to be limiting. Here, using single-molecule tracking, quantitative single-cell microscopy, and modeling, we show that genome dilution in cells arrested for DNA replication limits total RNA polymerase activity within physiological cell sizes across tested nutrient conditions.

View Article and Find Full Text PDF

Purpose: Renal cell carcinoma (RCC), exhibiting remarkable heterogeneity, can be highly infiltrated by regulatory T cells (Tregs). However, the relationship between Treg and the heterogeneity of RCC remains to be explored.

Methods: We acquired single-cell RNA-seq profiles and 537 bulk RNA-seq profiles of TCGA-KIRC cohort.

View Article and Find Full Text PDF

Objective: Searching for potential biomarkers and therapeutic targets for early diagnosis of gynecological tumors to improve patient survival.

Methods: Microarray datasets of cervical cancer (CC) and ovarian cancer (OC) were downloaded from the Gene Expression Omnibus (GEO) database, then, differential gene expression between cancerous and normal tissues in the datasets was analyzed. Weighted gene co-expression network analysis (WGCNA) was performed to screen for co-expression modules associated with CC and OC.

View Article and Find Full Text PDF

Background: Gliomas, particularly glioblastoma (GBM), are the most common and aggressive primary brain tumors in adults, characterized by high malignancy and frequent recurrence. Despite standard treatments, including surgery, radiotherapy, and chemotherapy, the prognosis for GBM remains poor, with a median survival of less than 15 months and a five-year survival rate below 10%. Tumor heterogeneity and resistance to treatment create significant challenges in controlling glioma progression.

View Article and Find Full Text PDF

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!