MicroRNAs (miRNAs) are known as an important indicator of cancers. The presence of cancer can be detected by identifying the responsible miRNAs. A fuzzy-rough entropy measure (FREM) is developed which can rank the miRNAs and thereby identify the relevant ones. FREM is used to determine the relevance of a miRNA in terms of separability between normal and cancer classes. While computing the FREM for a miRNA, fuzziness takes care of the overlapping between normal and cancer expressions, whereas rough lower approximation determines their class sizes. MiRNAs are sorted according to the highest relevance (i.e., the capability of class separation) and a percentage among them is selected from the top ranked ones. FREM is also used to determine the redundancy between two miRNAs and the redundant ones are removed from the selected set, as per the necessity. A histogram based patient selection method is also developed which can help to reduce the number of patients to be dealt during the computation of FREM, while compromising very little with the performance of the selected miRNAs for most of the data sets. The superiority of the FREM as compared to some existing methods is demonstrated extensively on six data sets in terms of sensitivity, specificity, and score. While for these data sets the score of the miRNAs selected by our method varies from 0.70 to 0.91 using SVM, those results vary from 0.37 to 0.90 for some other methods. Moreover, all the selected miRNAs corroborate with the findings of biological investigations or pathway analysis tools. The source code of FREM is available at http://www.jayanta.droppages.com/FREM.html.

Download full-text PDF

Source
http://dx.doi.org/10.1109/TCBB.2016.2623605DOI Listing

Publication Analysis

Top Keywords

data sets
12
fuzzy-rough entropy
8
entropy measure
8
histogram based
8
based patient
8
patient selection
8
mirnas
8
frem determine
8
normal cancer
8
selected mirnas
8

Similar Publications

Pharmacogenetics: Opportunities for the Research Program and Other Large Data Sets to Advance the Field.

Annu Rev Pharmacol Toxicol

January 2025

Clinical and Translational Science Institute, Colleges of Medicine and Pharmacy, The Ohio State University, Columbus, Ohio, USA.

Pharmacogenetic variation is common and an established driver of response for many drugs. There has been tremendous progress in pharmacogenetics knowledge over the last 30 years and in clinical implementation of that knowledge over the last 15 years. But there have also been many examples where translation has stalled because of the lack of available data sets for discovery or validation research.

View Article and Find Full Text PDF

Neuroinflammation immediately follows the onset of ischemic stroke in the middle cerebral artery. During this process, microglial cells are activated in and recruited to the penumbra. Microglial cells can be activated into two different phenotypes: M1, which can worsen brain injury; or M2, which can aid in long-term recovery.

View Article and Find Full Text PDF

Idiopathic pulmonary fibrosis (IPF) is a progressive lung disease characterized by repetitive alveolar injuries with excessive deposition of extracellular matrix (ECM) proteins. A crucial need in understanding IPF pathogenesis is identifying cell types associated with histopathological regions, particularly local fibrosis centers known as fibroblast foci. To address this, we integrated published spatial transcriptomics and single-cell RNA sequencing (scRNA-seq) transcriptomics and adopted the Query method and the Overlap method to determine cell type enrichments in histopathological regions.

View Article and Find Full Text PDF

BaNDyT: Bayesian Network Modeling of Molecular Dynamics Trajectories.

J Chem Inf Model

January 2025

Department of Computational and Quantitative Medicine, Beckman Research Institute of the City of Hope, 1218 S 5th Ave, Monrovia, California 91016, United States.

Bayesian network modeling (BN modeling, or BNM) is an interpretable machine learning method for constructing probabilistic graphical models from the data. In recent years, it has been extensively applied to diverse types of biomedical data sets. Concurrently, our ability to perform long-time scale molecular dynamics (MD) simulations on proteins and other materials has increased exponentially.

View Article and Find Full Text PDF

Anonymize or synthesize? Privacy-preserving methods for heart failure score analytics.

Eur Heart J Digit Health

January 2025

Klaus Tschira Institute for Integrative Computational Cardiology, University Hospital Heidelberg, Im Neuenheimer Feld 669, 69120 Heidelberg, Germany.

Aims: Data availability remains a critical challenge in modern, data-driven medical research. Due to the sensitive nature of patient health records, they are rightfully subject to stringent privacy protection measures. One way to overcome these restrictions is to preserve patient privacy by using anonymization and synthetization strategies.

View Article and Find Full Text PDF

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!