Motivation: Identifying differentially expressed genes (DEGs) in transcriptome data is a very important task. However, performances of existing DEG methods vary significantly for data sets measured in different conditions and no single statistical or machine learning model for DEG detection perform consistently well for data sets of different traits. In addition, setting a cutoff value for the significance of differential expressions is one of confounding factors to determine DEGs.

Results: We address these problems by developing an ensemble model that refines the heterogeneous and inconsistent results of the existing methods by taking accounts into network information such as network propagation and network property. DEG candidates that are predicted with weak evidence by the existing tools are re-classified by our proposed ensemble model for the transcriptome data. Tested on 10 RNA-seq datasets downloaded from gene expression omnibus (GEO), our method showed excellent performance of winning the first place in detecting ground truth (GT) genes in eight datasets and find almost all GT genes in six datasets. On the other hand, performances of all existing methods varied significantly for the 10 data sets. Because of the design principle, our method can accommodate any new DEG methods naturally.

Availability: The source code of our method is available at https://github.com/jihmoon/MLDEG.

Download full-text PDF

Source
http://dx.doi.org/10.1109/TCBB.2021.3067613DOI Listing

Publication Analysis

Top Keywords

data sets
12
machine learning
8
differentially expressed
8
expressed genes
8
network property
8
network propagation
8
transcriptome data
8
performances existing
8
deg methods
8
ensemble model
8

Similar Publications

Nursing activity recognition has immense importance in the development of smart healthcare management and is an extremely challenging area of research in human activity recognition. The main reasons are an extreme class-imbalance problem and intra-class variability depending on both the subject and the recipient. In this paper, we apply a unique two-step feature extraction, coupled with an intermediate feature 'Angle' and a new feature called mean min max sum to render the features robust against intra-class variation.

View Article and Find Full Text PDF

Marine microplastic is pervasive, polluting the remotest ecosystems including the Southern Ocean. Since this region is already undergoing climatic changes, the additional stress of microplastic pollution on the ecosystem should not be considered in isolation. We identify potential hotspot areas of ecological impact from a spatial overlap analysis of multiple data sets to understand where marine biota are likely to interact with local microplastic emissions (from ship traffic and human populations associated with scientific research and tourism).

View Article and Find Full Text PDF

Over the recent past, tools have been developed to asses people's connection to and attitudes towards nature due to increasing interest in this topic in society and research. We translated one such questionnaire, the Nature Relatedness Scale, consisting of three subscales (NR-Self, NR-Perspective, NR-Experience) to German. We collected 251 data sets and performed a confirmatory factor analysis, followed by an exploratory factor analysis.

View Article and Find Full Text PDF

Background: Anaphylaxis is increasing in Australia involving all levels of the health care system. Although guidelines recommend calling an ambulance and 4-hour observation, knowledge gaps exist regarding where people experiencing anaphylaxis receive care.

Objective: We sought to examine care pathways for anaphylaxis in Western Australia and factors associated with seeking care from ambulance versus the emergency department (ED), and subsequent hospital admission.

View Article and Find Full Text PDF

Drug discovery is essential in human diseases but faces challenges because of the vast chemical space. Molecular generation models have become powerful tools to accelerate drug design by efficiently exploring chemical space. 3D molecular generation has gained popularity for explicitly incorporating spatial structural information to generate rational molecules.

View Article and Find Full Text PDF

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!