Motivation: We have proposed a mixture model based approach to the concordant integrative analysis of multiple large-scale two-sample expression datasets. Since the mixture model is based on the transformed differential expression test P-values (z-scores), it is generally applicable to the expression data generated by either microarray or RNA-seq platforms. The mixture model is simple with three normal distribution components for each dataset to represent down-regulation, up-regulation and no differential expression. However, when the number of datasets increases, the model parameter space increases exponentially due to the component combination from different datasets.
Results: In this study, motivated by the well-known generalized estimating equations (GEEs) for longitudinal data analysis, we focus on the concordant components and assume that the proportions of non-concordant components follow a special structure. We discuss the exchangeable, multiset coefficient and autoregressive structures for model reduction, and their related expectation-maximization (EM) algorithms. Then, the parameter space is linear with the number of datasets. In our previous study, we have applied the general mixture model to three microarray datasets for lung cancer studies. We show that more gene sets (or pathways) can be detected by the reduced mixture model with the exchangeable structure. Furthermore, we show that more genes can also be detected by the reduced model. The Cancer Genome Atlas (TCGA) data have been increasingly collected. The advantage of incorporating the concordance feature has also been clearly demonstrated based on TCGA RNA sequencing data for studying two closely related types of cancer.
Availability And Implementation: Additional results are included in a supplemental file. Computer program R-functions are freely available at http://home.gwu.edu/∼ylai/research/Concordance.
Contact: ylai@gwu.edu.
Supplementary Information: Supplementary data are available at Bioinformatics online.
Download full-text PDF |
Source |
---|---|
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC5860313 | PMC |
http://dx.doi.org/10.1093/bioinformatics/btx061 | DOI Listing |
Langmuir
January 2025
College of Chemistry and Chemical Engineering, Qingdao University, Qingdao 266071, China.
The recovery of valuable materials from spent lithium-ion batteries (LIBs) has experienced increasing demand in recent years. Current recycling technologies are typically energy-intensive and are often plagued by high operation costs, low processing efficiency, and environmental pollution concerns. In this study, an efficient and environmentally friendly dielectrophoresis (DEP)-based approach is proposed to separate the main components of "black mass" mixtures from LIBs, specifically lithium iron phosphate (LFP) and graphite, based on their polarizability differences.
View Article and Find Full Text PDFBiosci Microbiota Food Health
August 2024
Central Research Institute, Itoen Ltd., 21 Mekami, Sagara-cho, Haibara-gun, Shizuoka, Japan.
Probiotics exert their beneficial effects by improving the intestinal environment. Heat-inactivated probiotics may show similar effects. However, whether multi-strain mixtures (MSM) are better than single strains, irrespective of whether the bacteria are alive or dead, is unknown.
View Article and Find Full Text PDFCharacterizing brain dynamic functional connectivity (dFC) patterns from functional Magnetic Resonance Imaging (fMRI) data is of paramount importance in neuroscience and medicine. Recently, many graph neural network (GNN) models, combined with transformers or recurrent neural networks (RNNs), have shown great potential for modeling the dFC patterns. However, these methods face challenges in effectively characterizing the modularity organization of brain networks and capturing varying dFC state patterns.
View Article and Find Full Text PDFSci Rep
January 2025
Health Management Center, The First Affiliated Hospital of Zhengzhou University, Zhengzhou, 450052, China.
This longitudinal study sought to identify distinct body mass index (BMI) trajectories and investigate the impact of these level-independent BMI trajectories on the prevalence of thyroid nodules (TN).This study encompassed a cohort of 1967 participants from a hospital in China. Utilizing latent class growth mixture modeling (LCGMM), four BMI trajectory groups were identified based on the BMI of individuals without TN from 2017 to 2019.
View Article and Find Full Text PDFSci Rep
January 2025
IRC-ISS, King Fahd University of Petroleum and Minerals, Dhahran, 34463, Saudi Arabia.
In real-world scenarios, mixture models are frequently employed to fit complex data, demonstrating remarkable flexibility and efficacy. This paper introduces an innovative Pufferfish privacy algorithm based on Gaussian priors, specifically designed for Gaussian mixture models. By leveraging a sophisticated masking mechanism, the algorithm effectively safeguards data privacy.
View Article and Find Full Text PDFEnter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!