Publications by authors named "Xiaoning Qian"

(Cb), the causative agent of Q fever, replicates within host macrophages by modulating immune responses through poorly understood mechanisms. Long non-coding RNAs (lncRNAs) are emerging as critical regulators of inflammation, yet their role in Cb pathogenesis remains largely unexplored. Here, we employed a global transcriptomic approach to identify lncRNAs specific to Cb infection in THP-1 derived macrophages, compared to 15 other microbial infections.

View Article and Find Full Text PDF

Radiation exposure poses a significant threat to human health. Emerging research indicates that even low-dose radiation once believed to be safe, may have harmful effects. This perception has spurred a growing interest in investigating the potential risks associated with low-dose radiation exposure across various scenarios.

View Article and Find Full Text PDF

Spatially resolved transcriptomics has revolutionized genome-scale transcriptomic profiling by providing high-resolution characterization of transcriptional patterns. Here, we present our spatial transcriptomics analysis framework, MUSTANG (MUlti-sample Spatial Transcriptomics data ANalysis with cross-sample transcriptional similarity Guidance), which is capable of performing multi-sample spatial transcriptomics spot cellular deconvolution by allowing both cross-sample expression-based similarity information sharing as well as spatial correlation in gene expression patterns within samples. Experiments on a semi-synthetic spatial transcriptomics dataset and three real-world spatial transcriptomics datasets demonstrate the effectiveness of MUSTANG in revealing biological insights inherent in the cellular characterization of tissue samples under study.

View Article and Find Full Text PDF

Many powerful methods have been employed to elucidate the global transcriptomic, proteomic, or metabolic responses to pathogen-infected host cells. However, the host glycome responses to bacterial infection remain largely unexplored, and hence, our understanding of the molecular mechanisms by which bacterial pathogens manipulate the host glycome to favor infection remains incomplete. Here, we address this gap by performing a systematic analysis of the host glycome during infection by the bacterial pathogen Brucella spp.

View Article and Find Full Text PDF

Significant acceleration of the future discovery of novel functional materials requires a fundamental shift from the current materials discovery practice, which is heavily dependent on trial-and-error campaigns and high-throughput screening, to one that builds on knowledge-driven advanced informatics techniques enabled by the latest advances in signal processing and machine learning. In this review, we discuss the major research issues that need to be addressed to expedite this transformation along with the salient challenges involved. We especially focus on Bayesian signal processing and machine learning schemes that are uncertainty aware and physics informed for knowledge-driven learning, robust optimization, and efficient objective-driven experimental design.

View Article and Find Full Text PDF

The need for efficient computational screening of molecular candidates that possess desired properties frequently arises in various scientific and engineering problems, including drug discovery and materials design. However, the enormous search space containing the candidates and the substantial computational cost of high-fidelity property prediction models make screening practically challenging. In this work, we propose a general framework for constructing and optimizing a high-throughput virtual screening (HTVS) pipeline that consists of multi-fidelity models.

View Article and Find Full Text PDF

TRIMER, Transcription Regulation Integrated with MEtabolic Regulation, is a genome-scale modeling pipeline targeting at metabolic engineering applications. Using TRIMER, regulated metabolic reactions can be effectively predicted by integrative modeling of metabolic reactions with a transcription factor-gene regulatory network (TRN), which is modeled through a Bayesian network (BN). In this article, we focus on sensitivity analysis of metabolic flux prediction for uncertainty quantification of BN structures for TRN modeling in TRIMER.

View Article and Find Full Text PDF

Purpose: Researchers have developed a constellation model of decoding-related reading disabilities (RD) to improve the RD risk determination. The model's hallmark is its inclusion of various RD indicators to determine RD risk. Classification methods such as logistic regression (LR) might be one way to determine RD risk within the constellation model framework.

View Article and Find Full Text PDF

TOPAS (TOPological network-based Alignment of Structural RNAs) is a network-based alignment algorithm that predicts structurally sound pairwise alignment of RNAs. In order to take advantage of recent advances in comparative network analysis for efficient structurally sound RNA alignment, TOPAS constructs topological network representations for RNAs, which consist of sequential edges connecting nucleotide bases as well as structural edges reflecting the underlying folding structure. Structural edges are weighted by the estimated base-pairing probabilities.

View Article and Find Full Text PDF

Van der Waals (vdW) heterostructures are constructed by different two-dimensional (2D) monolayers vertically stacked and weakly coupled by van der Waals interactions. VdW heterostructures often possess rich physical and chemical properties that are unique to their constituent monolayers. As many 2D materials have been recently identified, the combinatorial configuration space of vdW-stacked heterostructures grows exceedingly large, making it difficult to explore through traditional experimental or computational approaches in a trial-and-error manner.

View Article and Find Full Text PDF
Article Synopsis
  • The immune system uses a process called phagocytosis to destroy germs, but some germs, like the one that causes brucellosis, can trick this defense.
  • This germ activates a special process that breaks down a protein (BLOS1) needed for moving germs to lysosomes, where they get destroyed.
  • When certain cells or mice couldn't use this breakdown process, they were better at fighting off the infection, showing that this protein plays an important role in protecting against germs.
View Article and Find Full Text PDF

Classification has been a major task for building intelligent systems because it enables decision-making under uncertainty. Classifier design aims at building models from training data for representing feature-label distributions-either explicitly or implicitly. In many scientific or clinical settings, training data are typically limited, which impedes the design and evaluation of accurate classifiers.

View Article and Find Full Text PDF

Transfer learning (TL) techniques can enable effective learning in data scarce domains by allowing one to re-purpose data or scientific knowledge available in relevant source domains for predictive tasks in a target domain of interest. In this Data in Brief article, we present a synthetic dataset for binary classification in the context of Bayesian transfer learning, which can be used for the design and evaluation of TL-based classifiers. For this purpose, we consider numerous combinations of classification settings, based on which we simulate a diverse set of feature-label distributions with varying learning complexity.

View Article and Find Full Text PDF

This protocol explains the pipeline for condition-dependent metabolite yield prediction using Transcription Regulation Integrated with MEtabolic Regulation (TRIMER). TRIMER targets metabolic engineering applications via a hybrid model integrating transcription factor (TF)-gene regulatory network (TRN) with a Bayesian network (BN) inferred from transcriptomic expression data to effectively regulate metabolic reactions. For and yeast, TRIMER achieves reliable knockout phenotype and flux predictions from the deletion of one or more TFs at the genome scale.

View Article and Find Full Text PDF

Motivation: Accurate disease diagnosis and prognosis based on omics data rely on the effective identification of robust prognostic and diagnostic markers that reflect the states of the biological processes underlying the disease pathogenesis and progression. In this article, we present GCNCC, a Graph Convolutional Network-based approach for Clustering and Classification, that can identify highly effective and robust network-based disease markers. Based on a geometric deep learning framework, GCNCC learns deep network representations by integrating gene expression data with protein interaction data to identify highly reproducible markers with consistently accurate prediction performance across independent datasets possibly from different platforms.

View Article and Find Full Text PDF

There has been extensive research in predictive modeling of genome-scale metabolic reaction networks. Living systems involve complex stochastic processes arising from interactions among different biomolecules. For more accurate and robust prediction of target metabolic behavior under different conditions, not only metabolic reactions but also the genetic regulatory relationships involving transcription factors (TFs) affecting these metabolic reactions should be modeled.

View Article and Find Full Text PDF
Article Synopsis
  • CPI-613 disrupts mitochondrial metabolism by inhibiting the tricarboxylic acid (TCA) cycle, affecting the overall physiology of zebrafish.
  • Embryo-larval zebrafish exposed to CPI-613 showed an initial decrease in oxygen consumption, but by Day 20, their oxygen use returned to normal levels.
  • The study combined experimental data with computational models, revealing that CPI-613 exposure impaired ATP synthesis and fatty acid metabolism, ultimately impacting the zebrafish's cellular energy production and overall health.
View Article and Find Full Text PDF

Motivation: When learning to subtype complex disease based on next-generation sequencing data, the amount of available data is often limited. Recent works have tried to leverage data from other domains to design better predictors in the target domain of interest with varying degrees of success. But they are either limited to the cases requiring the outcome label correspondence across domains or cannot leverage the label information at all.

View Article and Find Full Text PDF

Survival data analysis has been leveraged in medical research to study disease morbidity and mortality, and to discover significant bio-markers affecting them. A crucial objective in studying high dimensional medical data is the development of inherently interpretable models that can efficiently capture sparse underlying signals while retaining a high predictive accuracy. Recently developed rule ensemble models have been shown to effectively accomplish this objective; however, they are computationally expensive when applied to survival data and do not account for sparsity in the number of variables included in the generated rules.

View Article and Find Full Text PDF

Melon (Cucumis melo L.) is an important diploid crop with a wide variety of flavors due to its distinct aromatic volatile organic compounds (VOC). To understand the development of VOC profiles during fruit development, we performed metabolomic and transcriptomic analysis of two cantaloupe varieties over the course of fruit development.

View Article and Find Full Text PDF

Multi-state model (MSM) is a useful tool to analyze longitudinal data for modeling disease progression at multiple time points. While the regularization approaches to variable selection have been widely used, extending them to MSM remains largely unexplored. In this paper, we have developed the L1-regularized multi-state model (L1MSTATE) framework that enables parameter estimation and variable selection simultaneously.

View Article and Find Full Text PDF

Melon is one of the most popular fruits worldwide and has been bred into various cultivars. RNA-sequencing using healthy melon fruit was performed to determine differences in gene expression among cultivars. Unexpected RNA-seq results revealed that viruses asymptomatically infected fruits at a high frequency (16 of 21 fruits examined were infected) and that viral transcripts highly accumulated in comparison with host transcripts (15 %-75 % of total reads).

View Article and Find Full Text PDF

Background: The current computational methods on identifying conserved protein complexes across multiple Protein-Protein Interaction (PPI) networks suffer from the lack of explicit modeling of the desired topological properties within conserved protein complexes as well as their scalability.

Results: To overcome those issues, we propose a scalable algorithm-ClusterM-for identifying conserved protein complexes across multiple PPI networks through the integration of network topology and protein sequence similarity information. ClusterM overcomes the computational barrier that existed in previous methods, where the complexity escalates exponentially when handling an increasing number of PPI networks; and it is able to detect conserved protein complexes with both topological separability and cohesive protein sequence conservation.

View Article and Find Full Text PDF