Background: RNA sequencing (RNA-Seq) offers profound insights into the complex transcriptomes of diverse biological systems. However, standard differential expression analysis pipelines based on DESeq2 and edgeR encounter challenges when applied to the immediate early transcriptomes of Chlamydia spp., obligate intracellular bacteria.
View Article and Find Full Text PDFThis Voices piece will highlight the impact of artificial intelligence on algorithm development among computational biologists. How has worldwide focus on AI changed the path of research in computational biology? What is the impact on the algorithmic biology research community?
View Article and Find Full Text PDFMotivation: RNA sequencing (RNA-Seq) offers profound insights into the complex transcriptomes of diverse biological systems. However, standard differential expression analysis pipelines based on DESeq2 and edgeR encounter challenges when applied to the immediate early transcriptomes of spp., obligate intracellular bacteria.
View Article and Find Full Text PDFSpatially resolved transcriptomics technologies have opened new avenues for understanding gene expression heterogeneity in spatial contexts. However, existing methods for identifying spatially variable genes often focus solely on statistical significance, limiting their ability to capture continuous expression patterns and integrate spot-level covariates. To address these challenges, we introduce spVC, a statistical method based on a generalized Poisson model.
View Article and Find Full Text PDFBackground & Aims: Lacticaseibacillus rhamnosus GG (LGG) is the world's most consumed probiotic but its mechanism of action on intestinal permeability and differentiation along with its interactions with an essential source of signaling metabolites, dietary tryptophan (trp), are unclear.
Methods: Untargeted metabolomic and transcriptomic analyses were performed in LGG monocolonized germ-free mice fed trp-free or -sufficient diets. LGG-derived metabolites were profiled in vitro under anaerobic and aerobic conditions.
The obligate intracellular bacterium has a unique developmental cycle that alternates between two contrasting cell types. With a hardy envelope and highly condensed genome, the small elementary body (EB) maintains limited metabolic activities yet survives in extracellular environments and is infectious. After entering host cells, EBs differentiate into larger and proliferating reticulate bodies (RBs).
View Article and Find Full Text PDFAnalyzing single-cell RNA sequencing (scRNA-seq) data remains a challenge due to its high dimensionality, sparsity and technical noise. Recognizing the benefits of dimensionality reduction in simplifying complexity and enhancing the signal-to-noise ratio, we introduce scBiG, a novel graph node embedding method designed for representation learning in scRNA-seq data. scBiG establishes a bipartite graph connecting cells and expressed genes, and then constructs a multilayer graph convolutional network to learn cell and gene embeddings.
View Article and Find Full Text PDFHallmarks of the developmental cycle of the obligate intracellular pathogenic bacterium are the primary differentiation of the infectious elementary body (EB) into the proliferative reticulate body (RB) and the secondary differentiation of RBs back into EBs. The mechanisms regulating these transitions remain unclear. In this report, we developed an effective novel strategy termed dependence on plasmid-mediated expression (DOPE) that allows for the knockdown of essential genes in .
View Article and Find Full Text PDFChronic infection of hepatitis B virus (HBV) is the major cause of hepatocellular carcinoma (HCC). Notably, 90% of HBV-positive HCC cases exhibit detectable HBV integrations, hinting at the potential early entanglement of these viral integrations in tumorigenesis and their subsequent oncogenic implications. Nevertheless, the precise chronology of integration events during HCC tumorigenesis, alongside their sequential structural patterns, has remained elusive thus far.
View Article and Find Full Text PDFPaneth cells (PCs), a specialized secretory cell type in the small intestine, are increasingly recognized as having an essential role in host responses to microbiome and environmental stresses. Whether and how commensal and pathogenic microbes modify PC composition to modulate inflammation remain unclear. Using newly developed PC-reporter mice under conventional and gnotobiotic conditions, we determined PC transcriptomic heterogeneity in response to commensal and invasive microbes at single cell level.
View Article and Find Full Text PDFMotivation: Since the development of single-cell RNA sequencing (scRNA-seq) technologies, clustering analysis of single-cell gene expression data has been an essential tool for distinguishing cell types and identifying novel cell types. Even though many methods have been available for scRNA-seq clustering analysis, the majority of them are constrained by the requirement on predetermined cluster numbers or the dependence on selected initial cluster assignment.
Results: In this article, we propose an adaptive embedding and clustering method named scAce, which constructs a variational autoencoder to simultaneously learn cell embeddings and cluster assignments.
Brief Bioinform
September 2023
The advent of single-cell RNA sequencing (scRNA-seq) technologies has enabled gene expression profiling at the single-cell resolution, thereby enabling the quantification and comparison of transcriptional variability among individual cells. Although alterations in transcriptional variability have been observed in various biological states, statistical methods for quantifying and testing differential variability between groups of cells are still lacking. To identify the best practices in differential variability analysis of single-cell gene expression data, we propose and compare 12 statistical pipelines using different combinations of methods for normalization, feature selection, dimensionality reduction and variability calculation.
View Article and Find Full Text PDF, an obligate intracellular bacterial pathogen, has a unique developmental cycle involving the differentiation of invading elementary bodies (EBs) to noninfectious reticulate bodies (RBs), replication of RBs, and redifferentiation of RBs into progeny EBs. Progression of this cycle is regulated by three sigma factors, which direct the RNA polymerase to their respective target gene promoters. We hypothesized that the -specific transcriptional regulator GrgA, previously shown to activate σ66 and σ28, plays an essential role in chlamydial development and growth.
View Article and Find Full Text PDFBrief Bioinform
January 2023
Data integration is a critical step in the analysis of multiple single-cell RNA sequencing samples to account for heterogeneity due to both biological and technical variability. scINSIGHT is a new integration method for single-cell gene expression data, and can effectively use the information of biological condition to improve the integration of multiple single-cell samples. scINSIGHT is based on a novel non-negative matrix factorization model that learns common and condition-specific gene modules in samples from different biological or experimental conditions.
View Article and Find Full Text PDFThe increasing number of scRNA-seq data emphasizes the need for integrative analysis to interpret similarities and differences between single-cell samples. Although different batch effect removal methods have been developed, none are suitable for heterogeneous single-cell samples coming from multiple biological conditions. We propose a method, scINSIGHT, to learn coordinated gene expression patterns that are common among, or specific to, different biological conditions, and identify cellular identities and processes across single-cell samples.
View Article and Find Full Text PDFMotivation: Single-cell RNA sequencing technologies facilitate the characterization of transcriptomic landscapes in diverse species, tissues and cell types with unprecedented molecular resolution. In order to better understand animal development, physiology, and pathology, unsupervised clustering analysis is often used to identify relevant cell populations. Although considerable progress has been made in terms of clustering algorithms in recent years, it remains challenging to evaluate the quality of the inferred single-cell clusters, which can greatly impact downstream analysis and interpretation.
View Article and Find Full Text PDFscDesign2 is a transparent simulator that generates high-fidelity single-cell gene expression count data with gene correlations captured. This article shows how to download and install the scDesign2 R package, how to fit probabilistic models (one per cell type) to real data and simulate synthetic data from the fitted models, and how to use scDesign2 to guide experimental design and benchmark computational methods. Finally, a note is given about cell clustering as a preprocessing step before model fitting and data simulation.
View Article and Find Full Text PDFMany computational pipelines exist for the detection of differentially expressed genes. However, computational pipelines for functional gene detection rarely exist. We developed a new computational pipeline for functional gene identification from transcriptome profiling data.
View Article and Find Full Text PDFMost eukaryotic genes express alternative polyadenylation (APA) isoforms. A growing number of RNA sequencing methods, especially those used for single-cell transcriptome analysis, generate reads close to the polyadenylation site (PAS), termed nearSite reads, hence inherently containing information about APA isoform abundance. Here, we present a probabilistic model-based method named MAAPER to utilize nearSite reads for APA analysis.
View Article and Find Full Text PDFBrief Bioinform
November 2021
Single-cell RNA sequencing (scRNA-seq) technologies facilitate the characterization of transcriptomic landscapes in diverse species, tissues, and cell types with unprecedented molecular resolution. In order to evaluate various biological hypotheses using high-dimensional single-cell gene expression data, most computational and statistical methods depend on a gene feature selection step to identify genes with high biological variability and reduce computational complexity. Even though many gene selection methods have been developed for scRNA-seq analysis, there lacks a systematic comparison of the assumptions, statistical models, and selection criteria used by these methods.
View Article and Find Full Text PDF