Publications by authors named "Kim Anh Le Cao"

Identifying statistical associations between biological variables is crucial to understanding molecular mechanisms. Most association studies are based on correlation or linear regression analyses, but the identified associations often lack reproducibility and interpretability due to the complexity and variability of omics datasets, making it difficult to translate associations into meaningful biological hypotheses. We developed StableMate, a regression framework, to address these challenges through a process of variable selection across heterogeneous datasets.

View Article and Find Full Text PDF

Recent advances in bioinformatics and high-throughput sequencing have enabled the large-scale recovery of genomes from metagenomes. This has the potential to bring important insights as researchers can bypass cultivation and analyze genomes sourced directly from environmental samples. There are, however, technical challenges associated with this process, most notably the complexity of computational workflows required to process metagenomic data, which include dozens of bioinformatics software tools, each with their own set of customizable parameters that affect the final output of the workflow.

View Article and Find Full Text PDF

Introduction: Preeclampsia is a life-threatening disorder of pregnancy unique to humans. Interleukin (IL)11 is elevated in serum from pregnancies that subsequently develop early-onset preeclampsia and pharmacological elevation of IL11 in pregnant mice causes the development of early-onset preeclampsia-like features (hypertension, proteinuria, and fetal growth restriction). However, the mechanism by which IL11 drives preeclampsia is unknown.

View Article and Find Full Text PDF

The Environmental Determinants of Islet Autoimmunity (ENDIA) pregnancy-birth cohort investigates the developmental origins of type 1 diabetes (T1D), with recruitment between 2013 and 2019. ENDIA is the first study in the world with comprehensive data and biospecimen collection during pregnancy, at birth and through childhood from at-risk children who have a first-degree relative with T1D. Environmental exposures are thought to drive the progression to clinical T1D, with pancreatic islet autoimmunity (IA) developing in genetically susceptible individuals.

View Article and Find Full Text PDF

Microbial communities are highly dynamic and sensitive to changes in the environment. Thus, microbiome data are highly susceptible to batch effects, defined as sources of unwanted variation that are not related to and obscure any factors of interest. Existing batch effect correction methods have been primarily developed for gene expression data.

View Article and Find Full Text PDF

Fecal samples are frequently used to characterize bacterial populations of the gastrointestinal tract. A protocol is provided to profile gut bacterial populations using rodent fecal samples. We describe the optimal procedures for collecting rodent fecal samples, isolating genomic DNA, 16S rRNA gene V4 region sequencing, and bioinformatic analyses.

View Article and Find Full Text PDF

The high-dimensional nature of proteomics data presents challenges for statistical analysis and biological interpretation. Multivariate analysis, combined with insightful visualization can help to reveal the underlying patterns in complex biological data. This chapter introduces the R package mixOmics which focuses on data exploration and integration.

View Article and Find Full Text PDF

BACKGROUNDAntigen-specific regulation of autoimmune disease is a major goal. In seropositive rheumatoid arthritis (RA), T cell help to autoreactive B cells matures the citrullinated (Cit) antigen-specific immune response, generating RA-specific V domain glycosylated anti-Cit protein antibodies (ACPA VDG) before arthritis onset. Low or escalating antigen administration under "sub-immunogenic" conditions favors tolerance.

View Article and Find Full Text PDF
Article Synopsis
  • - Huntington's disease is a serious neurodegenerative disorder that affects the brain and body, and currently has no effective treatments to slow its progression.
  • - Recent research indicates that disruptions in the gut microbiome may influence cognitive function in Huntington's disease, which suggests a link between gut health and brain health known as the microbiota-gut-brain axis.
  • - A study found that fecal microbiota transplant (FMT) from healthy mice improved cognitive abilities in female Huntington's disease mice; however, male mice showed less success due to more significant gut microbiome changes and imbalances, indicating a potential new therapeutic approach.
View Article and Find Full Text PDF

The microbiome is a complex and dynamic community of microorganisms that co-exist interdependently within an ecosystem, and interact with its host or environment. Longitudinal studies can capture temporal variation within the microbiome to gain mechanistic insights into microbial systems; however, current statistical methods are limited due to the complex and inherent features of the data. We have identified three analytical objectives in longitudinal microbial studies: (1) differential abundance over time and between sample groups, demographic factors or clinical variables of interest; (2) clustering of microorganisms evolving concomitantly across time and (3) network modelling to identify temporal relationships between microorganisms.

View Article and Find Full Text PDF

Characterizing the molecular identity of a cell is an essential step in single-cell RNA sequencing (scRNA-seq) data analysis. Numerous tools exist for predicting cell identity using single-cell reference atlases. However, many challenges remain, including correcting for inherent batch effects between reference and query data andinsufficient phenotype data from the reference.

View Article and Find Full Text PDF

Huntington's disease (HD) is a neurodegenerative disorder caused by a trinucleotide expansion in the HTT gene, which is expressed throughout the brain and body, including the gut epithelium and enteric nervous system. Afflicted individuals suffer from progressive impairments in motor, psychiatric, and cognitive faculties, as well as peripheral deficits, including the alteration of the gut microbiome. However, studies characterizing the gut microbiome in HD have focused entirely on the bacterial component, while the fungal community (mycobiome) has been overlooked.

View Article and Find Full Text PDF

The success of tropical scleractinian corals depends on their ability to establish symbioses with microbial partners. Host phylogeny and traits are known to shape the coral microbiome, but to what extent they affect its composition remains unclear. Here, by using 12 coral species representing the complex and robust clades, we explored the influence of host phylogeny, skeletal architecture, and reproductive mode on the microbiome composition, and further investigated the structure of the tissue and skeleton bacterial communities.

View Article and Find Full Text PDF

Gut dysbiosis in Huntington's disease (HD) has recently been reported using microbiome profiling in R6/1 HD mice and replicated in clinical HD. In HD mice, environmental enrichment (EE) and exercise (EX) were shown to have therapeutic impacts on the brain and associated symptoms. We hypothesize that these housing interventions modulate the gut microbiome, configuring one of the mechanisms that mediate their therapeutic effects observed in HD.

View Article and Find Full Text PDF

Multi-omics integration is key to fully understand complex biological processes in an holistic manner. Furthermore, multi-omics combined with new longitudinal experimental design can unreveal dynamic relationships between omics layers and identify key players or interactions in system development or complex phenotypes. However, integration methods have to address various experimental designs and do not guarantee interpretable biological results.

View Article and Find Full Text PDF

Evolving in sync with the computation revolution over the past 30 years, computational biology has emerged as a mature scientific field. While the field has made major contributions toward improving scientific knowledge and human health, individual computational biology practitioners at various institutions often languish in career development. As optimistic biologists passionate about the future of our field, we propose solutions for both eager and reluctant individual scientists, institutions, publishers, funding agencies, and educators to fully embrace computational biology.

View Article and Find Full Text PDF

Motivation: Multi-omics data integration enables the global analysis of biological systems and discovery of new biological insights. Multi-omics experimental designs have been further extended with a longitudinal dimension to study dynamic relationships between molecules. However, methods that integrate longitudinal multi-omics data are still in their infancy.

View Article and Find Full Text PDF

Omics longitudinal studies are effective experimental designs to inform on the stability and dynamics of microbial communities in response to perturbations, but time-course analytical frameworks are required to fully exploit the temporal information acquired in this context. In this study we investigate the influence of ammonia on the stability of anaerobic digestion (AD) microbiome with a new statistical framework. Ammonia can severely reduce AD performance.

View Article and Find Full Text PDF

The first 1000 days of life, from conception to 2 years, are a critical window for the influence of environmental exposures on the assembly of the oral microbiome, which is the precursor to dental caries (decay), one of the most prevalent microbially induced disorders worldwide. While it is known that the human microbiome is susceptible to environmental exposures, there is limited understanding of the impact of prenatal and early childhood exposures on the oral microbiome trajectory and oral health. A barrier has been the lack of technology to directly measure the foetal "exposome", which includes nutritional and toxic exposures crossing the placenta.

View Article and Find Full Text PDF

Despite the volume of experiments performed and data available, the complex biology of coronavirus SARS-COV-2 is not yet fully understood. Existing molecular profiling studies have focused on analysing functional omics data of a single type, which captures changes in a small subset of the molecular perturbations caused by the virus. As the logical next step, results from multiple such omics analysis may be aggregated to comprehensively interpret the molecular mechanisms of SARS-CoV-2.

View Article and Find Full Text PDF

The Stemformatics myeloid atlas is an integrated transcriptome atlas of human macrophages and dendritic cells that systematically compares freshly isolated tissue-resident, cultured, and pluripotent stem cell-derived myeloid cells. Three classes of tissue-resident macrophage were identified: Kupffer cells and microglia; monocyte-associated; and tumor-associated macrophages. Culture had a major impact on all primary cell phenotypes.

View Article and Find Full Text PDF

The integration of multiple omics datasets measured on the same samples is a challenging task: data come from heterogeneous sources and vary in signal quality. In addition, some omics data are inherently compositional, e.g.

View Article and Find Full Text PDF