Spatial transcriptomics has allowed researchers to analyze transcriptome data in its tissue sample's spatial context. Various methods have been developed for detecting spatially variable genes (SV genes), whose gene expression over the tissue space shows strong spatial autocorrelation. Such genes are often used to define clusters in cells or spots downstream.
View Article and Find Full Text PDFBackground: Preeclampsia is a potentially fatal complication during pregnancy, characterized by high blood pressure and the presence of excessive proteins in the urine. Due to its complexity, the prediction of preeclampsia onset is often difficult and inaccurate.
Objective: This study aimed to create quantitative models to predict the onset gestational age of preeclampsia using electronic health records.
Background: Maternal obesity is a health concern that may predispose newborns to a high risk of medical problems later in life. To understand the transgenerational effect of maternal obesity, we conducted a multi-omics study, using DNA methylation and gene expression in the CD34+/CD38-/Lin- umbilical cord blood hematopoietic stem cells (uHSCs) and metabolomics of the cord blood, all from a multi-ethnic cohort (n=72) from Kapiolani Medical Center for Women and Children in Honolulu, Hawaii (collected between 2016 and 2018).
Results: Differential methylation (DM) analysis unveiled a global hypermethylation pattern in the maternal pre-pregnancy obese group (BH adjusted p<0.
Single-cell RNA sequencing (scRNA-Seq) data from complex human tissues have prevalent blood cell contamination due to the sample preparation process and may comprise cells of different genetic makeups. To reveal such complexity and annotate cells appropriately, we propose the first-of-its-kind computational framework, Originator, which deciphers single cells by genetic origin and separates blood cells from tissue-resident cells. We show that blood contamination is widely spread in scRNA-Seq data from a variety of tissues.
View Article and Find Full Text PDFDeciphering cell-type heterogeneity is crucial for systematically understanding tissue homeostasis and its dysregulation in diseases. Computational deconvolution is an efficient approach for estimating cell-type abundances from a variety of omics data. Despite substantial methodological progress in computational deconvolution in recent years, challenges are still outstanding.
View Article and Find Full Text PDFBackground: Single-cell multiplex imaging data have provided new insights into disease subtypes and prognoses recently. However, quantitative models that explicitly capture single-cell resolution cell-cell interaction features to predict patient survival at a population scale are currently missing.
Methods: We quantified hundreds of single-cell resolution cell-cell interaction features through neighborhood calculation, in addition to cellular phenotypes.
Objectives: To identify and internally validate metabolites predictive of spontaneous preterm birth (sPTB) using multiple machine learning methods and sequential maternal serum samples, and to predict spontaneous early term birth (sETB) using these metabolites.
Design: Case-cohort design within a prospective cohort study.
Setting: Cambridge, UK.
Unlabelled: Quantitative models that explicitly capture single-cell resolution cell-cell interaction features to predict patient survival at population scale are currently missing. Here, we computationally extracted hundreds of features describing single-cell based cell-cell interactions and cellular phenotypes from a large, published cohort of cyto-images of breast cancer patients. We applied these features to a neural-network based Cox-nnet survival model and obtained high accuracy in predicting patient survival in test data (Concordance Index > 0.
View Article and Find Full Text PDFEpigenome-wide DNA methylation analysis (EWAS) is an important approach to identify biomarkers for early disease detection and prognosis prediction, yet its results could be confounded by other factors such as cell-type heterogeneity and patient characteristics. In this study, we address the importance of confounding adjustment by examining DNA methylation patterns in cord blood exposed to severe preeclampsia (PE), a prevalent and potentially fatal pregnancy complication. Without such adjustment, a misleading global hypomethylation pattern is obtained.
View Article and Find Full Text PDFSingle-cell RNA sequencing technology has enabled in-depth analysis of intercellular heterogeneity in various diseases. However, its full potential for precision medicine has yet to be reached. Towards this, we propose A Single-cell Guided Pipeline to Aid Repurposing of Drugs (ASGARD) that defines a drug score to recommend drugs by considering all cell clusters to address the intercellular heterogeneity within each patient.
View Article and Find Full Text PDFGenomics Proteomics Bioinformatics
October 2022
Recently developed technologies to generate single-cell genomic data have made a revolutionary impact in the field of biology. Multi-omics assays offer even greater opportunities to understand cellular states and biological processes. The problem of integrating different omics data with very different dimensionality and statistical properties remains, however, quite challenging.
View Article and Find Full Text PDFVariations in telomere length (TL) have been associated with aging, stress, and many diseases. Placenta TL is an essential molecular component influencing the outcome of birth. Previous investigations into the relationship between placenta TL and preeclampsia (PE) have produced conflicting findings.
View Article and Find Full Text PDFSevere respiratory viral infections, including SARS-CoV-2, have resulted in high mortality rates despite corticosteroids and other immunomodulatory therapies. Despite recognition of the pathogenic role of neutrophils, in-depth analyses of this cell population have been limited, due to technical challenges of working with neutrophils. We undertook an unbiased, detailed analysis of neutrophil responses in adult patients with COVID-19 and healthy controls, to determine whether distinct neutrophil phenotypes could be identified during infections compared to the healthy state.
View Article and Find Full Text PDFComput Struct Biotechnol J
June 2022
Spatial transcriptomics (ST) has advanced significantly in the last few years. Such advancement comes with the urgent need for novel computational methods to handle the unique challenges of ST data analysis. Many artificial intelligence (AI) methods have been developed to utilize various machine learning and deep learning techniques for computational ST analysis.
View Article and Find Full Text PDFCancer ranks as one of the deadliest diseases worldwide. The high mortality rate associated with cancer is partially due to the lack of reliable early detection methods and/or inaccurate diagnostic tools such as certain protein biomarkers. Cell-free nucleic acids (cfNA) such as circulating long noncoding RNAs (lncRNAs) have been proposed as a new class of potential biomarkers for cancer diagnosis.
View Article and Find Full Text PDFHepatocellular carcinoma (HCC) is a leading cause of cancer death worldwide. Identification and sequencing of circulating tumor (CT) cells and clusters may allow for noninvasive molecular characterization of HCC, which is an unmet need, as many patients with HCC do not undergo biopsy. We evaluated CT cells and clusters, collected using a dual-filtration system in patients with HCC.
View Article and Find Full Text PDFMotif discovery and characterization are important for gene regulation analysis. The lack of intuitive and integrative web servers impedes the effective use of motifs. Most motif discovery web tools are either not designed for non-expert users or lacking optimization steps when using default settings.
View Article and Find Full Text PDFGenomics Proteomics Bioinformatics
June 2021
We present GranatumX, a next-generation software environment for single-cell RNA sequencing (scRNA-seq) data analysis. GranatumX is inspired by the interactive webtool Granatum. GranatumX enables biologists to access the latest scRNA-seq bioinformatics methods in a web-based graphical environment.
View Article and Find Full Text PDFPreeclampsia is a pregnancy-specific syndrome characterized by hypertension and proteinuria after 20 weeks of gestation. However, it is not well understood what lipids are involved in the development of this condition, and even less is known how these lipids mediate its formation. To reveal the relationship between lipids and preeclampsia, we conducted lipidomic profiling of maternal sera of 44 severe preeclamptic and 20 healthy pregnant women from a multiethnic cohort in Hawaii.
View Article and Find Full Text PDFIntercellular heterogeneity is a major obstacle to successful precision medicine. Single-cell RNA sequencing (scRNA-seq) technology has enabled in-depth analysis of intercellular heterogeneity in various diseases. However, its full potential for precision medicine has yet to be reached.
View Article and Find Full Text PDFThe understanding of the foreign-body responses to implanted biomaterials would benefit from the reconstruction of intracellular and intercellular signalling networks in the microenvironment surrounding the implant. Here, by leveraging single-cell RNA-sequencing data from 42,156 cells collected from the site of implantation of either polycaprolactone or an extracellular-matrix-derived scaffold in a mouse model of volumetric muscle loss, we report a computational analysis of intercellular signalling networks reconstructed from predictions of transcription-factor activation. We found that intercellular signalling networks can be clustered into modules associated with specific cell subsets, and that biomaterial-specific responses can be characterized by interactions between signalling modules for immune, fibroblast and tissue-specific cells.
View Article and Find Full Text PDFMulti-omics data are good resources for prognosis and survival prediction; however, these are difficult to integrate computationally. We introduce DeepProg, a novel ensemble framework of deep-learning and machine-learning approaches that robustly predicts patient survival subtypes using multi-omics data. It identifies two optimal survival subtypes in most cancers and yields significantly better risk-stratification than other multi-omics integration methods.
View Article and Find Full Text PDFPathological images are easily accessible data with the potential of prognostic biomarkers. Moreover, integration of heterogeneous data types from multi-modality, such as pathological image and gene expression data, is invaluable to help predicting cancer patient survival. However, the analytical challenges are significant.
View Article and Find Full Text PDFMaternal pre-pregnancy obesity may have an impact on both maternal and fetal health. We examined the microbiome recovered from placentas in a multi-ethnic maternal pre-pregnant obesity cohort, through an optimized microbiome protocol to enrich low bacterial biomass samples. We found that the microbiomes recovered from the placentas of obese pre-pregnant mothers are less abundant and less diverse when compared to those from mothers of normal pre-pregnancy weight.
View Article and Find Full Text PDF