For most biological and medical applications of single-cell transcriptomics, an integrative study of multiple heterogeneous single-cell RNA sequencing (scRNA-seq) data sets is crucial. However, present approaches are unable to integrate diverse data sets from various biological conditions effectively because of the confounding effects of biological and technical differences. We introduce single-cell integration (scInt), an integration method based on accurate, robust cell-cell similarity construction and unified contrastive biological variation learning from multiple scRNA-seq data sets.
View Article and Find Full Text PDFEpstein-Barr virus (EBV) persists in human cells as episomes. EBV episomes are chromatinized and their 3D conformation varies greatly in cells expressing different latency genes. We used HiChIP, an assay which combines genome-wide chromatin conformation capture followed by deep sequencing (Hi-C) and chromatin immunoprecipitation (ChIP), to interrogate the EBV episome 3D conformation in different cancer cell lines.
View Article and Find Full Text PDFBackground: Single-cell RNA sequencing (scRNA-seq) provides a powerful tool to capture transcriptomes at single-cell resolution. However, dropout events distort the gene expression levels and underlying biological signals, misleading the downstream analysis of scRNA-seq data.
Results: We develop a statistical model-based multidimensional imputation algorithm, scMTD, that identifies local cell neighbors and specific gene co-expression networks based on the pseudo-time of cells, leveraging information on cell-level, gene-level, and transcriptome dynamic to recover scRNA-seq data.
With the rapid development of methylation profiling technology, many datasets are generated to quantify genome-wide methylation patterns. Given the heavy burden of multiple testing of hundreds of thousands of DNA methylation markers, individual studies often have limited sample sizes and power. The EWAS meta-analysis is an approach that combines results from multiple studies on the same scientific question.
View Article and Find Full Text PDFBackground: Epithelial-to-mesenchymal transition (EMT) is a process linked to metastasis and drug resistance with non-coding RNAs (ncRNAs) playing pivotal roles. We previously showed that miR-100 and miR-125b, embedded within the third intron of the ncRNA host gene MIR100HG, confer resistance to cetuximab, an anti-epidermal growth factor receptor (EGFR) monoclonal antibody, in colorectal cancer (CRC). However, whether the MIR100HG transcript itself has a role in cetuximab resistance or EMT is unknown.
View Article and Find Full Text PDFThe single-cell RNA sequencing (scRNA-seq) technologies obtain gene expression at single-cell resolution and provide a tool for exploring cell heterogeneity and cell types. As the low amount of extracted mRNA copies per cell, scRNA-seq data exhibit a large number of dropouts, which hinders the downstream analysis of the scRNA-seq data. We propose a statistical method, SDImpute (Single-cell RNA-seq Dropout Imputation), to implement block imputation for dropout events in scRNA-seq data.
View Article and Find Full Text PDFCOVID-19 patients always develop multiple organ dysfunction syndromes other than lungs, suggesting the novel virus SARS-CoV-2 also invades other organs. Therefore, studying the viral susceptibility of other organs is important for a deeper understanding of viral pathogenesis. Angiotensin-converting enzyme II (ACE2) is the receptor protein of SARS-CoV-2, and TMPRSS2 promotes virus proliferation and transmission.
View Article and Find Full Text PDFT-cell receptor (TCR) is crucial in T cell-mediated virus clearance. To date, TCR bias has been observed in various diseases. However, studies on the TCR repertoire of COVID-19 patients are lacking.
View Article and Find Full Text PDFBMC Bioinformatics
December 2020
Background: Single-cell RNA sequencing can be used to fairly determine cell types, which is beneficial to the medical field, especially the many recent studies on COVID-19. Generally, single-cell RNA data analysis pipelines include data normalization, size reduction, and unsupervised clustering. However, different normalization and size reduction methods will significantly affect the results of clustering and cell type enrichment analysis.
View Article and Find Full Text PDFBackground: With the rapid development of high-throughput sequencing technologies, many datasets on the same biological subject are generated. A meta-analysis is an approach that combines results from different studies on the same topic. The random-effects model in a meta-analysis enables the modeling of differences between studies by incorporating the between-study variance.
View Article and Find Full Text PDFBackground: The association between BIN1 rs744373 variant and Alzheimer's disease (AD) had been identified by genome-wide association studies (GWASs) as well as candidate gene studies in Caucasian populations. But in East Asian populations, both positive and negative results had been identified by association studies. Considering the smaller sample sizes of the studies in East Asian, we believe that the results did not have enough statistical power.
View Article and Find Full Text PDFObservational studies strongly supported the association of low levels of circulating 25-hydroxyvitamin D (25OHD) and cognitive impairment or dementia in aging populations. However, randomized controlled trials have not shown clear evidence that vitamin D supplementation could improve cognitive outcomes. In fact, some studies reported the association between vitamin D and cognitive impairment based on individuals aged 60 years and over.
View Article and Find Full Text PDFBackground: During procedures for conducting multiple sequence alignment, that is so essential to use the substitution score of pairwise alignment. To compute adaptive scores for alignment, researchers usually use Hidden Markov Model or probabilistic consistency methods such as partition function. Recent studies show that optimizing the parameters for hidden Markov model, as well as integrating hidden Markov model with partition function can raise the accuracy of alignment.
View Article and Find Full Text PDFThe advancement of high-throughput RNA sequencing has uncovered the profound truth in biology, ranging from the study of differential expressed genes to the identification of different genomic phenotype across multiple conditions. However, lack of biological replicates and low expressed data are still obstacles to measuring differentially expressed genes effectively. We present an algorithm based on differential entropy-like function (DEF) to test for the differential expression across time-course data or multi-sample data with few biological replicates.
View Article and Find Full Text PDFProc Natl Acad Sci U S A
February 2019
Alzheimer's disease (AD), a neurodegenerative diseases (neuro-diseases) which is prevalent in the elderly and seriously affects the lives of individuals. Many studies have discussed the relationship between immune system and AD pathogenesis. Here, the meta-analysis of differentially expressed (DE) genes based on microarray data was conducted to study the association between AD and immune system.
View Article and Find Full Text PDFBackground: Alzheimer's disease (AD) and Parkinson's disease (PD) are the top two common neurodegenerative diseases in elderly. Recent studies found the α-synuclein have a key role in AD. Although many clinical and pathological features between AD and PD are shared, the genetic association between them remains unclear, especially whether α-synuclein in PD genetically alters AD risk.
View Article and Find Full Text PDFNeurobiol Aging
December 2018
Alzheimer's disease (AD) is the leading cause of dementia in older adults. It is more than 50 years since vitamin E was recognized as critical for optimal neurological health. Clinical studies have yielded inconsistent findings on the effect of vitamin E on AD risk.
View Article and Find Full Text PDFGenetic association studies have identified significant association between the GAB2 rs2373115 variant and Alzheimer's disease (AD). However, it is unknown whether rs2373115 affects the regulation of nearby genes. Here, we evaluate the potential effect of rs2373115 on gene expression using multiple eQTL (expression quantitative trait loci) datasets from human brain tissues from the Mayo Clinic brain expression genome-wide association study (eGWAS), the UK Brain Expression Consortium (UKBEC), the Genotype-Tissue Expression (GTEx) project, and the Brain xQTL Serve.
View Article and Find Full Text PDFMotivation: With the development of biotechnology, DNA methylation data showed exponential growth. Epigenome-wide association study (EWAS) provide a systematic approach to uncovering epigenetic variants underlying common diseases/phenotypes. But the EWAS software has lagged behind compared with genome-wide association study (GWAS).
View Article and Find Full Text PDF