Publications by authors named "Ruibin Xi"

Feature selection by expectation maximization test (Festem) enables the direct selection of cell type marker genes, facilitating downstream clustering of single-cell RNA sequencing (scRNA-seq) data. Here, we present a protocol for using Festem to identify marker genes in scRNA-seq data and perform subsequent analyses. We describe comprehensive steps for setting up the environment, marker gene selection, clustering, and marker gene assignment.

View Article and Find Full Text PDF

Background: Determining the impact of somatic mutations requires understanding the functional relationship of genes acquiring mutations; however, it is largely unknown how mutations in functionally related genes influence each other.

Methods: We employed non-synonymous-to-synonymous or dNdS ratios to evaluate the evolutionary dependency (ED) of gene pairs, assuming a mutation in one gene of a gene pair can affect the evolutionary fitness of mutations in its partner genes as mutation context. We employed PanCancer- and tumor type-specific mutational profiles to infer the ED of gene pairs and evaluated their biological relevance with respect to gene dependency and drug sensitivity.

View Article and Find Full Text PDF

In single-cell RNA sequencing (scRNA-seq) studies, cell types and their marker genes are often identified by clustering and differentially expressed gene (DEG) analysis. A common practice is to select genes using surrogate criteria such as variance and deviance, then cluster them using selected genes and detect markers by DEG analysis assuming known cell types. The surrogate criteria can miss important genes or select unimportant genes, while DEG analysis has the selection-bias problem.

View Article and Find Full Text PDF

Background: Gallbladder cancer (GBC) is the most common and lethal malignancy of the biliary tract that lacks effective therapy. In many GBC cases, infiltration into adjacent organs or distant metastasis happened long before the diagnosis, especially the direct liver invasion, which is the most common and unfavorable way of spreading.

Methods: Single-cell RNA sequencing (scRNA-seq), spatial transcriptomics (ST), proteomics, and multiplexed immunohistochemistry (mIHC) were performed on GBC across multiple tumor stages to characterize the tumor microenvironment (TME), focusing specifically on the preferential enrichment of neutrophils in GBC liver invasion (GBC-LI).

View Article and Find Full Text PDF
Article Synopsis
  • Improvements in single-cell whole-genome sequencing (scWGS) have allowed for better analysis of somatic copy number alterations (CNAs) on a single-cell basis, making it possible to observe genetic variations within individual cells.
  • The newly developed tool HiScanner combines various data metrics to identify CNAs with greater precision, outperforming existing methods in simulated tests and real high-coverage scWGS data from human brain cells.
  • HiScanner's application revealed detailed differences in CNA patterns between neuron types and tracked evolutionary changes in tumor cells by integrating CNAs with point mutations across meningioma samples.
View Article and Find Full Text PDF

The inheritance of recurrent patellar dislocation (RPD) is known, but the susceptible gene remains unidentified. Here, we performed the first whole exome sequencing (WES) cohort study to identify the susceptible genes. The results showed eight genes were associated with this disease.

View Article and Find Full Text PDF

Many patient-derived tumor models have emerged recently. However, their potential to guide personalized drug selection remains unclear. Here, we report patient-derived tumor-like cell clusters (PTCs) for non-small cell lung cancer (NSCLC), capable of conducting 100-5,000 drug tests within 10 days.

View Article and Find Full Text PDF

Identifying expressed somatic mutations from single-cell RNA sequencing data de novo is challenging but highly valuable. We propose RESA - Recurrently Expressed SNV Analysis, a computational framework to identify expressed somatic mutations from scRNA-seq data. RESA achieves an average precision of 0.

View Article and Find Full Text PDF

Single-molecule Real-time Isoform Sequencing (Iso-seq) of transcriptomes by PacBio can generate very long and accurate reads, thus providing an ideal platform for full-length transcriptome analysis. We present an integrated computational toolkit named TAGET for Iso-seq full-length transcript data analyses, including transcript alignment, annotation, gene fusion detection, and quantification analyses such as differential expression gene analysis and differential isoform usage analysis. We evaluate the performance of TAGET using a public Iso-seq dataset and newly sequenced Iso-seq datasets from tumor patients.

View Article and Find Full Text PDF

Background: Neoantigens are critical for anti-tumor immunity and have been long-envisioned as promising therapeutic targets. However, current neoantigen analyses mostly focus on single nucleotide variations (SNVs) and indel mutations and seldom consider structural variations (SVs) that are also prevalent in cancer.

Results: Here, we develop a computational method termed NeoSV, which incorporates SV annotation, protein fragmentation, and MHC binding prediction together, to predict SV-derived neoantigens.

View Article and Find Full Text PDF

Background: Acute cellular rejection (ACR) is a major barrier to the long-term survival of cardiac allografts. Although immune cells are well known to play critical roles in ACR, the dynamic cellular landscape of allografts with ACR remains obscure.

Methods: Single-cell RNA sequencing (scRNA-seq) was carried out for mouse cardiac allografts with ACR.

View Article and Find Full Text PDF

Mutation signature analysis has been used to infer the contributions of various DNA mutagenic-repair events in individual cancer genomes. Here, we build a statistical framework using a multinomial distribution to assign individual mutations to their cognate mutation signatures. We applied it to 47 million somatic mutations in 1925 publicly available cancer genomes to obtain a mutation signature map at the resolution of individual somatic mutations.

View Article and Find Full Text PDF

Intrahepatic cholangiocarcinoma (iCCA) is a highly heterogeneous cancer with limited understanding of its classification and tumor microenvironment. Here, by performing single-cell RNA sequencing on 144,878 cells from 14 pairs of iCCA tumors and non-tumor liver tissues, we find that S100P and SPP1 are two markers for iCCA perihilar large duct type (iCCA) and peripheral small duct type (iCCA). S100P + SPP1- iCCA has significantly reduced levels of infiltrating CD4 T cells, CD56 NK cells, and increased CCL18 macrophages and PD1CD8 T cells compared to S100P-SPP1 + iCCA.

View Article and Find Full Text PDF

Gene fusions can play important roles in tumor initiation and progression. While fusion detection so far has been from bulk samples, full-length single-cell RNA sequencing (scRNA-seq) offers the possibility of detecting gene fusions at the single-cell level. However, scRNA-seq data have a high noise level and contain various technical artifacts that can lead to spurious fusion discoveries.

View Article and Find Full Text PDF

Streptococcus (S.) thermophilus, an indispensable dairy starter, has been used in autochthonous as well as industrial milk fermentation. However, the genetic architecture underlying S.

View Article and Find Full Text PDF

Liver metastasis, the leading cause of colorectal cancer mortality, exhibits a highly heterogeneous and suppressive immune microenvironment. Here, we sequenced 97 matched samples by using single-cell RNA sequencing and spatial transcriptomics. Strikingly, the metastatic microenvironment underwent remarkable spatial reprogramming of immunosuppressive cells such as M2-like macrophages.

View Article and Find Full Text PDF

Diverse immune cells in the tumor microenvironment form a complex ecosystem, but our knowledge of their heterogeneity and dynamics within hepatocellular carcinoma (HCC) still remains limited. To assess the plasticity and phenotypes of immune cells within HBV/HCV-related HCC microenvironment at single-cell level, we performed single-cell RNA sequencing on 41,698 immune cells from seven pairs of HBV/HCV-related HCC tumors and non-tumor liver tissues. We combined bio-informatic analyses, flow cytometry, and multiplex immunohistochemistry to assess the heterogeneity of different immune cell subsets in functional characteristics, transcriptional regulation, phenotypic switching, and interactions.

View Article and Find Full Text PDF

Several patient-derived tumor models emerged recently as robust preclinical drug-testing platforms. However, their potential to guide clinical therapy remained unclear. Here, we report a model called patient-derived tumor-like cell clusters (PTCs).

View Article and Find Full Text PDF

Esophageal squamous cell carcinoma (ESCC) is a poor-prognosis cancer type with limited understanding of its molecular etiology. Using 508 ESCC genomes, we identified five novel significantly mutated genes and uncovered mutational signature clusters associated with metastasis and patients' outcomes. Several functional assays implicated that NFE2L2 may act as a tumor suppressor in ESCC and that mutations in NFE2L2 probably impaired its tumor-suppressive function, or even conferred oncogenic activities.

View Article and Find Full Text PDF

Motivation: Whole-genome sequencing (WGS) is widely used for copy number variation (CNV) detection. However, for most bacteria, their circular genome structure and high replication rate make reads more enriched near the replication origin. CNV detection based on read depth could be seriously influenced by such replication bias.

View Article and Find Full Text PDF