Publications by authors named "Jin-xing Liu"

Machine learning techniques have become increasingly important in analyzing single-cell RNA and identifying cell types, providing valuable insights into cellular development and disease mechanisms. However, the presence of batch effects poses major challenges in scRNA-seq analysis due to data distribution variation across batches. Although several batch effect mitigation algorithms have been proposed, most of them focus only on the correlation of local structure embeddings, ignoring global distribution matching and discriminative feature representation in batch correction.

View Article and Find Full Text PDF

Background: Epilepsy is a prevalent neurological disorder in which seizures cause recurrent episodes of unconsciousness or muscle convulsions, seriously affecting the patient's work, quality of life, and health and safety. Timely prediction of seizures is critical for patients to take appropriate therapeutic measures. Accurate prediction of seizures remains a challenge due to the complex and variable nature of EEG signals.

View Article and Find Full Text PDF

Recent advancements in spatially transcriptomics (ST) technologies have enabled the comprehensive measurement of gene expression profiles while preserving the spatial information of cells. Combining gene expression profiles and spatial information has been the most commonly used method to identify spatial functional domains and genes. However, most existing spatial domain decipherer methods are more focused on spatially neighboring structures and fail to take into account balancing the self-characteristics and the spatial structure dependency of spots.

View Article and Find Full Text PDF

Cell type annotation refers to the process of categorizing and labeling cells to identify their specific cell types, which is crucial for understanding cell functions and biological processes. Although many methods have been developed for automated cell type annotation, they often encounter challenges such as batch effects due to variations in data distribution across platforms and species, thereby compromising their performance. To address batch effects, in this study, a pre-trained domain adaptation model based on structural similarity, named pscAdapt, is proposed for cell type annotation.

View Article and Find Full Text PDF
Article Synopsis
  • The development of single-cell RNA sequencing (scRNA-seq) technology allows researchers to analyze biological processes at the cellular level, focusing on tasks like cell type identification through unsupervised clustering.
  • Despite numerous clustering methods existing, many fail to fully leverage the relationships between cells, leading to less effective clustering outcomes.
  • The proposed scGAMF framework integrates graph autoencoder-based analysis with a multi-level kernel space to improve clustering accuracy by utilizing multiple feature sets and a consensus affinity matrix, demonstrating superior performance in real dataset validations compared to other methods.
View Article and Find Full Text PDF

Circular RNAs (circRNAs) have emerged as a novel class of non-coding RNAs with regulatory roles in disease pathogenesis. Computational models aimed at predicting circRNA-disease associations offer valuable insights into disease mechanisms, thereby enabling the development of innovative diagnostic and therapeutic approaches while reducing the reliance on costly wet experiments. In this study, SGFCCDA is proposed for predicting potential circRNA-disease associations based on scale graph convolutional networks and feature convolution.

View Article and Find Full Text PDF
Article Synopsis
  • The study focuses on improving seizure prediction for individuals with epilepsy through a novel hybrid deep learning model that combines DenseNet and Vision Transformer (ViT).
  • DenseNet effectively captures features and reduces parameters, while ViT provides self-attention and represents global features, with an attention fusion layer merging both for better predictions.
  • Using the CHB-MIT dataset for evaluation, the model shows high accuracy and efficiency in prediction, suggesting that this approach could lead to better therapeutic interventions for epilepsy patients.
View Article and Find Full Text PDF

Alzheimer's disease (AD) is a highly inheritable neurological disorder, and brain imaging genetics (BIG) has become a rapidly advancing field for comprehensive understanding its pathogenesis. However, most of the existing approaches underestimate the complexity of the interactions among factors that cause AD. To take full appreciate of these complexity interactions, we propose BIGFormer, a graph Transformer with local structural awareness, for AD diagnosis and identification of pathogenic mechanisms.

View Article and Find Full Text PDF
Article Synopsis
  • GWAS is a study that helps scientists understand the genetics behind complex diseases but often misses interactions between different genetic changes called SNPs.
  • To find these important interactions, researchers use genome-wide interaction studies, but they face challenges like not having good models and datasets to test their methods.
  • The paper compares different tools used for simulating SNP data, which helps improve interactions studies, and discusses the pros and cons of each tool to help scientists create better research methods.
View Article and Find Full Text PDF

Since genomics was proposed, the exploration of genes has been the focus of research. The emergence of single-cell RNA sequencing (scRNA-seq) technology makes it possible to explore gene expression at the single-cell level. Due to the limitations of sequencing technology, the data contains a lot of noise.

View Article and Find Full Text PDF

Numerous scientific studies have found a link between diverse microorganisms in the human body and complex human diseases. Because traditional experimental approaches are time-consuming and expensive, using computational methods to identify microbes correlated with diseases is critical. In this paper, a new microbe-disease association prediction model is proposed that combines a multi-view multi-modal network and a multi-scale feature fusion mechanism, called MHOGAT.

View Article and Find Full Text PDF
Article Synopsis
  • The DPC algorithm is effective for sample distribution and noise point identification but lacks adaptability and struggles with high time complexity.
  • This study introduces improvements to DPC using maximum nearest neighbor distance and K-nearest neighbors, and proposes a new method based on delayed spiking neural P systems (DSN P systems).
  • The new DSNP-ANDPC algorithm demonstrates superior performance in most cases when evaluated against various synthetic and real-world datasets.
View Article and Find Full Text PDF
Article Synopsis
  • Feature selection is essential in data mining but often faces issues of redundancy and errors due to complex mutual information methods, prompting the development of new approaches.
  • The proposed FSCME method combines Copula correlation and the maximum information coefficient using entropy weights to improve relevance between features and labels while managing redundancy.
  • Experimental results show that FSCME outperforms six other feature selection methods by providing a more effective feature subset, enhancing classification performance in clustering tasks.
View Article and Find Full Text PDF

Neonatal epilepsy is a common emergency phenomenon in neonatal intensive care units (NICUs), which requires timely attention, early identification, and treatment. Traditional detection methods mostly use supervised learning with enormous labeled data. Hence, this study offers a semi-supervised hybrid architecture for detecting seizures, which combines the extracted electroencephalogram (EEG) feature dataset and convolutional autoencoder, called Fd-CAE.

View Article and Find Full Text PDF
Article Synopsis
  • Alzheimer's disease (AD) is complex and difficult to treat, but analyzing varied data types can help in early diagnosis by understanding AD progression.* -
  • The proposed deep self-reconstruction fusion similarity hashing (DS-FSH) method enhances the identification of AD-related biomarkers through multi-modal data analysis and utilizes a deep self-reconstruction model for better data relationships.* -
  • Experiments show DS-FSH performs better than existing classification methods, helping to uncover crucial features related to AD and potentially improving our understanding of its pathogenesis.*
View Article and Find Full Text PDF

The roles of brain region activities and genotypic functions in the pathogenesis of Alzheimer's disease (AD) remain unclear. Meanwhile, current imaging genetics methods are difficult to identify potential pathogenetic markers by correlation analysis between brain network and genetic variation. To discover disease-related brain connectome from the specific brain structure and the fine-grained level, based on the Automated Anatomical Labeling (AAL) and human Brainnetome atlases, the functional brain network is first constructed for each subject.

View Article and Find Full Text PDF

CircRNA has been proved to play an important role in the diseases diagnosis and treatment. Considering that the wet-lab is time-consuming and expensive, computational methods are viable alternative in these years. However, the number of circRNA-disease associations (CDAs) that can be verified is relatively few, and some methods do not take full advantage of dependencies between attributes.

View Article and Find Full Text PDF

Accumulating evidence indicates that microRNAs (miRNAs) can control and coordinate various biological processes. Consequently, abnormal expressions of miRNAs have been linked to various complex diseases. Recognizable proof of miRNA-disease associations (MDAs) will contribute to the diagnosis and treatment of human diseases.

View Article and Find Full Text PDF

Identification of disease-associated long non-coding RNAs (lncRNAs) is crucial for unveiling the underlying genetic mechanisms of complex diseases. Multiple types of similarity networks of lncRNAs (or diseases) can complementary and comprehensively characterize their similarities. Hence, in this study, we presented a computational model iLncDA-RSN based on reliable similarity networks for identifying potential lncRNA-disease associations (LDAs).

View Article and Find Full Text PDF

Comprehensive analysis of multiple data sets can identify potential driver genes for various cancers. In recent years, driver gene discovery based on massive mutation data and gene interaction networks has attracted increasing attention, but there is still a need to explore combining functional and structural information of genes in protein interaction networks to identify driver genes. Therefore, we propose a network embedding framework combining functional and structural information to identify driver genes.

View Article and Find Full Text PDF

The development of single-cell RNA sequencing (scRNA-seq) technology has opened up a new perspective for us to study disease mechanisms at the single cell level. Cell clustering reveals the natural grouping of cells, which is a vital step in scRNA-seq data analysis. However, the high noise and dropout of single-cell data pose numerous challenges to cell clustering.

View Article and Find Full Text PDF

Advances in omics technology have enriched the understanding of the biological mechanisms of diseases, which has provided a new approach for cancer research. Multi-omics data contain different levels of cancer information, and comprehensive analysis of them has attracted wide attention. However, limited by the dimensionality of matrix models, traditional methods cannot fully use the key high-dimensional global structure of multi-omics data.

View Article and Find Full Text PDF
Article Synopsis
  • Determining drug-disease associations (DDAs) is crucial for drug development, but current prediction methods lack diversity in feature extraction.
  • A new graph representation method using a light gradient boosting machine (GRLGB) was introduced, which incorporates both network topology and biological knowledge to extract features from a heterogeneous network.
  • GRLGB showed promising results on two datasets through 10-fold cross-validation and successfully identified novel DDAs in case studies involving anxiety disorders and the drug clozapine.
View Article and Find Full Text PDF

The analysis of cancer data from multi-omics can effectively promote cancer research. The main focus of this article is to cluster cancer samples and identify feature genes to reveal the correlation between cancers and genes, with the primary approach being the analysis of multi-view cancer omics data. Our proposed solution, the Multi-View Enhanced Tensor Nuclear Norm and Local Constraint (MVET-LC) model, aims to utilize the consistency and complementarity of omics data to support biological research.

View Article and Find Full Text PDF

The development of single-cell transcriptome sequencing technologies has opened new ways to study biological phenomena at the cellular level. A key application of such technologies involves the employment of single-cell RNA sequencing (scRNA-seq) data to identify distinct cell types through clustering, which in turn provides evidence for revealing heterogeneity. Despite the promise of this approach, the inherent characteristics of scRNA-seq data, such as higher noise levels and lower coverage, pose major challenges to existing clustering methods and compromise their accuracy.

View Article and Find Full Text PDF