Publications by authors named "Xindong Wu"

We developed an intelligent innovative orientation method to improve the accuracy of polarization compasses in harsh conditions: weak skylight polarization patterns resulting from unfavorable weather conditions (e.g., haze, sandstorms) or locally destroyed skylight polarization conditions caused by occlusions (e.

View Article and Find Full Text PDF
Article Synopsis
  • The paper introduces Enhanced Multimodal Low-rank Embedding (EMLE), a new method for diagnosing Alzheimer's disease using various types of neuroimaging data.
  • EMLE tackles challenges like redundant features and corrupted images by using a unique ℓ-norm regularization approach and a similarity graph to enhance data robustness and feature selection.
  • Experimental results demonstrate that EMLE effectively identifies crucial features in multi-modal datasets, improving the accuracy of Alzheimer's diagnosis compared to previous methods.
View Article and Find Full Text PDF
Article Synopsis
  • - Generative AI tools like ChatGPT are being increasingly used to create articles, prompting this study to explore the unique characteristics of AI-generated content compared to scientific publications.
  • - The research involves creating articles on various diseases using prompt engineering and developing a new algorithm, xFakeSci, which can differentiate between AI-generated and authentic scientific articles through a rigorous training process.
  • - xFakeSci outperformed traditional data mining algorithms in accuracy, achieving F1 scores of 80 to 94%, thanks to its innovative calibration methods and proximity distance heuristics, highlighting its effectiveness in identifying fake science.
View Article and Find Full Text PDF

In Alzheimer's disease (AD) diagnosis, joint feature selection for predicting disease labels (classification) and estimating cognitive scores (regression) with neuroimaging data has received increasing attention. In this paper, we propose a model named Shared Manifold regularized Joint Feature Selection (SMJFS) that performs classification and regression in a unified framework for AD diagnosis. For classification, unlike the existing works that build least squares regression models which are insufficient in the ability of extracting discriminative information for classification, we design an objective function that integrates linear discriminant analysis and subspace sparsity regularization for acquiring an informative feature subset.

View Article and Find Full Text PDF

Bone cement leakage from the femoral medullary cavity is a rare complication following hip replacement. Currently, there are no reports of bone cement leakage into the heart. Here, we report an 81-year-old female patient with right femoral neck fracture.

View Article and Find Full Text PDF
Article Synopsis
  • The rise of transformer-based and generative AI technologies brings up significant concerns about the authenticity and explainability of AI-generated content in various fields.
  • The authors argue that it's crucial to establish robust detection and verification methods to combat issues like disinformation and unreliable scientific claims.
  • They advocate for proactive measures, such as fact-checking and clear explainability policies, to build trust and promote ethical standards in AI's application within science and society.
View Article and Find Full Text PDF

Background: Bladder cancer (BLCA) is a prevalent urinary system malignancy. Understanding the interplay of immunological and metabolic genes in BLCA is crucial for prognosis and treatment.

Methods: Immune/metabolism genes were extracted, their expression profiles analyzed.

View Article and Find Full Text PDF

Density peaks clustering (DPC) is a popular clustering algorithm, which has been studied and favored by many scholars because of its simplicity, fewer parameters, and no iteration. However, in previous improvements of DPC, the issue of privacy data leakage was not considered, and the "Domino" effect caused by the misallocation of noncenters has not been effectively addressed. In view of the above shortcomings, a horizontal federated DPC (HFDPC) is proposed.

View Article and Find Full Text PDF

Few-shot knowledge graph completion (FKGC), which aims to infer new triples for a relation using only a few reference triples of the relation, has attracted much attention in recent years. Most existing FKGC methods learn a transferable embedding space, where entity pairs belonging to the same relations are close to each other. In real-world knowledge graphs (KGs), however, some relations may involve multiple semantics, and their entity pairs are not always close due to having different meanings.

View Article and Find Full Text PDF

Streaming data mining can be applied in many practical applications, such as social media, market analysis, and sensor networks. Most previous efforts assume that all training instances except for the novel class have been completely labeled for novel class detection in streaming data. However, a more realistic situation is that only a few instances in the data stream are labeled.

View Article and Find Full Text PDF

MicroRNAs (miRNAs) feature prominently in regulating the progression of chronic heart failure (CHF). This study was performed to investigate the role of miR-8485 in the injury of cardiomyocytes and CHF. It was found that miR-8485 level was markedly reduced in the plasma of CHF patients, compared with the healthy controls.

View Article and Find Full Text PDF

Myocardin-related transcription factor A (MRTF-A) has an inhibitory effect on myocardial infarction; however, the mechanism is not clear. This study reveals the mechanism by which MRTF-A regulates autophagy to alleviate myocardial infarct-mediated inflammation, and the effect of silent information regulator 1 (SIRT1) on the myocardial protective effect of MRTF-A was also verified. MRTF-A significantly decreased cardiac damage induced by myocardial ischemia.

View Article and Find Full Text PDF

Graph Convolutional Networks (GCNs), as a prominent example of graph neural networks, are receiving extensive attention for their powerful capability in learning node representations on graphs. There are various extensions, either in sampling and/or node feature aggregation, to further improve GCNs' performance, scalability and applicability in various domains. Still, there is room for further improvements on learning efficiency because performing batch gradient descent using the full dataset for every training iteration, as unavoidable for training (vanilla) GCNs, is not a viable option for large graphs.

View Article and Find Full Text PDF

Traditional sequential pattern mining methods were designed for symbolic sequence. As a collection of measurements in chronological order, a time series needs to be discretized into symbolic sequences, and then users can apply sequential pattern mining methods to discover interesting patterns in time series. The discretization will not only cause the loss of some important information, which partially destroys the continuity of time series, but also ignore the order relations between time-series values.

View Article and Find Full Text PDF

Nonoverlapping sequential pattern mining, as a kind of repetitive sequential pattern mining with gap constraints, can find more valuable patterns. Traditional algorithms focused on finding all frequent patterns and found lots of redundant short patterns. However, it not only reduces the mining efficiency, but also increases the difficulty in obtaining the demand information.

View Article and Find Full Text PDF

Adverse drug-drug interaction (ADDI) is a significant life-threatening issue, posing a leading cause of hospitalizations and deaths in healthcare systems. This paper proposes a unified Multi-Attribute Discriminative Representation Learning (MADRL) model for ADDI prediction. Unlike the existing works that equally treat features of each attribute without discrimination and do not consider the underlying relationship among drugs, we first develop a regularized optimization problem based on CUR matrix decomposition for joint representative drug and discriminative feature selection such that the selected drugs and features can well approximate the original feature spaces and the critical factors discriminative to ADDIs can be properly explored.

View Article and Find Full Text PDF

Recently, causal feature selection (CFS) has attracted considerable attention due to its outstanding interpretability and predictability performance. Such a method primarily includes the Markov blanket (MB) discovery and feature selection based on Granger causality. Representatively, the max-min MB (MMMB) can mine an optimal feature subset, i.

View Article and Find Full Text PDF

Myocardial energy metabolism (MEM) is an important factor of myocardial injury. Trimetazidine (TMZ) provides protection against myocardial ischemia/reperfusion injury. The current study set out to evaluate the effect and mechanism of TMZ on MEM disorder induced by myocardial infarction (MI).

View Article and Find Full Text PDF

For sequence classification, an important issue is to find discriminative features, where sequential pattern mining (SPM) is often used to find frequent patterns from sequences as features. To improve classification accuracy and pattern interpretability, contrast pattern mining emerges to discover patterns with high-contrast rates between different categories. To date, existing contrast SPM methods face many challenges, including excessive parameter selection and inefficient occurrences counting.

View Article and Find Full Text PDF

Domain adaptation aims to facilitate the learning task in an unlabeled target domain by leveraging the auxiliary knowledge in a well-labeled source domain from a different distribution. Almost existing autoencoder-based domain adaptation approaches focus on learning domain-invariant representations to reduce the distribution discrepancy between source and target domains. However, there is still a weakness existing in these approaches: the class-discriminative information of the two domains may be damaged while aligning the distributions of the source and target domains, which makes the samples with different classes close to each other, leading to performance degradation.

View Article and Find Full Text PDF

Traditional feature selection methods assume that all data instances and features are known before learning. However, it is not the case in many real-world applications that we are more likely faced with data streams or feature streams or both. Feature streams are defined as features that flow in one by one over time, whereas the number of training examples remains fixed.

View Article and Find Full Text PDF

Learning with streaming data has received extensive attention during the past few years. Existing approaches assume that the feature space is fixed or changes by following explicit regularities, limiting their applicability in real-time applications. For example, in a smart healthcare platform, the feature space of the patient data varies when different medical service providers use nonidentical feature sets to describe the patients' symptoms.

View Article and Find Full Text PDF

Sequential pattern mining (SPM) has been applied in many fields. However, traditional SPM neglects the pattern repetition in sequence. To solve this problem, gap constraint SPM was proposed and can avoid finding too many useless patterns.

View Article and Find Full Text PDF

Due to object detection's close relationship with video analysis and image understanding, it has attracted much research attention in recent years. Traditional object detection methods are built on handcrafted features and shallow trainable architectures. Their performance easily stagnates by constructing complex ensembles that combine multiple low-level image features with high-level context from object detectors and scene classifiers.

View Article and Find Full Text PDF
Article Synopsis
  • Anti-community detection identifies negative relationships in networks, but existing methods often overlook node degree and have high computational costs.
  • The paper introduces a Degree-based Block Model (DBM) that incorporates node degree into its evaluation, along with a new objective function Q(C).
  • A Local Expansion Optimization Algorithm (LEOA) is then proposed, consisting of three stages to enhance anti-community detection, and experiments show its effectiveness on a synthetic benchmark and real-world networks.
View Article and Find Full Text PDF