Complex categorical data is often hierarchically coupled with heterogeneous relationships between attributes and attribute values and the couplings between objects. Such value-to-object couplings are heterogeneous with complementary and inconsistent interactions and distributions. Limited research exists on unlabeled categorical data representations, ignores the heterogeneous and hierarchical couplings, underestimates data characteristics and complexities, and overuses redundant information, etc. The deep representation learning of unlabeled categorical data is challenging, overseeing such value-to-object couplings, complementarity and inconsistency, and requiring large data, disentanglement, and high computational power. This work introduces a shallow but powerful UNsupervised heTerogeneous couplIng lEarning (UNTIE) approach for representing coupled categorical data by untying the interactions between couplings and revealing heterogeneous distributions embedded in each type of couplings. UNTIE is efficiently optimized w.r.t. a kernel k-means objective function for unsupervised representation learning of heterogeneous and hierarchical value-to-object couplings. Theoretical analysis shows that UNTIE can represent categorical data with maximal separability while effectively represent heterogeneous couplings and disclose their roles in categorical data. The UNTIE-learned representations make significant performance improvement against the state-of-the-art categorical representations and deep representation models on 25 categorical data sets with diversified characteristics.

Download full-text PDF

Source
http://dx.doi.org/10.1109/TPAMI.2020.3010953DOI Listing

Publication Analysis

Top Keywords

categorical data
28
value-to-object couplings
12
categorical
9
data
9
unsupervised heterogeneous
8
heterogeneous coupling
8
coupling learning
8
couplings
8
unlabeled categorical
8
heterogeneous hierarchical
8

Similar Publications

Importance: Retrieval strategies for children, adolescents, and young adults with relapsed classic Hodgkin lymphoma (cHL) aim to maintain efficacy while minimizing long-term toxic effects. Children, adolescents, and young adults with low-risk, relapsed cHL may benefit from replacing high-dose chemotherapy and autologous stem cell transplant with less intensive involved-site radiotherapy (ISRT).

Objective: To evaluate a risk-stratified, response-adapted, transplant-free approach for treatment of children, adolescents, and young adults with low-risk relapsed cHL with nivolumab plus brentuximab vedotin (BV) followed by BV plus bendamustine for patients with suboptimal response and ISRT (30.

View Article and Find Full Text PDF

Importance: Influenza vaccination remains the most important intervention to prevent influenza morbidity and mortality among nursing home residents. The additional effectiveness of recombinant influenza vaccine vs standard dose vaccines was demonstrated in outpatient older adults but has not been evaluated in nursing home populations.

Objective: To compare hospitalization rates among residents in nursing homes immunized with a recombinant vs a standard dose egg-based influenza vaccine.

View Article and Find Full Text PDF

Computational Methods for Lineage Reconstruction.

Methods Mol Biol

January 2025

Centro Nacional de Análisis Genómico, Barcelona, Spain.

The recent development of genetic lineage recorders, designed to register the genealogical history of cells using induced somatic mutations, has opened the possibility of reconstructing complete animal cell lineages. To reconstruct a cell lineage tree from a molecular recorder, it is crucial to use an appropriate reconstruction algorithm. Current approaches include algorithms specifically designed for cell lineage reconstruction and the repurposing of phylogenetic algorithms.

View Article and Find Full Text PDF

Measurements of cell phylogeny based on natural or induced mutations, known as lineage barcodes, in conjunction with molecular phenotype have become increasingly feasible for a large number of single cells. In this chapter, we delve into Quantitative Fate Mapping (QFM) and its computational pipeline, which enables the interrogation of the dynamics of progenitor cells and their fate restriction during development. The methods described here include inferring cell phylogeny with the Phylotime model, and reconstructing progenitor state hierarchy, commitment time, population size, and commitment bias with the ICE-FASE algorithm.

View Article and Find Full Text PDF

Water quality assessment of Johor River Basin, Malaysia, using multivariate analysis and spatial interpolation method.

Environ Sci Pollut Res Int

January 2025

Center for Environmental Sustainability and Water Security (IPASA), Research Institute for Sustainable Environment (RISE), Universiti Teknologi Malaysia, 81310, Johor Bahru, Johor, Malaysia.

In the Johor River Basin, a comprehensive analysis was conducted on 24 water environmental parameters across 33 sampling sites over 3 years, encompassing both dry and wet seasons. A total of 396 water samples were collected and analyzed to calculate the Water Quality Index (WQI). To further assess water quality and pinpoint potential pollution sources, multivariate techniques such as principal component analysis (PCA) and cluster analysis (CA), alongside spatial analysis using inverse distance weighted (IDW) interpolation, were employed.

View Article and Find Full Text PDF

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!