Development and evaluation of cost-sensitive universum-SVM.

IEEE Trans Cybern

Published: April 2015

Many machine learning applications involve analysis of high-dimensional data, where the number of input features is larger than/comparable to the number of data samples. Standard classification methods may not be sufficient for such data, and this provides motivation for nonstandard learning settings. One such new learning methodology is called learning through contradiction or Universum-support vector machine (U-SVM). Recent studies have shown U-SVM to be quite effective for sparse high-dimensional data sets. However, all these earlier studies have used balanced data sets with equal misclassification costs. This paper extends the U-SVM formulation to problems with different misclassification costs, and presents practical conditions for the effectiveness of this cost-sensitive U-SVM. Several empirical comparisons are presented to validate the proposed approach.

Download full-text PDF	Source
http://dx.doi.org/10.1109/TCYB.2014.2336876	DOI Listing

Publication Analysis

Top Keywords

high-dimensional data

data sets

misclassification costs

data

development evaluation

evaluation cost-sensitive

cost-sensitive universum-svm

universum-svm machine

learning

machine learning

Similar Publications

scSMD: a deep learning method for accurate clustering of single cells based on auto-encoder.

BMC Bioinformatics

January 2025

Department of Surgery, Shanghai Key Laboratory of Gastric Neoplasms, Shanghai Institute of Digestive Surgery, Ruijin Hospital, Shanghai Jiao Tong University School of Medicine, Shanghai, China.

Xiaoxu Cui Renkai Wu Yinghao Liu Peizhan Chen Qing Chang

Background: Single-cell RNA sequencing (scRNA-seq) has transformed biological research by offering new insights into cellular heterogeneity, developmental processes, and disease mechanisms. As scRNA-seq technology advances, its role in modern biology has become increasingly vital. This study explores the application of deep learning to single-cell data clustering, with a particular focus on managing sparse, high-dimensional data.

View Article and Find Full Text PDF

Similar Publications

A hybrid deep learning air pollution prediction approach based on neighborhood selection and spatio-temporal attention.

Sci Rep

January 2025

School of Architecture and Urban Planning, Beijing University of Civil Engineering and Architecture, Beijing, 100055, China.

Gang Chen Shen Chen Dong Li Cai Chen

Air pollution is a critical global environmental issue, further exacerbated by rapid industrialization and urbanization. Accurate prediction of air pollutant concentrations is essential for effective pollution prevention and control measures. The complex nature of pollutant data is influenced by fluctuating meteorological conditions, diverse pollution sources, and propagation processes, underscores the crucial importance of the spatial and temporal feature extraction for accurately predicting air pollutant concentrations.

View Article and Find Full Text PDF

Similar Publications

CRAmed: a conditional randomization test for high-dimensional mediation analysis in sparse microbiome data.

Bioinformatics

January 2025

Department of Statistics, School of Mathematical Sciences, Shanghai Jiao Tong University, Shanghai, 200240, China.

Tiantian Liu Xiangnan Xu Tao Wang Peirong Xu

Motivation: Numerous microbiome studies have revealed significant associations between the microbiome and human health and disease. These findings have motivated researchers to explore the causal role of the microbiome in human complex traits and diseases. However, the complexities of microbiome data pose challenges for statistical analysis and interpretation of causal effects.

View Article and Find Full Text PDF

Similar Publications

Tensor neural networks for high-dimensional Fokker-Planck equations.

Neural Netw

January 2025

Division of Applied Mathematics, Brown University, Providence, RI 02912, USA; Advanced Computing, Mathematics and Data Division, Pacific Northwest National Laboratory, Richland, WA, United States. Electronic address:

Taorui Wang Zheyuan Hu Kenji Kawaguchi Zhongqiang Zhang George Em Karniadakis

We solve high-dimensional steady-state Fokker-Planck equations on the whole space by applying tensor neural networks. The tensor networks are a linear combination of tensor products of one-dimensional feedforward networks or a linear combination of several selected radial basis functions. The use of tensor feedforward networks allows us to efficiently exploit auto-differentiation (in physical variables) in major Python packages while using radial basis functions can fully avoid auto-differentiation, which is rather expensive in high dimensions.

View Article and Find Full Text PDF

Similar Publications

Modeling Latent Neural Dynamics with Gaussian Process Switching Linear Dynamical Systems.

ArXiv

January 2025

Amber Hu David Zoltowski Aditya Nair David Anderson Lea Duncker

Understanding how the collective activity of neural populations relates to computation and ultimately behavior is a key goal in neuroscience. To this end, statistical methods which describe high-dimensional neural time series in terms of low-dimensional latent dynamics have played a fundamental role in characterizing neural systems. Yet, what constitutes a successful method involves two opposing criteria: (1) methods should be expressive enough to capture complex nonlinear dynamics, and (2) they should maintain a notion of interpretability often only warranted by simpler linear models.

View Article and Find Full Text PDF

Similar Publications

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!