The data stream poses additional challenges to statistical classification tasks because distributions of the training and target samples may differ as time passes. Such a distribution change in streaming data is called concept drift. Numerous histogram-based distribution change detection methods have been proposed to detect drift. Most histograms are developed on the grid-based or tree-based space partitioning algorithms which makes the space partitions arbitrary, unexplainable, and may cause drift blind spots. There is a need to improve the drift detection accuracy for the histogram-based methods with the unsupervised setting. To address this problem, we propose a cluster-based histogram, called equal intensity k -means space partitioning (EI-kMeans). In addition, a heuristic method to improve the sensitivity of drift detection is introduced. The fundamental idea of improving the sensitivity is to minimize the risk of creating partitions in distribution offset regions. Pearson's chi-square test is used as the statistical hypothesis test so that the test statistics remain independent of the sample distribution. The number of bins and their shapes, which strongly influence the ability to detect drift, are determined dynamically from the sample based on an asymptotic constraint in the chi-square test. Accordingly, three algorithms are developed to implement concept drift detection, including a greedy centroids initialization algorithm, a cluster amplify-shrink algorithm, and a drift detection algorithm. For drift adaptation, we recommend retraining the learner if a drift is detected. The results of experiments on the synthetic and real-world datasets demonstrate the advantages of EI-kMeans and show its efficacy in detecting concept drift.

Download full-text PDF

Source
http://dx.doi.org/10.1109/TCYB.2020.2983962DOI Listing

Publication Analysis

Top Keywords

drift detection
20
concept drift
16
space partitioning
12
drift
11
equal intensity
8
distribution change
8
detect drift
8
chi-square test
8
algorithm drift
8
detection
6

Similar Publications

Utilizing Tissues Self-Assembled in Fiber Optic-Based "Chinese Guzheng Strings" for Contractility Sensing and Drug Efficacy Evaluation: A Practical Approach.

Small

January 2025

State Key Laboratory of Biocatalysis and Enzyme Engineering, Stem Cells and Tissue Engineering Manufacture Center, School of Life Science, Hubei University, Wuhan, Hubei, 430062, China.

Recent advances in drug design and compound synthesis have highlighted the increasing need for effective methods of toxicity evaluation. A specialized force sensor, known as the light wavelength-encoded "Chinese guzheng" is developed. This innovative sensor is equipped with optical fiber strings and utilizes a wavelength-encoded fiber Bragg grating (FBG) that is chemically etched to reduce its diameter.

View Article and Find Full Text PDF

Development and Validation of a Highly Sensitive Isotope-Coded Equivalent Reporter Ion Assay for the Semi-Quantification of Isocoumarins in Complex Matrices.

Anal Chem

January 2025

China-Croatia Belt and Road Joint Laboratory on Biodiversity and Ecosystem Services, Chengdu Institute of Biology, Chinese Academy of Sciences, Chengdu 610213, China.

The accurate quantification of multicomponents using LC-MS is pivotal for ensuring the quality control of herbal medicine, as well as the investigation of their analysis of biological tissue distribution. However, two significant challenges persist: the scarcity of authentic standards and the selection of appropriate internal standards. In this study, we present a highly sensitive isotope-coded equivalent reporter ion assay (iERIA) that combines equivalently quantitative ion and isotope-coded derivatization strategies.

View Article and Find Full Text PDF

Metal-free molecular perovskites have shown great potential for X-ray detection due to their tunable chemical structures, low toxicity, and excellent photophysical properties. However, their limited X-ray absorption and environmental instability restrict their practical application. In this study, cesium-based molecular perovskites (MDABCO-CsX, X = Cl, Br, I) are developed by introducing Cs at the B-site to enhance X-ray absorption while retaining low toxicity.

View Article and Find Full Text PDF

Most methods currently used to infer the "demographic history of species" interpret this expression as a history of population size changes. The detection, quantification, and dating of demographic changes often rely on the assumption that population structure can be neglected. However, most vertebrates are typically organized in populations subdivided into social groups that are usually ignored in the interpretation of genetic data.

View Article and Find Full Text PDF

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!