A Bayesian Scoring Technique for Mining Predictive and Non-Spurious Rules.

Mach Learn Knowl Discov Databases

Department of Biomedical Informatics, University of Pittsburgh.

Published: January 2012

Rule mining is an important class of data mining methods for discovering interesting patterns in data. The success of a rule mining method heavily depends on the evaluation function that is used to assess the quality of the rules. In this work, we propose a new rule evaluation score - the Predictive and Non-Spurious Rules (PNSR) score. This score relies on Bayesian inference to evaluate the quality of the rules and considers the structure of the rules to filter out spurious rules. We present an efficient algorithm for finding rules with high PNSR scores. The experiments demonstrate that our method is able to cover and explain the data with a much smaller rule set than existing methods.

Download full-text PDF

Source
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC4416489PMC
http://dx.doi.org/10.1007/978-3-642-33486-3_17DOI Listing

Publication Analysis

Top Keywords

predictive non-spurious
8
non-spurious rules
8
rule mining
8
quality rules
8
rules
7
bayesian scoring
4
scoring technique
4
mining
4
technique mining
4
mining predictive
4

Similar Publications

Different individuals respond differently to emotional stimuli in their environment. Therefore, to understand how emotions are represented mentally will ultimately require investigations into individual-level information. Here we tasked participants with freely arranging emotionally charged images on a computer screen according to their subjective emotional similarity (yielding a unique affective space for each participant) and subsequently sought external validity of the layout of the individuals' affective spaces through the five-factor personality model (Neuroticism, Extraversion, Openness to Experience, Agreeableness, Conscientiousness) assessed via the NEO Five-Factor Inventory.

View Article and Find Full Text PDF

NON-LOCAL PRIORS FOR HIGH-DIMENSIONAL ESTIMATION.

J Am Stat Assoc

May 2017

UCLA, DEPARTMENT OF BIOSTATISTICS.

Jointly achieving parsimony and good predictive power in high dimensions is a main challenge in statistics. Non-local priors (NLPs) possess appealing properties for model choice, but their use for estimation has not been studied in detail. We show that for regular models NLP-based Bayesian model averaging (BMA) shrink spurious parameters either at fast polynomial or quasi-exponential rates as the sample size increases, while non-spurious parameter estimates are not shrunk.

View Article and Find Full Text PDF

This work proposes a pattern mining approach to learn event detection models from complex multivariate temporal data, such as electronic health records. We present Recent Temporal Pattern mining, a novel approach for efficiently finding predictive patterns for event detection problems. This approach first converts the time series data into time-interval sequences of temporal abstractions.

View Article and Find Full Text PDF

We study the problem of learning classification models from complex multivariate temporal data encountered in electronic health record systems. The challenge is to define a good set of features that are able to represent well the temporal aspect of the data. Our method relies on temporal abstractions and temporal pattern mining to extract the classification features.

View Article and Find Full Text PDF

Background: Several cross-sectional studies have shown an association between exposure to life events and psychological problems in adults with intellectual disability (ID). To establish life events as a risk factor, prospective designs are needed.

Methods: Support staff informants provided data on the psychological problems of 68 adults with ID and their recent exposure to life events.

View Article and Find Full Text PDF

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!