Healthcare insurance fraud detection using data mining.

BMC Med Inform Decis Mak

Software Engineering Department, College of Computing and Information Sciences, King Saud University, Riyadh, Saudi Arabia.

Published: April 2024

Background: Healthcare programs and insurance initiatives play a crucial role in ensuring that people have access to medical care. There are many benefits of healthcare insurance programs but fraud in healthcare continues to be a significant challenge in the insurance industry. Healthcare insurance fraud detection faces challenges from evolving and sophisticated fraud schemes that adapt to detection methods. Analyzing extensive healthcare data is hindered by complexity, data quality issues, and the need for real-time detection, while privacy concerns and false positives pose additional hurdles. The lack of standardization in coding and limited resources further complicate efforts to address fraudulent activities effectively.

Methodolgy: In this study, a fraud detection methodology is presented that utilizes association rule mining augmented with unsupervised learning techniques to detect healthcare insurance fraud. Dataset from the Centres for Medicare and Medicaid Services (CMS) 2008-2010 DE-SynPUF is used for analysis. The proposed methodology works in two stages. First, association rule mining is used to extract frequent rules from the transactions based on patient, service and service provider features. Second, the extracted rules are passed to unsupervised classifiers, such as IF, CBLOF, ECOD, and OCSVM, to identify fraudulent activity.

Results: Descriptive analysis shows patterns and trends in the data revealing interesting relationship among diagnosis codes, procedure codes and the physicians. The baseline anomaly detection algorithms generated results in 902.24 seconds. Another experiment retrieved frequent rules using association rule mining with apriori algorithm combined with unsupervised techniques in 868.18 seconds. The silhouette scoring method calculated the efficacy of four different anomaly detection techniques showing CBLOF with highest score of 0.114 followed by isolation forest with the score of 0.103. The ECOD and OCSVM techniques have lower scores of 0.063 and 0.060, respectively.

Conclusion: The proposed methodology enhances healthcare insurance fraud detection by using association rule mining for pattern discovery and unsupervised classifiers for effective anomaly detection.

Download full-text PDF

Source
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC11046758PMC
http://dx.doi.org/10.1186/s12911-024-02512-4DOI Listing

Publication Analysis

Top Keywords

healthcare insurance
20
insurance fraud
16
fraud detection
16
association rule
16
rule mining
16
anomaly detection
12
detection
9
healthcare
8
proposed methodology
8
frequent rules
8

Similar Publications

Background: Modeling studies suggest that hundreds of thousands of U.S. children have lost caregivers since the COVID-19 pandemic began.

View Article and Find Full Text PDF

Trends and determinants of maternal health services utilization in India from 2015 to 2021.

Sci Rep

January 2025

Department of Public Health and Community Medicine, Central University of Kerala, Tejaswini Hills, Periya, Kasaragod, Kerala, 671320, India.

Continuum of care (CoC) in maternal health services refers to a pathway spanning from pregnancy and childbirth to post-pregnancy, covering routine antenatal care (ANC), institutional delivery (ID), and post-natal services (PNC). The current study aims to investigate the distribution, trends, dropouts, and determinants of maternal health services (ANC, ID, and PNC) utilization along the CoC pathway using NFHS-4 and NFHS-5 datasets from 2015 to 2021. The binary logistic regression examined the association between the continuum of maternal health services utilization and the predictor variables.

View Article and Find Full Text PDF

Introduction: A significant portion of posterior urethral valve patients continue to progress to end stage renal disease despite improvements in medical care. Socioeconomic status has been connected to various healthcare outcomes but has not been evaluated in relation to longitudinal outcomes of posterior urethral valves.

Objective: To evaluate the effect of socioeconomic status on the progression to renal failure among patients with posterior urethral valves.

View Article and Find Full Text PDF

Self-reported determinants for subjective financial distress: a qualitative interview study with German cancer patients.

BMJ Open

January 2025

Department of Medical Oncology, Section Translational Medical Ethics, National Center for Tumor Diseases (NCT), NCT Heidelberg, a partnership between DKFZ and Heidelberg University Hospital, Medical Faculty Heidelberg, Heidelberg University, Heidelberg, Germany.

Objectives: Patient-reported financial effects of a tumour disease in a universal healthcare setting are a multidimensional phenomenon. Actual and anticipated objective financial burden caused by direct medical and non-medical costs as well as indirect costs such as loss of income can lead to subjective financial distress. To better understand subjective financial distress, the presented study explores self-reported determinants for subjective financial distress in German patients with cancer, aiming to inform a new German-language patient-reported outcome measure for determining the financial effects of a tumour disease.

View Article and Find Full Text PDF

Context.—: Chronic myeloid leukemia (CML) is a myeloproliferative disorder characterized by proliferation of the granulocytic cell line. The incidence of CML in Kenya is estimated at near 2000 cases annually.

View Article and Find Full Text PDF

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!