While noting the importance of data quality, existing process mining methodologies (i) do not provide details on how to assess the quality of event data (ii) do not consider how the identification of data quality issues can be exploited in the planning, data extraction and log building phases of any process mining analysis, (iii) do not highlight potential impacts of poor quality data on different types of process analyses. As our key contribution, we develop a process-centric, data quality-driven approach to preparing for a process mining analysis which can be applied to any existing process mining methodology. Our approach, adapted from elements of the well known CRISP-DM data mining methodology, includes conceptual data modeling, quality assessment at both attribute and event level, and trial discovery and conformance to develop understanding of system processes and data properties to inform data extraction. We illustrate our approach in a case study involving the Queensland Ambulance Service (QAS) and Retrieval Services Queensland (RSQ). We describe the detailed preparation for a process mining analysis of retrieval and transport processes (ground and aero-medical) for road-trauma patients in Queensland. Sample datasets obtained from QAS and RSQ are utilised to show how quality metrics, data models and exploratory process mining analyses can be used to (i) identify data quality issues, (ii) anticipate and explain certain observable features in process mining analyses, (iii) distinguish between systemic and occasional quality issues, and (iv) reason about the mechanisms by which identified quality issues may have arisen in the event log. We contend that this knowledge can be used to guide the data extraction and pre-processing stages of a process mining case study to properly align the data with the case study research questions.

Download full-text PDF

Source
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC6479847PMC
http://dx.doi.org/10.3390/ijerph16071138DOI Listing

Publication Analysis

Top Keywords

process mining
36
data quality
16
quality issues
16
data
14
data extraction
12
mining analysis
12
case study
12
quality
10
process
10
mining
10

Similar Publications

Clinical entity-aware domain adaptation in low resource setting for inflammatory bowel disease.

Front Artif Intell

January 2025

Language Intelligence and Information Retrieval (LIIR) Lab, Department of Computer Science, KU Leuven, Leuven, Belgium.

The digitization of healthcare records has revolutionized medical research and patient care, with electronic health records (EHRs) containing a wealth of structured and unstructured data. Extracting valuable information from unstructured clinical text presents a significant challenge, necessitating automated tools for efficient data mining. Natural language processing (NLP) methods have been pivotal in this endeavor, aiming to extract crucial clinical concepts embedded within free-form text.

View Article and Find Full Text PDF

Carbohydrate-functionalized quantum dots exhibit excellent physical characteristics and enhance the steric interaction with biological cells and tissues. Glycoconjugation of quantum dots promotes aqueous solubility, stability, and reduced immunogenicity. Carbohydrate-protein interactions are involved in various vital processes and provide insight into cellular recognition, cell-to-cell communication, pathogenicity, antigen-antibody recognition, and enzymatic action.

View Article and Find Full Text PDF

Circadian rhythms, intrinsic 24-h cycles that drive rhythmic changes in behavior and physiology, are important for normal physiology and health. Previous work in adults has identified sex differences in circadian rhythms of melatonin, temperature, and the intrinsic period of the human circadian timing system. However, less is known about sex differences in circadian rhythms at other developmental stages.

View Article and Find Full Text PDF

Background: Due to its previously illicit nature, Cannabis sativa had not fully reaped the benefits of recent innovations in genomics and plant sciences. However, Canada's legalization of C. sativa and products derived from its flower in 2018 triggered significant new demand for robust genotyping tools to assist breeders in meeting consumer demands.

View Article and Find Full Text PDF

As the depth of coal mining in China continues to increase, the fracturing of coal rock masses has an increasingly complex impact on the surrounding rock roadways. The majority of the mine's roadways run through coal rock masses with hard roofs and soft bottoms, which typically exhibit complex dynamic behaviour. To further research the mechanical behaviour and fracture evolution of coal rock masses under hard-roof and soft-floor conditions, the study is based on the majority of working faces in a mine, which have hard roofs and soft floors.

View Article and Find Full Text PDF

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!