Evaluating whether machines improve on human performance is one of the central questions of machine learning. However, there are many domains where the data is in the sense that the observed outcomes are themselves a consequence of the existing choices of the human decision-makers. For instance, in the context of judicial bail decisions, we observe the outcome of whether a defendant fails to return for their court appearance only if the human judge decides to release the defendant on bail. This selective labeling makes it harder to evaluate predictive models as the instances for which outcomes are observed do not represent a random sample of the population. Here we propose a novel framework for evaluating the performance of predictive models on selectively labeled data. We develop an approach called which allows us to compare the performance of predictive models and human decision-makers without resorting to counterfactual inference. Our methodology harnesses the heterogeneity of human decision-makers and facilitates effective evaluation of predictive models even in the presence of unmeasured confounders (unobservables) which influence both human decisions and the resulting outcomes. Experimental results on real world datasets spanning diverse domains such as health care, insurance, and criminal justice demonstrate the utility of our evaluation metric in comparing human decisions and machine predictions.

Download full-text PDF

Source
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC5958915PMC
http://dx.doi.org/10.1145/3097983.3098066DOI Listing

Publication Analysis

Top Keywords

predictive models
16
human decision-makers
12
performance predictive
8
human decisions
8
human
7
selective labels
4
labels problem
4
problem evaluating
4
evaluating algorithmic
4
algorithmic predictions
4

Similar Publications

Background: Recent research has revealed the potential value of machine learning (ML) models in improving prognostic prediction for patients with trauma. ML can enhance predictions and identify which factors contribute the most to posttraumatic mortality. However, no studies have explored the risk factors, complications, and risk prediction of preoperative and postoperative traumatic coagulopathy (PPTIC) in patients with trauma.

View Article and Find Full Text PDF

Importance: Chronic obstructive pulmonary disease (COPD) is often undiagnosed. Although genetic risk plays a significant role in COPD susceptibility, its utility in guiding spirometry testing and identifying undiagnosed cases is unclear.

Objective: To determine whether a COPD polygenic risk score (PRS) enhances the identification of undiagnosed COPD beyond a case-finding questionnaire (eg, the Lung Function Questionnaire) using conventional risk factors and respiratory symptoms.

View Article and Find Full Text PDF

Purpose: Chemoradiation-induced lymphopenia is common and associated with poorer survival in multiple solid malignancies. However, the association between chemoradiation-related lymphopenia and survival outcomes in rectal cancer is yet unclear. The objective of this study was to evaluate the prognostic impact of lymphopenia and its predictors in patients with rectal cancer undergoing neoadjuvant chemoradiation.

View Article and Find Full Text PDF

Aim: o point out how novel analysis tools of AI can make sense of the data acquired during OL and OC diagnosis and treatment in an effort to help improve and standardize the patient pathway for these disease.

Material And Methods: ultilizing programmed detection of heterogeneus OL and OC habitats through radiomics and correlate to imaging based tumor grading plus a literature review.

Results: new analysis pipelines have been generated for integrating imaging and patient demographic data and identify new multi-omic biomarkers of response prediction and tumour grading using cutting-edge artificial intelligence (AI) in OL and OC.

View Article and Find Full Text PDF

Background: Hematologic changes after splenectomy and hyperthermic intraperitoneal chemotherapy (HIPEC) can complicate postoperative assessment of infection. This study aimed to develop a machine-learning model to predict postoperative infection after cytoreductive surgery (CRS) and HIPEC with splenectomy.

Methods: The study enrolled patients in the national TriNetX database and at the Johns Hopkins Hospital (JHH) who underwent splenectomy during CRS/HIPEC from 2010 to 2024.

View Article and Find Full Text PDF

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!