Publications by authors named "P Godau"

Increasing evidence shows that flaws in machine learning (ML) algorithm validation are an underestimated global problem. In biomedical image analysis, chosen performance metrics often do not reflect the domain interest, and thus fail to adequately measure scientific progress and hinder translation of ML techniques into practice. To overcome this, we created Metrics Reloaded, a comprehensive framework guiding researchers in the problem-aware selection of metrics.

View Article and Find Full Text PDF

Validation metrics are key for tracking scientific progress and bridging the current chasm between artificial intelligence research and its translation into practice. However, increasing evidence shows that, particularly in image analysis, metrics are often chosen inadequately. Although taking into account the individual strengths, weaknesses and limitations of validation metrics is a critical prerequisite to making educated choices, the relevant knowledge is currently scattered and poorly accessible to individual researchers.

View Article and Find Full Text PDF
Article Synopsis
  • - Formalizing surgical activities as triplets of instruments, actions, and target anatomies helps enhance the understanding of tool-tissue interactions, improving AI assistance in image-guided surgeries.
  • - The CholecTriplet2022 challenge expands the previous work by adding weakly-supervised localization of surgical tools and modeling their activities as ‹instrument, verb, target› triplets.
  • - The paper outlines a baseline method and presents 10 new deep learning algorithms, while also comparing their effectiveness and analyzing results to provide insights for future surgical research.
View Article and Find Full Text PDF

Purpose: Validation metrics are a key prerequisite for the reliable tracking of scientific progress and for deciding on the potential clinical translation of methods. While recent initiatives aim to develop comprehensive theoretical frameworks for understanding metric-related pitfalls in image analysis problems, there is a lack of experimental evidence on the concrete effects of common and rare pitfalls on specific applications. We address this gap in the literature in the context of colon cancer screening.

View Article and Find Full Text PDF

Challenges have become the state-of-the-art approach to benchmark image analysis algorithms in a comparative manner. While the validation on identical data sets was a great step forward, results analysis is often restricted to pure ranking tables, leaving relevant questions unanswered. Specifically, little effort has been put into the systematic investigation on what characterizes images in which state-of-the-art algorithms fail.

View Article and Find Full Text PDF