Minimizing bias is critical to adoption and implementation of machine learning (ML) in clinical practice. Systematic mathematical biases produce consistent and reproducible differences between the observed and expected performance of ML systems, resulting in suboptimal performance. Such biases can be traced back to various phases of ML development: data handling, model development, and performance evaluation. This report presents 12 suboptimal practices during data handling of an ML study, explains how those practices can lead to biases, and describes what may be done to mitigate them. Authors employ an arbitrary and simplified framework that splits ML data handling into four steps: data collection, data investigation, data splitting, and feature engineering. Examples from the available research literature are provided. A Google Colaboratory Jupyter notebook includes code examples to demonstrate the suboptimal practices and steps to prevent them. Data Handling, Bias, Machine Learning, Deep Learning, Convolutional Neural Network (CNN), Computer-aided Diagnosis (CAD) © RSNA, 2022.
Download full-text PDF |
Source |
---|---|
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC9533091 | PMC |
http://dx.doi.org/10.1148/ryai.210290 | DOI Listing |
Sensors (Basel)
December 2024
LAPLACE Laboratory-UMR5213, National Polytechnic Institute of Toulouse, 31077 Toulouse, France.
This paper introduces a novel methodology for evaluating communication performance in rotating electric machines using Received Signal Strength Indication (RSSI) measurements coupled with artificial intelligence. The proposed approach focuses on assessing the quality of wireless signals in the complex, dynamic environment inside these machines, where factors like reflections, metallic surfaces, and rotational movements can significantly impact communication. RSSI is used as a key parameter to monitor real-time signal behavior, enabling a detailed analysis of communication reliability.
View Article and Find Full Text PDFSensors (Basel)
December 2024
Institute of Mechanical Intelligence, Scuola Superiore Sant'Anna, Via G. Moruzzi 1, 56124 Pisa, Italy.
CloudSim is a versatile simulation framework for modeling cloud infrastructure components that supports customizable and extensible application provisioning strategies, allowing for the simulation of cloud services. On the other hand, Distributed Acoustic Sensing (DAS) is a ubiquitous technique used for measuring vibrations over an extended region. Data handling in DAS remains an open issue, as many applications need continuous monitoring of a volume of samples whose storage and processing in real time require high-capacity memory and computing resources.
View Article and Find Full Text PDFSensors (Basel)
December 2024
School of Electrical and Information Engineering, Tianjin University, Tianjin 300072, China.
Makeup modifies facial textures and colors, impacting the precision of face anti-spoofing systems. Many individuals opt for light makeup in their daily lives, which generally does not hinder face identity recognition. However, current research in face anti-spoofing often neglects the influence of light makeup on facial feature recognition, notably the absence of publicly accessible datasets featuring light makeup faces.
View Article and Find Full Text PDFSensors (Basel)
December 2024
Department of Semiconductor Systems Engineering, Sejong University, Seoul 05006, Republic of Korea.
Recently, AI systems such as autonomous driving and smart homes have become integral to daily life. Intelligent multi-sensors, once limited to single data types, now process complex text and image data, demanding faster and more accurate processing. While integrating NPUs and sensors has improved processing speed and accuracy, challenges like low resource utilization and long memory latency remain.
View Article and Find Full Text PDFSensors (Basel)
December 2024
Department of Computer Engineering, Gachon University, Sujeong-gu, Seongnam-si 13120, Republic of Korea.
Generating accurate and contextually rich captions for images and videos is essential for various applications, from assistive technology to content recommendation. However, challenges such as maintaining temporal coherence in videos, reducing noise in large-scale datasets, and enabling real-time captioning remain significant. We introduce MIRA-CAP (Memory-Integrated Retrieval-Augmented Captioning), a novel framework designed to address these issues through three core innovations: a cross-modal memory bank, adaptive dataset pruning, and a streaming decoder.
View Article and Find Full Text PDFEnter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!