AI Article Synopsis

  • - Detection and diagnosis of colon polyps are crucial for preventing colorectal cancer, and AI technologies can improve colonoscopy effectiveness through computer-aided detection (CADe) and diagnosis (CADx) systems.
  • - The REAL-Colon dataset offers a large collection of 2.7 million native video frames from real-world colonoscopy, featuring 350,000 expert-annotated bounding boxes, which provide a more realistic dataset compared to existing down-sampled images.
  • - This dataset includes comprehensive patient and procedural data, promoting transparency and enabling researchers to develop and benchmark more accurate AI algorithms for improved colonoscopy outcomes.

Article Abstract

Detection and diagnosis of colon polyps are key to preventing colorectal cancer. Recent evidence suggests that AI-based computer-aided detection (CADe) and computer-aided diagnosis (CADx) systems can enhance endoscopists' performance and boost colonoscopy effectiveness. However, most available public datasets primarily consist of still images or video clips, often at a down-sampled resolution, and do not accurately represent real-world colonoscopy procedures. We introduce the REAL-Colon (Real-world multi-center Endoscopy Annotated video Library) dataset: a compilation of 2.7 M native video frames from sixty full-resolution, real-world colonoscopy recordings across multiple centers. The dataset contains 350k bounding-box annotations, each created under the supervision of expert gastroenterologists. Comprehensive patient clinical data, colonoscopy acquisition information, and polyp histopathological information are also included in each video. With its unprecedented size, quality, and heterogeneity, the REAL-Colon dataset is a unique resource for researchers and developers aiming to advance AI research in colonoscopy. Its openness and transparency facilitate rigorous and reproducible research, fostering the development and benchmarking of more accurate and reliable colonoscopy-related algorithms and models.

Download full-text PDF

Source
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC11127922PMC
http://dx.doi.org/10.1038/s41597-024-03359-0DOI Listing

Publication Analysis

Top Keywords

real-colon dataset
8
real-world colonoscopy
8
colonoscopy
6
dataset developing
4
real-world
4
developing real-world
4
real-world applications
4
applications colonoscopy
4
colonoscopy detection
4
detection diagnosis
4

Similar Publications

Background: Ensuring accurate polyp detection during colonoscopy is essential for preventing colorectal cancer (CRC). Recent advances in deep learning-based computer-aided detection (CADe) systems have shown promise in enhancing endoscopists' performances. Effective CADe systems must achieve high polyp detection rates from the initial seconds of polyp appearance while maintaining low false positive (FP) detection rates throughout the procedure.

View Article and Find Full Text PDF
Article Synopsis
  • - Detection and diagnosis of colon polyps are crucial for preventing colorectal cancer, and AI technologies can improve colonoscopy effectiveness through computer-aided detection (CADe) and diagnosis (CADx) systems.
  • - The REAL-Colon dataset offers a large collection of 2.7 million native video frames from real-world colonoscopy, featuring 350,000 expert-annotated bounding boxes, which provide a more realistic dataset compared to existing down-sampled images.
  • - This dataset includes comprehensive patient and procedural data, promoting transparency and enabling researchers to develop and benchmark more accurate AI algorithms for improved colonoscopy outcomes.
View Article and Find Full Text PDF

Few-shot Tumor Bud Segmentation Using Generative Model in Colorectal Carcinoma.

Proc SPIE Int Soc Opt Eng

February 2024

Center for Artificial Intelligence Research, Wake Forest University, School of Medicine, Winston-Salem, NC, USA.

Current deep learning methods in histopathology are limited by the small amount of available data and time consumption in labeling the data. Colorectal cancer (CRC) tumor budding quantification performed using H&E-stained slides is crucial for cancer staging and prognosis but is subject to labor-intensive annotation and human bias. Thus, acquiring a large-scale, fully annotated dataset for training a tumor budding (TB) segmentation/detection system is difficult.

View Article and Find Full Text PDF

Patient classification through feature selection (FS) based on gene expression data (GED) has already become popular to the research communities. T-test is the well-known statistical FS method in GED analysis. However, it produces higher false positives and lower accuracies for small sample sizes or in presence of outliers.

View Article and Find Full Text PDF

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!