Background And Objective: Recent advances in neural networks and temporal image processing have provided new results and opportunities for vision-based bronchoscopy tracking. However, such progress has been hindered by the lack of comparative experimental data conditions. We address the issue by sharing a novel synthetic dataset, which allows for a fair comparison of methods. Moreover, as incorporating deep learning advances in temporal structures is not yet explored in bronchoscopy navigation, we investigate several neural network architectures for learning temporal information at different levels of subject personalization, providing new insights and results.

Methods: Using our own shared synthetic dataset for bronchoscopy navigation and tracking, we explore deep learning temporal information architectures (Recurrent Neural Networks and 3D convolutions), which have not been fully explored on bronchoscopy tracking, putting a special focus on network efficiency by using a modern backbone (EfficientNet-B0) and ShuffleNet blocks. Finally, we provide a study of different losses for rotation tracking and population modeling schemes (personalized vs. population) for bronchoscopy tracking.

Results: Temporal information architectures provide performance improvements, both in position and angle estimation. Additionally, population scheme analysis illustrates the benefits of offering a personalized model, while loss analysis indicates the benefits of using an adequate metric, improving results. We finally compare with a state-of-the-art model obtaining better results both in performance, with  12.2% and  18.7% improvement for position and rotation respectively, and around  67.6% reduction in memory consumption.

Conclusions: Proposed advances in temporal information architectures, loss configuration, and population scheme definition allow for improving the current state of the art in bronchoscopy analysis. Moreover, the publication of the first synthetic dataset allows for further improving bronchoscopy research by enabling proper comparison grounds among methods.

Download full-text PDF

Source
http://dx.doi.org/10.1016/j.cmpb.2022.107241DOI Listing

Publication Analysis

Top Keywords

synthetic dataset
12
temporal architectures
12
bronchoscopy
8
vision-based bronchoscopy
8
neural networks
8
bronchoscopy tracking
8
dataset allows
8
deep learning
8
advances temporal
8
explored bronchoscopy
8

Similar Publications

Improved hardware and processing techniques such as synthetic aperture sonar have led to imaging sonar with centimeter resolution. However, practical limitations and old systems limit the resolution in modern and legacy datasets. This study proposes using single image super resolution based on a conditioned diffusion model to map between images at different resolutions.

View Article and Find Full Text PDF

Multiple sclerosis (MS) is an autoimmune disease of the brain and spinal cord with both inflammatory and neurodegenerative features. Although advances in imaging techniques, particularly magnetic resonance imaging (MRI), have improved the process of diagnosis, its cause is unknown, a cure remains elusive and the evidence base to guide treatment is lacking. Computational techniques like machine learning (ML) have started to be used to understand MS.

View Article and Find Full Text PDF

Introduction: Subsea applications recently received increasing attention due to the global expansion of offshore energy, seabed infrastructure, and maritime activities; complex inspection, maintenance, and repair tasks in this domain are regularly solved with pilot-controlled, tethered remote-operated vehicles to reduce the use of human divers. However, collecting and precisely labeling submerged data is challenging due to uncontrollable and harsh environmental factors. As an alternative, synthetic environments offer cost-effective, controlled alternatives to real-world operations, with access to detailed ground-truth data.

View Article and Find Full Text PDF

Object pose estimation is essential for computer vision applications such as quality inspection, robotic bin picking, and warehouse logistics. However, this task often requires expensive equipment such as 3D cameras or Lidar sensors, as well as significant computational resources. Many state-of-the-art methods for 6D pose estimation depend on deep neural networks, which are computationally demanding and require GPUs for real-time performance.

View Article and Find Full Text PDF

Objectives: Deidentification of personally identifiable information in free-text clinical data is fundamental to making these data broadly available for research. However, there exist gaps in the deidentification landscape with regard to the functionality and flexibility of extant tools, as well as suboptimal tradeoffs between deidentification accuracy and speed. To address these gaps and tradeoffs, we develop a new Python-based deidentification software, pyDeid.

View Article and Find Full Text PDF

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!