AI Article Synopsis

  • The paper introduces a navigation and collision avoidance system for visually impaired individuals using vision transformers and multimodal feedback, focusing on real-time object detection and identification.
  • The system employs semantic segmentation and algorithms to create trajectory vectors for detected objects, predicting potential collisions, and integrates audio and vibrotactile alerts for warning users.
  • Based on a diverse dataset of 27,867 images, the model achieved 95% accuracy, indicating strong performance, with further testing planned to validate the usability and effectiveness of the feedback mechanisms.

Article Abstract

This paper presents a system that utilizes vision transformers and multimodal feedback modules to facilitate navigation and collision avoidance for the visually impaired. By implementing vision transformers, the system achieves accurate object detection, enabling the real-time identification of objects in front of the user. Semantic segmentation and the algorithms developed in this work provide a means to generate a trajectory vector of all identified objects from the vision transformer and to detect objects that are likely to intersect with the user's walking path. Audio and vibrotactile feedback modules are integrated to convey collision warning through multimodal feedback. The dataset used to create the model was captured from both indoor and outdoor settings under different weather conditions at different times across multiple days, resulting in 27,867 photos consisting of 24 different classes. Classification results showed good performance (95% accuracy), supporting the efficacy and reliability of the proposed model. The design and control methods of the multimodal feedback modules for collision warning are also presented, while the experimental validation concerning their usability and efficiency stands as an upcoming endeavor. The demonstrated performance of the vision transformer and the presented algorithms in conjunction with the multimodal feedback modules show promising prospects of its feasibility and applicability for the navigation assistance of individuals with vision impairment.

Download full-text PDF

Source
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC10455554PMC
http://dx.doi.org/10.3390/jimaging9080161DOI Listing

Publication Analysis

Top Keywords

multimodal feedback
16
feedback modules
16
vision transformer
12
visually impaired
8
vision transformers
8
collision warning
8
vision
6
feedback
5
transformer customized
4
customized environment
4

Similar Publications

Background: Entropy quantifies the level of disorder within a system. Low entropy reflects increased rigidity of homeostatic feedback systems possibly reflecting failure of protective physiological mechanisms like cerebral autoregulation. In traumatic brain injury (TBI), low entropy of heart rate and intracranial pressure (ICP) predict unfavorable outcome.

View Article and Find Full Text PDF

Different whole-brain computational models have been recently developed to investigate hypotheses related to brain mechanisms. Among these, the Dynamic Mean Field (DMF) model is particularly attractive, combining a biophysically realistic model that is scaled up via a mean-field approach and multimodal imaging data. However, an important barrier to the widespread usage of the DMF model is that current implementations are computationally expensive, supporting only simulations on brain parcellations that consider less than 100 brain regions.

View Article and Find Full Text PDF

This literature review explores the role of biofeedback therapy (BFT) in managing functional fecal incontinence (FFI) in children - a common condition with a substantial impact on the quality of life. FFI diagnosis relies primarily on medical history and thorough physical examination and is categorized by the Rome IV criteria into functional constipation (FC) and functional nonretentive fecal incontinence (FNRFI). Treatment options for FFI remain limited, particularly for FNRFI.

View Article and Find Full Text PDF

Aims: Health care transition (HCT) to adult care and young adult disease self-management is a multi-step process involving three major stakeholders - the adolescent, the caregiver, and the provider. Preparation gaps exist within each of these stakeholder groups. This paper presents the development of the Intervention to Promote Autonomy and Competence in Transition-aged Youth (IPACT), a multi-level (adolescent, caregiver, provider), multi-modal (interactive skill building sessions, educational materials, videos) intervention to address gaps in all three stakeholder groups simultaneously and help support achieving the three core elements of HCT planning.

View Article and Find Full Text PDF

Development of Multimodal Training Tool for Preterm Infant Care.

Indian J Pediatr

December 2024

Department of Pediatrics, WHO Collaborating Center for Newborn Care and Research, All India Institute of Medical Sciences, New Delhi, 110 029, India.

Objectives: Comprehensive packages for training healthcare professionals who care for premature newborns are scarce. AIIMS WHO CC, New Delhi, partnered with India Association of Neonatal Nurses (IANN) and National Neonatology Forum (NNF) and 35 national leaders in the field of newborn care to develop a multimodal educational package for preterm infants.

Methods: Findings from evaluation of four representative special newborn care units in the state of Madhya Pradesh informed the content of the package.

View Article and Find Full Text PDF

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!