Enhancing the Travel Experience for People with Visual Impairments through Multimodal Interaction: NaviGPT, A Real-Time AI-Driven Mobile Navigation System.

He Zhang Nicholas J Falletta Jingyi Xie Rui Yu Sooyeon Lee Syed Masum Billah John M Carroll

GROUP ACM SIGCHI Int Conf Support Group Work

College of Information Sciences and Technology, The Pennsylvania State University, University Park, Pennsylvania, USA.

Published: January 2025

Assistive technologies for people with visual impairments (PVI) have made significant advancements, particularly with the integration of artificial intelligence (AI) and real-time sensor technologies. However, current solutions often require PVI to switch between multiple apps and tools for tasks like image recognition, navigation, and obstacle detection, which can hinder a seamless and efficient user experience. In this paper, we present NaviGPT, a high-fidelity prototype that integrates LiDAR-based obstacle detection, vibration feedback, and large language model (LLM) responses to provide a comprehensive and real-time navigation aid for PVI. Unlike existing applications such as Be My AI and Seeing AI, NaviGPT combines image recognition and contextual navigation guidance into a single system, offering continuous feedback on the user's surroundings without the need for app-switching. Meanwhile, NaviGPT compensates for the response delays of LLM by using location and sensor data, aiming to provide practical and efficient navigation support for PVI in dynamic environments.

Download full-text PDF	Source
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC11727231	PMC
http://dx.doi.org/10.1145/3688828.3699636	DOI Listing

Publication Analysis

Top Keywords

people visual

visual impairments

image recognition

obstacle detection

navigation

enhancing travel

travel experience

experience people

impairments multimodal

multimodal interaction

Similar Publications

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!