Towards Efficient Scene Understanding via Squeeze Reasoning.

Xiangtai Li Xia Li Ansheng You Li Zhang Guangliang Cheng Kuiyuan Yang Yunhai Tong Zhouchen Lin

IEEE Trans Image Process

Published: August 2021

Graph-based convolutional model such as non-local block has shown to be effective for strengthening the context modeling ability in convolutional neural networks (CNNs). However, its pixel-wise computational overhead is prohibitive which renders it unsuitable for high resolution imagery. In this paper, we explore the efficiency of context graph reasoning and propose a novel framework called Squeeze Reasoning. Instead of propagating information on the spatial map, we first learn to squeeze the input feature into a channel-wise global vector and perform reasoning within the single vector where the computation cost can be significantly reduced. Specifically, we build the node graph in the vector where each node represents an abstract semantic concept. The refined feature within the same semantic category results to be consistent, which is thus beneficial for downstream tasks. We show that our approach can be modularized as an end-to-end trained block and can be easily plugged into existing networks. Despite its simplicity and being lightweight, the proposed strategy allows us to establish the considerable results on different semantic segmentation datasets and shows significant improvements with respect to strong baselines on various other scene understanding tasks including object detection, instance segmentation and panoptic segmentation. Code is available at https://github.com/lxtGH/SFSegNets.

Download full-text PDF	Source
http://dx.doi.org/10.1109/TIP.2021.3099369	DOI Listing

Publication Analysis

Top Keywords

scene understanding

squeeze reasoning

efficient scene

understanding squeeze

reasoning

reasoning graph-based

graph-based convolutional

convolutional model

model non-local

non-local block

Similar Publications

From the backroom to the boardroom: Health care chief information officers, stereotypes, and strategic leadership in the digital transformation era.

Health Care Manage Rev

January 2025

Michele L Heath Geoffrey A Silvera Tracy H Porter

Issue: The digital transformation of the U.S. health care system is underway, but the role of health care chief information officers (HCIOs) in that transformation has been unclear.

View Article and Find Full Text PDF

Similar Publications

Use of reference markers in the speech of people with schizophrenia spectrum disorders: Evidence from two referential communication tasks manipulating common ground with the interaction partner.

Schizophr Res Cogn

June 2025

Institut des sciences logopédiques, Université de Neuchâtel, Rue Pierre-à-Mazel 7, CH-2000 Neuchâtel, Switzerland.

Amélie M Achim Dominique Knutsen Marc-André Roy Souleymane Gadio Marion Fossard

Introduction: People with schizophrenia spectrum disorders present with language dysfunctions, yet we know little about their use of reference markers (indefinite markers, definite markers, pronouns or names), a fundamental aspect of efficient speech production.

Methods: Twenty-five (25) participants with a recent-onset schizophrenia spectrum disorder (SZ) and 25 healthy controls (HC) completed two referential communication tasks. The tasks involved presenting to an interaction partner a series of movie characters (character identification task) and movie scenes composed of six images (narration task).

View Article and Find Full Text PDF

Similar Publications

Priors for natural image statistics inform confidence in perceptual decisions.

Conscious Cogn

January 2025

University of the Sunshine Coast.

Rebecca K West Emily J A-Izzeddin David K Sewell William J Harrison

Decision confidence plays a critical role in humans' ability to make adaptive decisions in a noisy perceptual world. Despite its importance, there is currently little consensus about the computations underlying confidence judgements in perceptual decisions. To better understand these mechanisms, we addressed the extent to which confidence is informed by a naturalistic prior distribution.

View Article and Find Full Text PDF

Similar Publications

Improvement of flipped classroom teaching in colleges and universities based on virtual reality assisted by deep learning.

Sci Rep

January 2025

School of Electronic and Information Engineering, Changsha Institute of Technology, Changsha, 410200, China.

Wenxia Dai Qinqing Kang

In order to solve the limitations of flipped classroom in personalized teaching and interactive effect improvement, this paper designs a new model of flipped classroom in colleges and universities based on Virtual Reality (VR) by combining the algorithm of Contrastive Language-Image Pre-Training (CLIP). Through cross-modal data fusion, the model deeply combines students' operation behavior with teaching content, and improves teaching effect through intelligent feedback mechanism. The test data shows that the similarity between video and image modes reaches 0.

View Article and Find Full Text PDF

Similar Publications

Enhancing doctor-patient communication using large language models for pathology report interpretation.

BMC Med Inform Decis Mak

January 2025

Department of Thoracic Surgery, Guizhou Provincial People's Hospital, No. 83, Zhongshan East Road, Guiyang, Guizhou, 550000, China.

Xiongwen Yang Yi Xiao Di Liu Yun Zhang Huiyin Deng

Background: Large language models (LLMs) are increasingly utilized in healthcare settings. Postoperative pathology reports, which are essential for diagnosing and determining treatment strategies for surgical patients, frequently include complex data that can be challenging for patients to comprehend. This complexity can adversely affect the quality of communication between doctors and patients about their diagnosis and treatment options, potentially impacting patient outcomes such as understanding of their condition, treatment adherence, and overall satisfaction.

View Article and Find Full Text PDF

Similar Publications

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!