We can visually discriminate and recognize a wide range of materials. Meanwhile, we use language to express our subjective understanding of visual input and communicate relevant information about the materials. Here, we investigate the relationship between visual judgment and language expression in material perception to understand how visual features relate to semantic representations. We use deep generative networks to construct an expandable image space to systematically create materials of well-defined and ambiguous categories. From such a space, we sampled diverse stimuli and compared the representations of materials from two behavioral tasks: visual material similarity judgments and free-form verbal descriptions. Our findings reveal a moderate but significant correlation between vision and language on a categorical level. However, analyzing the representations with an unsupervised alignment method, we discover structural differences that arise at the image-to-image level, especially among materials morphed between known categories. Moreover, visual judgments exhibit more individual differences compared to verbal descriptions. Our results show that while verbal descriptions capture material qualities on the coarse level, they may not fully convey the visual features that characterize the material's optical properties. Analyzing the image representation of materials obtained from various pre-trained data-rich deep neural networks, we find that human visual judgments' similarity structures align more closely with those of the text-guided visual-semantic model than purely vision-based models. Our findings suggest that while semantic representations facilitate material categorization, non-semantic visual features also play a significant role in discriminating materials at a finer level. This work illustrates the need to consider the vision-language relationship in building a comprehensive model for material perception. Moreover, we propose a novel framework for quantitatively evaluating the alignment and misalignment between representations from different modalities, leveraging information from human behaviors and computational models.

Download full-text PDF

Source
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC10849714PMC
http://dx.doi.org/10.1101/2024.01.25.577219DOI Listing

Publication Analysis

Top Keywords

material perception
12
visual features
12
verbal descriptions
12
vision language
8
visual
8
semantic representations
8
materials
7
material
6
representations
5
probing link
4

Similar Publications

The human visual nervous system excels at recognizing and processing external stimuli, essential for various physiological functions. Biomimetic visual systems leverage biological synapse properties to improve memory encoding and perception. Optoelectronic devices mimicking these synapses can enhance wearable electronics, with layered heterojunction materials being ideal materials for optoelectronic synapses due to their tunable properties and biocompatibility.

View Article and Find Full Text PDF

Background: Existing studies on breast cancer survivors (BCS) have primarily focused on individual aspects of either diet or exercise preferences and barriers. Our study aims to examine BCS' perceptions toward diet and exercise combined. Given the transformative impact of COVID-19, there is a crucial need for insights in the post-pandemic era to address the distinct challenges faced by BCS in maintaining their health and well-being.

View Article and Find Full Text PDF

Objective: To investigate the dynamics of collaborative learning in team-based learning (TBL) through students' reflections and feedback.

Methods: A phenomenological mixed-methods approach was adopted where the survey and reflections were conducted concurrently after the TBL session and the results were analyzed. The study employed a mini-cluster technique to include all first-year MBBS students of batch 2023-24 with an age range between 19 and 22 years.

View Article and Find Full Text PDF

Introduction: Pharmacy-based vaccination services are now available in 56 countries, including Romania, that started administering the flu-vaccines in the community pharmacies from 2022. Assessing how pharmacists managed this new pharmaceutical service in Romania is the subject of this study.

Methods: A cross-sectional study was conducted among all the pharmacies from Romania that were authorized to provide this service (442 pharmacies, from which 53 were in rural areas).

View Article and Find Full Text PDF

A cross-sectional study was conducted to determine the seroprevalence and potential risk factors of camel brucellosis and to assess public health awareness of the disease in the selected kebele of Arero District, Borena Zone, Southern Ethiopia. A total of 313 blood samples were collected from selected camels using a systematic random sampling technique. The serum samples underwent initial screening for brucellosis using the rose Bengal plate test (RBPT), with further confirmation through the indirect enzyme-linked immunosorbent Assay (i-ELISA).

View Article and Find Full Text PDF

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!