Bottom-up and top-down as well as low-level and high-level factors influence where we fixate when viewing natural scenes. However, the importance of each of these factors and how they interact remains a matter of debate. Here, we disentangle these factors by analyzing their influence over time. For this purpose, we develop a saliency model that is based on the internal representation of a recent early spatial vision model to measure the low-level, bottom-up factor. To measure the influence of high-level, bottom-up features, we use a recent deep neural network-based saliency model. To account for top-down influences, we evaluate the models on two large data sets with different tasks: first, a memorization task and, second, a search task. Our results lend support to a separation of visual scene exploration into three phases: the first saccade, an initial guided exploration characterized by a gradual broadening of the fixation density, and a steady state that is reached after roughly 10 fixations. Saccade-target selection during the initial exploration and in the steady state is related to similar areas of interest, which are better predicted when including high-level features. In the search data set, fixation locations are determined predominantly by top-down processes. In contrast, the first fixation follows a different fixation density and contains a strong central fixation bias. Nonetheless, first fixations are guided strongly by image properties, and as early as 200 ms after image onset, fixations are better predicted by high-level information. We conclude that any low-level, bottom-up factors are mainly limited to the generation of the first saccade. All saccades are better explained when high-level features are considered, and later, this high-level, bottom-up control can be overruled by top-down influences.

Download full-text PDF

Source
http://dx.doi.org/10.1167/19.3.1DOI Listing

Publication Analysis

Top Keywords

saliency model
8
low-level bottom-up
8
high-level bottom-up
8
top-down influences
8
fixation density
8
steady state
8
better predicted
8
high-level features
8
high-level
7
top-down
5

Similar Publications

Background: Although it is well established that lower cognitive performance, on average, is associated with a greater risk of developing Alzheimer's disease (AD) dementia, it is unclear whether distinct cognitively-defined subgroups exist among non-demented older adults and whether such profiles map onto distinct AD neuroimaging measure profiles.

Method: The sample consisted of 167 non-demented older adults from the BIOCARD study with comprehensive neuropsychological and clinical evaluations, amyloid PET and brain MRI scans. The MRI measure included: global cortical volume in AD-signature regions and a medial-temporal lobe composite; resting-state functional connectivity within 5 large-scale cognitive networks; global white matter microstructure, index by fractional anisotropy (FA) and radial diffusion (RD) on DTI scans; and global white matter hyperintensity (WMH) volume on FLAIR scans.

View Article and Find Full Text PDF

The salient object detection task based on deep learning has made significant advances. However, the existing methods struggle to capture long-range dependencies and edge information in complex images, which hinders precise prediction of salient objects. To this end, we propose a salient object detection method with non-local feature enhancement and edge reconstruction.

View Article and Find Full Text PDF

The intelligent identification of wear particles in ferrography is a critical bottleneck that hampers the development and widespread adoption of ferrography technology. To address challenges such as false detection, missed detection of small wear particles, difficulty in distinguishing overlapping and similar abrasions, and handling complex image backgrounds, this paper proposes an algorithm called TCBGY-Net for detecting wear particles in ferrography images. The proposed TCBGY-Net uses YOLOv5s as the backbone network, which is enhanced with several advanced modules to improve detection performance.

View Article and Find Full Text PDF

The atypical static brain functions related to the executive control network (ECN), default mode network (DMN), and salience network (SN) in people with autism spectrum disorder (ASD) has been widely reported. However, their transient functions in ASD are not clear. We aim to identify transient network states (TNSs) using coactivation pattern (CAP) analysis to characterize the age-related atypical transient functions in ASD.

View Article and Find Full Text PDF

Attention-based image segmentation and classification model for the preoperative risk stratification of thyroid nodules.

World J Surg

December 2024

Monash University Endocrine Surgery Unit, Department of General Surgery, Alfred Hospital, Melbourne, Victoria, Australia.

Background: Despite widespread use of standardized classification systems, risk stratification of thyroid nodules is nuanced and often requires diagnostic surgery. Genomic sequencing is available for this dilemma however, costs and access restricts global applicability. Artificial intelligence (AI) has the potential to overcome this issue nevertheless, the need for black-box interpretability is pertinent.

View Article and Find Full Text PDF

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!