Multimodal emotion recognition research is gaining attention because of the emerging trend of integrating information from different sensory modalities to improve performance. Electroencephalogram (EEG) signals are considered objective indicators of emotions and provide precise insights despite their complex data collection. In contrast, eye movement signals are more susceptible to environmental and individual differences but offer convenient data collection. Conventional emotion recognition methods typically use separate models for different modalities, potentially overlooking their inherent connections. This study introduces a cross-modal guiding neural network designed to fully leverage the strengths of both modalities. The network includes a dual-branch feature extraction module that simultaneously extracts features from EEG and eye movement signals. In addition, the network includes a feature guidance module that uses EEG features to direct eye movement feature extraction, reducing the impact of subjective factors. This study also introduces a feature reweighting module to explore emotion-related features within eye movement signals, thereby improving emotion classification accuracy. The empirical findings from both the SEED-IV dataset and our collected dataset substantiate the commendable performance of the model, thereby confirming its efficacy.

Download full-text PDF

Source
http://dx.doi.org/10.1109/JBHI.2024.3419043DOI Listing

Publication Analysis

Top Keywords

eye movement
20
movement signals
16
emotion recognition
12
cross-modal guiding
8
guiding neural
8
neural network
8
multimodal emotion
8
eeg eye
8
data collection
8
study introduces
8

Similar Publications

End-range movements are among the most demanding but least understood in the sport of tennis. Using male Hawk-Eye data from match-play during the 2021-2023 Australian Open tournaments, we evaluated the speed, deceleration, acceleration, and shot quality characteristics of these types of movement in men's Grand Slam tennis. Lateral end-range movements that incorporated a change of direction (CoD) were identified for analysis using k-means (end-range) and random forest (CoD) machine learning models.

View Article and Find Full Text PDF

Increased attention towards progress information near a goal state.

Psychon Bull Rev

January 2025

Department of Psychology, McGill University, 2001 Av. McGill College, Montréal, QC, H3A 1G1, Canada.

A growing body of evidence across psychology suggests that (cognitive) effort exertion increases in proximity to a goal state. For instance, previous work has shown that participants respond more quickly, but not less accurately, when they near a goal-as indicated by a filling progress bar. Yet it remains unclear when over the course of a cognitively demanding task do people monitor progress information: Do they continuously monitor their goal progress over the course of a task, or attend more frequently to it as they near their goal? To answer this question, we used eye-tracking to examine trial-by-trial changes in progress monitoring as participants completed blocks of an attentionally demanding oddball task.

View Article and Find Full Text PDF

How are arbitrary sequences of verbal information retained and manipulated in working memory? Increasing evidence suggests that serial order in verbal WM is spatially coded and that spatial attention is involved in access and retrieval. Based on the idea that brain areas controlling spatial attention are also involved in oculomotor control, we used eye tracking to reveal how the spatial structure of serial order information is accessed in verbal working memory. In two experiments, participants memorized a sequence of auditory words in the correct order.

View Article and Find Full Text PDF

We conducted two experiments to examine the lexical and sub-lexical processing of Chinese two-character words in reading. We used a co-registration electroencephalogram (EEG) for the first fixation on target words. In Experiment 1, whole-word occurrence frequency and initial constituent character frequency were orthogonally manipulated, while in Experiment 2, whole-word occurrence frequency and end constituent character frequency were orthogonally manipulated.

View Article and Find Full Text PDF

Visual search becomes slower with aging, particularly when targets are difficult to discriminate from distractors. Multiple distractor rejection processes may contribute independently to slower search times: dwelling on, skipping of, and revisiting of distractors, measurable by eye-tracking. The present study investigated how age affects each of the distractor rejection processes, and how these contribute to the final search times in difficult (inefficient) visual search.

View Article and Find Full Text PDF

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!