An Electroencephalography (EEG) dataset utilizing rich text stimuli can advance the understanding of how the brain encodes semantic information and contribute to semantic decoding in brain-computer interface (BCI). Addressing the scarcity of EEG datasets featuring Chinese linguistic stimuli, we present the ChineseEEG dataset, a high-density EEG dataset complemented by simultaneous eye-tracking recordings. This dataset was compiled while 10 participants silently read approximately 13 hours of Chinese text from two well-known novels. This dataset provides long-duration EEG recordings, along with pre-processed EEG sensor-level data and semantic embeddings of reading materials extracted by a pre-trained natural language processing (NLP) model. As a pilot EEG dataset derived from natural Chinese linguistic stimuli, ChineseEEG can significantly support research across neuroscience, NLP, and linguistics. It establishes a benchmark dataset for Chinese semantic decoding, aids in the development of BCIs, and facilitates the exploration of alignment between large language models and human cognitive processes. It can also aid research into the brain's mechanisms of language processing within the context of the Chinese natural language.

Download full-text PDF

Source
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC11137001PMC
http://dx.doi.org/10.1038/s41597-024-03398-7DOI Listing

Publication Analysis

Top Keywords

eeg dataset
16
chinese linguistic
12
dataset
8
semantic decoding
8
linguistic stimuli
8
stimuli chineseeeg
8
natural language
8
language processing
8
eeg
7
semantic
5

Similar Publications

Enhancing motor disability assessment and its imagery classification is a significant concern in contemporary medical practice, necessitating reliable solutions to improve patient outcomes. One promising avenue is the use of brain-computer interfaces (BCIs), which establish a direct communication pathway between users and machines. This technology holds the potential to revolutionize human-machine interaction, especially for individuals diagnosed with motor disabilities.

View Article and Find Full Text PDF

The objective was to test the generalisability of electroencephalography (EEG) markers of future pain using two independent datasets. Datasets, A [N = 20] and B [N = 35], were collected from participants with subacute spinal cord injury who did not have neuropathic pain at the time of recording. In both datasets, some participants developed pain within six months, (PDP) will others did not (PNP).

View Article and Find Full Text PDF

Zipper Pattern: An Investigation into Psychotic Criminal Detection Using EEG Signals.

Diagnostics (Basel)

January 2025

Department of Digital Forensics Engineering, Technology Faculty, Firat University, Elazig 23119, Turkey.

Electroencephalography (EEG) signal-based machine learning models are among the most cost-effective methods for information retrieval. In this context, we aimed to investigate the cortical activities of psychotic criminal subjects by deploying an explainable feature engineering (XFE) model using an EEG psychotic criminal dataset. In this study, a new EEG psychotic criminal dataset was curated, containing EEG signals from psychotic criminal and control groups.

View Article and Find Full Text PDF

Emotion recognition is an advanced technology for understanding human behavior and psychological states, with extensive applications for mental health monitoring, human-computer interaction, and affective computing. Based on electroencephalography (EEG), the biomedical signals naturally generated by the brain, this work proposes a resource-efficient multi-entropy fusion method for classifying emotional states. First, Discrete Wavelet Transform (DWT) is applied to extract five brain rhythms, i.

View Article and Find Full Text PDF

Background: Decoding motor intentions from electroencephalogram (EEG) signals is a critical component of motor imagery-based brain-computer interface (MI-BCIs). In traditional EEG signal classification, effectively utilizing the valuable information contained within the electroencephalogram is crucial.

Objectives: To further optimize the use of information from various domains, we propose a novel framework based on multi-domain feature rotation transformation and stacking ensemble for classifying MI tasks.

View Article and Find Full Text PDF

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!