Using cascade CNN-LSTM-FCNs to identify AI-altered video based on eye state sequence.

PLoS One

Faculty of Electrical and Electronics Engineering Technology, Universiti Malaysia Pahang, Pekan Campus, Pekan, Pahang, Malaysia.

Published: December 2022

Deep learning is notably successful in data analysis, computer vision, and human control. Nevertheless, this approach has inevitably allowed the development of DeepFake video sequences and images that could be altered so that the changes are not easily or explicitly detectable. Such alterations have been recently used to spread false news or disinformation. This study aims to identify Deepfaked videos and images and alert viewers to the possible falsity of the information. The current work presented a novel means of revealing fake face videos by cascading the convolution network with recurrent neural networks and fully connected network (FCN) models. The system detection approach utilizes the eye-blinking state in temporal video frames. Notwithstanding, it is deemed challenging to precisely depict (i) artificiality in fake videos and (ii) spatial information within the individual frame through this physiological signal. Spatial features were extracted using the VGG16 network and trained with the ImageNet dataset. The temporal features were then extracted in every 20 sequences through the LSTM network. On another note, the pre-processed eye-blinking state served as a probability to generate a novel BPD dataset. This newly-acquired dataset was fed to three models for training purposes with each entailing four, three, and six hidden layers, respectively. Every model constitutes a unique architecture and specific dropout value. Resultantly, the model optimally and accurately identified tampered videos within the dataset. The study model was assessed using the current BPD dataset based on one of the most complex datasets (FaceForensic++) with 90.8% accuracy. Such precision was successfully maintained in datasets that were not used in the training process. The training process was also accelerated by lowering the computation prerequisites.

Download full-text PDF

Source
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC9754287PMC
http://journals.plos.org/plosone/article?id=10.1371/journal.pone.0278989PLOS

Publication Analysis

Top Keywords

eye-blinking state
8
features extracted
8
bpd dataset
8
training process
8
dataset
5
cascade cnn-lstm-fcns
4
cnn-lstm-fcns identify
4
identify ai-altered
4
ai-altered video
4
video based
4

Similar Publications

Background: Diagnostic and prognostic decision-making in patients with Disorders of Consciousness (DoC) is challenging. It has been suggested that spontaneous eye blink rate is an index of patients' level of consciousness easy to detect in clinical practice. Further blinking features (i.

View Article and Find Full Text PDF

Blinking contributes to the health and protection of the eye and also holds potential in the context of muscle or nerve disorder diagnosis. Traditional methods of classifying eye blinking as open or closed are insufficient, as they do not capture medical-relevant aspects like closure speed, duration, or percentage. The issue could be solved by reliably detecting blinking intervals in high-temporal recordings.

View Article and Find Full Text PDF

Accurately evaluating cognitive load during work-related tasks in complex real-world environments is challenging, leading researchers to investigate the use of eye blinking as a fundamental pacing mechanism for segmenting EEG data and understanding the neural mechanisms associated with cognitive workload. Yet, little is known about the temporal dynamics of eye blinks and related visual processing in relation to the representation of task-specific information. Therefore, we analyzed EEG responses from two experiments involving simulated driving (re-active and pro-active) with three levels of task load for each, as well as operating a steam engine (active vs.

View Article and Find Full Text PDF

In recent years, limited works on EOG (electrooculography)-based biometric authentication systems have been carried out with eye movements or eye blinking activities in the current literature. EOGs have permanent and unique traits that can separate one individual from another. In this work, we have investigated FSST (Fourier Synchrosqueezing Transform)-ICA (Independent Component Analysis)-EMD (Empirical Mode Decomposition) robust framework-based EOG-biometric authentication ( verification) performances using ensembled RNN (Recurrent Neural Network) deep models voluntary eye blinkings movements.

View Article and Find Full Text PDF

Drowsy driving can significantly affect driving performance and overall road safety. Statistically, the main causes are decreased alertness and attention of the drivers. The combination of deep learning and computer-vision algorithm applications has been proven to be one of the most effective approaches for the detection of drowsiness.

View Article and Find Full Text PDF

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!