Publications by Leyuan Qu

Publications by authors named "Leyuan Qu"

Page 1 of 1

Emphasizing unseen words: New vocabulary acquisition for end-to-end speech recognition.

Leyuan Qu Cornelius Weber Stefan Wermter

Neural Netw

April 2023

Due to the dynamic nature of human language, automatic speech recognition (ASR) systems need to continuously acquire new vocabulary. Out-Of-Vocabulary (OOV) words, such as trending words and new named entities, pose problems to modern ASR systems that require long training times to adapt their large numbers of parameters. Different from most previous research focusing on language model post-processing, we tackle this problem on an earlier processing level and eliminate the bias in acoustic modeling to recognize OOV words acoustically.

View Article and Find Full Text PDF

LipSound2: Self-Supervised Pre-Training for Lip-to-Speech Reconstruction and Lip Reading.

Leyuan Qu Cornelius Weber Stefan Wermter

IEEE Trans Neural Netw Learn Syst

February 2024

The aim of this work is to investigate the impact of crossmodal self-supervised pre-training for speech reconstruction (video-to-audio) by leveraging the natural co-occurrence of audio and visual streams in videos. We propose LipSound2 that consists of an encoder-decoder architecture and location-aware attention mechanism to map face image sequences to mel-scale spectrograms directly without requiring any human annotations. The proposed LipSound2 model is first pre-trained on ∼ 2400 -h multilingual (e.

View Article and Find Full Text PDF