With the development of artificial intelligence, speech recognition and prediction have become one of the important research domains with wild applications, such as intelligent control, education, individual identification, and emotion analysis. Chinese poetry reading contains rich features of continuous pronunciations, such as mood, emotion, rhythm schemes, lyric reading, and artistic expression. Therefore, the prediction of the pronunciation characteristics of a Chinese poetry reading is the significance for the presentation of high-level machine intelligence and has the potential to create a high-level intelligent system for teaching children to read Tang poetry. Mel frequency cepstral coefficient (MFCC) is currently used to present important speech features. Due to the complexity and high degree of nonlinearity in poetry reading, however, there is a tough challenge facing accurate pronunciation feature prediction, that is, how to model complex spatial correlations and time dynamics, such as rhyme schemes. As for many current methods, they ignore the spatial and temporal characteristics in MFCC presentation. In addition, these methods are subjected to certain limitations on prediction for long-term performance. In order to solve these problems, we propose a novel spatial-temporal graph model (STGM-MHA) based on multihead attention for the purpose of pronunciation feature prediction of Chinese poetry. The STGM-MHA is designed using an encoder-decoder structure. The encoder compresses the data into a hidden space representation, while the decoder reconstructs the hidden space representation as output. In the model, a novel gated recurrent unit (GRU) module (AGRU) based on multihead attention is proposed to extract the spatial and temporal features of MFCC data effectively. The evaluation comparison of our proposed model versus state-of-the-art methods in six datasets reveals the clear advantage of the proposed model.

Download full-text PDF

Source
http://dx.doi.org/10.1109/TNNLS.2022.3165554DOI Listing

Publication Analysis

Top Keywords

chinese poetry
16
pronunciation feature
12
feature prediction
12
poetry reading
12
spatial-temporal graph
8
graph model
8
prediction chinese
8
spatial temporal
8
based multihead
8
multihead attention
8

Similar Publications

With the advancement of internet of things (IoT) and artificial intelligence (AI) technology, access to large-scale bilingual parallel data has become more efficient, thereby accelerating the development and application of machine translation. Given the increasing cultural exchanges between China and Japan, many scholars have begun to study the Chinese translation of Japanese waka poetry. Based on this, the study first explores the structure of waka and the current state of its Chinese translations, analyzing existing translation disputes and introducing a data collection method for waka using IoT.

View Article and Find Full Text PDF

Classical poetry, chinese literati, chinese physicians, and pediatric neurosurgeons.

Childs Nerv Syst

November 2024

Department of Neurosurgery, Hebei Children's Hospital, Hebei Medical University, Shijiazhuang, Hebei, China.

View Article and Find Full Text PDF

Visual analysis of geographical distribution of poets in Song China based on Complete Song Poetry.

PLoS One

September 2024

School of Chinese Languages and Literatures, Lanzhou University, Lanzhou, P. R. China.

This article analyzes the geographical distribution of poets in Song China based on Complete Song Poetry (Quansongshi), which includes poems from over 9000 poets, with 6056 of them having clear native place. Visualization strategy is used to present the geographical distribution of these 6056 poets, and its formation factors are also analyzed. The results reveal that majority of the poets come from Zhejiang, Henan and Sichuan provinces, with Guangdong province also exceeding a hundred poets despite its remote location.

View Article and Find Full Text PDF

From a multiple-histories perspective, this paper attempts to restore the feature framework of Tang Dynasty gardens by describing the environment panorama of that era. Tang Dynasty gardens have their own unique and complex environment features, which is crucial for understanding Chinese classical gardens. This research developed a massive text mining method of historical-document to detect all garden-related elements in Tang poetry.

View Article and Find Full Text PDF

As an important carrier of culture, poetry plays a significant role in deepening language learners' understanding of the target language culture as well as enhancing their language skills; however, the effect of the target language culture on language learners' enjoyment of poetry remains unclear. The study served as an attempt to shed light on the point of whether the target language culture has different effects on high- and low-level Chinese Arabic learners' fondness for Arabic poetry with the use of pictures related to Arabic culture and those not related to Arabic culture. In the current study, 40 Arabic learners (20 high-level and 20 low-level) scored the Arabic poem line based on their fondness for it after viewing two kinds of picture with electroencephalogram (EEG) recording.

View Article and Find Full Text PDF

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!