VOWEL DURATION MEASUREMENT USING DEEP NEURAL NETWORKS.

IEEE Int Workshop Mach Learn Signal Process

Dept. of Linguistics, Northwestern University, Evanston, IL, USA.

Published: September 2015

Vowel durations are most often utilized in studies addressing specific issues in phonetics. Thus far this has been hampered by a reliance on subjective, labor-intensive manual annotation. Our goal is to build an algorithm for automatic accurate measurement of vowel duration, where the input to the algorithm is a speech segment contains one vowel preceded and followed by consonants (CVC). Our algorithm is based on a deep neural network trained at the frame level on manually annotated data from a phonetic study. Specifically, we try two deep-network architectures: convolutional neural network (CNN), and deep belief network (DBN), and compare their accuracy to an HMM-based forced aligner. Results suggest that CNN is better than DBN, and both CNN and HMM-based forced aligner are comparable in their results, but neither of them yielded the same predictions as models fit to manually annotated data.

Download full-text PDF

Source
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC5636193PMC
http://dx.doi.org/10.1109/MLSP.2015.7324331DOI Listing

Publication Analysis

Top Keywords

vowel duration
8
deep neural
8
neural network
8
manually annotated
8
annotated data
8
hmm-based forced
8
forced aligner
8
vowel
4
duration measurement
4
measurement deep
4

Similar Publications

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!