Publications by Kun-Ching Wang

Publications by authors named "Kun-Ching Wang"

Page 1 of 1

Robust Audio Content Classification Using Hybrid-Based SMD and Entropy-Based VAD.

Entropy (Basel)

February 2020

A robust approach for the application of audio content classification (ACC) is proposed in this paper, especially in variable noise-level conditions. We know that speech, music, and background noise (also called silence) are usually mixed in the noisy audio signal. Based on the findings, we propose a hierarchical ACC approach consisting of three parts: voice activity detection (VAD), speech/music discrimination (SMD), and post-processing.

View Article and Find Full Text PDF

Time-frequency feature representation using multi-resolution texture analysis and acoustic activity detector for real-life speech emotion recognition.

Kun-Ching Wang

Sensors (Basel)

January 2015

The classification of emotional speech is mostly considered in speech-related research on human-computer interaction (HCI). In this paper, the purpose is to present a novel feature extraction based on multi-resolutions texture image information (MRTII). The MRTII feature set is derived from multi-resolution texture analysis for characterization and classification of different emotions in a speech signal.

View Article and Find Full Text PDF

The feature extraction based on texture image information for emotion sensing in speech.

Kun-Ching Wang

Sensors (Basel)

September 2014

In this paper, we present a novel texture image feature for Emotion Sensing in Speech (ESS). This idea is based on the fact that the texture images carry emotion-related information. The feature extraction is derived from time-frequency representation of spectrogram images.

View Article and Find Full Text PDF

A novel voice sensor for the detection of speech signals.

Kun-Ching Wang

Sensors (Basel)

December 2013

In order to develop a novel voice sensor to detect human voices, the use of features which are more robust to noise is an important issue. Voice sensor is also called voice activity detection (VAD). Due to that the inherent nature of the formant structure only occurred on the speech spectrogram (well-known as voiceprint), Wu et al.

View Article and Find Full Text PDF