The classification of emotional speech is mostly considered in speech-related research on human-computer interaction (HCI). In this paper, the purpose is to present a novel feature extraction based on multi-resolutions texture image information (MRTII). The MRTII feature set is derived from multi-resolution texture analysis for characterization and classification of different emotions in a speech signal. The motivation is that we have to consider emotions have different intensity values in different frequency bands. In terms of human visual perceptual, the texture property on multi-resolution of emotional speech spectrogram should be a good feature set for emotion classification in speech. Furthermore, the multi-resolution analysis on texture can give a clearer discrimination between each emotion than uniform-resolution analysis on texture. In order to provide high accuracy of emotional discrimination especially in real-life, an acoustic activity detection (AAD) algorithm must be applied into the MRTII-based feature extraction. Considering the presence of many blended emotions in real life, in this paper make use of two corpora of naturally-occurring dialogs recorded in real-life call centers. Compared with the traditional Mel-scale Frequency Cepstral Coefficients (MFCC) and the state-of-the-art features, the MRTII features also can improve the correct classification rates of proposed systems among different language databases. Experimental results show that the proposed MRTII-based feature information inspired by human visual perception of the spectrogram image can provide significant classification for real-life emotional recognition in speech.
Download full-text PDF |
Source |
---|---|
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC4327087 | PMC |
http://dx.doi.org/10.3390/s150101458 | DOI Listing |
J Imaging
August 2024
Computer Science and Engineering Department, Thapar Institute of Engineering and Technology, Patiala 147004, Punjab, India.
Sci Rep
March 2024
Department of Petroleum and Mining Engineering, South Tehran Branch, Islamic Azad University, Tehran, Iran.
Electrofacies analysis conducted the distribution effects throughout the reservoir despite the difficulty of characterizing stratigraphic relationships. Clustering methods quantitatively define the reservoir zone from non-reservoir considering electrofacies. Asmari Formation is the most significant reservoir of the Mansouri oilfield in SW Iran, generally composed of carbonate and sandstone layers.
View Article and Find Full Text PDFComput Biol Med
March 2024
School of Information Science and Engineering, Shandong Normal University, Jinan, 250358, China. Electronic address:
Given the constraints posed by hardware capacity, scan duration, and patient cooperation, the reconstruction of magnetic resonance imaging (MRI) images emerges as a pivotal aspect of medical imaging research. Currently, deep learning-based super-resolution (SR) methods have been widely discussed in medical image processing due to their ability to reconstruct high-quality, high resolution (HR) images from low resolution (LR) inputs. However, most existing MRI SR methods are designed for specific magnifications and cannot generate MRI images at arbitrary scales, which hinders the radiologists from fully visualizing the lesions.
View Article and Find Full Text PDFCancers (Basel)
September 2023
Vision Lab, Department of Electrical & Computer Engineering, Old Dominion University, Norfolk, VA 23529, USA.
Recent clinical research describes a subset of glioblastoma patients that exhibit REP prior to the start of radiation therapy. Current literature has thus far described this population using clinicopathologic features. To our knowledge, this study is the first to investigate the potential of conventional radiomics, sophisticated multi-resolution fractal texture features, and different molecular features (MGMT, IDH mutations) as a diagnostic and prognostic tool for prediction of REP from non-REP cases using computational and statistical modeling methods.
View Article and Find Full Text PDFSensors (Basel)
August 2023
Laboratory of Photogrammetry, School of Rural, Surveying and Geoinformatics Engineering, National Technical University of Athens, 15780 Athens, Greece.
In the context of web augmented reality (AR), 3D rendering that maintains visual quality and frame rate requirements remains a challenge. The lack of a dedicated and efficient 3D format often results in the degraded visual quality of the original data and compromises the user experience. This paper examines the integration of web-streamable view-dependent representations of large-sized and high-resolution 3D models in web AR applications.
View Article and Find Full Text PDFEnter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!