[Speaker gender identification based on audio fractal dimension and pitch feature].

Sheng Wu Yi Xue Gong Cheng Xue Za Zhi

Biomedical Engineering & Instrument Institute, Hangzhou Dianzi University, Hangzhou 310018, China.

Published: August 2008

AI Article Synopsis

  • This study focuses on improving automatic speaker gender identification by incorporating non-linear parameters, specifically fractal dimension, along with traditional linear parameters like pitch.
  • The research utilizes the lifting scheme to analyze pitch and applies Takens theory for reconstructing the phase space of the fractal dimension, enhancing the feature space used for identification.
  • Experimental results indicate that the combination of multiple dimensional features (pitch, fractal dimension, and fractal dimension complexity) yields more accurate and robust gender identification compared to using single-dimensional linear parameters.

Article Abstract

Automatic speaker gender identification based on voice feature is an important task in voice processing and analysis fields. In this paper non-linear parameters such as fractal dimension are applied to be one part of feature space for improving the ability of describing speaker gender feature through conventional linear parameters method. Pitch is picked using lifting scheme, and audio fractal dimension is extracted. Then based on Takens theory, the time delay method is used to reconstruct the phase space of fractal dimension sequence. And fractal dimension complexity is obtained by calculating Approximate Entropy. Three dimension feature vectors, including the pitch, the fractal dimension and the fractal dimension complexity, are applied to speaker gender identification. Experiment results show that through adding non-linear parameters, compared with the linear parameter using one dimension only such as pitch, the proposed method is more accurate and robust, and thus provides a new way for speaker gender identification.

Download full-text PDF

Source

Publication Analysis

Top Keywords

fractal dimension
28
gender identification
16
speaker gender
16
dimension
9
identification based
8
audio fractal
8
dimension pitch
8
non-linear parameters
8
dimension complexity
8
fractal
7

Similar Publications

Aberrant Cortical Morphological Networks in First-Episode Schizophrenia.

Schizophr Bull

January 2025

Institute for Brain Research and Rehabilitation, South China Normal University, Guangzhou 510631, China.

Background And Hypothesis: Population-based morphological covariance networks are widely reported to be altered in schizophrenia. Individualized morphological brain network approaches have emerged recently. We hypothesize that individualized morphological brain networks are disrupted in schizophrenia.

View Article and Find Full Text PDF

Incorporating nanomaterials into hydrogels allows for the creation of versatile materials with properties that can be precisely tailored by manipulating their nanoscale structures, leading to a wide range of bulk properties. Investigating the structural and property characteristics of composite hydrogels is crucial in tailoring their performance for specific applications. This study focuses on investigating the correlation between the structural arrangement and properties of a composite hydrogel of thermoresponsive polymer, gelatin, and light-responsive antimicrobial porous gold nanorods (PAuNRs).

View Article and Find Full Text PDF

The left ventricular trabecular fractal dimension (LVTFD) derived from cardiac magnetic resonance reflects myocardial trabecular complexity, which is associated with cardiovascular disease risk. Baseline risk stratification of cancer therapy-related cardiac dysfunction (CTRCD) in patients with breast cancer who received anthracycline is a very important clinical issue. In this study, we used the Cox model to derive and validate a new score system based on LVTFD for baseline risk stratification of CTRCD in breast cancer patients receiving anthracycline.

View Article and Find Full Text PDF

Background: Recent studies have focused on evaluating the biomarker value of textural features in radiological images. Our study investigated whether or not a texture analysis of computed tomographic colonography (CTC) images could be a novel biomarker for colorectal cancer (CRC).

Methods: This retrospective study investigated 263 patients with CRC who underwent contrast-enhanced CTC (CE-CTC) before curative surgery between January 2014 and December 2017.

View Article and Find Full Text PDF

The pore structure of shale is a key factor affecting the occurrence and flow of shale gas, and fractal dimensions can be used to quantitatively describe the complexity of the shale pore structure. In this study, the Leping Formation shale in the Junlian block of the southern Sichuan Basin was investigated. The pore structure characteristics of this shale were examined via low-pressure CO adsorption (LP-COA) and low-temperature N adsorption (LT-NA) methods via field emission scanning electron microscopy (FE-SEM), shale geochemistry, and mineral composition analysis.

View Article and Find Full Text PDF

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!