Publications by authors named "Sishir Kalita"

Imprecise articulation is the major issue reported in various types of dysarthria. Detection of articulation errors can help in diagnosis. The cues derived from both the burst and the formant transitions contribute to the discrimination of place of articulation of stops.

View Article and Find Full Text PDF

Assessment of intelligibility is required to characterize the overall speech production capability and to measure the speech outcome of different interventions for individuals with cleft lip and palate (CLP). Researchers have found that articulation error and hypernasality have a significant effect on the degradation of CLP speech intelligibility. Motivated by this finding, the present work proposes an objective measure of sentence-level intelligibility by combining the information of articulation deficits and hypernasality.

View Article and Find Full Text PDF

The present work explores the acoustic characteristics of articulatory deviations near g(lottis) landmarks to derive the correlates of cleft lip and palate speech intelligibility. The speech region around the g landmark is used to compute two different acoustic features, namely, two-dimensional discrete cosine transform based joint spectro-temporal features, and Mel-frequency cepstral coefficients. Sentence-specific acoustic models are built using these features extracted from the normal speakers' group.

View Article and Find Full Text PDF

Intelligibility is considered as one of the primary measures for speech rehabilitation of individuals with a cleft lip and palate (CLP). Currently, speech processing and machine-learning-based objective methods are gaining more research interest as a way to quantify speech intelligibility. In this work, joint spectro-temporal features computed from a time-frequency representation of speech are explored to derive speech representations based on Gaussian posteriograms.

View Article and Find Full Text PDF

In this paper, acoustic analysis of misarticulated trills in cleft lip and palate speakers is carried out using excitation source based features: strength of excitation and fundamental frequency, derived from zero-frequency filtered signal, and vocal tract system features: first formant frequency (F1) and trill frequency, derived from the linear prediction analysis and autocorrelation approach, respectively. These features are found to be statistically significant while discriminating normal from misarticulated trills. Using acoustic features, dynamic time warping based trill misarticulation detection system is demonstrated.

View Article and Find Full Text PDF