Publications by authors named "Prithwijit Guha"

Simultaneous speech of multiple speakers is known as overlapped speech, which causes problems for speech recognition and speaker diarization systems. The present work uses previously less utilized signal phase information in the task of overlapped speech detection. In this context, Instantaneous Frequency Cosine Coefficient (IFCC) and Modified Group Delay Cepstral Coefficient (MGDCC) features are explored.

View Article and Find Full Text PDF

Discrimination between shouted and normal speech is an essential prerequisite for many speech processing applications. Existing works have established that excitation source information plays a significant role in shouted speech production. In speech processing literature, various features have been proposed to model different aspects of the excitation source.

View Article and Find Full Text PDF