Activities

What We Do.

Research in information extraction of audio and speech signals. Determining the robust representations which can succintly capture speech characteristics while being invariant to noise. Learning models which describe the multiple realizations of the data. Understanding and modeling the auditory system.

Why We Do.

Fundamental research in this direction to develop novel algorithms for processing audio signals. Combining signal procesing and machine learning for real world applications.

Applications

Large Language Models

Multimodal emotion recognition

Emotion understanding

AI for Healthcare

Representation learning

Robust speech recognition

End-to-end modeling

EEG analysis for decoding speech perception

Speech synthesis