Google Scholar page can be found here
Thesis
A.Soman, "Investigating Neural Mechanisms of Word Learning and Speech Perception", Indian Institute of Science, April. 2024.
P.Singh, "Graph Clustering Approaches for Speaker Diarization of Conversational Speech", Indian Institute of Science, Feb. 2024.
Anurenjan P.R., "Dereverberation of Speech Using Autoregressive Models of Sub-band Envelopes", Indian Institute of Science, Sep. 2023.
S.Ramoji, "Supervised Learning Approaches for Language and Speaker Recognition", Indian Institute of Science, July 2023.
P. Agrawal, " Neural Representation Learning for Speech and Audio Signals", Indian Institute of Science, Jan. 2021.
S. Ganapathy, "Signal Analysis using Autoregressive Models of Amplitude Modulation ", Johns Hopkins University, Jan. 2012.
Tutorials, Keynotes, Defense and Colloquia
A.Soman, "Investigating Neural Mechanisms of Word Learning and Speech Perception", Thesis Defense Talk, February 2024. [Slides]
P.Singh, "Graph Clustering Approaches for Speaker Diarization of Conversational Speech", Thesis Defense Talk, February 2024. [Slides]
Anurenjan P.R., "Dereverberation of Speech Using Autoregressive Models of Sub-band Envelopes", Thesis Defense Talk, September 2023. [Slides]
P. Singh, "Graph Clustering approaches for Speaker Diarization of Conversational Speech", Thesis Colloquium Talk, July 2023.[Slides]
A. Soman, "Investigating Neural Mechanisms of Word Learning and Speech Perception", Thesis Colloquium Talk, July 2023.
S. Ramoji, "Supervised Approaches for Language and Speaker Recognition", Thesis Defense Talk, July 2023. [Slides]
P. Agrawal, "Neural Representation Learning for Speech and Audio Signals", Thesis Defense Talk, January 2021. [Video]
P. Agrawal, "Neural Representation Learning of Speech and Audio Signals", Thesis Colloquium Talk, July 2020. [Video] [Slides]
S. Ganapathy, "Speaker and Language Recognition - From Laboratory Technologies to the Wild", Invited Perspective Keynote Talk, Interspeech 2018.
S. Ganapathy and S. Thomas, "The Art and Science of Speech Feature Engineering", Interspeech, Singapore, Sept. 2014.
Journals
Debarpan Bhattacharya, Amir H. Poorjam, Deepak Mittal, Sriram Ganapathy, "Gradient-free Post-hoc Explainability Using Distillation Aided Learnable Approach ," IEEE Journal of Selected Topics in Signal Processing (JSTSP)- Special Series on AI in Signal & Data Science, 2024.
Baghel, Shikha, Shreyas Ramoji, Somil Jain, Pratik Roy Chowdhuri, Prachi Singh, Deepu Vijayasenan, and Sriram Ganapathy, "Summary of the DISPLACE challenge 2023-DIarization of SPeaker and LAnguage in Conversational Environments," Elsevier Speech Communication 161 (2024): 103080.
Varun Krishna, Tarun Sai, Sriram Ganapathy, "Representation Learning With Hidden Unit
Clustering For Low Resource Speech Applications," IEEE/ACM Transactions on Audio, Speech, and Language Processing, 2023.
Anurenjan Purushothaman, Debottam Dutta, Rohit Kumar, Sriram Ganapathy, "Speech Dereverberation with Frequency Domain Autoregressive Modeling," IEEE/ACM Transactions on Audio, Speech, and Language Processing, 2023.
S. R. Chetupalli, P. Krishnan, N.K.Sharma, A.Muguli, R.Kumar, V.Nanda, L.M.Pinto, P.K.Ghosh, S. Ganapathy, "Multi-Modal Point-of-Care Diagnostics for COVID-19 Based on Acoustics and Symptoms," IEEE Journal of Translational Engineering in Health and Medicine, 2023.
D. Bhattacharya, N. K. Sharma, D. Dutta, S. R. Chetupalli, P. Mote, S. Ganapathy, S. Nori, S. Gonuguntla, M. Alagesan, "Coswara: A respiratory sounds and symptoms dataset for remote screening of SARS-CoV-2 infection," Nature Scientific Data, 2023.
Ramoji, Shreyas, Prashant Krishnan, and Sriram Ganapathy, " PLDA inspired Siamese networks for speaker verification ," Computer Speech and Language, 2022.
Akshara Soman, Prathibha Ramachandran, and Sriram Ganapathy, " ERP Evidences of Rapid Semantic Learning In Foreign Language Word Comprehension ," Frontiers in Neuroscience, 2022. [Data]
Neeraj Kumar Sharma, Ananya Muguli, Prashant Krishnan, Rohit Kumar, Srikanth Raj Chetupalli, Sriram Ganapathy, " Towards sound based testing of COVID-19 -Summary of the first Diagnostics of COVID-19 using Acoustics (DiCOVA) Challenge ," Elsevier Journal on Computer, Speech and Language, 2022.
Purushothaman, A., Sreeram, A., Kumar, R., & Ganapathy, S, " Dereverberation of Autoregressive Envelopes for Far-field Speech Recognition ," Elsevier Journal on Computer, Speech and Language, 2021. [Code]
JR Katthi, S. Ganapathy, " Deep Correlation Analysis for Audio-EEG Decoding ," IEEE Transactions on Neural Systems and Rehabilitation Engineering, 2021. [Code]
P. Singh, S. Ganapathy, "Self-supervised Representation Learning With Path Integral Clustering For Speaker Diarization," IEEE Transactions and Audio, Speech and Language Processing, 2021. [Code]
N. Sharma, V. Krishnamohan, S. Ganapathy, A. Gangopadhyay, L. Fink "Acoustic and linguistic features influence talker change detection", Journal of Acoustic Society of America (JASA) - Express Letters, EL414, Oct. 2020.
P. Agrawal, S. Ganapathy, "Interpretable Representation Learning for Speech and Audio Signals Based on Relevance Weighting," IEEE Transactions and Audio, Speech and Language Processing, 2020. [Code]
Shareef Babu Kalluri, Deepu Vijayasenan and S. Ganapathy, "Automatic Speaker Profiling from Short Duration Speech Data", Elsevier Speech Communications, April 2020.
B. Padi, A. Mohan and S. Ganapathy, "Towards Relevance and Sequence Modeling in Language Recognition", IEEE Transactions on Audio, Speech and Language Processing, March, 2020.[Code]
S. Ramoji and S. Ganapathy, "Supervised I-vector Modeling for Language and Accent Recognition", Elsevier Journal on Computer, Speech and Language, Oct. 2019.
A. Kanagasundaram, S. Sridharan, S. Ganapathy and C. Fookes, "A Study on Pairwise LDA for X-vector based Speaker Recognition", IET Electronic Letters, (2019).
P. Agrawal and S. Ganapathy, "Modulation Filter Learning Using Deep Variational Networks for Robust Speech Recognition", IEEE Journal of Selected Topics in Signal Processing (J-STSP), Special Issue on Data Science: Machine Learning for Audio Signal Processing, April 2019. [Code]
A. Soman, Madhavan C. R., K. Sarkar, and S. Ganapathy, "An EEG Study On The Brain Representations in Language Learning", IOP Journal on Biomedical Physics and Engineering Express, 5(2), 25041, (2019).
N. Sharma, S. Ganesh, S. Ganapathy and L. Holt, "Talker change detection: A comparison of human and machine performance", Journal of Acoustical Society of America, December 2018. [Code]
V. S. Kadimesetty, S. Gutta, S. Ganapathy, and P. K. Yalavarthy, "Convolutional Neural Network based Robust Denoising of Low-Dose Computed Tomography Perfusion Maps", IEEE Transactions on Radiation and Plasma Medical Sciences, August 2018.
S. Gutta, V.S. Kadimesetty, S. K. Kalva, M. Pramanik, S. Ganapathy and P. K. Yalavarthy, "Deep Neural Network Based Bandwidth Enhancement of Photoacoustic Data", Journal of Biomedical Optics, October 2017.
G. Kocavs, L. Toth, D. V. Compernolle and S. Ganapathy , "Increasing the Robustness of CNN Acoustic Models using ARMA Spectrogram Features and Channel Dropout", Elsevier Pattern Recognition Letters, September 2017.
P. Agrawal and S. Ganapathy, "Unsupervised Modulation Filter Learning
for Noise-Robust Speech Recognition", Journal of Acoustical Society of America, Sept. 2017. [Code]
S. Ganapathy, "Multi-variate Autoregressive Spectrogram Modeling for Noisy Speech Recognition", IEEE Signal Processing Letters, July 2017.
S. Ganapathy and M. Omar, "Auditory Motivated Front-end for Noisy Speech Using Spectro-temporal Modulation Filtering", Journal of Acoustical Society of America, EL343-349, Vol. 136(5), Nov. 2014.
S. Ganapathy, H. Mallidi and H. Hermansky, "Robust Feature Extraction Using Modulation Filtering of Autoregressive Models", IEEE Transactions on Audio, Speech and Language Processing, Vol. 22(8), pp. 1285-1295, Aug. 2014.
S. Ganapathy and J. Pelecanos, "Enhancing Frequency Shifted Speech Signals in Single Side Band Communication", IEEE Signal Processing Letters, Vol. 20(12), pp. 1231-1234, Oct. 2013.
S. Ganapathy and H. Hermansky, "Temporal Resolution Analysis in Frequency Domain Linear Prediction", Journal of Acoustical Society of America, EL436-442, Vol. 132(5), Oct. 2012.
S. Ganapathy, S. Thomas and H. Hermansky, "Temporal envelope compensation for robust phoneme recognition using modulation spectrum ", Journal of Acoustical Society of America, Vol. 128(6), pp. 3769-3780, Dec. 2010.
S. Ganapathy, P. Motlicek and H. Hermansky, "Autoregressive Models Of Amplitude Modulations In Audio Compression", IEEE Transactions on Audio, Speech and Language Processing, Vol. 18(6), pp.1624-1631, Aug. 2010.
P. Motlicek, S. Ganapathy, H. Hermansky and H. Garudadri,"Wide-Band Audio Coding based on Frequency Domain Linear Prediction", EURASIP Journal on Audio, Speech, and Music Processing, Vol. 2010 (3), pp. 1-14, Jan. 2010.
S. Ganapathy, S. Thomas and H. Hermansky, "Modulation Frequency Features For Phoneme Recognition In Noisy Speech", Journal of Acoustical Society of America, EL8-12, Vol. 125(1), Jan. 2009.
S. Thomas, S. Ganapathy and H. Hermansky, "Recognition Of Reverberant Speech Using Frequency Domain Linear Prediction", IEEE Signal Processing Letters, Vol. 15, pp. 681-684, Dec 2008.
Conferences
Darshan Prabhu, Abhishek Gupta, Omkar Nitsure, Preethi Jyothi, Sriram Ganapathy,"Improving Self-supervised Pre-training using Accent-Specific Codebooks", Interspeech 2024, Kos Island, Greece.
Bansal, Rachit, Bidisha Samanta, Siddharth Dalmia, Nitish Gupta, Sriram Ganapathy, Abhishek Bapna, Prateek Jain, and Partha Talukdar, "LLM Augmented LLMs: Expanding Capabilities through Composition.", In The Twelfth International Conference on Learning Representations (ICLR) 2024, Vienna Austria.
Shareef Babu Kalluri, P.Singh, P.R.Chowdhuri, A.Kulkarni, S.Baghel, P.Hegde, S.Sontakke, Deepak K T, S.R.M.Prasanna, D.Vijayasenan, S.Ganapathy, "The Second DISPLACE Challenge : DIarization of SPeaker and LAnguage in Conversational Environment", Interspeech 2024, Kos Island, Greece.
Soumya Dutta and Sriram Ganapathy, "Zero Shot Audio to Audio to Audio Emotion Transfer with Speaker Disentanglement",ICASSP 2024, Seoul, South Korea.[Code]
M.Thakkar, T.Bolukbasi, S.Ganapathy, S.Vashishth, S.Chandar, P.Talukdar, "Self-Influence Guided Data Reweighting for Language Model Pre-training",EMNLP 2023, Singapore.
D.Prabhu, P.Jyothi, S.Ganapathy, V.Unni, "Accented Speech Recognition With Accent-specific Codebooks",EMNLP 2023, Singapore.
A.Raj, S. Bharadwaj, S. Ganapathy, M. Ma, S.Vashishth, "MASR:Multi-Label Aware Speech Representation",IEEE ASRU 2023, Taiwan.
Varun Krishna and Sriram Ganapathy, "Pseudo-Label Based Supervised Contrastive Loss for Robust Speech Representations",IEEE ASRU 2023, Taiwan.
S. Vashishth, S. Bharadwaj, S. Ganapathy, A. Bapna, M. Ma, W. Han, V. Axelrod, P. Talukdar, "Label Aware Speech Representation Learning For Language Identification",Interspeech 2023, Dublin, Ireland.
Akshara Soman, Vidhi Sinha, and Sriram Ganapathy, "Enhancing the EEG Speech Match-Mismatch Tasks With Word Boundaries",Interspeech 2023, Dublin, Ireland.[Code]
S. Baghel, S. Ramoji, Sidharth, Ranjana H, P. Singh, S. Jain, P. R. Chowdhuri, K. Kulkarni, S. Padhi, D. Vijayasenan and S. Ganapathy, "DISPLACE Challenge: DIarization of SPeaker and LAnguage in Conversational Environments",Interspeech 2023, Dublin, Ireland.
Prachi Singh , Amrit Kaul and Sriram Ganapathy, "Supervised Hierarchical Clustering Using Graph Neural Networks for Speaker Diarization",ICASSP 2023, Rhodes Island, Greece.
Debottam Dutta , Debarpan Bhattacharya , Sriram Ganapathy, Amir H. Poorjam, Deepak Mittal, and Maneesh Singh, "Interpretable Acoustic Representation Learning on Breathing and Speech Signals for COVID-19 Detection",Interspeech 2022, Incheon, South Korea.
Debarpan Bhattacharya, Debottam Dutta, Neeraj Kumar Sharma , Srikanth Raj Chetupalli , Pravin Mote, Sriram Ganapathy , Chandrakiran C , Sahiti Nori , Suhail K K , Sadhana Gonuguntla , and Murali Alagesan, "Analyzing the impact of SARS-CoV-2 variants on respiratory sound signals",Interspeech 2022, Incheon, South Korea.
Debarpan Bhattacharya, Debottam Dutta, Neeraj Kumar Sharma , Srikanth Raj Chetupalli , Pravin Mote, Sriram Ganapathy , Chandrakiran C , Sahiti Nori , Suhail K K , Sadhana Gonuguntla , and Murali Alagesan, "Coswara: A website application enabling COVID-19 screening by analysing respiratory sound samples and health symptoms",Interspeech 2022, Incheon, South Korea.
Soumya Dutta and Sriram Ganapathy, "Multimodal Transformer with Learnable Frontend and Self Attention for Emotion Recognition", ICASSP 2022, Singapore.[Code]
Neeraj Kumar Sharma, Srikanth Raj Chetupalli, Debarpan Bhattacharya, Debottam Dutta, Pravin Mote, and Sriram Ganapathy, "The Second DiCOVA Challenge: Dataset and performance analysis for Diagnosis of COVID-19 using acoustics ", ICASSP 2022, Singapore.
Rohit Kumar, Anurenjan Purushothaman, Anirudh Sreeram, and Sriram Ganapathy, "End-to-end speech recognition with joint dereverberation of sub-band autoregressive envelopes",ICASSP 2022, Singapore.[Code]
Varun Krishna PS and Sriram Ganapathy, "Self Supervised Representation Learning with Deep Clustering for Acoustic Unit Discovery from Raw Speech", ICASSP 2022, Singapore. [Code]
P. Singh and S. Ganapathy, "Self-Supervised Metric Learning With Graph Clustering For Speaker Diarization", ASRU 2021, Cartagena.[Code]
A. Flavio, A. Poorjam, D. Mittal, C. Dognin, A. Muguli, R. Kumar, S. R. Chetupalli, S. Ganapathy and M. Singh, "Investigating the Feature Selection and Explainability of COVID-19 Diagnostics from Cough Sounds", Interspeech 2021, Brno, Czech Republic.
P. Singh, R. Varma, V. Krishnamohan, S. R. Chetupalli and S. Ganapathy, "LEAP Submission for the Third DIHARD Diarization Challenge", Interspeech 2021, Brno, Czech Republic.[Video][PPT]
P. R. Gudepu, R. Kumar, M. K. Jayesh, A. Purushothaman, S. Ganapathy and M. A. Basha, "SRIB-LEAP lab submission to Far-field Multi-Channel Speech Enhancement Challenge for Video Conferencing", Interspeech 2021, Brno, Czech Republic.
N. Ryant, P. Singh, V. Krishnamohan, R. Varma, K. Church, C. Cieri, J. Du, S. Ganapathy and M. Liberman, "The Third DIHARD Diarization Challenge", Interspeech 2021, Brno, Czech Republic.
A. Muguli, L. Pinto, R. Nirmala, N. Sharma, P. Krishnan, P. Ghosh, R. Kumar, S. Bhat, S. R. Chetupalli, S. Ganapathy, S. Ramoji and V. Nanda,"“DiCOVA Challenge: Dataset, task, and baseline system for COVID-19 diagnosis using acoustics”", Interspeech 2021, Brno, Czech Republic.
D. Dutta, P. Agrawal, and S. Ganapathy,"“A Multi-Head Relevance Weighting Framework for Learning Raw Waveform Audio Representations”, ", WASPAA 2021, New York, USA.
Shareef Babu Kalluri, Deepu Vijayasenan, Ganapathy, S., & Krishnan, P. ,"NISP: A Multi-lingual Multi-accent Dataset for Speaker Profiling", ICASSP 2021, Toronto.
Katthi, J. R., & Ganapathy, S.,"Deep Multiway Canonical Correlation Analysis For Multi-Subject Eeg Normalization.", ICASSP 2021, Toronto.
Basak, S., Agarwal, S., Ganapathy, S., & Takahashi, N.,"End-to-End Lyrics Recognition with Voice to Singing Style Transfer ", ICASSP 2021, Toronto.
P. Agrawal and S. Ganapathy,"Representation Learning For Speech Recognition Using Feedback Based Relevance Weighting", ICASSP 2021, Toronto, 2020.
N. Sharma, P. Krishnan, R. Kumar, S. Ramoji, S. R. Chetupalli, R. Nirmala, P. K. Ghosh and S. Ganapathy,"Coswara -- A Database of Breathing, Cough, and Voice Sounds for COVID-19 Diagnosis ", Interspeech 2020, Beijing, October 2020 [Dataset]
S. Ramoji, P. Krishnan and S. Ganapathy,"Neural PLDA Modeling for End-to-end Speaker Verification", Interspeech 2020, Beijing, October 2020 [Code]
P. Agrawal and S. Ganapathy,"Robust Raw Waveform Speech Recognition Using Relevance Weighted Representations", Interspeech 2020, Beijing, October 2020.
V. Krishnamohan, A. Soman, A. Gupta and S. Ganapathy,"Audiovisual Correspondence Learning in Humans And Machines", Interspeech 2020, Beijing, October 2020.
A. Purushothaman, A. Sreeram, R. Kumar and S. Ganapathy,"Deep Learning Based Dereverberation of Temporal Envelopes for Robust Speech Recognition", Interspeech 2020, Beijing, October 2020.
P. Singh and S. Ganapathy,"Deep Self-Supervised Hierarchical Clustering for Speaker Diarization", Interspeech 2020, Beijing, October 2020.
S. R. Chetupalli and S. Ganapathy,"Context Dependent RNNLM for Automatic Transcription of Conversations", Interspeech 2020, Beijing, October 2020.
J. Reddy and S. Ganapathy,"Deep Canonical Correlation Analysis For Decoding The Auditory Brain", IEEE EMBC, Toronto, Canada, July 2020.
S. Ramoji, P. Krishnan, and S. Ganapathy,"NPLDA: A Deep Neural PLDA Model for Speaker Verification", Speaker Odyssey Workshop, November, 2020.
S. Ramoji, P. Krishnan, B. Mysore, P. Singh and S. Ganapathy,"LEAP System for SRE19 Challenge -- Improvements and Error Analysis", Speaker Odyssey Workshop, November, 2020.
N. Sharma, V. Krishnamohan, S. Ganapathy, A. Gangopadhayay and L. Fink "On The Impact of Language Familiarity In Talker Change Detection", ICASSP 2020.
A. Purushothaman, A. Sreeram and S. Ganapathy "3-D Feature and Acoustic Modeling for Far-Field Speech Recognition", ICASSP 2020.
R. Kumar, A. Sreeram, A. Purushothaman and S. Ganapathy "Unsupervised Neural Mask Estimator For Generalized Eigen-Value Beamforming Based ASR", ICASSP 2020.
N. Takahashi, M. Singh, S. Basak, P. Sudarsanam, S. Ganapathy, Y. Mitsufuji "Improving Voice Separation by Incorporating End-to-end Speech Recognition", ICASSP 2020.
K. Praveen, A. Gupta, A. Soman and S. Ganapathy "Second Language Transfer Learning in Humans and Machines Using Image Supervision", IEEE ASRU, Dec. 2019.
S. Bansal, K. Malhotra, S. Ganapathy"Speaker and Language Aware Training for End-to-End ASR", IEEE ASRU, Dec. 2019.
N. Ryant, K. Church, C. Cieri, A. Cristia, J. Du, S. Ganapathy and M. Liberman, "The Second DIHARD Diarization Challenge: Dataset - task - and baselines", INTERSPEECH, Sept. 2019, Austria.
P. Singh, Harsha Vardhan M A, S. Ganapathy and A. Kanagasundaram, "LEAP Diarization System for the Second DIHARD Challenge", INTERSPEECH, Sept. 2019, Austria.
B. Padi, A. Mohan and S. Ganapathy, "Attention based Hybrid I-vector BLSTM Model for Language Recognition", INTERSPEECH, Sept. 2019, Austria.
K. Malhotra, S. Bansal and S. Ganapathy, "Active Learning Methods for Low Resource End-To-End Automatic Speech Recognition", INTERSPEECH, Sept. 2019, Austria.
P. Agrawal and S. Ganapathy, "Unsupervised Raw Waveform Representation Learning for ASR", INTERSPEECH, Sept. 2019, Austria.
A. Kanagasundaram, S. Sridharan, S. Ganapathy and P. Singh, "A Study of X-vector Based Speaker Recognition on Short Utterances", INTERSPEECH, Sept. 2019, Austria.
S. Ramoji, A. Mohan, B. Mysore, A. Bhatia, P. Singh, Harsha Vardhan M A and S. Ganapathy, "The LEAP Speaker Recognition System for NIST SRE 2018 Challenge", ICASSP, 2019.
N. Sharma, S. Ganesh, S. Ganapathy and L. Holt, "Analyzing human reaction time for talker change detection", ICASSP, 2019.[Code]
P. Agrawal and S. Ganapathy, "Deep variational filter learning models for speech recognition", ICASSP, 2019.
Shareef Babu Kalluri, Deepu Vijayasenan and S. Ganapathy, "A Deep Neural Network Based End-to-End Model for Joint Height And Age Estimation From Short Duration Speech", ICASSP, 2019.
B. Padi, A. Mohan and S. Ganapathy, "End-to-end language recognition using attention based hierarchical gated recurrent unit models", ICASSP, 2019.[Code]
S. Ramoji and S. Ganapathy, "Supervised i-vector modeling - Theory and Applications", INTERSPEECH, 2018.
P. Agrawal and S. Ganapathy, "Comparison of unsupervised modulation filter learning methods for ASR", INTERSPEECH, 2018.
N. Takahashi, P. Agrawal, N. Goswami and Y. Mitsufuji, "PhaseNet: Discretized phase modeling with deep neural networks for audio source separation", INTERSPEECH, 2018.
A. Cristia, S. Ganesh, M. Casillas and S. Ganapathy, "Talker diarization in the wild: The case of child-centered daylong audio-recordings", INTERSPEECH, 2018.
R. Kumar, V. Yeruva and S. Ganapathy, "On Convolutional LSTM Modeling for Joint Wake-Word Detection and Text Dependent Speaker Verification", INTERSPEECH, 2018.
S. Ganapathy and M. Harish, "Far-Field Speech Recognition Using Multivariate Autoregressive Models", INTERSPEECH, 2018.
B. Padi, S. Ramoji, V. Yeruva, S. Kumar and S. Ganapathy, "The LEAP Language Recognition System for LRE 2017 Challenge - Improvements and Error Analysis", Odyssey: The speaker and language recognition workshop, 2018.
N. Sajjan, S. Ganesh, N. Sharma, S. Ganapathy and N. Ryant "Leveraging LSTM Models for Overlap Detection in Multi-Party Meetings", ICASSP, Calgary Canada, April 2018.
S. Ganapathy and V. Peddinti "3-D CNN Models for Far-Field Multi-Channel Speech Recognition", ICASSP, Calgary Canada, April 2018.
N. Ryant et al. "Enhancement and Analysis of Conversational Speech: JSALT 2017", ICASSP, Calgary Canada, April 2018.
Ansari T, R. Kumar, S. Singh and S. Ganapathy "Unsupervised HMM Posteriograms for Language Independent Acoustic Modeling in Zero Resource Conditions", IEEE ASRU, Dec. 2017.
Ansari T, R. Kumar, S. Singh, S. Ganapathy "Deep Learning Methods For Unsupervised Acoustic Modeling - LEAP Submission to ZeroSpeech Challenge 2017", IEEE ASRU, Dec. 2017.
A. Siddhant, P. Jyothi and S. Ganapathy "Leveraging Native Language Speech For Accent Identfication Using Deep Siamese Networks", IEEE ASRU, Dec. 2017.
P. Agrawal and S. Ganapathy "Speech representation learning using unsupervised data-driven modulation filtering for robust ASR", Interspeech, Stockholm, Sweden, Aug. 2017.
N. Kumar, R. K. Das, S. Jelil, Dhanush B K, H. Kashyap, K. S. R. Murthy, S. Ganapathy, R. Sinha and S. R. M. Prasanna "IITG-Indigo system for NIST 2016 SRE challenge", Interspeech, Stockholm, Sweden, Aug. 2017.
Dhanush B, Suparna S., Aarthy R., Likhita C., Shashank D., Harish H. and S. Ganapathy "Factor Analysis Methods for Joint Speaker Verification and Spoof Detection", ICASSP, New Orleans, USA, 2017.
S. Sadjadi, J. Pelecanos and S. Ganapathy "The IBM Speaker Recognition System: Recent Advances and Error Analysis", Interspeech, San Francisco, September, 2016.
D. Dimitriadis, S. Thomas and S. Ganapathy, "An investigation on the use of ivectors for improved ASR robustness", Interspeech, San Francisco, Sept. 2016.
S. Sadjadi, S. Ganapathy and J. Pelecanos, "The IBM 2016 Speaker Recognition System", Odyssey, Spain, June, 2016.
S. Sadjadi, S. Ganapathy and J. Pelecanos, "Speaker Age Estimation On Conversational Telephone Speech Using Senone Posterior Based I-vectors", ICASSP, Shanghai, March, 2016.
S. Ganapathy, S. Thomas, D. Dimitriadis, S. Rennie "Investigating Factor Analysis Features for Deep Neural Networks In Noisy Speech Recognition", Interspeech, Dresden, Germany, Sept. 2015.
S. Ganapathy, "Robust Speech Processing Using ARMA Spectrograms", ICASSP, Brisbane, April, 2015.
S. Sadjadi, J. Pelecanos and S. Ganapathy, "Nearest Neighbor Discriminant Analysis for Language Recognition", ICASSP, Brisbane, April, 2015.
S. Ganapathy, K. J. Han, S. Thomas, M. Omar, M. V. Segbroeck and S. Narayanan, ""Robust Language Identification Using Convolutional Neural Networks", Interspeech, Singapore, Sept. 2014.
M. Omar and S. Ganapathy, "Shift-Invariant Features for Speech Activity Detection in Adverse Radio-Frequency Channel Conditions", ICASSP, Florence, Italy, May, 2014.
K. J. Han, S. Ganapathy, M Li, M. Omar and S. Narayanan, "Analyzing Convolutional Neural Networks for Speech Activity Detection in Mismatched Acoustic Conditions", ICASSP, Florence, Italy, May, 2014.
G. Saon, S. Thomas, H. Soltau, S. Ganapathy and B. Kingsbury, "The IBM Speech Activity Detection System for the DARPA RATS Program", Interspeech, Lyon, Aug. 2013.
K. J. Han, S. Ganapathy, M Li, M. Omar and S. Narayan, "TRAP Language Identification System for RATS Phase II Evaluation", Interspeech, Lyon, Aug. 2013.
H. Mallidi, S. Ganapathy and H. Hermansky, "Robust Speaker Recognition Using Spectro-Temporal Autoregressive Models", Interspeech, Lyon, Aug. 2013.
S. Ganapathy, M. Omar and J. Pelecanos, "Unsupervised Channel Adaptation For Language Identification Using Co-training", ICASSP, Vancouver, May, 2013.
S. Ganapathy, M. Omar and J. Pelecanos, "Noisy Channel Adaptation in Language Identification", IEEE SLT, Miami, Dec, 2012.
S. Ganapathy and H. Hermansky, "Robust Phoneme Recognition Using High Resolution Temporal Envelopes", Interspeech, Portland, Sept. 2012.
S. Thomas, S. Ganapathy, A. Jansen and H. Hermansky, "Data-driven Posterior Features for Low Resource Speech Recognition Applications", Interspeech, Portland, Sept. 2012.
S. Ganapathy, S. Thomas and H. Hermansky, "Feature Extraction Using 2-D Autoregressive Models For Speaker Recognition", ISCA Speaker Odyssey, June 2012.
S. Thomas, H. Mallidi, S. Ganapathy and H. Hermansky, "Adaptation Transforms of Auto-Associative Neural Networks as Features for Speaker Verification", ISCA Speaker Odyssey, June 2012.
D. Gomero et al. "The UMD-JHU 2011 Speaker Recognition System", ICASSP, Japan, Mar. 2012.
S. Thomas, S. Ganapathy and H. Hermansky, "Multilingual MLP Features For Low-resource LVCSR Systems", ICASSP, Japan, Mar. 2012.
S. Ganapathy, P. Rajan and H. Hermansky, "Multi-layer Perceptron Based Speech Activity Detection for Speaker Verification", IEEE WASPAA, Oct. 2011.
H. Mallidi, S. Ganapathy and H. Hermansky, "Modulation spectrum analysis for recognition of reverberant speech", Interspeech, Italy, Aug. 2011.
S. Ganapathy, J. Pelecanos and M. Omar, "Feature Normalization for Speaker Verification in Room Reverberation", ICASSP, Prague, May 2011.
S. Garimella, S. Ganapathy and H. Hermansky, "Sparse Auto-associative Neural Networks: Theory and Application to Speech Recognition", Interspeech, Japan, Sept. 2010.
S. Thomas, S. Ganapathy and H. Hermansky, "Cross-lingual and Multi-stream Posterior Features for Low-resource LVCSR Systems", Proc. of Interspeech, Japan, Sept. 2010.
S. Thomas, K. Patil, S. Ganapathy, N. Mesgarani, H. Hermansky, "A Phoneme Recognition Framework based on Auditory Spectro-Temporal Receptive Fields", Proc. of Interspeech, Japan, Sept. 2010.
S. Ganapathy, S. Thomas and H. Hermansky, "Robust Spectro-Temporal Features Based on Autoregressive Models of Hilbert Envelopes", ICASSP, Dallas, USA, March 2010.
S. Ganapathy, S. Thomas and H. Hermansky, "Comparison of Modulation Features For Phoneme Recognition", ICASSP, Dallas, USA, March 2010.
S. Ganapathy, S. Thomas, and H. Hermansky, "Temporal Envelope Subtraction for Robust Speech Recognition Using Modulation Spectrum", IEEE ASRU, 2009.
S. Ganapathy, S. Thomas, P. Motlicek and H. Hermansky, "Applications of Signal Analysis Using Autoregressive Models for Amplitude Modulation", IEEE WASPAA 2009.
S. Ganapathy, S. Thomas and H. Hermansky, "Static and Dynamic Modulation Spectrum for Speech Recognition", Proc. of Interspeech, Brighton, UK, Sept. 2009.
S. Thomas, S. Ganapathy and H. Hermansky, "Tandem Representations of Spectral Envelope and Modulation Frequency Features for ASR", Proc. of Interspeech, Brighton, UK, Sept. 2009.
S. Thomas, S. Ganapathy and H. Hermansky, "Phoneme Recognition Using Spectral Envelope and Modulation Frequency Features", ICASSP, Taiwan, April 2009.
S. Ganapathy, S. Thomas and H. Hermansky, "Front-end for Far-field Speech Recognition based on Frequency Domain Linear Prediction", Proc. of INTERSPEECH, Brisbane, Australia, Sep 2008.
S. Thomas, S. Ganapathy and H. Hermansky, "Hilbert Envelope Based Specto-Temporal Features for Phoneme Recognition in Telephone Speech", Proc. of INTERSPEECH, Brisbane, Australia, Sep 2008.
S. Ganapathy, P. Motlicek, H. Hermansky and H. Garudadri, "Spectral Noise Shaping: Improvements in Speech/Audio Codec Based on Linear Prediction in
Spectral Domain", Proc. of INTERSPEECH, Brisbane, Australia, Sep 2008.
P. Motlicek, S. Ganapathy, H. Hermansky, H. Garudadri and Marios
Athineos, "Perceptually motivated Sub-band Decomposition for FDLP Audio Coding", in
Lecture Notes In Artificial Intelligence, Springer-Verlag Berlin, Heidelberg, 2008.
S. Thomas, S. Ganapathy and H. Hermansky, "Spectro-Temporal Features
for Automatic Speech Recognition using Linear Prediction in Spectral Domain",
Proc. of EUSIPCO, Lausanne, Switzerland, Aug 2008.
S. Ganapathy, P. Motlicek, H. Hermansky and H. Garudadri,
"Autoregressive Modelling of Hilbert Envelopes for Wide-band Audio Coding", AES 124th Convention, Audio Engineering Society, May 2008.
S. Ganapathy, P. Motlicek, H. Hermansky and H. Garudadri, ""Temporal Masking for Bit-rate Reduction in Audio Codec Based on Frequency Domain Linear Prediction", Proc. of ICASSP, April 2008.
S. Thomas, S. Ganapathy and H. Hermansky, "Hilbert Envelope Based Features for Far-Field Speech Recognition", Lecture Notes in Computer Science, Springer Berlin, Heidelberg 2008.
P. Motlicek, H. Hermansky, S. Ganapathy and H. Garudadri, "Frequency Domain Linear Prediction for QMF Sub-bands and Applications to Audio Coding", Lecture Notes in Computer Science, Springer Berlin, Heidelberg 2007.
P. Motlicek, H. Hermansky, S. Ganapathy and H. Garudadri, "Non-
Uniform Speech/Audio Coding Exploiting Predictability of Temporal Evolution of Spectral Envelopes", Lecture Notes in Computer Science, Springer Berlin, Heidelberg
2007.
Patents
"Adaptive System Combination in Language Recognition", Oct. 2018.
"Spectral Noise Shaping in Audio Coding Based on Spectral Dynamics in Frequency Sub-bands", Nov. 2011.
Temporal Masking in Audio Coding Based on Spectral Dynamics in Frequency Sub-bands", Aug. 2009.