Publications


Thesis       Tutorials       Journals       Conferences       Patents       Publications By Year      

Google Scholar page can be found here

Thesis

  • S. Ganapathy, "Signal Analysis using Autoregressive Models of Amplitude Modulation ", Johns Hopkins University, Jan. 2012.

  • S. Ganapathy and S. Thomas, "The Art and Science of Speech Feature Engineering", Interspeech, Singapore, Sept. 2014.

  • S. Gutta, V.S. Kadimesetty, S. K. Kalva, M. Pramanik, S. Ganapathy and P. K. Yalavarthy, "Deep Neural Network Based Bandwidth Enhancement of Photoacoustic Data", Journal of Biomedical Optics, October 2017.

  • G. Kocavs, L. Toth, D. V. Compernolle and S. Ganapathy , "Increasing the Robustness of CNN Acoustic Models using ARMA Spectrogram Features and Channel Dropout", Elsevier Pattern Recognition Letters, September 2017.

  • P. Agrawal and S. Ganapathy, "Unsupervised Modulation Filter Learning for Noise-Robust Speech Recognition", Journal of Acoustical Society of America, September 2017. [Code]

  • S. Ganapathy, "Multi-variate Autoregressive Spectrogram Modeling for Noisy Speech Recognition", IEEE Signal Processing Letters, July 2017.

  • S. Ganapathy and M. Omar, "Auditory Motivated Front-end for Noisy Speech Using Spectro-temporal Modulation Filtering", Journal of Acoustical Society of America, EL343-349, Vol. 136(5), Nov. 2014.

  • S. Ganapathy, H. Mallidi and H. Hermansky, "Robust Feature Extraction Using Modulation Filtering of Autoregressive Models", IEEE Transactions on Audio, Speech and Language Processing, Vol. 22(8), pp. 1285-1295, Aug. 2014.

  • S. Ganapathy and J. Pelecanos, "Enhancing Frequency Shifted Speech Signals in Single Side Band Communication", IEEE Signal Processing Letters, Vol. 20(12), pp. 1231-1234, Oct. 2013.

  • S. Ganapathy and H. Hermansky, "Temporal Resolution Analysis in Frequency Domain Linear Prediction", Journal of Acoustical Society of America, EL436-442, Vol. 132(5), Oct. 2012.

  • S. Ganapathy, S. Thomas and H. Hermansky, "Temporal envelope compensation for robust phoneme recognition using modulation spectrum ", Journal of Acoustical Society of America, Vol. 128(6), pp. 3769-3780, Dec. 2010.

  • S. Ganapathy, P. Motlicek and H. Hermansky, "Autoregressive Models Of Amplitude Modulations In Audio Compression", IEEE Transactions on Audio, Speech and Language Processing, Vol. 18(6), pp.1624-1631, Aug. 2010.

  • P. Motlicek, S. Ganapathy, H. Hermansky and H. Garudadri,"Wide-Band Audio Coding based on Frequency Domain Linear Prediction", EURASIP Journal on Audio, Speech, and Music Processing, Vol. 2010 (3), pp. 1-14, Jan. 2010.

  • S. Ganapathy, S. Thomas and H. Hermansky, "Modulation Frequency Features For Phoneme Recognition In Noisy Speech", Journal of Acoustical Society of America, EL8-12, Vol. 125(1), Jan. 2009.

  • S. Thomas, S. Ganapathy and H. Hermansky, "Recognition Of Reverberant Speech Using Frequency Domain Linear Prediction", IEEE Signal Processing Letters, Vol. 15, pp. 681-684, Dec 2008.

    Top      

  • N. Sajjan, S. Ganesh, N. Sharma, S. Ganapathy and N. Ryant "Leveraging LSTM Models for Overlap Detection in Multi-Party Meetings", ICASSP, Calgary Canada, April 2018.

  • S. Ganapathy and V. Peddinti "3-D CNN Models for Far-Field Multi-Channel Speech Recognition", ICASSP, Calgary Canada, April 2018.

  • N. Ryant et al. "Enhancement and Analysis of Conversational Speech: JSALT 2017", ICASSP, Calgary Canada, April 2018.

  • Ansari T, R. Kumar, S. Singh, S. Ganapathy "Unsupervised HMM Posteriograms for Language Independent Acoustic Modeling in Zero Resource Conditions", IEEE ASRU, Dec. 2017.

  • Ansari T, R. Kumar, S. Singh, S. Ganapathy "Deep Learning Methods For Unsupervised Acoustic Modeling - LEAP Submission to ZeroSpeech Challenge 2017", IEEE ASRU, Dec. 2017.

  • A. Siddhant, P. Jyothi and S. Ganapathy "Leveraging Native Language Speech For Accent Identfication Using Deep Siamese Networks", IEEE ASRU, Dec. 2017.

  • P. Agrawal and S. Ganapathy "Speech representation learning using unsupervised data-driven modulation filtering for robust ASR", Interspeech, Stockholm, Sweden, Aug. 2017.

  • N. Kumar, R. K. Das, S. Jelil, Dhanush B K, H. Kashyap, K. S. R. Murthy, S. Ganapathy, R. Sinha and S. R. M. Prasanna "IITG-Indigo system for NIST 2016 SRE challenge", Interspeech, Stockholm, Sweden, Aug. 2017.

  • Dhanush B, Suparna S., Aarthy R., Likhita C., Shashank D., Harish H. and S. Ganapathy "Factor Analysis Methods for Joint Speaker Verification and Spoof Detection", ICASSP, New Orleans, USA, 2017.

  • S. Sadjadi, J. Pelecanos and S. Ganapathy "The IBM Speaker Recognition System: Recent Advances and Error Analysis", Interspeech, San Francisco, September, 2016.

  • D. Dimitriadis, S. Thomas and S. Ganapathy, "An investigation on the use of ivectors for improved ASR robustness", Interspeech, San Francisco, Sept. 2016.

  • S. Sadjadi, S. Ganapathy and J. Pelecanos, "The IBM 2016 Speaker Recognition System", Odyssey, Spain, June, 2016.

  • S. Sadjadi, S. Ganapathy and J. Pelecanos, "Speaker Age Estimation On Conversational Telephone Speech Using Senone Posterior Based I-vectors", ICASSP, Shanghai, March, 2016.

  • S. Ganapathy, S. Thomas, D. Dimitriadis, S. Rennie "Investigating Factor Analysis Features for Deep Neural Networks In Noisy Speech Recognition", Interspeech, Dresden, Germany, Sept. 2015.

  • S. Ganapathy, "Robust Speech Processing Using ARMA Spectrograms", ICASSP, Brisbane, April, 2015.

  • S. Sadjadi, J. Pelecanos and S. Ganapathy, "Nearest Neighbor Discriminant Analysis for Language Recognition", ICASSP, Brisbane, April, 2015.

  • S. Ganapathy, K. J. Han, S. Thomas, M. Omar, M. V. Segbroeck and S. Narayanan, ""Robust Language Identification Using Convolutional Neural Networks", Interspeech, Singapore, Sept. 2014.

  • M. Omar and S. Ganapathy, "Shift-Invariant Features for Speech Activity Detection in Adverse Radio-Frequency Channel Conditions", ICASSP, Florence, Italy, May, 2014.

  • K. J. Han, S. Ganapathy, M Li, M. Omar and S. Narayanan, "Analyzing Convolutional Neural Networks for Speech Activity Detection in Mismatched Acoustic Conditions", ICASSP, Florence, Italy, May, 2014.

  • G. Saon, S. Thomas, H. Soltau, S. Ganapathy and B. Kingsbury, "The IBM Speech Activity Detection System for the DARPA RATS Program", Interspeech, Lyon, Aug. 2013.

  • K. J. Han, S. Ganapathy, M Li, M. Omar and S. Narayan, "TRAP Language Identification System for RATS Phase II Evaluation", Interspeech, Lyon, Aug. 2013.

  • H. Mallidi, S. Ganapathy and H. Hermansky, "Robust Speaker Recognition Using Spectro-Temporal Autoregressive Models", Interspeech, Lyon, Aug. 2013.

  • S. Ganapathy, M. Omar and J. Pelecanos, "Unsupervised Channel Adaptation For Language Identification Using Co-training", ICASSP, Vancouver, May, 2013.

  • S. Ganapathy, M. Omar and J. Pelecanos, "Noisy Channel Adaptation in Language Identification", IEEE SLT, Miami, Dec, 2012.

  • S. Ganapathy and H. Hermansky, "Robust Phoneme Recognition Using High Resolution Temporal Envelopes", Interspeech, Portland, Sept. 2012.

  • S. Thomas, S. Ganapathy, A. Jansen and H. Hermansky, "Data-driven Posterior Features for Low Resource Speech Recognition Applications", Interspeech, Portland, Sept. 2012.

  • S. Ganapathy, S. Thomas and H. Hermansky, "Feature Extraction Using 2-D Autoregressive Models For Speaker Recognition", ISCA Speaker Odyssey, June 2012.

  • S. Thomas, H. Mallidi, S. Ganapathy and H. Hermansky, "Adaptation Transforms of Auto-Associative Neural Networks as Features for Speaker Verification", ISCA Speaker Odyssey, June 2012.

  • D. Gomero et al. "The UMD-JHU 2011 Speaker Recognition System", ICASSP, Japan, Mar. 2012.

  • S. Thomas, S. Ganapathy and H. Hermansky, "Multilingual MLP Features For Low-resource LVCSR Systems", ICASSP, Japan, Mar. 2012.

  • S. Ganapathy, P. Rajan and H. Hermansky, "Multi-layer Perceptron Based Speech Activity Detection for Speaker Verification", IEEE WASPAA, Oct. 2011.

  • H. Mallidi, S. Ganapathy and H. Hermansky, "Modulation spectrum analysis for recognition of reverberant speech", Interspeech, Italy, Aug. 2011.

  • S. Ganapathy, J. Pelecanos and M. Omar, "Feature Normalization for Speaker Verification in Room Reverberation", ICASSP, Prague, May 2011.

  • S. Garimella, S. Ganapathy and H. Hermansky, "Sparse Auto-associative Neural Networks: Theory and Application to Speech Recognition", Interspeech, Japan, Sept. 2010.

  • S. Thomas, S. Ganapathy and H. Hermansky, "Cross-lingual and Multi-stream Posterior Features for Low-resource LVCSR Systems", Proc. of Interspeech, Japan, Sept. 2010.

  • S. Thomas, K. Patil, S. Ganapathy, N. Mesgarani, H. Hermansky, "A Phoneme Recognition Framework based on Auditory Spectro-Temporal Receptive Fields", Proc. of Interspeech, Japan, Sept. 2010.

  • S. Ganapathy, S. Thomas and H. Hermansky, "Robust Spectro-Temporal Features Based on Autoregressive Models of Hilbert Envelopes", ICASSP, Dallas, USA, March 2010.

  • S. Ganapathy, S. Thomas and H. Hermansky, "Comparison of Modulation Features For Phoneme Recognition", ICASSP, Dallas, USA, March 2010.

  • S. Ganapathy, S. Thomas, and H. Hermansky, "Temporal Envelope Subtraction for Robust Speech Recognition Using Modulation Spectrum", IEEE ASRU, 2009.

  • S. Ganapathy, S. Thomas, P. Motlicek and H. Hermansky, "Applications of Signal Analysis Using Autoregressive Models for Amplitude Modulation", IEEE WASPAA 2009.

  • S. Ganapathy, S. Thomas and H. Hermansky, "Static and Dynamic Modulation Spectrum for Speech Recognition", Proc. of Interspeech, Brighton, UK, Sept. 2009.

  • S. Thomas, S. Ganapathy and H. Hermansky, "Tandem Representations of Spectral Envelope and Modulation Frequency Features for ASR", Proc. of Interspeech, Brighton, UK, Sept. 2009.

  • S. Thomas, S. Ganapathy and H. Hermansky, "Phoneme Recognition Using Spectral Envelope and Modulation Frequency Features", ICASSP, Taiwan, April 2009.

  • S. Ganapathy, S. Thomas and H. Hermansky, "Front-end for Far-field Speech Recognition based on Frequency Domain Linear Prediction", Proc. of INTERSPEECH, Brisbane, Australia, Sep 2008.

  • S. Thomas, S. Ganapathy and H. Hermansky, "Hilbert Envelope Based Specto-Temporal Features for Phoneme Recognition in Telephone Speech", Proc. of INTERSPEECH, Brisbane, Australia, Sep 2008.

  • S. Ganapathy, P. Motlicek, H. Hermansky and H. Garudadri, "Spectral Noise Shaping: Improvements in Speech/Audio Codec Based on Linear Prediction in Spectral Domain", Proc. of INTERSPEECH, Brisbane, Australia, Sep 2008.

  • P. Motlicek, S. Ganapathy, H. Hermansky, H. Garudadri and Marios Athineos, "Perceptually motivated Sub-band Decomposition for FDLP Audio Coding", in Lecture Notes In Artificial Intelligence, Springer-Verlag Berlin, Heidelberg, 2008.

  • S. Thomas, S. Ganapathy and H. Hermansky, "Spectro-Temporal Features for Automatic Speech Recognition using Linear Prediction in Spectral Domain", Proc. of EUSIPCO, Lausanne, Switzerland, Aug 2008.

  • S. Ganapathy, P. Motlicek, H. Hermansky and H. Garudadri, "Autoregressive Modelling of Hilbert Envelopes for Wide-band Audio Coding", AES 124th Convention, Audio Engineering Society, May 2008.

  • S. Ganapathy, P. Motlicek, H. Hermansky and H. Garudadri, ""Temporal Masking for Bit-rate Reduction in Audio Codec Based on Frequency Domain Linear Prediction", Proc. of ICASSP, April 2008.

  • S. Thomas, S. Ganapathy and H. Hermansky, "Hilbert Envelope Based Features for Far-Field Speech Recognition", Lecture Notes in Computer Science, Springer Berlin, Heidelberg 2008.

  • P. Motlicek, H. Hermansky, S. Ganapathy and H. Garudadri, "Frequency Domain Linear Prediction for QMF Sub-bands and Applications to Audio Coding", Lecture Notes in Computer Science, Springer Berlin, Heidelberg 2007.

  • P. Motlicek, H. Hermansky, S. Ganapathy and H. Garudadri, "Non- Uniform Speech/Audio Coding Exploiting Predictability of Temporal Evolution of Spectral Envelopes", Lecture Notes in Computer Science, Springer Berlin, Heidelberg 2007.

    Top      

  • "Method for System Combination in Audio Analytics Applications", Filed July 2015.

  • "Spectral Noise Shaping in Audio Coding Based on Spectral Dynamics in Frequency Sub-bands", Nov. 2011.

  • Temporal Masking in Audio Coding Based on Spectral Dynamics in Frequency Sub-bands", Aug. 2009.

    Top      

  • N. Sajjan, S. Ganesh, N. Sharma, S. Ganapathy and N. Ryant "Leveraging LSTM Models for Overlap Detection in Multi-Party Meetings", ICASSP, Calgary Canada, April 2018.

  • S. Ganapathy and V. Peddinti "3-D CNN Models for Far-Field Multi-Channel Speech Recognition", ICASSP, Calgary Canada, April 2018.

  • N. Ryant et al. "Enhancement and Analysis of Conversational Speech: JSALT 2017", ICASSP, Calgary Canada, April 2018.

    2017

  • Ansari T, R. Kumar, S. Singh, S. Ganapathy "Unsupervised HMM Posteriograms for Language Independent Acoustic Modeling in Zero Resource Conditions", IEEE ASRU, Dec. 2017.

  • Ansari T, R. Kumar, S. Singh, S. Ganapathy "Deep Learning Methods For Unsupervised Acoustic Modeling - LEAP Submission to ZeroSpeech Challenge 2017", IEEE ASRU, Dec. 2017.

  • A. Siddhant, P. Jyothi and S. Ganapathy "Leveraging Native Language Speech For Accent Identfication Using Deep Siamese Networks", IEEE ASRU, Dec. 2017.

  • S. Gutta, V.S. Kadimesetty, S. K. Kalva, M. Pramanik, S. Ganapathy and P. K. Yalavarthy, "Deep Neural Network Based Bandwidth Enhancement of Photoacoustic Data", Journal of Biomedical Optics, October 2017.

  • G. Kocavs, L. Toth, D. V. Compernolle and S. Ganapathy , "Increasing the Robustness of CNN Acoustic Models using ARMA Spectrogram Features and Channel Dropout", Elsevier Pattern Recognition Letters, September 2017.

  • P. Agrawal and S. Ganapathy, "Unsupervised Modulation Filter Learning for Noise-Robust Speech Recognition", Journal of Acoustical Society of America, September 2017. [Code]

  • P. Agrawal and S. Ganapathy "Speech representation learning using unsupervised data-driven modulation filtering for robust ASR", Interspeech, Stockholm, Sweden, Aug. 2017.

  • N. Kumar, R. K. Das, S. Jelil, Dhanush B K, H. Kashyap, K. S. R. Murthy, S. Ganapathy, R. Sinha and S. R. M. Prasanna "IITG-Indigo system for NIST 2016 SRE challenge", Interspeech, Stockholm, Sweden, Aug. 2017.

  • S. Ganapathy, "Multi-variate Autoregressive Spectrogram Modeling for Noisy Speech Recognition", IEEE Signal Processing Letters, July 2017.

  • Dhanush B, Suparna S., Aarthy R., Likhita C., Shashank D., Harish H. and S. Ganapathy "Factor Analysis Methods for Joint Speaker Verification and Spoof Detection", ICASSP, New Orleans, USA, March 2017.

    2016

  • S. Sadjadi, J. Pelecanos and S. Ganapathy "The IBM Speaker Recognition System: Recent Advances and Error Analysis", Interspeech, San Francisco, September, 2016.

  • D. Dimitriadis, S. Thomas and S. Ganapathy, "An investigation on the use of ivectors for improved ASR robustness", Interspeech, San Francisco, Sept. 2016.

  • S. Sadjadi, S. Ganapathy and J. Pelecanos, "The IBM 2016 Speaker Recognition System", Odyssey, Spain, June, 2016.

  • S. Sadjadi, S. Ganapathy and J. Pelecanos, "Speaker Age Estimation On Conversational Telephone Speech Using Senone Posterior Based I-vectors", ICASSP, Shanghai, March, 2016.

    2015 and before

    Refer above in Conferences and Journals.
    Top