MLSP publications

Books

Dan Ellis, Tuomas Virtanen, Mark D Plumbley, Bhiksha Raj, Future Perspectives, Chapter in Computational Analysis of Sound Scenes and Events, 2017

Manas A. Pathak, Privacy-Preserving Machine Learning for Speech Processing. Monograph published in the Springer best thesis series, 2012.

Tuomas Virtanen, Rita Singh, Bhiksha Raj (Eds). Techniques for Noise Robustness in Automatic Speech Recognition,, Wiley, 2012.

2018

Anurag Kumar and Bhiksha Raj, Classifier Risk Estimation under Limited Labeling Resources, in 22nd Pacific-Asia Conference on Knowledge Discovery and Data Mining (PAKDD) , 2018

Pranay Manocha, Rohan Badlani, Anurag Kumar, Ankit Shah, Benjamin Elizalde and Bhiksha Raj, Content-Based Representations Of Audio Using Siamese Neural Networks, in IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), 2018

Yandong Wen, Tianyan Zhou, Rita Singh and Bhiksha Raj, A Corrective Learning Approach For Text-Independent Speaker Verification, in IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), 2018

Yang Gao, Rita Singh and Bhiksha Raj, Voice Impersonation Using Generative Adversial Networks, in IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), 2018

Abelino Jimenez, Benjamin Elizalde and Bhiksha Raj, Acoustic Scene Classification Using Discrete Random Hashing for Laplacian Kernel Machines, in IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), 2018

Rohan Badlani, Ankit Shah, Benjamin Elizalde, Anurag Kumar and Bhiksha Raj, Framework For Evaluation Of Sound Event Detection in Web Videos, in IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), 2018

2017

G Friedland, P Smaragdis, J McDermott, Bhiksha Raj, Audition for multimedia computing, in Frontiers of Multimedia Research, 2017

Abelino Jiménez, Bhiksha Raj, A two factor transformation for speaker verification through ℓ1 comparison in IEEE International Workshop on nformation Forensics and Security (WIFS), 2017

Anurag Kumar, Bhiksha Raj, Deep CNN Framework for Audio Event Recognition using Weakly Labeled Web Data, in NIPS Workshop on Machine Learning for Audio, 2017

Bhiksha Raj, Benjamin Elizalde, Ankit Shah, Rohan Badlani and Anurag Kumar, Never-Ending Learner of Sounds, in NIPS Workshop on Machine Learning for Audio, 2017

Weiyang Liu, Yandong Wen, Zhiding Yu, Ming Li, Bhiksha Raj, Le Song, SphereFace: Deep Hypersphere Embedding for Face Recognition, in Computer Vision and Pattern Recognition (CVPR), 2017

A. Mesaros, T. Heittola, A. Diment, B. Elizalde, A. Shah, E. Vincent, B. Raj, and T. Virtanen, IEEE DCASE 2017 challenge setup: tasks, datasets and baseline system, in IEEE Detection and Classification of Acoustic Scenes and Events Workshop 2017 (DCASE2017).

Abelino Jimenez, Benjamin Elizalde and Bhiksha Raj, DCASE 2017 Task 1: Acoustic Scene Classification Using Shift-Invariant Kernels and Random Features, in IEEE Detection and Classification of Acoustic Scenes and Events Workshop 2017 (DCASE2017)

Janek Ebbers, Jahn Heymann, Lukas Drude, Thomas Glarner, Reinhold Haeb-Umbach, Bhiksha Raj, Hidden Markov Model Variational Autoencoder for Acoustic Unit Discovery, in Interspeech, 2017 ( (Best paper award))

Anurag Kumar, Benjamin Elizalde and Bhiksha Raj, Audio Content based Geotagging in Multimedia, in Interspeech, 2017

Anurag Kumar, Bhiksha Raj, Audio Event and Scene Recognition: A Unified Approach using Strongly and Weakly Labeled Data , in International Joint Conference on Neural Networks (IJCNN), 2017

Anurag Kumar, Bhiksha Raj, Ndapandula Nakashole, Discovering Sound Concepts and Acoustic Relations In Text, in IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), 2017

Abelino Jiménez, Bhiksha Raj Privacy Preserving Distance Computation using Somewhat-Trusted Third Parties in IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), 2017

Keiichi Osako, Yuki Mitsufuji, Rita Singh, Bhiksha Raj, Supervised Monoaural Source Separation Based on Autoencoders, in IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), 2017

Ankit Shah, Rohan Badlani, Anurag Kumar, Benjamin Elizalde and Bhiksha Raj, An Approach for Self-Training Audio Event Detectors Using Web Data in European 25th European Signal Processing Conference (EUSIPCO), 2017

Nia Peters, Griffin Romigh, George Bradley, Bhiksha Raj, When to Interrupt: A Comparative Analysis of Interruption Timings Within Collaborative Communication Tasks, in Advances in Human Factors and System Interactions, 2017

2016

Anurag Kumar, Bhiksha Raj, Weakly supervised scalable audio content analysis, in IEEE International Conference on Multimedia and Expo (ICME), 2016

Agha Ali Raza, Rajat Kulshreshtha, Spandana Gella, Sean Blagsvedt, Maya Chandrasekaran, Bhiksha Raj, Roni Rosenfeld, Viral Spread via Entertainment and Voice-Messaging Among Telephone Users in India, in Eighth International Conference on Information and Communication Technologies and Development, ACM, 2016

Anurag Kumar, Bhiksha Raj, Audio Event Detection using Weakly Labeled Data, in 24th ACM International Conference on Multimedia (ACM MM), 2016

Joana Correia, Isabel Trancoso, Bhiksha Raj, Adaptation of SVM for MIL for infering the polarity of movies and movie reviews, in IEEE Workshop on Spoken Language Technology, 2016

Sohail Bahmani, Petros T Boufounos, Bhiksha Raj, Learning model-based sparsity via projected gradient descent, in IEEE Transactions on Information Theory, 2016

Rahul Radhakrishnan Iyer, Sanjeel Parekh, Vikas Mohandoss, Anush Ramsurat, Bhiksha Raj, Rita Singh, Content-based Video Indexing and Retrieval Using Corr-LDA arXiv preprint arXiv:1602.08581

Afsaneh Asaei, Mohammad Javad Taghizadeh, Saeid Haghighatshoar, Bhiksha Raj, Hervé Bourlard, Volkan Cevher, Binary Sparse Coding of Convolutive Mixtures for Sound Localization and Separation via Spatialization, in IEEE Transactions on Signal Processing, 2016

Suyoun Kim, Bhiksha Raj, Ian Lane, Environmental Noise Embeddings for Robust Speech Recognition, arXiv preprint arXiv:1601.02553

Lukas Drude, Bhiksha Raj, Reinhold Haeb-Umbach, On the Appropriateness of Complex-Valued Neural Networks for Speech Enhancement, in Interspeech 2016

Rita Singh, Mereological algebras as mechanisms for reasoning about sound, in IEEE International Conference on Machine Learning for Signal Processing, 2016

Rita Singh, Bhiksha Raj, James Baker, Short-Term Analysis for estimating physical parameters of speakers, in 4th International Workshop on Biometrics and Forensics (IWBF), 2016

Rita Singh, Joseph Keshet, Deniz Gencaga, Bhiksha Raj, The relationship of voice onset time and voice offset time to physical age, in IEEE International Conference on Acoustics Speech and Signal Processing (ICASSP), 2016

Jill Fain Lehman and Rita Singh, Estimation of childrens physical characteristics from their voices, in Interspeech, 2016

Rita Singh, Joseph Keshet and Eduard Hovy, Profiling Hoax Callers, in IEEE International Symposium on Technologies for Homeland Security, 2016

Rita Singh, Deniz Gencaga and Bhiksha Raj, Formant manipulations in voice disguise by mimicry, in 4th International Workshop on Biometrics and Forensics (IWBF), ,2016 ( (Best paper award))

Rita Singh, Deniz Gencaga and Bhiksha Raj, Forensic anthropometry from voice: an articulatory-phonetic approach, in 39th International Convention on Information and Communication Technology, Electronics and Microelectronics: Special session on Biometrics, Forensics and De-identification, 2016

2015

Abelino Jiménez, Bhiksha Raj, Jose Portelo, Isabel Trancoso. Secure Modular Hashing in IEEE International Workshop on nformation Forensics and Security (WIFS), 2015

Zhenzhong Lan, Shoou-I Yu, Ming Lin, Bhiksha Raj, Alexander G Hauptmann Handcrafted local features are convolutional neural networks in arXiv preprint arXiv:1511.05045, 2015

Keiichi Osako, Rita Singh, Bhiksha Raj. Complex recurrent neural networks for denoising speech signals in IEEE Workshop on Applications of Signal Processing to Audio and Acoustics (WASPAA), 2015

Haohan Wang, Bhiksha Raj. A survey: Time travel in deep learning space: An introduction to deep learning models and how deep learning models evolved from the initial ideas, arXiv preprint arXiv:1510.04781, 2015

Wenbo Liu, Xhiding Yu, Bhiksha Raj, Li Yi, Xiaobing Zou, Ming Li. Efficient autism spectrum disorder prediction with eye movement: A machine learning framework in International Conference on Affective Computing and Intelligent Interaction (ACII), 2015

Luís Marujo, José Portêlo, Wang Ling, David Martins de Matos, João P Neto, Anatole Gershman, Jaime Carbonell, Isabel Trancoso, Bhiksha Raj. Privacy-preserving multi-document summarization arXiv preprint arXiv:1508.01420, 2015

Anurag Kumar, Bhiksha Raj. A novel ranking method for multiple classifier systems in IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), 2015

José Portêlo, Alberto Abad, Bhiksha Raj, Isabel Trancoso. Privacy-preserving Query-by-Example Speech Search in IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), 2015

Anders Øland, Bhiksha Raj. Reducing communication overhead in distributed learning by an order of magnitude (almost) in IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), 2015

Rita Singh, Kenichi Kumatani. Free energy for speech recognition in IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), 2015

Rita Singh Minimizing free energy of stochastic functions of Markov chains in International Conference on NonLinear Speech Processing, 2015

Muhammad Haris Usmani, Ramon Cepeda Jr, Thomas M Sullivan, Bhiksha Raj. Improving headphone spatialization for stereo music in The Journal of the Acoustical Society of America, 2015

José Portêlo, Bhiksha Raj, Isabel Trancoso. Logsum Using Garbled Circuits in Public Library of Science (PloS one), 2015

Tuomas Virtanen, Jort Florent Gemmeke, Bhiksha Raj, Paris Smaragdis. Compositional models for audio processing: Uncovering the structure of sound mixtures in IEEE Signal Processing Magazine, 2015

Anurag Kumar, Bhiksha Raj. Unsupervised fusion weight learning in multiple classifier systems in arXiv preprint arXiv:1502.01823, 2015

Nikolas Wolfe, Juneki Hong, Agha Ali Raza, Bhiksha Raj, Roni Rosenfeld. Rapid development of public health education systems in low-literacy multilingual environments: Combating ebola through voice messaging in ISCA Special Interest Group on Speech and Language Technology in Education (SLaTE), 2015

Wenbo Liu, Zhiding Yu, Bhiksha Raj, Ming Li. Locality Constrained Transitive Distance Clustering on Speech Data in 16th Annual Conference of the International Speech Communication Association (Interspeech), 2015

Harshavardhan Sundar, Jill Fain Lehman, Rita Singh Keyword spotting in multi-player voice driven games for children in 16th Annual Conference of the International Speech Communication Association (Interspeech), 2015

Zhengzhong Lan, Ming Lin, Xuanchong Li, Alex G Hauptmann, Bhiksha Raj. Beyond gaussian pyramid: Multi-skip feature stacking for action recognition in IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2015

2014

I Liu, Bhiksha Ramakrishnan. Bach in 2014: Music Composition with Recurrent Neural Network arXiv preprint arXiv:1412.3191, 2014

José Portêlo, Bhiksha Raj, Alberto Abad, Isabel Trancoso. Privacy-preserving speaker verification using garbled GMMS in 22nd European Signal Processing Conference (EUSIPCO), 2014

Anurag Kumar, Rita Singh, Bhiksha Raj. Detecting sound objects in audio recordings in 22nd European Signal Processing Conference (EUSIPCO), 2014

Luís Marujo, José Portêlo, David Martins De Matos, Joao P Neto, Anatole Gershman, Jaime Carbonell, Isabel Trancoso, Bhiksha Raj. Privacy-preserving important passage retrieval , arXiv preprint arXiv:1407.5416, 2014

José Portêlo, Bhiksha Raj, Alberto Abad, Isabel Trancoso. Privacy-preserving speaker verification using secure binary embeddings in 37th International Convention on Information and Communication Technology, Electronics and Microelectronics (MIPRO), 2014

Tuomas Virtanen, Bhiksha Raj, Jort F Gemmeke. Active-set newton algorithm for non-negative sparse coding of audio, in IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), 2014

Jahn Heymann, Oliver Walter, Reinhold Haeb-Umbach, Bhiksha Raj. in Iterative Bayesian word segmentation for unsupervised vocabulary discovery from phoneme lattices in IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), 2014

Tuomas Virtanen, Jon Barker, Shrikanth Narayanan, Alexandros Potamianos, Bhiksha Raj, Gaël Richard, Rita Singh, Paris Smaragdis, Stefano Squartini, Shiva Sundaram. Unsupervised Learning for Audio in Computational Audio Analysis, 2014

CMU Aladdin Team. Informedia@ trecvid 2014 med and mer in NIST TRECVID Video Retrieval Evaluation Workshop , 2014

Amir R Moghimi, Bhiksha Raj, Richard M Stern. Post-masking: a hybrid approach to array processing for speech recognition in Interspeech 2014.

2013

Oliver Walter, Timo Korthals, Reinhold Haeb-Umbach, Bhiksha Raj. A hierarchical system for word discovery exploiting DTW-based initialization. ASRU, pp. 386-391, 2013. (best student paper award)

Jahn Heymann, Oliver Walter, Reinhold Haeb-Umbach, Bhiksha Raj. Unsupervised word segmentation from noisy input. ASRU, pp. 458-463, 2013.

Parul Agarwal, Harish Karnik, Bhiksha Raj. A comparative study of Indian and Western musical forms. Conference of the International Society for Music Information Retrieval (ISMIR), 2013.

Pranay Dighe, Harish Karnik, Bhiksha Raj. Swara histogram based structural analysis and identification of Indian classical ragas. Conference of the International Society for Music Information Retrieval (ISMIR), 2013.

Anurag Kumar, Rajesh Hegde, Rita Singh, Bhiksha Raj. Event detection in short-duration audio using Gaussian-mixture model and Random-Forest classifier. European Signal Processing Conference (EUSIPCO), 2013.

Jose Portelo, Bhiksha Raj, Petros Boufounos, Isabel Trancoso, Alberto Abad. Speaker verification using Secure Binary Embeddings. European Signal Processing Conference (EUSIPCO), 2013.

Benjamin Lambert, Bhiksha Raj, Rita Singh. Discriminatively Trained Dependency Language Modeling for Conversational Speech Recognition. Interspeech, 2013.

Leibny Paola Garcia Perera, Bhiksha Raj, Juan Arturo Nolazco Flores. Ensemble approach in Speaker Verification. Interspeech, 2013.

Jose Portelo, Alberto Abad, Bhiksha Raj, Isabel Trancoso. Secure Binary Embeddings of Front-end Factor Analysis for Privacy Preserving Speaker Verification. Interspeech, 2013.

Shubhranshu Barnwal, Rohit Barnwal, Rajesh Hegde, Rita Singh, Bhiksha Raj. Doppler-based Speed Estimation using a Passive Sensor. IEEE International Conference on Multimedia and Expo, 2013.

Pranay Dighe, Pulkit Agarwal, Rajesh Hedge, S. Thota, Bhiksha Raj. Scale-independent Raga Identification using Chromagram features and Swara based features. IEEE International Conference on Multimedia and Expo, 2013.

Afsaneh Asaei, Bhiksha Raj, Herve Bourlard, Volkan Cevher. A multi-path Sparse Beamforming Method. Signal Processing with Adaptive Sparse Structured Representations (SPARS), 2013.

K. Kumatani, R. Singh, F. Faubel, J. McDonough, Y. Oualil.Speech Separation and Enhancement. IEEE International Conference on Acoustics Speech and Signal Processing (ICASSP), 2013.

Sourish Chaudhuri, Bhiksha Raj. Unsupervised Hierarchical Structure Induction For Deeper Semantic Analysis of Audio. IEEE International Conference on Acoustics Speech and Signal Processing (ICASSP), 2013.

John McDonough, Kenichi Kumatani, Takayuki Arakawa, Kazumasa Yamamoto, Bhiksha Raj. Speaker tracking with spherical microphone arrays. IEEE International Conference on Acoustics Speech and Signal Processing (ICASSP), 2013.

Liebny Paola Garcia Perera, Juan Arturo Nolazco Flores, Bhiksha Raj. Optimization of the DET curve in Speaker Verification under noisy conditions. IEEE International Conference on Acoustics Speech and Signal Processing (ICASSP), 2013.

Oliver Walter, Reinhold Haeb-Umbach, Sourish Chaudhuri, Bhiksha Raj. Unsupervised Word Discovery from Phonetic Input Using Nested Pitman-Yor Language Modeling. IEEE International Conference on Robotics and Automation (ICRA) Workshop on Autonomous Learning, 2013.

Gahgene Gweon, Mahaveer Jain, John McDonough, Bhiksha Raj, Carolyn Rose. Measuring prevalance of other-oriented transactive contributions using an automated measure of speech style accommodation. International Journal on Computer Supported Collaborative Learning, 2013.

Sohail Bahmani, Petros Boufounos, Bhiksha Raj. Greedy Sparsity-Constrained Optimization. Journal of Machine Learning Research (JMLR), 14(Mar):807-841, 2013.

Sohail Bahmani, Bhiksha Raj. A Unifying Analysis of Projected Gradient Descent for lp-constrained Least Squares. Applied and Computational Harmonic Analysis, Vol 34 (2013), pp. 366-378, 2013.

Manas Pathak, Bhiksha Raj, Shantanu D. Rane, Paris Smaragdis. Privacy-preserving speech processing: Cryptographic and String-Matching frameworks show promise. IEEE Signal Processing Magazine, Vol 30:2, pp. 62-74, March 2013.

Tuomas Virtanen, Jort Gemmeke, Bhiksha Raj. Active-Set Newton Algorithm for Overcomplete Non-Negative Representations of Audio. IEEE Transactions on Audio, Speech and Language Processing, 2013.

Kenichi Kumatani, John McDonough, Bhiksha Raj. Microphone Array Processing for Distant Speech Recognition: From Close-Talking Microphones to Far-field Sensors. IEEE Signal Processing Magazine,

2013.
Manas Pathak, Bhiksha Raj. Privacy-Preserving Speaker Verification and Identification using Gaussian Mixture Models. IEEE Transactions on Audio, Speech, and Language Processing, Vol 21, pp. 397-406, Feb. 2013.
2012

Sourish Chaudhuri, Bhiksha Raj. Unsupervised Structure Discovery for Semantic Analysis of Audio, Neural Information Processing Systems (NIPS), 2012.

José Portelo, Bhiksha Raj, Petros Boufounos, Alberto Abad, Isabel Trancoso. Privacy-preserving speaker authentication. Workshop on Information Forensics and Security, Tenerife, 2012.

Leibny Paola Garcia Perera, Juan Arturo Nolazco Flores, Bhiksha Raj, Richard Stern. Optimization of the DET curve in speaker verification. Spoken Language Technologies Conference, 2012.

Manas Pathak, José Portelo, Bhiksha Raj, Isabel Trancoso. Privacy-preserving speaker authentication. Information Security Conference, Passau 2012.

John McDonough, Kenichi Kumatani and Bhiksha Raj Microphone Array Processing for Distant Speech Recognition: Spherical Arrays. Asian-Pacific Signal and Information Processing Association (APSIPA), 2012.

Kenichi Kumatani, Takayuki Arakawa, Kazumasa Yamamoto, John McDonough, Bhiksha Raj, Rita Singh and Ivan Tashev. Microphone Array Processing for Distant Speech Recognition: towards Real-World Deployment. Asian-Pacific Signal and Information Processing Association (APSIPA), 2012.

Victor Finomore, John Stewart, Rita Singh, Bhiksha Raj, Ron Dallman. Demonstration of Advanced Multi-Modal, Network-Centric Communication Management Suite. Proc. Interspeech, 2012.

Rita Singh, Kenichi Kumatani, John McDonough, Liu Chen. A signal-separation-based array postfilter for distant speech recognition. Proc. Interspeech, 2012.

Kenichi Kumatani, Bhiksha Raj, Rita Singh, John McDonough. Microphone array post-filter based on spatially-correlated noise measurements for distant speech recognition. Proc. Interspeech, 2012.

Soham De, Inradyumna Roy, Tarunima Prabhakar, Kriti Suneja, Sourish Chaudhuri, Rita Singh and Bhiksha Raj. Plagiarism detection in polyphonic music using monaural signal separation. Proc. Interspeech, 2012.

Sourish Chaudhuri, Rita Singh, Bhiksha Raj. Exploiting temporal sequence structure for semantic analysis of multimedia. Proc. Interspeech, 2012.

Kamal Sahni, Pranay Dighe, Rita Singh, Bhiksha Raj. Language identification using spectro-temporal patch features. Proc. 5th ISCA workshop on statistical and perceptual audition (SAPA2012), 2012.

Afsaneh Asaei, Bhiksha Raj, Volkan Cevher. Structured sparse coding for microphone array location calibration. Proc. 5th ISCA workshop on statistical and perceptual audition (SAPA2012) 2012.

Manas Pathak, Bhiksha Raj. Large Margin Gaussian Mixture Models with Differential Privacy, IEEE Transactions on dependable and secure computing, 2012.

Gahgene Gweon, Mahaveer Jain, John McDonough, Carolyn Rosé, Bhiksha Raj Predicting Idea Co-Construction in Speech Data using Insights from Sociolinguistics. International Conference of the Learning Sciences, 2012.

Mahaveer Jain, John McDonough, Gahgene Gweon, Bhiksha Raj, Carolyn Rosé An Unsupervised Dynamic Bayesian Network Approach to Measuring Speech Style Accommodation. 13th Conference of the European Chapter of the Association for Computational Linguistics (EACL), 2012.

Rita Singh. Compensating for denoising artifacts. IEEE International Conference on Acoustics Speech and Signal Processing (ICASSP), 2012.

Shubhranshu Barnwal, Kamal Sahni, Rita Singh, Bhiksha Raj. Spectrographic seam patterns for discriminative word spotting. IEEE International Conference on Acoustics Speech and Signal Processing (ICASSP), 2012.

Anurag Kumar, Pranay Dighe, Sourish Chaudhuri, Rita Singh, Bhiksha Raj. Audio event detection from acoustic unit occurrence patterns. IEEE International Conference on Acoustics Speech and Signal Processing (ICASSP), 2012.

José Portelo, Bhiksha Raj and Isabel Trancoso. Attacking a Privacy Preserving Music Matching Algorithm. IEEE International Conference on Acoustics Speech and Signal Processing (ICASSP), 2012.

Yu-Hsiang (Bosco) Chiu, Bhiksha Raj, Richard Stern. Learning-based Auditory Encoding for Robust Speech Recognition. IEEE Transactions on Audio Speech and Language Processing, Vol 20., 900-914, March 2012.

Manas Pathak and Bhiksha Raj. Privacy preserving speaker verification as password matching. IEEE International Conference on Acoustics Speech and Signal Processing (ICASSP), 2012.

Bhiksha Raj, Kaustubh Kalgaonkar, Chris Harrison, Paul Dietz. Properties and Applications of Ultrasonic Doppler Sensing in Human-Computer Interaction. IEEE Pervasive Computing, special issue on pervasive I/O, 2012.

Paris Smaragdis, Bhiksha Raj. "The Markov Selection Model for concurrent speech recognition", Neurocomputing, pp. 64-72, March 2012.

2011

Manas Pathak, Bhiksha Raj. Privacy Preserving Protocols for Eigenvector Computation, Transactions on Data Privacy, foundations and technologies, 2011.

Baji Babu, Ronanki Srikanth, Sathya Adithya Thati, Bhiksha Raj, Bayya Yegnanarayana, Kishore Prahallad. A comparison of prosody modification using instants of significant excitation and mel-cepstral vocoder, Centenary Confererence of the Indian Institute of Science, 14-17 Dec 2011.

Petros Boufounos, Paris Smaragdis, Bhiksha Raj. Joint sparsity models for wideband array processing, Wavelets and Sparsity XIV, SPIE Optics and Photonics, 2011.

Kenichi Kumatani, John McDonough, Bhiksha Raj, Maximum kurtosis beamforming with a subspace filter for distant speech recognition, Automatic Speech Recognition and Understanding (ASRU), 2011.

Sourish Chaudhuri, Bhiksha Raj. "Learning contextual relevance of audio segments using discriminative models over AUD sequences," IEEE Workshop on Applications of Signal Processing to Audio and Acoustics, 2011

John McDonough, Bhiksha Raj, Kenichi Kumatani. On the combination of voice prompt suppression with maximum kurtosis beamforming. IEEE Workshop on Applications of Signal Processing to Audio and Acoustics, 2011.

John McDonough, Kenichi Kumatani, Bhiksha Raj. Block-wise incremental adaptation algorithm for maximum-kurtosis beamforming. IEEE Workshop on Applications of Signal Processing to Audio and Acoustics, 2011.

Sohail Bahmani, Petros Boufounos, Bhiksha Raj. "Greedy Sparsity-Constrained Optimization", Asilomar Conference on Signals, Systems, and Computers, 2011.

John McDonough, Kenichi Kumatani, Bhiksha Raj, Jill Lehman. "An information filter for voice prompt suppression", Asilomar Conference on Signals, Systems, and Computers, 2011.

Sourish Chaudhuri, Mark Harvilla, Bhiksha Raj. "Unsupervised Learning of Acoustic Unit Descriptors for Audio Content Representation and Classification", Interspeech, 2011.

Sourish Chaudhuri, Bhiksha Raj, Tony Ezzat. "A paradigm for small vocabulary speech recognition based on redundant spectro-temporal feature sets", Interspeech, 2011.

Manas Pathak, Bhiksha Raj. Privacy preseving speaker verification using adapted GMMs, Interspeech, 2011.

Bhiksha Raj, Rita Singh, Tuomas Virtanen. "Phoneme-dependent NMF for speech enhancement in monaural mixtures", Interspeech, 2011.

Evandro Gouvea. Hybrid speech recognition for voice search; a comparative study. Interspeech, Florence, 2011.

José Portelo, Alberto Abad, Bhiksha Raj, Isabel Tranciso. "On the Implementation of a Secure Musical Database Matching". 19th European Signal Processing Conference (EUSIPCO) 2011.

Gahgene Gweon, Pulkit Agarwal, Mikesh Udani, Bhiksha Raj, Carolyn Rosé. "The automatic assessment of knowledge integration processes in project teams", 9th International Conference on Computer Supported Collaborative Learning (CSCL). 2011.(best student paper award)

Sourish Chaudhuri, Bhiksha Raj. "A Comparison of Latent Variable Models for Conversation Analysis," SIGDIAL, 2011. (best paper award)

Kenichi Kumatani, John McDonough, Bhiksha Raj, Jill Lehman. Channel Selection based on Multichannel Cross-Correlation Coefficients for Distant Speech Recognition. Hands-free speech communication and microphone arrays (HSCMA. June 2011.

Kshitiz Kumar, Bhiksha Raj, Rita Singh, Richard Stern. "An iterative least-squares technique for dereverberation", IEEE International Conference on Acoustics Speech and Signal Processing. Prague, 2011.

Kshitiz Kumar, Bhiksha Raj, Rita Singh, Richard Stern. "Gammatone sub-band magnitude-domain dereverberation for ASR", IEEE International Conference on Acoustics Speech and Signal Processing. Prague, 2011.

Manas Pathak, Shantanu Rane, Wei Sun, Bhiksha Raj. "Privacy-preserving probabilistic inference with hidden Markov models", IEEE International Conference on Acoustics Speech and Signal Processing. Prague, 2011.

Bhiksha Raj, Rita Singh, James Baker. "A paired test for recognizer selection with untranscribed data", IEEE International Conference on Acoustics Speech and Signal Processing. Prague, 2011.

2010

Manas Pathak, Shantanu Rane, and Bhiksha Raj. "Multiparty differential privacy via aggregation of locally trained classifiers." Neural Information Processing Systems (NIPS). Vancouver, Canada, December 2010.

Manas Pathak and Bhiksha Raj. "Large Margin Multiclass Gaussian Classification with Differential Privacy." ECML/PKDD Workshop on Privacy and Security issues in Data Mining and Machine Learning (PSDML). Barcelona, Spain, September 2010.

Manas Pathak and Bhiksha Raj. "Privacy-Preserving Protocols for Eigenvector Computation." ECML/PKDD Workshop on Privacy and Security issues in Data Mining and Machine Learning (PSDML). Barcelona, Spain, September 2010.

A. Toth, M. Wand, S. Chen, S. Jou, T. Schultz, B. Rj, K. Kalgaonkar and T. Ezzat. "Synthesizing speech from surface electromyography and acoustic Doppler sonar". J. Acoust. Soc. Am. Volume 127, Issue 3, pp. 1816-1816 (2010)

Paris Smaragdis, Bhiksha Raj, Madhusudana Shashanka. "Missing data imputation for time-frequency representations of audio signals." Journal of Signal Processing Systems (Springer). To appear

Rita Singh, Benjamin Lambert, and Bhiksha Raj. "The use of sense in unsupervised training of acoustic models for HMM-based ASR systems." Interspeech, 2010.

Benjamin Lambert, Rita Singh, and Bhiksha Raj. "Creating a semantic coherence dataset with non-expert annotators." Interspeech, 2010.

Bhiksha Raj, Tuomas Virtanen, Sourish Chaudhuri, and Rita Singh. "Non-negative matrix factorization based compensation of music for automatic speech recognition." Interspeech, 2010.

Bhiksha Raj, Kevin Wilson, Alexander Krüger, Reinhold Häb-Umbach. "Ungrounded non-negative independent factor analysis." Interspeech, 2010.

Paris Smaragdis and Bhiksha Raj. The Markov Selection Model for concurrent speech recognition. IEEE workshop on machine learning for signal processing. Kittilä, Finland, 2010.

Gautham Mysore, Paris Smaragdis, and Bhiksha Raj. Non-negative Hidden-Markov modeling of audio with application to source separation. 9th international conference on latent variable analysis and source separation. (best student paper) St. Malo, France, 2010.

Yu-Hsiang (Bosco) Chiu, Bhiksha Raj, and Richard M. Stern. Learning-based auditory encoding for robust speech recognition. IEEE International Conference on Acoustics, Speech, and Signal Processing. March 2010, Dallas, Texas.

Kevin Wilson and Bhiksha Raj. Spectrogram Dimensionality Reduction with Independence Constraints. IEEE International Conference on Acoustics, Speech, and Signal Processing. March 2010, Dallas, Texas.

Ziad Al-Bawab, Bhiksha Raj, and Richard M. Stern. A Hybrid Physical and Statistical Dynamic Articulatory Framework Incorporating Analysis-by-Synthesis for Improved Phone Classification. IEEE International Conference on Acoustics, Speech, and Signal Processing. March 2010, Dallas, Texas.

Sundararajan Srinivasan, Bhiksha Raj, and Tony Ezzat. Ultrasonic Sensing for Robust Speech Recognition. IEEE International Conference on Acoustics, Speech, and Signal Processing. March 2010, Dallas, Texas.

Rita Singh, Bhiksha Raj, and Paris Smaragdis. Latent-Variable Decomposition Based Dereverberation of Monaural and Multi-Channel Signals. IEEE International Conference on Acoustics, Speech, and Signal Processing. March 2010, Dallas, Texas.

Arthur Toth, Kaustubh Kalgaonkar, and Bhiksha Raj. Synthesizing Speech from Doppler Signals. IEEE International Conference on Acoustics, Speech, and Signal Processing. March 2010, Dallas, Texas.

2009

Paris Smaragdis, Madhusudana Shashanka, and Bhiksha Raj. Topic Models for Audio Mixture Analysis. NIPS workshop on applications for topic models: text and beyond. 2009.

Paris Smaragdis, Madhusudana Shashanka, and Bhiksha Raj. A Sparse Non-Parameteric Approach for Single Channel Separation of Known Sounds. NIPS. 2009.

Paris Smaragdis, Bhiksha Raj, and Madhusudana Shashanka. Missing Data Imputation for Spectral Audio Signals. IEEE International Workshop for Machine Learning in Signal Processing. September 2009.

Chanwoo Kim, Kshitiz Kumar, and Bhiksha Raj. Signal Separation for Robust Speech Recognition based on Phase Difference Information obtained in the Frequency Domain. Interspeech. 2009.

Yu-Hsiang (Bosco) Chiu, Bhiksha Raj, and Richard M. Stern. Towards Fusion of Feature Extraction and Acoustic Model Training: A Top Down Process for Robust Speech Recognition. Interspeech. 2009.

Ziad Al-Bawab, Lorenzo Turicchia, and Bhiksha Raj. Towards Speech Synthesis from Elecetromagnetic Articulograph Data using a Physical Model of the Vocal Tract. Interspeech. 2009.

Paris Smaragdis, Bhiksha Raj, and Gautham Mysore. Probabilistic Factorization of Non-Negative Data with Co-occurrence Constraints. 8th International Conference on Independent Component Analysis and Signal Separation. 2009.

Evandro Gouvêa and Bhiksha Raj. Word Particles applied to Information Retrieval. European Conference on Information Retrieval (ECIR). 2009.

Kaustubh Kalgaonkar and Bhiksha Raj. One-handed Gesture Recognition using Ultrasonic Doppler Sonar. IEEE International Conference on Acoustics Speech and Signal Processing (ICASSP). 2009.

Dhananjay Bansal, Nishanth Nair, Rita Singh and Bhiksha Raj. A Joint Decoding Algorithm for Multiple-Example-Based addition of Words to a Pronunciation Lexicon. IEEE International Conference on Acoustics Speech and Signal Processing (ICASSP). 2009.

News