Publications

2012

Ma, N., Barker, J., Christensen, H., Green P. (submitted) A hearing-inspired approach for distant-microphone speech recognition in the presence of multiple sources. Computer Speech and Language
Barker, J., Vincent, E., Ma, N., Christensen, H., Green P. (submitted) The PASCAL CHiME Speech Separation and Recognition Challenge. Computer Speech and Language
Gonzalez, J., Peinado, A., Ma, N., Gomez, A. and Barker, J. (submitted) MMSE-based missing-feature reconstruction with temporal modeling for robust speech recognition. IEEE Transactions on Audio, Speech and Language Processing
Gonzalez, J., Peinado, A., Gomez, A., Ma, N. and Barker, J. (accepted) Combining missing-data reconstruction and uncertainty decoding for robust speech recognition. In: Proceedings of IEEE International Conference on Acoustics, Speech, and Signal Processing, Kyoto
Ma, N., Barker, J., Christensen, H., Green P. (2012) Combining speech fragment decoding and adaptive noise floor modelling. IEEE Transactions on Audio, Speech and Language Processing, 20(3): 818--827 []

Ma, N., Barker, J., Christensen, H. and Green, P. (2011) Binaural cues for fragment-based speech recognition in reverberant multisource environments. In: Proceedings of Interspeech, pp. 1657--1660, Florence []
Ma, N., Barker, J., Christensen, H. and Green, P. (2011) Recent advances in fragment-based speech recognition in reverberant multisource environments. In: Proceedings of Workshop on Machine Listening in Multisource Environments, pp. 68--73, Florence
Morales, J., Ma, N., Sanchez, V., Carmona, J., Peinado, A., and Barker, J. (2011) A pitch based noise estimation technique for robust speech recognition with missing data. In: Proceedings of IEEE International Conference on Acoustics, Speech, and Signal Processing, pp. 4808--4811, Prague []
Ma, N., Barker, J., Christensen, H. and Green, P. (2011) Incorporating localisation cues in a fragment decoding framework for distant binaural speech recognition. In: Proceedings of IEEE Joint Workshop on Hands-Free Speech Communication and Microphone Arrays (HSCMA'11), pp. 207--212, Edinburgh

Ma, N., Barker, J., Christensen, H. and Green, P. (2010) Distant microphone speech recognition in a noisy indoor environment: combining soft missing data and speech fragment decoding. ISCA Tutorial and Research Workshop on Statistical And Perceptual Audition (SAPA 2010), Makuhari, Japan, September 2010.

Christensen, H., Barker, J., Ma, N., and Green, P. (2010) The CHiME corpus: a resource and a challenge for Computational Hearing in Multisource Environments. Interspeech'10, Makuhari, Japan, September 2010.

Barker, J., Ma, N., Coy, A. and Cooke, M. (2010) Speech fragment decoding techniques for simultaneous speaker identification and speech recognition. Computer Speech and Language, 24(1): 94--111.

Barker, J. and Shao, X. (2009), Energetic and informational masking effects in an audio-visual speech recognition system. IEEE Trans. Audio, Speech and Language Processing, 17(3):446-458.
Christensen, H., Ma, N., Wrigley, S.N. and Barker, J. (2009), A Speech Fragment Approach to Localising Multiple Speakers in Reverberant Environments. In Proc. of ICASSP 2009, page 4593-96, Taipei, Taiwan.

Ma, N., Green, P., Barker, J. and Coy, A. (2007), Exploiting correlogram structure for robust speech recognition with multiple speech sources. Speech Communication, 49(12): 874--891.

Coy, A. and Barker, J. (2007), An automatic speech recognition system based on the scene analysis account of auditory perception. Speech Communication, 49(5):384-401.

Barker, J., Cooke, M.P. and Ellis, D.P.W. (2005), Decoding speech in the presence of other sources. Speech Communication, 45:5-25.

Cooke, M.P., Green, P.D., Josifovski, L. and Vizinho, A. (2001), Robust automatic speech recognition with missing and unreliable acoustic data. Speech Communication, 34:267-285.