Publications

2012

  • Ma, N., Barker, J., Christensen, H., Green P. (submitted) A hearing-inspired approach for distant-microphone speech recognition in the presence of multiple sources. Computer Speech and Language
  • Barker, J., Vincent, E., Ma, N., Christensen, H., Green P. (submitted) The PASCAL CHiME Speech Separation and Recognition Challenge. Computer Speech and Language
  • Gonzalez, J., Peinado, A., Ma, N., Gomez, A. and Barker, J. (submitted) MMSE-based missing-feature reconstruction with temporal modeling for robust speech recognition. IEEE Transactions on Audio, Speech and Language Processing
  • Gonzalez, J., Peinado, A., Gomez, A., Ma, N. and Barker, J. (accepted) Combining missing-data reconstruction and uncertainty decoding for robust speech recognition. In: Proceedings of IEEE International Conference on Acoustics, Speech, and Signal Processing, Kyoto
  • Ma, N., Barker, J., Christensen, H., Green P. (2012) Combining speech fragment decoding and adaptive noise floor modelling. IEEE Transactions on Audio, Speech and Language Processing, 20(3): 818--827 [PDF]

2011

  • Ma, N., Barker, J., Christensen, H. and Green, P. (2011) Binaural cues for fragment-based speech recognition in reverberant multisource environments. In: Proceedings of Interspeech, pp. 1657--1660, Florence [PDF]
  • Ma, N., Barker, J., Christensen, H. and Green, P. (2011) Recent advances in fragment-based speech recognition in reverberant multisource environments. In: Proceedings of Workshop on Machine Listening in Multisource Environments, pp. 68--73, Florence
  • Morales, J., Ma, N., Sanchez, V., Carmona, J., Peinado, A., and Barker, J. (2011) A pitch based noise estimation technique for robust speech recognition with missing data. In: Proceedings of IEEE International Conference on Acoustics, Speech, and Signal Processing, pp. 4808--4811, Prague [PDF]
  • Ma, N., Barker, J., Christensen, H. and Green, P. (2011) Incorporating localisation cues in a fragment decoding framework for distant binaural speech recognition. In: Proceedings of IEEE Joint Workshop on Hands-Free Speech Communication and Microphone Arrays (HSCMA'11), pp. 207--212, Edinburgh

2010

  • Ma, N., Barker, J., Christensen, H. and Green, P. (2010) Distant microphone speech recognition in a noisy indoor environment: combining soft missing data and speech fragment decoding. ISCA Tutorial and Research Workshop on Statistical And Perceptual Audition (SAPA 2010), Makuhari, Japan, September 2010.
  • Christensen, H., Barker, J., Ma, N., and Green, P. (2010) The CHiME corpus: a resource and a challenge for Computational Hearing in Multisource Environments. Interspeech'10, Makuhari, Japan, September 2010. PDF
  • Barker, J., Ma, N., Coy, A. and Cooke, M. (2010) Speech fragment decoding techniques for simultaneous speaker identification and speech recognition. Computer Speech and Language, 24(1): 94--111. PDF

Background Reading

2009

  • Barker, J. and Shao, X. (2009), Energetic and informational masking effects in an audio-visual speech recognition system. IEEE Trans. Audio, Speech and Language Processing, 17(3):446-458. PDF
  • Christensen, H., Ma, N., Wrigley, S.N. and Barker, J. (2009), A Speech Fragment Approach to Localising Multiple Speakers in Reverberant Environments. In Proc. of ICASSP 2009, page 4593-96, Taipei, Taiwan. PDF

2007

  • Ma, N., Green, P., Barker, J. and Coy, A. (2007), Exploiting correlogram structure for robust speech recognition with multiple speech sources. Speech Communication, 49(12): 874--891. PDF
  • Coy, A. and Barker, J. (2007), An automatic speech recognition system based on the scene analysis account of auditory perception. Speech Communication, 49(5):384-401. PDF

2005

  • Barker, J., Cooke, M.P. and Ellis, D.P.W. (2005), Decoding speech in the presence of other sources. Speech Communication, 45:5-25. PDF

2001

  • Cooke, M.P., Green, P.D., Josifovski, L. and Vizinho, A. (2001), Robust automatic speech recognition with missing and unreliable acoustic data. Speech Communication, 34:267-285. PDF