Perceptual constancy in real-room listening by humans and machines


Current Publications

Keronen, S., Kallasjoki, H., Remes, U., Brown, G.J., Gemmeke, J.F. and Palomäki, K.J. (2012). Mask estimation and imputation methods for missing data speech recognition in multisource reverberant environments. Computer Speech and Language, in press
Brown, G.J., Beeston, A.V. and Palomäki, K.J. (2012). Perceptual compensation for the effects of reverberation on consonant identification: A comparison of human and machine performance. Proceedings of Interspeech 2012, Portland, Oregon, in press
Watkins, A.J., Raimond, A.P. and Makin, S.J. (2011). Temporal-envelope constancy of speech in rooms and the perceptual weighting of frequency bands. Journal of the Acoustical Society of America, 130 (5), pp. 2777-2788.  <pdf> |
Beeston, A. and Brown, G.J. (2011). Low-level and high-level models of perceptual compensation for reverberation. Poster presented at Computational Audition workshop, Delmenhorst, Germany, Oct., 24-26.  <pdf> |
Brown, G.J., Palomäki, K.J. and Beeston, A. (2011). Compensation for the effects of reverberation on automatic speech recognition: a perceptually-inspired approach based on weighting of parallel acoustic models. Poster presented at British Society of Audiology Annual Conference, Nottingham, Sept., 7-9.  <pdf> |
Palomäki, K.J. and Brown, G.J. (2011). A computational model of binaural speech recognition: role of across-frequency vs. within-frequency processing and internal noise. Speech Communication, 53 (6), pp. 924-940
Raimond, A.P. and Watkins, A.J. (2011). Cross-frequency effects on loudness asymmetry in real-room reverberation. Poster presented at British Society of Audiology Annual Conference, Nottingham, Sept., 7-9.  <pdf> |
Makin, S.J., Watkins, A.J. and Raimond, A.P. (2011). The dominance of ILD cues when tracking talkers in real-room reverberation. Poster presented at First International Conference on Cognitive Hearing Science for Communication, Linkoping, Sweden, June 19-22.  <pdf> |
Watkins, A.J., Raimond, A.P. and Makin, S.J. (2011). Room reflections, speech identification and the perceptual weighting of frequency bands. Poster presented at First International Conference on Cognitive Hearing Science for Communication, Linkoping, Sweden, June 19-22.  <pdf> | Linkoping 2010.pdf
Kallasjoki, H., Keronen, S., Brown, G.J., Gemmeke, J.F., Remes, U. and Palomäki, K.J. (2012). Mask estimation and sparse imputation for missing data speech recognition in a multisource reverberant environments. CHiME International Workshop on Machine Listening in Multisource Environments, Florence, Italy, 1st September. 
Watkins, A.J., Makin, S.J. and Raimond, A.P. (2010). Constancy in the perception of speech when the level of room-reflections varies. In Buchholz, J., Dau, T., Dalsgaard, J. and Poulsen, T. (eds.) Binaural processing and spatial hearing. ISAAR - International Symposium on Auditory and Audiological Research. The Danavox Jubilee Foundation, Ballerup, Denmark, pp 371-380. ISBN 8799001322.
Brown, G.J., Ferry, R.F. and Meddis, R. (2010). A computer model of auditory efferent suppression: Implications for the coding of speech in noise. Journal of the Acoustical Society of America, 127 (2), pp. 943-954
Beeston, A. and Brown, G.J. (2010). Perceptual compensation for effects of reverberation in speech identification: A computer model based on auditory efferent processing. Interspeech, Makuhari, Chiba, Japan. Proceedings, pp 2462-2465.
Beeston, A., Brown, G.J., Watkins, A.J. and Makin, S.J. (2010). Perceptual compensation for reverberation: human identification of stop consonants in reverberated speech contexts. Poster presented at British Society of Audiology Annual Conference, Manchester. Programme and Abstracts, pp 109-110.  <pdf> |
Brown, G.J. and Beeston, A. (2010). A computer model of perceptual compensation for reverberation: evaluation on a consonant identification task. Poster presented at British Society of Audiology Annual Conference, Manchester. Programme and Abstracts, pp 108-109.  <pdf> |
Beeston, A. and Brown, G.J. (2010). Perceptual compensation for adverse effects of room reverberation on speech recognition: A model based on auditory efferent processing. Poster presented at Psycholinguistic approaches to speech recognition in adverse conditions, Bristol, 8-10th March 2010.  <pdf> |
Watkins, A.J., Makin, S.J. and Raimond, A.P. (2009). Perceptual constancy, reverberation, and grouping: Within-band effects. Poster presented at Acoustical Society of America meeting, San Antonio, Oct. 26-30th.  <pdf> |
Makin, S.J., Watkins, A.J. and Raimond, A.P. (2009). Spectral- and temporal-envelope room-acoustic cues in attentional tracking. Poster presented at Acoustical Society of America meeting, San Antonio, Oct. 26-30th.  <pdf> |
Beeston, A. and Brown, G.J. (2009). A computer model of perceptual compensation for reverberation based on auditory efferent suppression. Poster presented at British Society of Audiology short papers meeting, Southampton, Sept. 17-18th.  <pdf> |
Watkins, A.J., Makin, S.J. and Raimond, A.P. (2009). Room reflections, perceptual grouping and constancy in speech-like sounds. Poster presented at British Society of Audiology short papers meeting, Southampton, Sept. 17-18th.  <pdf> |
Raimond, A.P. and Watkins, A.J. (2009). Monaural and dichotic effects of real-room reverberation on loudness perception. Poster presented at British Society of Audiology short papers meeting, Southampton, Sept. 17-18th.  <pdf> |
Watkins, A.J., Raimond, A.P. and Makin, S.J. (2009). Room reflections and constancy in speech-like sounds: Within-band effects. In Lopez-Poveda E, Palmer A, Meddis R, (eds.) Auditory Physiology, Perception and Models, Springer, NY. ISH - 15th International Symposium on Hearing, Salamanca, Spain, June, 1-5thISBN 1441956859. <pdf> |
Beeston, A. and Brown, G.J. (2009). A computer model of perceptual compensation for reverberation. Poster. In PhD Workshop: Professional Opportunities for Hearing Scientists, Programme and Abstracts, pp. 20-21.  <pdf> |

Some relevant recent work

Watkins, A.J., Makin, S.J. and Raimond, A.P. (2008). Room-acoustic factors in attentional tracking. Journal of the Acoustical Society of America, 123, pp. 2976-2977.
Raimond, A.P. and Watkins, A.J. (2008). Effect of reverberation on loudness perception. Acta Acustica united with Acustica, 94, S332.
Watkins, A.J. and Makin, S.J. (2007). Perceptual compensation for reverberation in speech identification: Effects of single-band, multiple-band and wideband contexts. Acta Acustica united with Acustica, 93, pp. 403-410.
Watkins, A.J. and Makin, S.J. (2007). Steady-spectrum contexts and perceptual compensation for reverberation in speech identification. Journal of the Acoustical Society of America, 121 (1) pp. 257-266.
Watkins, A.J. and Makin, S.J. (2007). Perceptual compensation for reverberation: Effects of 'noise-like' and 'tonal' contexts. In Hearing - from basic research to applications B. Kollmeier, G. Klump, V. Hohmann, U Langemann, M. Mauermann, S. Uppenkamp, and J. Verhey. Springer Verlag, New York, pp. 533-540.
Palomäki, K.J. and Brown, G.J. (2006). Recognition of reverberant speech using full cepstral features and spectral missing data. Proceedings of the International Conference on Acoustics, Speech and Signal Processing (ICASSP-2006), Toulouse, 15th-19th May, I, pp. 289-292.  <pdf> |
Makin, S.J., Watkins, A.J. and Raimond, A.P. (2006). Perceptual compensation for reverberation: Effects of noise-context bandwidth. Journal of the Acoustical Society of America, 120, pp. 3358.
Watkins, A.J. (2005). Perceptual compensation for effects of reverberation in speech identification. Journal of the Acoustical Society of America, 118 (1) pp. 249-262.
Watkins, A.J. (2005). Perceptual compensation for effects of echo and of reverberation on speech identification. Acta Acustica united with Acustica, 91, pp. 892-901.
Palomäki, K.J., Brown, G.J., and wang D.L. (2004). A binaural processor for missing data speech recognition in the presence of noise and small-room reverberation. Speech Communication, 43 (4) pp. 361-378.  <pdf> |
Palomäki, K.J., Brown, G.J., and Barker J.P. (2004). Techniques for handling convolutional distortion with 'missing data' automatic speech recognition. Speech Communication, 43 pp. 123-142.
Brown, G.J., Palomäki, K.J., and Barker J.P. (2004). A missing data approach to robust automatic speech recognition in the presence of reverberation. Proceedings of the 18th International Congress on Acoustics (ICA-2004), Kyoto, 4th-9th April, pp. 449-452.  <pdf> |
Brown, G.J. and Palomäki, K.J. (2004). Techniques for robust speech recognition in noisy and reverberant conditions. In Speech Separation by Humans and Machines, edited by P. Divenyi. Norwell, MA: Kluwer Academic Publishers, pp. 213-220.
Palomäki, K.J., Brown, G.J., and Barker J.P. (2002). Missing data speech recognition in reverberant conditions. Proceedings of the International Conference on Acoustics, Speech and Signal Processing (ICASSP-2002), Orlando, 13th-17th May, pp. 65-68.  <pdf> |