Perceptual constancy in real-room listening by humans and machines

Publications

Current Publications

Keronen, S., Kallasjoki, H., Remes, U., Brown, G.J., Gemmeke, J.F. and Palomäki, K.J. (2012). Mask estimation and imputation methods for missing data speech recognition in multisource reverberant environments. Computer Speech and Language, in press
Brown, G.J., Beeston, A.V. and Palomäki, K.J. (2012). Perceptual compensation for the effects of reverberation on consonant identification: A comparison of human and machine performance. Proceedings of Interspeech 2012, Portland, Oregon, in press
Watkins, A.J., Raimond, A.P. and Makin, S.J. (2011). Temporal-envelope constancy of speech in rooms and the perceptual weighting of frequency bands. Journal of the Acoustical Society of America, 130 (5), pp. 2777-2788.  <pdf> | www.personal.reading.ac.uk/~sxs04sjm/constancy/papers/Watkins_Raimond_Makin_JASA2011.pdf
Beeston, A. and Brown, G.J. (2011). Low-level and high-level models of perceptual compensation for reverberation. Poster presented at Computational Audition workshop, Delmenhorst, Germany, Oct., 24-26.  <pdf> | www.personal.reading.ac.uk/~sxs04sjm/constancy/papers/Poster_BSA_September2010_human.pdf
Brown, G.J., Palomäki, K.J. and Beeston, A. (2011). Compensation for the effects of reverberation on automatic speech recognition: a perceptually-inspired approach based on weighting of parallel acoustic models. Poster presented at British Society of Audiology Annual Conference, Nottingham, Sept., 7-9.  <pdf> | www.personal.reading.ac.uk/~sxs04sjm/constancy/papers/brown-bsa11.pdf
Palomäki, K.J. and Brown, G.J. (2011). A computational model of binaural speech recognition: role of across-frequency vs. within-frequency processing and internal noise. Speech Communication, 53 (6), pp. 924-940
Raimond, A.P. and Watkins, A.J. (2011). Cross-frequency effects on loudness asymmetry in real-room reverberation. Poster presented at British Society of Audiology Annual Conference, Nottingham, Sept., 7-9.  <pdf> | www.personal.reading.ac.uk/~sxs04sjm/constancy/papers/Raimond_NottinghamBSA2011_Final.pdf
Makin, S.J., Watkins, A.J. and Raimond, A.P. (2011). The dominance of ILD cues when tracking talkers in real-room reverberation. Poster presented at First International Conference on Cognitive Hearing Science for Communication, Linkoping, Sweden, June 19-22.  <pdf> | www.personal.reading.ac.uk/~sxs04sjm/constancy/papers/Makin_Linkoping_2011.pdf
Watkins, A.J., Raimond, A.P. and Makin, S.J. (2011). Room reflections, speech identification and the perceptual weighting of frequency bands. Poster presented at First International Conference on Cognitive Hearing Science for Communication, Linkoping, Sweden, June 19-22.  <pdf> | www.personal.reading.ac.uk/~sxs04sjm/constancy/papers/watkins Linkoping 2010.pdf
Kallasjoki, H., Keronen, S., Brown, G.J., Gemmeke, J.F., Remes, U. and Palomäki, K.J. (2012). Mask estimation and sparse imputation for missing data speech recognition in a multisource reverberant environments. CHiME International Workshop on Machine Listening in Multisource Environments, Florence, Italy, 1st September. 
Watkins, A.J., Makin, S.J. and Raimond, A.P. (2010). Constancy in the perception of speech when the level of room-reflections varies. In Buchholz, J., Dau, T., Dalsgaard, J. and Poulsen, T. (eds.) Binaural processing and spatial hearing. ISAAR - International Symposium on Auditory and Audiological Research. The Danavox Jubilee Foundation, Ballerup, Denmark, pp 371-380. ISBN 8799001322.
Brown, G.J., Ferry, R.F. and Meddis, R. (2010). A computer model of auditory efferent suppression: Implications for the coding of speech in noise. Journal of the Acoustical Society of America, 127 (2), pp. 943-954
Beeston, A. and Brown, G.J. (2010). Perceptual compensation for effects of reverberation in speech identification: A computer model based on auditory efferent processing. Interspeech, Makuhari, Chiba, Japan. Proceedings, pp 2462-2465.
Beeston, A., Brown, G.J., Watkins, A.J. and Makin, S.J. (2010). Perceptual compensation for reverberation: human identification of stop consonants in reverberated speech contexts. Poster presented at British Society of Audiology Annual Conference, Manchester. Programme and Abstracts, pp 109-110.  <pdf> | www.personal.reading.ac.uk/~sxs04sjm/constancy/papers/Poster_BSA_September2010_human.pdf
Brown, G.J. and Beeston, A. (2010). A computer model of perceptual compensation for reverberation: evaluation on a consonant identification task. Poster presented at British Society of Audiology Annual Conference, Manchester. Programme and Abstracts, pp 108-109.  <pdf> | www.personal.reading.ac.uk/~sxs04sjm/constancy/papers/Poster_BSA_September2010_machine.pdf
Beeston, A. and Brown, G.J. (2010). Perceptual compensation for adverse effects of room reverberation on speech recognition: A model based on auditory efferent processing. Poster presented at Psycholinguistic approaches to speech recognition in adverse conditions, Bristol, 8-10th March 2010.  <pdf> | www.personal.reading.ac.uk/~sxs04sjm/constancy/papers/BeestonBrown_Bristol2010.pdf
Watkins, A.J., Makin, S.J. and Raimond, A.P. (2009). Perceptual constancy, reverberation, and grouping: Within-band effects. Poster presented at Acoustical Society of America meeting, San Antonio, Oct. 26-30th.  <pdf> | www.personal.reading.ac.uk/~sxs04sjm/constancy/papers/watkins_ASA_09_awOK.pdf
Makin, S.J., Watkins, A.J. and Raimond, A.P. (2009). Spectral- and temporal-envelope room-acoustic cues in attentional tracking. Poster presented at Acoustical Society of America meeting, San Antonio, Oct. 26-30th.  <pdf> | www.personal.reading.ac.uk/~sxs04sjm/constancy/papers/Spectral-_and_temporal-envelope_room-acoustic_cues_in_attentional_tracking_ASA_v10.pdf
Beeston, A. and Brown, G.J. (2009). A computer model of perceptual compensation for reverberation based on auditory efferent suppression. Poster presented at British Society of Audiology short papers meeting, Southampton, Sept. 17-18th.  <pdf> | www.personal.reading.ac.uk/~sxs04sjm/constancy/papers/Poster_BSA_September2009.pdf
Watkins, A.J., Makin, S.J. and Raimond, A.P. (2009). Room reflections, perceptual grouping and constancy in speech-like sounds. Poster presented at British Society of Audiology short papers meeting, Southampton, Sept. 17-18th.  <pdf> | www.personal.reading.ac.uk/~sxs04sjm/constancy/papers/watkins_BSA_09_Room_reflections_NEW_awOK.pdf
Raimond, A.P. and Watkins, A.J. (2009). Monaural and dichotic effects of real-room reverberation on loudness perception. Poster presented at British Society of Audiology short papers meeting, Southampton, Sept. 17-18th.  <pdf> | www.personal.reading.ac.uk/~sxs04sjm/constancy/papers/Raimond_Watkins_BSA_2009_v4.pdf
Watkins, A.J., Raimond, A.P. and Makin, S.J. (2009). Room reflections and constancy in speech-like sounds: Within-band effects. In Lopez-Poveda E, Palmer A, Meddis R, (eds.) Auditory Physiology, Perception and Models, Springer, NY. ISH - 15th International Symposium on Hearing, Salamanca, Spain, June, 1-5thISBN 1441956859. <pdf> | www.personal.reading.ac.uk/~sxs04sjm/constancy/papers/watkins_ish2009_with_figs.pdf
Beeston, A. and Brown, G.J. (2009). A computer model of perceptual compensation for reverberation. Poster. In PhD Workshop: Professional Opportunities for Hearing Scientists, Programme and Abstracts, pp. 20-21.  <pdf> | www.personal.reading.ac.uk/~sxs04sjm/constancy/papers/beeston_brown_ihr_09.pdf

Some relevant recent work

Watkins, A.J., Makin, S.J. and Raimond, A.P. (2008). Room-acoustic factors in attentional tracking. Journal of the Acoustical Society of America, 123, pp. 2976-2977.
Raimond, A.P. and Watkins, A.J. (2008). Effect of reverberation on loudness perception. Acta Acustica united with Acustica, 94, S332.
Watkins, A.J. and Makin, S.J. (2007). Perceptual compensation for reverberation in speech identification: Effects of single-band, multiple-band and wideband contexts. Acta Acustica united with Acustica, 93, pp. 403-410.
Watkins, A.J. and Makin, S.J. (2007). Steady-spectrum contexts and perceptual compensation for reverberation in speech identification. Journal of the Acoustical Society of America, 121 (1) pp. 257-266.
Watkins, A.J. and Makin, S.J. (2007). Perceptual compensation for reverberation: Effects of 'noise-like' and 'tonal' contexts. In Hearing - from basic research to applications B. Kollmeier, G. Klump, V. Hohmann, U Langemann, M. Mauermann, S. Uppenkamp, and J. Verhey. Springer Verlag, New York, pp. 533-540.
Palomäki, K.J. and Brown, G.J. (2006). Recognition of reverberant speech using full cepstral features and spectral missing data. Proceedings of the International Conference on Acoustics, Speech and Signal Processing (ICASSP-2006), Toulouse, 15th-19th May, I, pp. 289-292.  <pdf> | www.personal.reading.ac.uk/~sxs04sjm/constancy/papers/palomaki-icassp06.pdf
Makin, S.J., Watkins, A.J. and Raimond, A.P. (2006). Perceptual compensation for reverberation: Effects of noise-context bandwidth. Journal of the Acoustical Society of America, 120, pp. 3358.
Watkins, A.J. (2005). Perceptual compensation for effects of reverberation in speech identification. Journal of the Acoustical Society of America, 118 (1) pp. 249-262.
Watkins, A.J. (2005). Perceptual compensation for effects of echo and of reverberation on speech identification. Acta Acustica united with Acustica, 91, pp. 892-901.
Palomäki, K.J., Brown, G.J., and wang D.L. (2004). A binaural processor for missing data speech recognition in the presence of noise and small-room reverberation. Speech Communication, 43 (4) pp. 361-378.  <pdf> | www.personal.reading.ac.uk/~sxs04sjm/constancy/papers/palomaki_brown_wang_spcomm_04.pdf
Palomäki, K.J., Brown, G.J., and Barker J.P. (2004). Techniques for handling convolutional distortion with 'missing data' automatic speech recognition. Speech Communication, 43 pp. 123-142.
Brown, G.J., Palomäki, K.J., and Barker J.P. (2004). A missing data approach to robust automatic speech recognition in the presence of reverberation. Proceedings of the 18th International Congress on Acoustics (ICA-2004), Kyoto, 4th-9th April, pp. 449-452.  <pdf> | www.personal.reading.ac.uk/~sxs04sjm/constancy/papers/brown-palomaki-barker-ica04.pdf
Brown, G.J. and Palomäki, K.J. (2004). Techniques for robust speech recognition in noisy and reverberant conditions. In Speech Separation by Humans and Machines, edited by P. Divenyi. Norwell, MA: Kluwer Academic Publishers, pp. 213-220.
Palomäki, K.J., Brown, G.J., and Barker J.P. (2002). Missing data speech recognition in reverberant conditions. Proceedings of the International Conference on Acoustics, Speech and Signal Processing (ICASSP-2002), Orlando, 13th-17th May, pp. 65-68.  <pdf> | www.personal.reading.ac.uk/~sxs04sjm/constancy/papers/palomaki_brown_barker_icassp_02.pdf