SPHEAR TMR Network

References

S. Ahmad and V. Tresp (1993), 'Some solutions to the missing feature problem in vision', in S.J. Hanson et al, eds, 'Advances in Neural Information Processing Systems 5', Morgan Kaufmann, 1993.
W.A.Ainsworth (1995), 'Perception of formant transitions in synthesised vowel pairs', Proc. ICPhS, 2, 666-669.
W.A.Ainsworth (1995b), 'Effects of preceding noise duration on the perception of voiced plosives and vowels', Proc. Eurospeech, 1, 741-744. Jont Allen (1994), `How do humans process and recognize speech?', IEEE Trans. Speech and Audio Processing, vol. 2, no. 4, pp. 567-577, 1994.
F. Berthommier and G.F. Meyer (1995), 'Source Separation by a Functional Model of Amplitude Demodulation'. 4th European Conference on Speech Communication and Technology, EUROSPEECH'95, Madrid (Espagne), Vol. 1, 135-138.
J. Blauert (1996), 'Spatial Hearing - The Psychophysics of Human Sound Localization (revised edition)', The MIT-Press, Cambridge MA.
M. Bodden (1993), 'Modeling Human Sound Source Localization and the Cocktail-Party-Effect', Acta Acustica (1), 43-55.
M. Bodden and K. Rateitschek(1996), 'Noise-robust speech recognition based on a binaural auditory model', ESCA workshop on the "Auditory Basis of Speech Perception", Keele, UK, July 1996.
H. Bourlard, H. Hermansky and N. Morgan (1996), 'Towards Increasing Speech Recognition Error Rates', Speech Communication, vol. 18, no. 3, pp. 205-231, 1996.
H. Bourlard, and S. Dupont (1996), `A new ASR approach based on independent processing and re-combination of partial frequency bands', Proc. ICSLP-96, pp. 426-429, October 1996.
H. Bourlard and S. Dupont (1996b), 'Multi-stream speech recognition', IDIAP Research Report No. 96-07
G.J. Brown & M.P. Cooke (1994) `Computational auditory scene analysis'. Computer Speech and Language 8, 297-336.
G.J. Brown, M.P. Cooke and E. Mousset (1996), `Are neural oscillations the substrate of auditory grouping', ESCA workshop on the "Auditory Basis of Speech Perception", Keele, UK, July 1996, pp 174-179.
A.S. Bregman (1990), 'Auditory Scene analysis', MIT Press.
R. Campbell, B. Dodd & D. Burnham (eds.), (1997) `Hearing by eye, II. Perspectives and directions in research on audiovisual aspects of language processing', Erlbaum / Psychology Press. In press.
R.K. Clifton (1987), 'Breakdown of echo suppression in the precedence effect', J. Acoust. Soc. Am. 82, 1834-1835.
M.P. Cooke (1993), 'Modelling auditory Processing and Organisation', Cambridge University Press.
M.P. Cooke & G.J. Brown (1993) `Computational auditory scene analysis: exploiting principles of perceived continuity'. Speech Communication, 13, 391-399
M.P. Cooke, A.C. Morris and P.D. Green (1996), 'Recognising Occluded Speech', Proc. ESCA Workshop on the 'Auditory Basis of Speech Perception', Keele, July 1996, pp 297-300.
F. Crick (1984) `Function of the thalamic reticular complex: the searchlight hypothesis'. Proc. Natl. Acad. Sci. USA 81, 4586-4590.
H. Fletcher (1953), 'Speech and Hearing in Communication', New-York: Krieger, 1953.
M.J.F. Gales and S.J. Young (1993), 'Cepstral Parameter Compensation for HMM Recognition in Noise', Speech Communication, Vol.12, No.3, 1993.
Y. Gong (1995), 'Speech Recognition in Noisy Environments: A Survey', Speech communication 16, pp 261-291.
PD Green, MP Cooke and MD Crawford (1995), 'Auditory Scene Analysis and HMM-Recognition of Speech in Noise', proc IEEE international conference on Acoustics, Speech and Signal Processing (ICASSP-95), Detroit, p401-404.
S.Greenberg (1995), 'The ears have it: The auditory basis of speech perception', Proc. ICPhS, 3, 34-41. K. Hartung and S. Sterbing (1996), 'Generation of virtual sound sources for electrophysiological characterization of auditory spatial tuning in the guinea pig', Acoustical Signal Processing in the Central Auditory System, Plenum Press, New York, London.
H. Hermansky, S. Tibrewala and M. Pavel (1996), 'Towards ASR on Partially Corrupted Speech', Proc ICSLP 96, Philadelphia.
H.G. Hirsch (1990), 'Robust Speech Recognition in Noisy and Reverberant Environment', Proc NATO ASI 1990.
H.G. Hirsch, P. Meyer and H.W. Ruehl (1991), 'Improved Speech Recognition using High-Pass Filtering of Subband Envelopes', Proc. Eurospeech 1991, Genoa.
H.G. Hirsch and C. Ehrlicher (1995),'Noise Estimation Techniques for Robust Speech Recognition', Proc. ICASSP 95, Detroit.
H.G. Hirsch (1997), 'Automatic Recognition of Noisy Speech in the Telephone Channel', to be presented and published on the Aachener symposium on signal theory in March 97.
M. Hochberg, S. Renals, A. Robinson and G. Cook (1995), `Recent improvements to the ABBOT large vocabulary CSR system'', proc ICASSP-95, pp69--72.
M.T. Lallouache (1990), `Un poste 'visage-parole'. Acquisition et traitement de contours labiaux', Proceedings of the XVIII Journees d'Etudes sur la Parole, Montreal, pp. 282-286
R.P. Lippmann (1996), 'Speech Perception by Humans and Machines', Proc. ESCA Workshop on the 'Auditory Basis of Speech Perception', Keele, July 1996, pp 309-316.
P. Lockwood and J. Boudy (1992), 'Experiments with a Nonlinear Spectral Subtractor, Hidden Markov Models and the Projection for Robust Speech Recognition in Cars', Speech Communication, Vol. 11, No.2-3, 1992.
G.F Meyer & F.Berthommier (1996), 'Vowel segregation with amplitude modulation maps: a re-evaluation of place and place-time models', Proc. ESCA Workshop on the 'Auditory Basis of Speech Perception, Keele, July 1996, 212-215.
A.C Morris, J.-L. Schwartz and P. Escudier (1993) `An information-theoretical investigation into the distribution of phonetic information across the auditory spectrogram', Computer Speech and Language (2), Pages.121-136.
N. Morgan and H. Bourlard (1995), 'Neural Networks for Statistical Recognition of Continuous Speech', Proceedings of the IEEE, Invited Paper, vol.83, no.5, pp.741-770, May 1995.
M.Paraskevas and J.Mourjopoulos (1995), 'A Differential Perceptual Audio Coding Method with Reduced Bitrate Requirements', IEEE Trans. Speech and Audio Proc., Nov. 1995.
R.D. Patterson, T.R. Anderson and K.Francis (1996), 'Binaural Auditory Images and a Noise-Resistant, Binaural Auditory Spectrogram for Speech Recognition', Proc. ESCA Workshop on the 'Auditory Basis of Speech Perception', Keele, July 1996, pp 245-252.
J. Robert-Ribes, M. Piquemal, J.L. Schwartz and P. Escudier (1996), 'Exploiting sensor fusion architectures and stimuli complementarity in AV speech recognition', in D.G. Stork & M.E. Hennecke (Eds.), 'Speechreading by Man and Machine: Models, Systems and Applications' (pp. 193-210). NATO ASI Series, Springer.
D.Tsoukalas, J.Mourjopoulos and G.Kokkinakis (1997), 'Perceptual Filters for Audio Signal Enhancement', Journal of the Audio Engineering Society, to appear Jan/Feb. 1997.
D. Van Campernolle (1989), 'Noise Adaptation in a Hidden Markov Model Speech Recognition System', Computer Speech and Language, Vol.3, 1989.
A. Varga and R. Moore (1990), 'Hidden Markov Model Decomposition of Speech and Noise', Proc IEEE ICASSP pp 845-848, 1990. S.J. Young (1996), 'A Review of Large Vocabulary Continuous-speech Recognition', IEEE Signal Processing Magazine, 13, 5, pp 45, 57.
Wolf (1991), 'Lokalisation von Schallquellen in geschlossenen Riumen [Localisation of sound sources in enclosed spaces]', Dissertation, Univ. Bochum.
WS96 Workshop Page, WWW site http://www.cslp.jhu.edu/ws96/ws96_workshop.html, July 1996.
S.J. Young (1996), 'A review of large-vocabulary continuous speech recognition', IEEE Signal Processing Magazine. 13, 5, pp. 45-57.