SPHEAR TMR Network
References
- S. Ahmad and V. Tresp (1993),
'Some solutions to the missing feature problem in vision', in S.J. Hanson
et al, eds, 'Advances in Neural Information Processing Systems 5', Morgan
Kaufmann, 1993.
- W.A.Ainsworth (1995), 'Perception
of formant transitions in synthesised vowel pairs', Proc. ICPhS, 2, 666-669.
- W.A.Ainsworth (1995b), 'Effects
of preceding noise duration on the perception of voiced plosives and vowels',
Proc. Eurospeech, 1, 741-744. Jont Allen (1994), `How do humans process
and recognize speech?', IEEE Trans. Speech and Audio Processing, vol. 2,
no. 4, pp. 567-577, 1994.
- F. Berthommier and
G.F. Meyer (1995), 'Source Separation by a Functional Model of Amplitude
Demodulation'. 4th European Conference on Speech Communication and Technology,
EUROSPEECH'95, Madrid (Espagne), Vol. 1, 135-138.
- J. Blauert (1996), 'Spatial Hearing
- The Psychophysics of Human Sound Localization (revised edition)', The
MIT-Press, Cambridge MA.
- M. Bodden (1993), 'Modeling Human Sound
Source Localization and the Cocktail-Party-Effect', Acta Acustica (1),
43-55.
- M. Bodden and K. Rateitschek(1996),
'Noise-robust speech recognition based on a binaural auditory model', ESCA
workshop on the "Auditory Basis of Speech Perception", Keele,
UK, July 1996.
- H. Bourlard,
H. Hermansky and N. Morgan (1996), 'Towards Increasing Speech Recognition
Error Rates', Speech Communication, vol. 18, no. 3, pp. 205-231, 1996.
- H. Bourlard, and S.
Dupont (1996), `A new ASR approach based on independent processing and
re-combination of partial frequency bands', Proc. ICSLP-96, pp. 426-429,
October 1996.
- H. Bourlard and S.
Dupont (1996b), 'Multi-stream speech recognition', IDIAP Research Report
No. 96-07
- G.J. Brown & M.P.
Cooke (1994) `Computational auditory scene analysis'. Computer Speech and
Language 8, 297-336.
- G.J. Brown,
M.P. Cooke and E. Mousset (1996), `Are neural oscillations the substrate
of auditory grouping', ESCA workshop on the "Auditory Basis of Speech
Perception", Keele, UK, July 1996, pp 174-179.
- A.S. Bregman (1990), 'Auditory Scene
analysis', MIT Press.
- R. Campbell,
B. Dodd & D. Burnham (eds.), (1997) `Hearing by eye, II. Perspectives
and directions in research on audiovisual aspects of language processing',
Erlbaum / Psychology Press. In press.
- R.K. Clifton (1987), 'Breakdown of
echo suppression in the precedence effect', J. Acoust. Soc. Am. 82, 1834-1835.
- M.P. Cooke (1993), 'Modelling auditory
Processing and Organisation', Cambridge University Press.
- M.P. Cooke & G.J.
Brown (1993) `Computational auditory scene analysis: exploiting principles
of perceived continuity'. Speech Communication, 13, 391-399
- M.P. Cooke,
A.C. Morris and P.D. Green (1996), 'Recognising Occluded Speech', Proc.
ESCA Workshop on the 'Auditory Basis of Speech Perception', Keele, July
1996, pp 297-300.
- F. Crick (1984) `Function of the thalamic
reticular complex: the searchlight hypothesis'. Proc. Natl. Acad. Sci.
USA 81, 4586-4590.
- H. Fletcher (1953), 'Speech and Hearing
in Communication', New-York: Krieger, 1953.
- M.J.F. Gales and S.J.
Young (1993), 'Cepstral Parameter Compensation for HMM Recognition in Noise',
Speech Communication, Vol.12, No.3, 1993.
- Y. Gong (1995), 'Speech Recognition in
Noisy Environments: A Survey', Speech communication 16, pp 261-291.
- PD Green, MP
Cooke and MD Crawford (1995), 'Auditory Scene Analysis and HMM-Recognition
of Speech in Noise', proc IEEE international conference on Acoustics, Speech
and Signal Processing (ICASSP-95), Detroit, p401-404.
- S.Greenberg (1995), 'The ears have
it: The auditory basis of speech perception', Proc. ICPhS, 3, 34-41. K.
Hartung and S. Sterbing (1996), 'Generation of virtual sound sources for
electrophysiological characterization of auditory spatial tuning in the
guinea pig', Acoustical Signal Processing in the Central Auditory System,
Plenum Press, New York, London.
- H. Hermansky,
S. Tibrewala and M. Pavel (1996), 'Towards ASR on Partially Corrupted Speech',
Proc ICSLP 96, Philadelphia.
- H.G. Hirsch (1990), 'Robust Speech
Recognition in Noisy and Reverberant Environment', Proc NATO ASI 1990.
- H.G. Hirsch,
P. Meyer and H.W. Ruehl (1991), 'Improved Speech Recognition using High-Pass
Filtering of Subband Envelopes', Proc. Eurospeech 1991, Genoa.
- H.G. Hirsch and C.
Ehrlicher (1995),'Noise Estimation Techniques for Robust Speech Recognition',
Proc. ICASSP 95, Detroit.
- H.G. Hirsch (1997), 'Automatic Recognition
of Noisy Speech in the Telephone Channel', to be presented and published
on the Aachener symposium on signal theory in March 97.
- M.
Hochberg, S. Renals, A. Robinson and G. Cook (1995), `Recent improvements
to the ABBOT large vocabulary CSR system'', proc ICASSP-95, pp69--72.
- M.T. Lallouache (1990), `Un poste
'visage-parole'. Acquisition et traitement de contours labiaux', Proceedings
of the XVIII Journees d'Etudes sur la Parole, Montreal, pp. 282-286
- R.P. Lippmann (1996), 'Speech Perception
by Humans and Machines', Proc. ESCA Workshop on the 'Auditory Basis of
Speech Perception', Keele, July 1996, pp 309-316.
- P. Lockwood and J. Boudy
(1992), 'Experiments with a Nonlinear Spectral Subtractor, Hidden Markov
Models and the Projection for Robust Speech Recognition in Cars', Speech
Communication, Vol. 11, No.2-3, 1992.
- G.F Meyer & F.Berthommier
(1996), 'Vowel segregation with amplitude modulation maps: a re-evaluation
of place and place-time models', Proc. ESCA Workshop on the 'Auditory Basis
of Speech Perception, Keele, July 1996, 212-215.
- A.C
Morris, J.-L. Schwartz and P. Escudier (1993) `An information-theoretical
investigation into the distribution of phonetic information across the
auditory spectrogram', Computer Speech and Language (2), Pages.121-136.
- N. Morgan and H. Bourlard
(1995), 'Neural Networks for Statistical Recognition of Continuous Speech',
Proceedings of the IEEE, Invited Paper, vol.83, no.5, pp.741-770, May 1995.
- M.Paraskevas and
J.Mourjopoulos (1995), 'A Differential Perceptual Audio Coding Method with
Reduced Bitrate Requirements', IEEE Trans. Speech and Audio Proc., Nov.
1995.
- R.D.
Patterson, T.R. Anderson and K.Francis (1996), 'Binaural Auditory Images
and a Noise-Resistant, Binaural Auditory Spectrogram for Speech Recognition',
Proc. ESCA Workshop on the 'Auditory Basis of Speech Perception', Keele,
July 1996, pp 245-252.
- J.
Robert-Ribes, M. Piquemal, J.L. Schwartz and P. Escudier (1996), 'Exploiting
sensor fusion architectures and stimuli complementarity in AV speech recognition',
in D.G. Stork & M.E. Hennecke (Eds.), 'Speechreading by Man and Machine:
Models, Systems and Applications' (pp. 193-210). NATO ASI Series, Springer.
- D.Tsoukalas,
J.Mourjopoulos and G.Kokkinakis (1997), 'Perceptual Filters for Audio Signal
Enhancement', Journal of the Audio Engineering Society, to appear Jan/Feb.
1997.
- D. Van Campernolle (1989),
'Noise Adaptation in a Hidden Markov Model Speech Recognition System',
Computer Speech and Language, Vol.3, 1989.
- A. Varga and R. Moore (1990),
'Hidden Markov Model Decomposition of Speech and Noise', Proc IEEE ICASSP
pp 845-848, 1990. S.J. Young (1996), 'A Review of Large Vocabulary Continuous-speech
Recognition', IEEE Signal Processing Magazine, 13, 5, pp 45, 57.
- Wolf (1991), 'Lokalisation von Schallquellen
in geschlossenen Riumen [Localisation of sound sources in enclosed spaces]',
Dissertation, Univ. Bochum.
- WS96 Workshop Page, WWW site http://www.cslp.jhu.edu/ws96/ws96_workshop.html,
July 1996.
- S.J. Young (1996), 'A review of large-vocabulary
continuous speech recognition', IEEE Signal Processing Magazine. 13, 5,
pp. 45-57.