Resynthesis Demos

Demos of the HMM-based Analysis/Resynthesis approach to speech enhancement. For full details see,

  • Carmona, J.L., Barker, J., Gomez, A.,M. and Ma, N. (2013) Speech Spectral Envelope Enhancement by HMM-based Analysis/Resynthesis, IEEE Signal Processing Letters, 20(3):563-566

In the table below the abbreviations have the following meaning (see paper for details)

  • MI = mean imputation
  • CI = constrained imputation
  • G0 = no grammar or dictionary; G1 = using dictionary and sentence grammar
  • SD = speaker dependent models
Enhancement male female
None
Spectral Subtraction
MMSE
MI+G0
CI+G0
CI+G0+SD
CI+G1+SD
Clean

The technique is currently only enhancing the spectral envelope so there is a roughness in the quality of the enhancement that is arising from noise in the excitation. If the excitation of the original speech signal is used then this disappears as demonstrated below. A small amount of musical noise be caused by errors in the mask estimation. Note this is not present when using the oracle mask.

Noisy
Spectral Subtraction
G0
G1
Oracle excitation
Oracle mask
Oracle mask + excitation
Clean