It is known that applying a time-frequency binary mask to very noisy speech can improve its intelligibility but results in poor perceptual quality. In this paper we propose a new approach to applying a binary mask that combines the intel- ligibility gains of conventional binary masking with the per- ceptual quality gains of a classical speech enhancer. The bi- nary mask is not applied directly as a time-frequency gain as in most previous studies. Instead, the mask is used to sup- ply prior information to a classical speech enhancer about the probability of speech presence in different time-frequency re- gions. Using an oracle ideal binary mask, we show that the proposed method results in a higher predicted quality than other methods of applying a binary mask whilst preserving the improvements in predicted intelligibility.
展开▼