首页>
外文OA文献
>EM-based Phoneme Confusion Matrix Generation for Low-resource Spoken Term Detection
【2h】
EM-based Phoneme Confusion Matrix Generation for Low-resource Spoken Term Detection
展开▼
机译:基于EM的音素混淆矩阵生成,用于低资源口语词检测
展开▼
免费
页面导航
摘要
著录项
引文网络
相似文献
相关主题
摘要
The idea of using a data-driven phoneme confusion matrix (PCM) to enhance speech recognition and retrieval performance is not new to the speech community. Although empirical results show various degrees of improvements brought by introducing a PCM, the underlying data-driven processes introduced in most papers are rather ad-hoc and lack rigorous statistical justifications. In this paper we will focus on the statistical aspects of PCM generation, propose and justify a novel expectation-maximization based algorithm for data-driven PCM generation. We will evaluate the performance of the generated PCMs under the context of low-resource spoken term detection, with primary focus on out-of-vocabulary keywords.
展开▼