首页> 外文会议> >Speech data retrieval system constructed on a universal phonetic code domain
【24h】

Speech data retrieval system constructed on a universal phonetic code domain

机译:基于通用语音码域的语音数据检索系统

获取原文

摘要

We propose a novel speech processing framework, where all of the speech data are encoded into universal phonetic code (UPC) sequences and speech processing systems, such as speech recognition, retrieval, digesting, etc., are constructed on this UPC domain. As the first step, we introduce a sub-phonetic segment (SPS) set, based on IPA (international phonetic alphabet), to deal with multilingual speech and develop a procedure to estimate acoustic models of the SPS from IPA-like phone models. The key point of the framework is to employ environment adaptation into the SPS encoding stage. This makes it possible to normalize acoustic variations and extract the language factor contained in speech signals as encoded SPS sequences. We confirm these characteristics by constructing a speech retrieval system on the SPS domain. The system can retrieve key phrases, given by speech, from different environment speech data in a vocabulary-free condition. We show several preliminary experimental results on this system, using Japanese and English sentence speech sets.
机译:我们提出了一种新颖的语音处理框架,其中所有语音数据被编码为通用拼音代码(UPC)序列,并且在该UPC域上构建了语音处理系统,例如语音识别,检索,消化等。作为第一步,我们介绍了基于IPA(国际语音字母)的子拼音段(SPS),以处理多语言语音,并开发一种方法来从类似IPA的手机型号估算SPS的声学模型。框架的关键点是使用环境适应在SPS编码阶段。这使得可以归一化声学变化并提取作为编码的SPS序列中的语音信号中包含的语言因子。我们通过在SPS域上构建语音检索系统来确认这些特征。系统可以从语音给出的关键短语,从不同的环境语音数据处于无词汇状态。我们在该系统上显示了几种初步实验结果,使用日语和英语句子语音集。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号