首页> 外国专利> Automatic generation of statistical laguage models for interactive voice response applacation

Automatic generation of statistical laguage models for interactive voice response applacation

机译:自动生成用于交互式语音响应应用程序的统计语言模型

摘要

A Statistical Language Model (SLM) that can be used in an ASR for Interactive Voice Response (IVR) systems in general and Natural Language Speech Applications (NLSAs) in particular can be created by first manually producing a brief description in text for each task that can be performed in an NLSA. These brief descriptions are then analyzed, in one embodiment, to generate spontaneous speech utterances based pre-filler patterns and a skeletal set of content words. The pre-filler patterns are in turn used with Part-of-Speech (POS) tagged conversations from a spontaneous speech corpus to generate a set of pre-filler phrases. The skeletal set of content words is used with an electronic lexico-semantic database and with a thesaurus-based content word extraction process to generate a more extensive list of content words. The pre-filler phrases and content words set, thus generated, are combined into utterances using a lexico-semantic resource based process. In one embodiment, a lexico-semantic statistical validation process is used to correct and/or add the automatically generated utterances to the database of expected utterances. The system requires a minimum amount of human intervention and no prior knowledge regarding the expected user utterances, and the WWW is used to validate the word models. The system requires a minimum amount of human intervention and no prior knowledge regarding the expected user utterances in response to a particular prompt.
机译:可以通过首先手动为每个任务在文本中生成简短的描述来创建可以在通用语言(尤其是自然语言语音应用程序(NLSA))的交互式语音响应(IVR)系统的ASR中使用的统计语言模型(SLM)。可以在NLSA中执行。然后,在一个实施例中,对这些简要描述进行分析,以基于预填充模式和内容词的骨架集来生成自发语音。预填充器模式又与自发语音语料库的词性(POS)标记的会话一起使用,以生成一组预填充器短语。内容词的骨架集与电子词汇语义数据库以及基于词库的内容词提取过程一起使用,以生成内容词的更广泛列表。如此生成的预填充短语和内容单词集使用基于词汇语义资源的过程组合成语音。在一个实施例中,使用词汇语义统计验证过程来将自动生成的话语校正和/或添加到期望话语数据库中。该系统需要最少的人工干预,并且不需要有关预期用户话语的先验知识,并且WWW用于验证单词模型。该系统需要最少的人工干预,并且不需要响应于特定提示的有关预期用户话语的先验知识。

著录项

相似文献

  • 专利
  • 外文文献
  • 中文文献
获取专利

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号