首页> 外文会议>European conference on speech communication and technology >Extracting Caller Information from Voicemail
【24h】

Extracting Caller Information from Voicemail

机译:从语音邮件中提取呼叫者信息

获取原文

摘要

In this paper we address the problem of extracting the identities and phone numbers of the callers in voicemail messages. Previous work in information extraction from speech includes spoken document retrieval and named entity detection. This task differs from the named entity task in that the information we are interested in is a subset of the named entities in the message, and consequently, the need to pick the correct subset makes the problem more difficult. Also, the caller's identity may include information that is not typically associated with a named entity. In this work, we present two information extraction methods, one based on hand-crafted rules, and one based on a maximum entropy model. We find that both systems give good performance when applied to manually-derived transcriptions, and that the maximum entropy system can reliably identify the time intervals containing phone numbers, even in the presence of significant decoding errors.
机译:在本文中,我们解决了在语音邮件消息中提取呼叫者的身份和电话号码的问题。以前的信息提取来自语音的信息包括口语文档检索和命名实体检测。此任务与命名实体任务不同,因为我们感兴趣的信息是消息中的命名实体的子集,因此,需要选择正确的子集的需要使问题更加困难。此外,呼叫者的身份可以包括通常与命名实体相关联的信息。在这项工作中,我们介绍了两个信息提取方法,一个基于手工制作的规则,以及一个基于最大熵模型。我们发现,在应用于手动衍生的转录时,两个系统都会表现出良好的性能,并且即使在存在显着解码错误的情况下,最大熵系统也可以可靠地识别包含电话号码的时间间隔。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号