首页> 外文会议>IEEE International Conference on Acoustics, Speech and Signal Processing;ICASSP >Automatic error region detection and characterization in LVCSR transcriptions of TV news shows
【24h】

Automatic error region detection and characterization in LVCSR transcriptions of TV news shows

机译:电视新闻节目LVCSR转录中的自动错误区域检测和表征

获取原文

摘要

This paper addresses the issue of error region detection and characterization in LVCSR transcriptions. It is a well-known phenomenon that errors are not independent and tend to co-occur in automatic transcriptions. We are interested in automatically detecting these so-called error regions. Additionally, in the context of information extraction in TVBN shows, being able to automatically characterize detected error regions is a crucial step towards the definition of suitable recovery strategies. In this paper we propose to classify error regions in four classes with a particular focus on errors on person names. We propose several sequential detection + classification approaches and an integrated sequence labeling approach. We show that our best classification system can reach 70% classification accuracy on automatically detected error regions. Additionally, the overall system is able to detect and correctly characterize 29.6% of error region corresponding to a person name with a precision of 61.9%.
机译:本文解决了LVCSR转录中的错误区域检测和表征问题。一个众所周知的现象是错误不是独立的,并且倾向于在自动转录中同时发生。我们对自动检测这些所谓的错误区域感兴趣。另外,在TVBN节目中的信息提取中,能够自动表征检测到的错误区域是朝着定义适当恢复策略的关键一步。在本文中,我们建议将错误区域分类为四个类别,特别关注人名错误。我们提出了几种顺序检测+分类方法和一种集成的序列标记方法。我们表明,我们最好的分类系统可以在自动检测到的错误区域上达到70%的分类精度。此外,整个系统能够以61.9%的精度检测并正确表征与人名相对应的29.6%的错误区域。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号