首页> 外文会议>International Workshop on Evaluation of Natural Language and Speech Tool for Italian >A Simple Yet Effective Approach for Named Entity Recognition from Transcribed Broadcast News
【24h】

A Simple Yet Effective Approach for Named Entity Recognition from Transcribed Broadcast News

机译:一种简单但有效的方法,用于指定转录广播新闻的实体识别

获取原文

摘要

Automatic speech transcriptions pose serious challenges for NLP systems due to various peculiarities in the data. In this paper, we propose a simple approach for NER on speech transcriptions which achieves good results despite the peculiarities. The novelty of our approach is that it emphasizes on the maximum exploitation of the tokens, as they are, in the data. We developed a system for participating in the "NER on Transcribed Broadcast News" (closed) task of the EVALITA 2011 evaluation campaign where it was one of the best systems obtaining an F1-score of 57.02 on the automatic speech transcription test data. On the manual transcriptions of the same test data (although having no sentence boundary and punctuation symbol), the system achieves an F1-score of 73.54 which is quite high considering the fact that the system is language independent and uses no external dictionaries, gazetteers or ontologies.
机译:由于数据中的各种特点,自动语音转录对NLP系统构成了严重挑战。在本文中,尽管存在特殊性,我们提出了一种关于语音转录的简单方法,尽管存在良好的结果。我们的方法的新颖之处在于它强调了对令牌的最大利用,因为它们在数据中。我们开发了一个用于参与“单位转录的广播新闻”(已关闭)任务的系统(已关闭)任务,其中它是在自动语音转录测试数据上获得57.02的F1分数的最佳系统之一。在相同测试数据的手动转录(尽管没有句子边界和标点符号),该系统实现了73.54的F1分数,这非常高,考虑到系统是语言独立的事实,并且不使用外部词典,缩进者或本体。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号