首页> 外文会议>Evaluation of natural language and speech tools for Italian >The Tanl Tagger for Named Entity Recognition on Transcribed Broadcast News at Evalita 2011
【24h】

The Tanl Tagger for Named Entity Recognition on Transcribed Broadcast News at Evalita 2011

机译:在Evalita 2011上,用于转录广播新闻的命名实体识别的Tanl Tagger

获取原文
获取原文并翻译 | 示例

摘要

The Tanl tagger is a flexible sequence labeller based on Conditional Markov Model that can be configured to use different classifiers and to extract features according to feature templates expressed through patterns provided in a configuration file. The Tanl Tagger was applied to the task of Named Entity Recognition (NER) on Transcribed Broadcast News of Evalita 2011. The goal of the task was to identify named entities within texts produced by an Automatic Speech Recognition (ASR) system. Since such texts do not provide capitalization, punctuation or even sentence segmentation and transcription is often noisy, this represents a challenge for state of the art NER tools. We report on the results of our experiments using the Tanl Tagger as well as another widely available tagger in both the closed and open modalities.
机译:Tanl标记器是基于条件马尔可夫模型的灵活序列标记器,可以配置为使用不同的分类器,并根据通过配置文件中提供的模式表示的特征模板提取特征。 Tanl Tagger用于Evalita 2011转录广播新闻上的命名实体识别(NER)任务。该任务的目的是在自动语音识别(ASR)系统产生的文本中识别命名实体。由于此类文本不提供大写字母,标点符号或什至句子分段,并且转录通常比较吵杂,因此这代表了最新的NER工具的挑战。我们报告了使用Tanl Tagger以及另一种广泛使用的封闭式和开放式标签机的实验结果。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号