首页> 外文会议>7th International Conference on Application of Information and Communication Technologies >Named entity recognition on real data: A preliminary investigation for Turkish
【24h】

Named entity recognition on real data: A preliminary investigation for Turkish

机译:指定实体对真实数据的识别:土耳其的初步调查

获取原文
获取原文并翻译 | 示例

摘要

Named Entity Recognition (NER) is a well-studied area in natural language processing (NLP) and the reported results in the literature are generally very high (~>%95) for most of the languages. Today, the focus area of most practical natural language applications (i.e. web mining, sentiment analysis, machine translation) is real natural language data such as Web2.0 or speech data. Nevertheless, the NER task is rarely investigated on this type of data which differs severely from formal written text. In this paper, we present 3 new Turkish data sets from different domains (on this focused area; namely from Twitter, a Speech-to-Text Interface and a Hardware Forum) annotated specifically for NER and report our first results on them. We believe, the paper draws light to the difficulty of these new domains for NER and the possible future work.
机译:命名实体识别(NER)是自然语言处理(NLP)领域中经过深入研究的领域,对于大多数语言,文献中报告的结果通常很高(〜> 95%)。如今,大多数实际自然语言应用程序(例如,网络挖掘,情感分析,机器翻译)的重点领域是诸如Web2.0或语音数据之类的真实自然语言数据。然而,很少对这种类型的数据进行NER任务调查,这与正式书面文本有很大不同。在本文中,我们提出了3个来自不同领域的土耳其新数据集(在这个重点领域;即来自Twitter,语音到文本界面和硬件论坛),这些数据集专门针对NER进行了注释,并报告了他们的第一批结果。我们相信,本文为NER这些新领域的困难以及未来可能的工作提供了启示。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号