首页> 外文会议>Signal Processing and Communication Application Conference >A named entity recognition dataset for Turkish
【24h】

A named entity recognition dataset for Turkish

机译:土耳其语的命名实体识别数据集

获取原文

摘要

Named entity recognition is one of the important topics in the research area of natural language processing. Named entity recognition studies conducted on Turkish texts are quite limited, compared to the studies on other languages. Besides, the lack of common data sets makes the comparison of different approaches harder. In this study, a dataset comprising news articles in Turkish annotated with named entities is presented. The annotations comprise the basic named entity types of person, location, and organization names. Additionally, to be used as reference in future studies, a rule-based named entity recognition system is evaluated on the final form of this data set and the corresponding evaluation results are presented. It is envisioned that our study will contribute to the advancement of named entity recognition studies on Turkish texts.
机译:命名实体识别是自然语言处理研究领域的重要主题之一。与其他语言的研究相比,在土耳其语文本上进行的命名实体识别研究非常有限。此外,缺乏通用数据集使比较不同方法变得更加困难。在这项研究中,提供了一个数据集,该数据集包含土耳其新闻报道的带有命名实体的新闻报道。注释包括人员,位置和组织名称的基本命名实体类型。另外,为了在将来的研究中用作参考,在此数据集的最终形式上评估了基于规则的命名实体识别系统,并给出了相应的评估结果。可以预见,我们的研究将有助于土耳其文本上的命名实体识别研究的发展。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号