【24h】

Enhancing Sumerian Lemmatization by Unsupervised Named-Entity Recognition

机译:通过无监督的命名实体识别增强苏美尔人的合法化

获取原文

摘要

Lemmatization for the Sumerian language, compared to the modern languages, is much more challenging due to that it is a long dead language, highly skilled language experts are extremely scarce and more and more Sumerian texts are coming out. This paper describes how our unsupervised Sumerian named-entity recognition (NER) system helps to improve the lemmatization of the Cuneiform Digital Library Initiative (CDLI), a specialist database of cuneiform texts, from the Ur Ⅲ period. Experiments show that a promising improvement in personal name annotation in such texts and a substantial reduction in expert annotation effort can be achieved by leveraging our system with minimal seed annotation.
机译:与现代语言相比,苏美尔语的合法化更具挑战性,因为它是一种已死的语言,熟练的语言专家极为匮乏,越来越多的苏美尔语文本问世。本文介绍了我们的无监督苏美尔命名实体识别(NER)系统如何帮助改善UrⅢ时期的楔形文字数字图书馆计划(CDLI)(楔形文字的专业数据库)的词形化。实验表明,通过利用我们的系统使用最少的种子注释,可以在此类文本中对人名注释进行有希望的改进,并大大减少专家注释的工作量。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号