首页> 外文会议>Culture and Computing 2011 >Term Extraction from Japanese Ancient Writings Using Probability of Character N-grams
【24h】

Term Extraction from Japanese Ancient Writings Using Probability of Character N-grams

机译:使用字符N-gram的概率从日本古代著作中提取术语

获取原文

摘要

Currently, there are no tools available to separate ancient Japanese sentence into words. Therefore, it is difficult to extract archaic Japanese terms from Japanese ancient writings. In this paper, we propose a method of term extraction for ancient Japanese documents. We calculate the likelihood of character n-grams to be a word, and extract character n-grams with higher likelihood as archaic Japanese terms. We conducted experiments of term separation using the term likelihood by the proposed method.
机译:当前,没有可用的工具将古日语句子分成单词。因此,很难从日本古代著作中提取古老的日语术语。在本文中,我们提出了一种用于古代日本文献的术语提取方法。我们计算字符n-gram为一个单词的可能性,并提取具有较高可能性的字符n-gram作为古日语术语。我们通过提出的方法使用术语似然进行了术语分离实验。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号