【24h】

Recognition of Domain-Specific Terms with D-bigram Model

机译:用D-bigram模型识别领域特定术语

获取原文
获取原文并翻译 | 示例

摘要

There have been many researches on the use of corpora in natural language processing (NLP). In this paper, first we describe d-bigram model, which is a statistical model used in NLP. And then we explain the experiments and their results on the influence of corpus size. We take the unknown term recognition as an application, and examine it with various corpus sizes. Empirical results showed that the corpus size changes the reducing ratio of unknown terms. And also, the result indicated that the difference between closed corpora and open corpora should be considered in term recognition.
机译:语料库在自然语言处理(NLP)中的使用已有许多研究。在本文中,我们首先描述d-bigram模型,这是用于NLP的统计模型。然后我们解释了实验及其结果对语料库大小的影响。我们将未知术语识别作为一种应用程序,并以各种语料库大小对其进行检查。实证结果表明,语料库大小改变了未知词的减少率。并且,结果表明,在术语识别中应考虑封闭语料库和开放语料库之间的差异。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号