首页> 外文期刊>High Technology Letters >Chinese multi-document personal name disambiguation
【24h】

Chinese multi-document personal name disambiguation

机译:中文多文档个人名称消除歧义

获取原文
获取原文并翻译 | 示例
           

摘要

This paper presents a new approach to determining whether an interested personal name across documents refers to the same entity. Firstly, three vectors for each text are formed: the personal name Boolean vectors denoting whether a personal name occurs in the text, the biographical word Boolean vector representing title, occupation and so forth, and the feature vector with real values. Then, by combining a heuristic strategy based on Boolean vectors with an agglomerative clustering algorithm based on feature vectors, it seeks to resolve multi-document personal name coreference. Experimental results show that this approach achieves a good performance by testing on ' Wang Gang corpus.
机译:本文提出了一种新的方法来确定文档中感兴趣的个人名称是否指向同一实体。首先,为每个文本形成三个向量:表示文本中是否出现个人名字的个人名字布尔向量,代表标题,职业等的传记字布尔向量以及具有实数值的特征向量。然后,通过将基于布尔向量的启发式策略与基于特征向量的聚集聚类算法相结合,力求解决多文档个人姓名共指问题。实验结果表明,通过对王刚语料库的测试,该方法取得了良好的性能。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号