首页> 外文会议>Chinese intelligent automation conference >A Chinese Expert Name Disambiguation Approach Based on Spectral Clustering with the Expert Page-Associated Relationships
【24h】

A Chinese Expert Name Disambiguation Approach Based on Spectral Clustering with the Expert Page-Associated Relationships

机译:基于专家页面关联关系的谱聚类的中文专家名消歧方法

获取原文

摘要

Aimed at the problems of Chinese experts' name repetition and representation diversity, a Chinese expert name disambiguation approach based on spectral clustering with the expert page-associated relationships is proposed. Firstly, the TF-IDF algorithm is used to calculate the word-based feature weights, and then the cosine similarity algorithm is employed to compute the similarity between the evidence-pages to obtain the initial similarity matrix of expert evidence-pages. Secondly, the expert page-associated relationship features are taken as the semi-supervised constraint information to correct the initial similarity matrix, and next the spectral clustering-based method is used to build expert disambiguation model. Finally, taking the contrast experiments on Chinese expert evidence-page corpus of manually labeled, the result shows that the semi-supervised spectral clustering on Chinese experts' name disambiguation method with the expert page-associated relationships than that without the associated constraint information, the F-value has an average increase of 9.02 %.
机译:针对中国专家姓名重复性和代表多样性的问题,提出了一种基于谱聚类和专家页面关联关系的中文专家姓名消歧方法。首先,使用TF-IDF算法计算基于词的特征权重,然后使用余弦相似度算法计算证据页面之间的相似度,得到专家证据页面的初始相似度矩阵。其次,将专家页面关联关系特征作为半监督约束信息来校正初始相似度矩阵,然后基于谱聚类的方法建立专家消歧模型。最后,通过对人工标记的中国专家证据页语料库进行对比实验,结果表明,与专家页面相关的关系比没有相关约束信息的情况下,基于中国专家名消歧方法的半监督谱聚类具有明显的优势。 F值平均增加9.02%。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号