Clustering Technique in Multi-Document Personal Name Disambiguation

机译：多文档人名消歧中的聚类技术

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

Focusing on multi-document personal name disambiguation, this paper develops an agglo-merative clustering approach to resolving this problem. We start from an analysis of point-wise mutual information between feature and the ambiguous name, which brings about a novel weight computing method for feature in clustering. Then a trade-off measure between within-cluster compactness and among-cluster separation is proposed for stopping clustering. After that, we apply a labeling method to find representative feature for each cluster. Finally, experiments are conducted on word-based clustering in Chinese dataset and the result shows a good effect.

机译：针对多文档人名歧义消除，本文提出了一种聚类聚类方法来解决此问题。我们从特征和模棱两可的名称之间的点向互信息的分析开始，这为聚类中的特征带来了一种新颖的权重计算方法。然后，提出了一种在集群内部的紧密度和集群之间的分离之间权衡的措施，以阻止聚类。之后，我们应用标记方法为每个聚类找到代表性特征。最后，对中文数据集中基于词的聚类进行了实验，结果显示了良好的效果。

著录项

来源
《Joint conference of the annual meeting of the Association for Computational Linguistics;International joint conference on natural language processing of the Asian Federation of Natural Languages Processing;ACL 2009;IJCNLP 2009》|2009年|P.A88-A95|共8页
会议地点
作者
Chen Chen; Hu Junfeng; Wang Houfeng;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类程序语言、算法语言;
关键词

相似文献

外文文献
中文文献
专利

1. Chinese multi-document personal name disambiguation [J] . Wang Houfeng, Mei Zheng High Technology Letters . 2005,第3期

机译：中文多文档个人名称消除歧义
2. CLUSTERING TECHNIQUES AND DISCRETE PARTICLE SWARM OPTIMIZATION ALGORITHM FOR MULTI-DOCUMENT SUMMARIZATION [J] . Ramiz M. Aliguliyev Computational Intelligence . 2010,第4期

机译：多文档摘要的聚类技术和离散粒子群优化算法
3. Chinese Personal Name Disambiguation Based on Clustering [J] . Chao Fan, Yu Li Wireless communications & mobile computing . 2021,第a期

机译：基于聚类的中国个人名称消歧
4. Clustering Technique in Multi-Document Personal Name Disambiguation [C] . Joint conference of the annual meeting of the Association for Computational Linguistics . 2009

机译：多文件个人名称歧义中的聚类技术
5. Multi-document Summarization Based on Document Clustering and Neural Sentence Fusion [D] . Fuad, Tanvir Ahmed. 2018

机译：基于文档聚类和神经句子融合的多文件摘要
6. Fast max-margin clustering for unsupervised word sense disambiguation in biomedical texts [O] . Weisi Duan, Min Song, Alexander Yates 2009

机译：快速最大边距聚类用于生物医学文本中无监督的词义消歧
7. Clustering Technique in Multi-Document Personal Name Disambiguation [O] . Chen Chen, Hu Junfeng, Wang Houfeng 2010

机译：多文档人名消歧中的聚类技术

Clustering Technique in Multi-Document Personal Name Disambiguation

摘要

著录项

相似文献

相关主题

期刊订阅