Weakly Supervised Learning for Cross-document Person Name Disambiguation Supported by Information Extraction

机译：信息提取支持的跨文档人员歧义的弱化学习

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

It is fairly common that different people are associated with the same name. In tracking person entities in a large document pool, it is important to determine whether multiple mentions of the same name across documents refer to the same entity or not. Previous approach to this problem involves measuring context similarity only based on co-occurring words. This paper presents a new algorithm using information extraction support in addition to co-occurring words. A learning scheme with minimal supervision is developed within the Bayesian framework. Maximum entropy modeling is then used to represent the probability distribution of context similarities based on heterogeneous features. Statistical annealing is applied to derive the final entity coreference chains by globally fitting the pairwise context similarities. Benchmarking shows that our new approach significantly outperforms the existing algorithm by 25 percentage points in overall F-measure.

机译：它与同名相关的不同人员相当普遍。在跟踪大型文档池中的人员实体中，重要的是要确定跨文档跨文档的多个提交是否参考相同的实体。先前的此问题的方法涉及仅基于共同发生的单词测量上下文相似度。本文除了共同出现的单词之外，还提供了一种使用信息提取支持的新算法。在贝叶斯框架内开发了一个具有最小监督的学习方案。然后，使用最大熵建模来表示基于异构特征的上下文相似性的概率分布。应用统计退火以通过全局拟合这对成对上下文相似性来导出最终实体芯必芯链。基准表明，我们的新方法显着优于现有的算法在整体F测量中的25个百分点。

著录项

来源
《Association for Computational Linguistics Annual Meeting》|2004年||共8页
会议地点
作者
Cheng Niu; Wei Li; Rohini K. Srihari; Association for Computational Linguistics(ACL)(US);
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类计算机软件;
关键词

相似文献

外文文献
中文文献
专利

1. Weakly Supervised Person Re-ID: Differentiable Graphical Learning and a New Benchmark [J] . Wang Guangrun, Wang Guangcong, Zhang Xujie, Neural Networks and Learning Systems, IEEE Transactions on . 2021,第5期

机译：弱监督人员重新ID：可差异的图形学习和新的基准
2. Relation extraction with weakly supervised learning based on process-structure-property-performance reciprocity [J] . Takeshi Onishi, Takuya Kadohira, Ikumu Watanabe Science and technology of advanced materials . 2018,第1期

机译：基于过程 - 结构 - 财产性能互惠的弱监督学习的关系提取
3. Weakly supervised learning of biomedical information extraction from curated data [J] . Suvir Jain, Kashyap R., Tsung-Ting Kuo, BMC Bioinformatics . 2016,第1期

机译：从策划数据中弱监督学习生物医学信息提取
4. Weakly Supervised Learning for Cross-document Person Name Disambiguation Supported by Information Extraction [C] . Cheng Niu, Wei Li, Rohini K. Srihari, Association for Computational Linguistics Annual Meeting . 2004

机译：信息提取支持的跨文档人员歧义的弱化学习
5. Visual Learning with Weak Supervision: Applications in Video Summarization and Person Re-identification [D] . Panda, Rameswar. 2018

机译：具有弱监督的视觉学习：视频摘要和人员重新识别中的应用
6. Relation extraction with weakly supervised learning based on process-structure-property-performance reciprocity [O] . Takeshi Onishi, Takuya Kadohira, Ikumu Watanabe 2018

机译：基于过程-结构-性能-性能互惠的弱监督学习关系提取
7. Weakly Supervised Learning for Cross-document Person Name Disambiguation Supported by Information Extraction [O] . 2008

机译：信息提取支持的跨文档人名消歧的弱监督学习

Weakly Supervised Learning for Cross-document Person Name Disambiguation Supported by Information Extraction

摘要

著录项

相似文献

相关主题

期刊订阅