首页> 外文会议>International Conference on Analysis of Images, Social Networks, and Texts >Fighting with the Sparsity of Synonymy Dictionaries for Automatic Synset Induction
【24h】

Fighting with the Sparsity of Synonymy Dictionaries for Automatic Synset Induction

机译:与同义词词典的诽谤进行战斗自动扫描诱导

获取原文

摘要

Graph-based synset induction methods, such as MaxMax and WATSET, induce synsets by performing a global clustering of a synonymy graph. However, such methods are sensitive to the structure of the input synonymy graph: sparseness of the input dictionary can substantially reduce the quality of the extracted synsets. In this paper, we propose two different approaches designed to alleviate the incompleteness of the input dictionaries. The first one performs a pre-processing of the graph by adding missing edges, while the second one performs a post-processing by merging similar synset clusters. We evaluate these approaches on two datasets for the Russian language and discuss their impact on the performance of synset induction methods. Finally, we perform an extensive error analysis of each approach and discuss prominent alternative methods for coping with the problem of sparsity of the synonymy dictionaries.
机译:基于图形的SYNSET诱导方法,例如MAXMAX和Watet,通过执行同义图的全局群集来引导Synsets。然而,这种方法对输入同义词图的结构敏感:输入字典的稀疏可以大大降低提取的拟合的质量。在本文中,我们提出了两种不同的方法,旨在减轻输入词典的不完整性。第一个通过添加缺失的边缘来执行图的预处理,而第二个通过合并类似的SYNSET集群执行后处理。我们评估了两种数据集的两种方法,为俄语语言,并讨论了它们对Synpet诱导方法性能的影响。最后,我们对每个方法进行了广泛的误差分析,并讨论了应对同义词典的稀疏问题的突出替代方法。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号