首页> 外文会议>International semantic web conference >Identifying Poorly-Defined Concepts in WordNet with Graph Metrics
【24h】

Identifying Poorly-Defined Concepts in WordNet with Graph Metrics

机译:使用图形指标识别WordNet中定义不正确的概念

获取原文

摘要

Princeton WordNet is the most widely-used lexical resource in natural language processing and continues to provide a gold standard model of semantics. However, there are still significant quality issues with the resource and these affect the performance of all NLP systems built on this resource. One major issue is that many nodes are insufficiently defined and new links need to be added to increase performance in NLP. We combine the use of graph-based metrics with measures of ambiguity in order to predict which synsets are difficult for word sense disambiguation, a major NLP task, which is dependent on good lexical information. We show that this method allows use to find poorly defined nodes with a 89.9% precision, which would assist manual annotators to focus on improving the most in-need parts of the WordNet graph.
机译:普林斯顿WordNet是自然语言处理中使用最广泛的词汇资源,并且继续提供语义的黄金标准模型。但是,资源仍然存在严重的质量问题,这些问题会影响基于此资源构建的所有NLP系统的性能。一个主要问题是许多节点定义不充分,需要添加新的链接以提高NLP的性能。我们将基于图的度量与歧义度的度量结合使用,以预测哪些同义词集很难消除词义歧义,这是一项主要的NLP任务,它依赖于良好的词汇信息。我们证明了该方法可以用来以89.9%的精度查找定义不明确的节点,这将有助于手动注释器集中精力改进WordNet图中最需要的部分。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号