首页> 外文会议>LREC-2012 >A proposal for improving WordNet Domains
【24h】

A proposal for improving WordNet Domains

机译:改进Wordnet域的提案

获取原文

摘要

WordNet Domains (WND) is a lexical resource where synsets have been semi-automatically annotated with one or more domain labels from a set of 165 hierarchically organized domains. The uses of WND include the power to reduce the polysemy degree of the words, grouping those senses that belong to the same domain. But the semi-automatic method used to develop this resource was far from being perfect. By cross-checking the content of the Multilingual Central Repository (MCR) it is possible to find some errors and inconsistencies. Many are very subtle. Others, however, leave no doubt. Moreover, it is very difficult to quantify the number of errors in the original version of WND. This paper presents a novel semi-automatic method to propagate domain information through the MCR. We also compare both labellings (the original and the new one) allowing us to detect anomalies in the original WND labels.
机译:WordNet域(WND)是一种词汇资源,其中Synsets已半自动注释,其中一个或多个域标签来自一组165个分层组织的域。 WND的用途包括降低单词的多义度的功率,将属于同一域的这些感官分组。但是,用于开发这种资源的半自动方法远非完美。通过交叉检查多语言中央存储库(MCR)的内容,可以找到一些错误和不一致。许多人非常微妙。然而,其他人毫无疑问地留下了。此外,很难量化WND的原始版本中的错误数量。本文介绍了一种新的半自动方法,可以通过MCR传播域信息。我们还比较标签(原始和新的),允许我们检测原始WND标签中的异常。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号