首页> 外文会议>Advances in artificial intelligence >A Supervised Method of Feature Weighting for Measuring Semantic Relatedness
【24h】

A Supervised Method of Feature Weighting for Measuring Semantic Relatedness

机译:一种衡量语义相关性的加权特征加权方法

获取原文
获取原文并翻译 | 示例

摘要

The clustering of related words is crucial for a variety of Natural Language Processing applications. Many known techniques of word clustering use the context of a word to determine its meaning. Words which frequently appear in similar contexts are assumed to have similar meanings. Word clustering usually applies the weighting of contexts, based on some measure of their importance. One of the most popular measures is Pointwise Mutual Information. It increases the weight of contexts where a word appears regularly but other words do not, and decreases the weight of contexts where many words may appear. Essentially, it is unsupervised feature weighting. We present a method of supervised feature weighting. It identifies contexts shared by pairs of words known to be semantically related or unrelated, and then uses Pointwise Mutual Information to weight these contexts on how well they indicate closely related words. We use Roget's Thesaurus as a source of training and evaluation data. This work is as a step towards adding new terms to Roget's Thesaurus automatically, and doing so with high confidence.
机译:相关单词的聚类对于各种自然语言处理应用至关重要。单词聚类的许多已知技术使用单词的上下文来确定其含义。假定在相似上下文中频繁出现的单词具有相似的含义。词聚类通常基于对上下文重要性的某种度量来应用上下文的权重。最受欢迎的措施之一是逐点相互信息。它增加了一个单词经常出现但其他单词没有出现的上下文的权重,并降低了可能出现多个单词的上下文的权重。本质上,它是无监督的特征加权。我们提出了一种监督特征加权的方法。它识别由已知在语义上相关或不相关的单词对共享的上下文,然后使用“点对互惠信息”对这些上下文加权,以指示它们紧密相关的单词的程度。我们使用Roget词库作为培训和评估数据的来源。这项工作是向Roget同义词库自动添加新术语的一步,并且具有很高的信心。

著录项

  • 来源
    《Advances in artificial intelligence》|2011年|p.222-233|共12页
  • 会议地点 St. Johns(CA);St. Johns(CA);St. Johns(CA);St. Johns(CA);St. Johns(CA);St. Johns(CA)
  • 作者单位

    SITE, University of Ottawa, Ottawa, Ontario, Canada;

    SITE, University of Ottawa, Ottawa, Ontario, Canada ,Institute of Computer Science Polish Academy of Sciences, Warsaw, Poland;

  • 会议组织
  • 原文格式 PDF
  • 正文语种 eng
  • 中图分类 人工智能理论;
  • 关键词

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号