首页> 外文会议>International joint conference on artificial intelligence;IJCAI-09 >Incorporating User Behaviors in New Word Detection
【24h】

Incorporating User Behaviors in New Word Detection

机译:在新单词检测中纳入用户行为

获取原文

摘要

In this paper, we proposed a novel method to detect new words in domain-specific fields based on user behaviors. First, we select the most representative words from domain-specific lexicon. Then combining with user behaviors, we try to discover the potential experts in this field who use those terminologies frequently. Finally, we make further efforts to identify new words from behaviors of those experts. Words used much more frequently in this community than others are most probably new words. In brief, our method follows a collaborative filtering way: first from words to find professional experts, then from experts to discover new words, which is different from the traditional new word detection methods. Our method achieves up to 0.86 in accuracy on a computer science related data set. Moreover, the proposed method can be easily extended to related words retrieval task. We compare our method with Google Sets and Bayesian Sets. Experiments show that our method and Bayesian Sets gives better results than Google Sets.
机译:在本文中,我们提出了一种基于用户行为检测领域特定字段中的新单词的新方法。首先,我们从特定领域的词典中选择最具代表性的单词。然后结合用户行为,我们尝试发现该领域中经常使用这些术语的潜在专家。最后,我们将进一步努力从这些专家的行为中找出新单词。在这个社区中使用的单词比其他单词使用得更频繁,这很可能是新单词。简而言之,我们的方法遵循协作过滤的方式:首先从单词中寻找专业的专家,然后从专家中发现新的单词,这与传统的新单词检测方法不同。我们的方法在与计算机科学相关的数据集上的精度最高达到0.86。此外,所提出的方法可以容易地扩展到相关单词检索任务。我们将我们的方法与Google Sets和Bayesian Sets进行了比较。实验表明,我们的方法和贝叶斯集比Google集能提供更好的结果。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号