首页> 外文会议>International joint conference on artificial intelligence >Incorporating User Behaviors in New Word Detection
【24h】

Incorporating User Behaviors in New Word Detection

机译:在新单词检测中纳入用户行为

获取原文

摘要

In this paper, we proposed a novel method to detect new words in domain-specific fields based on user behaviors. First, we select the most representative words from domain-specific lexicon. Then combining with user behaviors, we try to discover the potential experts in this field who use those terminologies frequently. Finally, we make further efforts to identify new words from behaviors of those experts. Words used much more frequently in this community than others are most probably new words. In brief, our method follows a collaborative filtering way: first from words to find professional experts, then from experts to discover new words, which is different from the traditional new word detection methods. Our method achieves up to 0.86 in accuracy on a computer science related data set. Moreover, the proposed method can be easily extended to related words retrieval task. We compare our method with Google Sets and Bayesian Sets. Experiments show that our method and Bayesian Sets gives better results than Google Sets.
机译:在本文中,我们提出了一种基于用户行为的特定于域特定字段中的新词的新方法。首先,我们选择特定于域的Lexicon中最代表性的单词。然后与用户行为组合,我们试图发现经常使用这些术语的该领域中的潜在专家。最后,我们进一步努力识别这些专家的行为的新词。在这个社区中使用的单词比其他人更频繁地是最重要的。简而言之,我们的方法遵循协作过滤方式:首先从单词找专家专家,然后从专家发现新词,这与传统的新词检测方法不同。我们的方法在计算机科学相关数据集的准确性上达到了高达0.86。此外,所提出的方法可以很容易地扩展到相关词检索任务。我们将我们的方法与Google Set和Bayesian集进行比较。实验表明,我们的方法和贝叶斯集提供了比Google集的更好结果。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号