首页> 外文会议>Conference on empirical methods in natural language processing >A Single Word is not Enough: Ranking Multiword Expressions Using Distributional Semantics
【24h】

A Single Word is not Enough: Ranking Multiword Expressions Using Distributional Semantics

机译:一个单词不够用:使用分布语义对多单词表达进行排名

获取原文

摘要

We present a new unsupervised mechanism, which ranks word n-grams according to their multiwordness. It heavily relies on a new uniqueness measure that computes, based on a distributional thesaurus, how often an n-gram could be replaced in context by a single-worded term. In addition with a downweighting mechanism for incomplete terms this forms a new measure called DRUID. Results show large improvements on two small test sets over competitive baselines. We demonstrate the scalability of the method to large corpora, and the independence of the measure of shallow syntactic filtering.
机译:我们提出了一种新的无监督机制,该机制根据单词n-gram的多单词性对其进行排名。它在很大程度上依赖于一种新的唯一性度量,该度量基于分布词库来计算在上下文中用单个单词替换n-gram的频率。除了针对不完整术语的权重降低机制之外,这还形成了一种称为DRUID的新度量。结果表明,相对于竞争基准而言,两个小型测试集有很大的改进。我们证明了该方法对大型语料库的可扩展性,以及浅层语法过滤措施的独立性。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号