首页> 外文会议>European Conference on Artificial Intelligence;Conference on Prestigious Applications of Intelligent Systems >Speeding up Word Mover's Distance and Its Variants via Properties of Distances Between Embeddings
【24h】

Speeding up Word Mover's Distance and Its Variants via Properties of Distances Between Embeddings

机译:通过嵌入之间的距离的特性加速单词移动器的距离及其变体

获取原文

摘要

The Word Mover's Distance (WMD) proposed by Kusner et al. is a distance between documents that takes advantage of semantic relations among words that are captured by their embeddings. This distance proved to be quite effective, obtaining state-of-art error rates for classification tasks, but is also impracticable for large collections/documents due to its computational complexity. For circumventing this problem, variants of WMD have been proposed. Among them, Relaxed Word Mover's Distance (RWMD) is one of the most successful due to its simplicity, effectiveness, and also because of its fast implementations. Relying on assumptions that are supported by empirical properties of the distances between embeddings, we propose an approach to speed up both WMD and RWMD. Experiments over 10 datasets suggest that our approach leads to a significant speed-up in document classification tasks while maintaining the same error rates.
机译:kusner等人提出的移动距离(WMD)。 是在嵌入式捕获的单词之间利用语义关系的文档之间的距离。 该距离被证明是非常有效的,获得用于分类任务的最先进的错误率,但由于其计算复杂性而对大型集合/文件也是不可判断的。 为了避免这个问题,已经提出了WMD的变体。 其中,由于其简单,有效性以及其快速实现,放松的单词移动器距离(RWMD)是最成功的距离之一。 依赖于嵌入区之间距离的经验性质支持的假设,我们提出了一种加快WMD和RWMD的方法。 超过10个数据集的实验表明,我们的方法在文档分类任务中导致了显着的加速,同时保持相同的错误率。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号