首页> 外文会议>Annual meeting of the Association for Computational Linguistics >Idiom Token Classification using Sentential Distributed Semantics
【24h】

Idiom Token Classification using Sentential Distributed Semantics

机译:使用句子分布式语义的成语标记分类

获取原文

摘要

Idiom token classification is the task of deciding for a set of potentially idiomatic phrases whether each occurrence of a phrase is a literal or idiomatic usage of the phrase. In this work we explore the use of Skip-Thought Vectors to create distributed representations that encode features that are predictive with respect to idiom token classification. We show that classifiers using these representations have competitive performance compared with the state of the art in idiom token classification. Importantly, however, our models use only the sentence containing the target phrase as input and are thus less dependent on a potentially inaccurate or incomplete model of discourse context. We further demonstrate the feasibility of using these representations to train a competitive general idiom token classifier.
机译:惯用语标记分类是确定一组潜在惯用语短语的任务,而不论短语的每次出现是该短语的字面意义还是惯用用法。在这项工作中,我们探索使用跳过思想向量来创建分布式表示形式,该表示形式编码对成语标记分类具有预测性的特征。我们证明,与成语标记分类中的最新技术相比,使用这些表示法的分类器具有竞争优势。但是重要的是,我们的模型仅使用包含目标短语的句子作为输入,因此较少依赖于话语上下文的潜在不准确或不完整模型。我们进一步证明了使用这些表示来训练竞争性的通用成语令牌分类器的可行性。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号