...
首页> 外文期刊>Natural language engineering >Adaptive graph walk-based similarity measures for parsed text
【24h】

Adaptive graph walk-based similarity measures for parsed text

机译:基于自适应图行走的解析文本相似度度量

获取原文
获取原文并翻译 | 示例

摘要

We consider a dependency-parsed text corpus as an instance of a labeled directed graph, where nodes represent words and weighted directed edges represent the syntactic relations between them. We show that graph walks, combined with existing techniques of supervised learning that model local and global information about the graph walk process, can be used to derive a task-specific word similarity measure in this graph. We also propose and evaluate a new learning method in this framework, a path-constrained graph walk variant, in which the walk process is guided by high-level knowledge about meaningful edge sequences (paths) in the graph. Empirical evaluation on the tasks of named entity coordinate term extraction and general word synonym extraction show that this framework is preferable to, or competitive with, vector-based models when learning is applied, and using small to moderate size text corpora.
机译:我们将依赖项解析的文本语料库视为带标签的有向图的一个实例,其中节点代表单词,加权有向边代表它们之间的句法关系。我们表明,图行走与结合模型学习有关图行走过程的本地和全局信息的监督学习的现有技术相结合,可以用于在此图中导出特定于任务的单词相似性度量。我们还提出并评估了该框架中的一种新的学习方法,即路径受限的图行走变体,其中,行走过程由有关图中有意义边缘序列(路径)的高级知识指导。对命名实体坐标项提取和通用词同义词提取任务的经验评估表明,当应用学习并使用中小尺寸的文本语料库时,该框架比基于矢量的模型更可取或与之竞争。

著录项

  • 来源
    《Natural language engineering 》 |2014年第3期| 361-397| 共37页
  • 作者

    EINAT MINKOV; WILLIAM W. COHEN;

  • 作者单位

    Department of Information Systems, University of Haifa, Haifa, Israel;

    School of Computer Science, Carnegie Mellon University, Pittsburgh, PA, USA;

  • 收录信息
  • 原文格式 PDF
  • 正文语种 eng
  • 中图分类
  • 关键词

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号