首页> 外文会议>International conference on computational linguistics >Multi-way Tensor Factorization for Unsupervised Lexical Acquisition
【24h】

Multi-way Tensor Factorization for Unsupervised Lexical Acquisition

机译:无监督词汇获取的多向张量分解

获取原文

摘要

This paper introduces a novel method for joint unsupervised aquisition of verb subcategorization frame (SCF) and selectional preference (SP) information. Treating SCF and SP induction as a multi-way co-occurrence problem, we use multi-way tensor factorization to cluster frequent verbs from a large corpus according to their syntactic and semantic behaviour. The method extends previous tensor factorization approaches by predicting whether a syntactic argument is likely to occur with a verb lemma (SCF) as well as which lexical items are likely to occur in the argument slot (SP), and integrates a variety of lexical and syntactic features, including co-occurrence information on grammatical relations not explicitly represented in the SCFS. The SCF lexicon that emerges from the clusters achieves an F-score of 68.7 against a gold standard, while the SP model achieves an accuracy of 77.8 in a novel evaluation that considers all of a verb's arguments simultaneously.
机译:本文介绍了一种新的联合动词子分类框架(SCF)和选择偏好(SP)信息的无监督收购方法。将SCF和SP归纳视为一个多向共现问题,我们使用多向张量分解将来自大型语料库的频繁动词根据其句法和语义行为进行聚类。该方法通过预测句法自变量是否可能与动词引理(SCF)一起出现以及哪些词汇项可能在自变量槽(SP)中发生而扩展了先前的张量分解方法,并且将各种词汇和句法进行了集成功能,包括未在SCFS中明确表示的语法关系的共现信息。从聚类中出现的SCF词典相对于黄金标准达到68.7的F分数,而SP模型在同时考虑所有动词自变量的新颖评估中获得77.8的准确性。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号