首页> 外文会议>International conference on computational linguistics >Multi-way Tensor Factorization for Unsupervised Lexical Acquisition
【24h】

Multi-way Tensor Factorization for Unsupervised Lexical Acquisition

机译:无监督词汇收购的多通方式张量分解

获取原文

摘要

This paper introduces a novel method for joint unsupervised aquisition of verb subcategorization frame (SCF) and selectional preference (SP) information. Treating SCF and SP induction as a multi-way co-occurrence problem, we use multi-way tensor factorization to cluster frequent verbs from a large corpus according to their syntactic and semantic behaviour. The method extends previous tensor factorization approaches by predicting whether a syntactic argument is likely to occur with a verb lemma (SCF) as well as which lexical items are likely to occur in the argument slot (SP), and integrates a variety of lexical and syntactic features, including co-occurrence information on grammatical relations not explicitly represented in the SCFS. The SCF lexicon that emerges from the clusters achieves an F-score of 68.7 against a gold standard, while the SP model achieves an accuracy of 77.8 in a novel evaluation that considers all of a verb's arguments simultaneously.
机译:本文介绍了一种新颖的动词子类别帧(SCF)的无人监督级联和选择偏好(SP)信息的新方法。将SCF和SP归纳视为一种多种方式的共同发生问题,我们使用多向张解因子分解根据其句法和语义行为来聚类来自大型语料库的频繁动词。该方法通过预测动词引理(SCF)可能会发生句法参数,以及哪个词汇表(SP)中可能发生哪些词汇项目,并集成各种词汇和句法,通过预测句法参数来扩展先前的张量分解方法特征,包括关于在SCFS中未明确表示的语法关系的共同发生信息。从集群中出现的SCF词典达到了黄金标准的F分,而SP模型在新的评估中实现了77.8的准确性,以同时考虑所有动词的争论。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号