首页> 外文会议>International Workshop on Evaluation of Natural Language and Speech Tool for Italian >Super-Sense Tagging Using Support Vector Machines and Distributional Features
【24h】

Super-Sense Tagging Using Support Vector Machines and Distributional Features

机译:超声标记使用支持向量机和分布特征

获取原文

摘要

This paper describes our participation in the EVALITA 2011 Super-Sense Tagging (SST) task. SST is the task of annotating each word in a text with a super-sense that defines a general concept such as animal, person or food. Due to the smaller set of concepts involved the task is simpler than Word Sense Disambiguation one which identifies a specific meaning for each word. In this task, we exploit a supervised learning method based on Support Vector Machines. However, supervised approaches are subject to the data-sparseness problem. This side effect is more evident when lexical features are involved, because test data can contain words with low frequency (or absent) in training data. To overcome the sparseness problem, in our supervised strategy, we incorporate information coming from a distributional space of words built on a large corpus, Wikipedia. The results obtained in the task show the effectiveness of our approach.
机译:本文介绍了我们参与评估2011年度超级感知标记(SST)任务。 SST是在文本中注释每个单词的任务,具有超声的超义,定义动物,人或食物等一般概念。由于涉及较小的概念概念,任务比单词义歧义歧义更简单,它标识每个单词的特定含义。在这项任务中,我们利用了基于支持向量机的监督学习方法。但是,监督方法受数据稀疏问题。当涉及词汇特征时,这种副作用更为明显,因为测试数据可以包含训练数据中具有低频(或不存在)的单词。为了克服稀疏问题,在我们的监督战略中,我们纳入了来自在大型语料库,维基百科的词的分布空间。任务中获得的结果表明了我们方法的有效性。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号