TALN at SemEval-2016 Task 11: Modelling Complex Words by Contextual, Lexical and Semantic Features

机译：TALN在SemEval-2016任务11：通过上下文，词汇和语义特征对复杂的单词建模

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

This paper presents the participation of the TALN team in the Complex Word Identification Task of SemEval-2016 (Task 11). The purpose of the task was to determine if a word in a given sentence can be judged as complex or not by a certain target audience. To experiment with word complexity identification approaches, Task organizers provided a training set of 2,237 words judged as complex or not by 20 human evaluators, together with the sentence in which each word occurs. In our contribution we modelled each word to evaluate as a numeric vector populated with a set of lexical, semantic and contextual features that may help assess the complexity of a word. We trained a Random Forest classifier to automatically decide if each word is complex or not. We submitted two runs in which we respectively considered unweighted and weighted instances of complex words to train our classifier, where the weight of each instance is proportional to the number of evaluators that judged the word as complex. Our system scored as the third best performing one.

机译：本文介绍了TALN团队对SemEval-2016的复杂单词识别任务（任务11）的参与。该任务的目的是确定给定句子中的某个单词是否可以由特定的目标受众判断为复杂。为了试验单词复杂度识别方法，任务组织者提供了一个训练集，其中包含由20位人工评估者判断为复杂与否的2,237个单词，以及每个单词出现的句子。在我们的贡献中，我们对每个单词进行建模，以评估为一个数字矢量，其中填充了一组词汇，语义和上下文特征，可帮助评估单词的复杂性。我们训练了一个随机森林分类器来自动确定每个单词是否复杂。我们提交了两次运行，其中我们分别考虑了复杂单词的未加权实例和加权实例，以训练我们的分类器，其中每个实例的权重与判断该单词为复杂词的评估者的数量成正比。我们的系统得分排名第三。

著录项

来源
《International workshop on semantic evaluation;Conference of the North American Chapter of the Association for Computational Linguistics - Human Language Technologies》|2016年|1011-1016|共6页
会议地点
作者
Francesco Ronzano; Ahmed Aburaed; Luis Espinosa-Anke; Horacio Saggion;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类
关键词

相似文献

外文文献
中文文献
专利

1. Semantic and lexical features of words dissimilarly affected by non-fluent, logopenic, and semantic primary progressive aphasia [J] . Jet M. J.Vonk, RoelJonkers, H. IsabelHubbard, Journal of the International Neuropsychological Society: JINS . 2019,第10期

机译：语义和词汇特征，受非流利，令人阳痿和语义初级进步性失血病影响的单词
2. Lexical acquisition and semantic space models: Learning the semantics of unknown words [J] . KOSTADIN CHOLAKOV Natural language engineering . 2014,第pta4期

机译：词汇习得和语义空间模型：学习未知词的语义
3. Distractor Modality Can Turn Semantic Interference Into Semantic Facilitation in the Picture-Word Interference Task: Implications for Theories of Lexical Access in Speech Production [J] . Hantsch A, Jescheniak JD, Schriefers H Journal of experimental psychology. Learning, memory, and cognition . 2009,第6期

机译：干扰词模态可将语义干扰转化为图片词干扰任务中的语义促进：语音产生中词汇访问理论的含义
4. TALN at SemEval-2016 Task 11: Modelling Complex Words by Contextual, Lexical and Semantic Features [C] . Francesco Ronzano, Ahmed Aburaed, Luis Espinosa-Anke, International workshop on semantic evaluation . 2016

机译：Semeval-2016 Task 11：通过上下文，词汇和语义功能建模复杂词
5. Automatic acquisition of lexical semantic knowledge from large corpora: The identification of semantically related words, markedness, polarity, and antonymy. [D] . Hatzivassiloglou, Vasileios. 1998

机译：从大型语料库自动获取词汇语义知识：识别与语义相关的单词，标记，极性和反义词。
6. The Effect of Lexical Cohort Size Is Independent of Semantic Context Effects in a Picture–Word Interference Task: A Combined ERP and sLORETA Study [O] . Mingkun Ouyang, Xiao Cai, Qingfang Zhang 2019

机译：词汇队列大小的影响与图片-单词干扰任务中的语义上下文效应无关：ERP和sLORETA的组合研究
7. TALN at SemEval-2016 Task 11: Modelling Complex Words by Contextual, Lexical and Semantic Features [O] . Francesco Ronzano, Ahmed Aburaed, Luis Espinosa Anke, 2016

机译：Semeval-2016 Task 11：通过上下文，词汇和语义功能建模复杂词

TALN at SemEval-2016 Task 11: Modelling Complex Words by Contextual, Lexical and Semantic Features

摘要

著录项

相似文献

相关主题

期刊订阅