UINSUSKA-TiTech at SemEval-2017 Task 3: Exploiting Word Importance Levels for Similarity Features for CQA

机译：UINSUSKA-TiTech在SemEval-2017上的任务3：为CQA的相似性功能利用单词重要性级别

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

The majority of core techniques to solve many problems in Community Question Answering (CQA) task rely on similarity computation. This work focuses on similarity between two sentences (or questions in subtask B) based on word embeddings. We exploit words importance levels in sentences or questions for similarity features, for classification and ranking with machine learning. Using only 2 types of similarity metric, our proposed method has shown comparable results with other complex systems. This method on subtask B 2017 dataset is ranked on position 7 out of 13 participants. Evaluation on 2016 dataset is on position 8 of 12, outperforms some complex systems. Further, this finding is explorable and potential to be used as baseline and extensible for many tasks in CQA and other textual similarity based system.

机译：解决社区问答（CQA）任务中许多问题的大多数核心技术都依赖于相似度计算。这项工作着重于基于单词嵌入的两个句子（或子任务B中的问题）之间的相似性。我们利用句子或问题中单词的重要性级别来寻找相似性特征，以便通过机器学习进行分类和排名。仅使用两种类型的相似性度量，我们提出的方法已显示出与其他复杂系统可比的结果。子任务B 2017数据集上的此方法在13位参与者中排名第7。对2016年数据集的评估位于12的第8位，优于某些复杂的系统。此外，该发现是可探索的，并且有可能被用作基线，并且可以扩展用于CQA和其他基于文本相似性的系统中的许多任务。

著录项

来源
《International workshop on semantic evaluation;Annual meeting of the Association for Computational Linguistics》|2017年|370-374|共5页
会议地点
作者
Surya Agustian; Hiroya Takamura;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类
关键词

相似文献

外文文献
中文文献
专利

1. Opposing effects of phonological similarity on item and order memory of words and nonwords in the serial recall task [J] . Arild Lian, Paul Johan Karlsen, Thor Birger Eriksen Memory . 2004,第3期

机译：语音相似性对串行召回任务中单词和非单词的项目和顺序记忆的相反影响
2. A novel approach for modeling non-keyword intervals in a keyword spotter exploiting acoustic similarities of languages [J] . Heracleous P, Shimizu T Speech Communication . 2005,第4期

机译：一种利用语言的声学相似性在关键字检测器中对非关键字间隔进行建模的新颖方法
3. Words into action II: A task-oriented system: Harpy is an experimental, continuous-speech recognition system that exploits a low-cost minicomputer [J] . Reddy Raj Spectrum, IEEE . 1980,第6期

机译：言谈成语II：面向任务的系统：Harpy是一种实验性，连续语音识别系统，利用低成本的微型计算机
4. UINSUSKA-TiTech at SemEval-2017 Task 3: Exploiting Word Importance Levels for Similarity Features for CQA [C] . Surya Agustian, Hiroya Takamura International workshop on semantic evaluation . 2017

机译：UINSUSKA-TITECH在SEMEVAL-2017任务3：利用CQA的相似性功能的重要性级别
5. Improved GloVe Word Embedding Using Linear Weighting Scheme for Word Similarity Tasks [D] . Lu, Qinglan. 2021

机译：使用线性加权方案进行改进的手套单词嵌入单词相似性任务
6. Representational similarity analysis reveals task-dependent semantic influence of the visual word form area [O] . Xiaosha Wang, Yangwen Xu, Yuwei Wang, -1

机译：代表性相似性分析揭示了视觉单词形式区域的任务相关语义影响
7. LIPN-IIMAS at SemEval-2017 Task 1: Subword Embeddings, Attention Recurrent Neural Networks and Cross Word Alignment for Semantic Textual Similarity [O] . Ignacio Arroyo-Fernández, Ivan Vladimir Meza Ruiz 2017

机译：Lipn-IIMAS在Semeval-2017任务1：子字嵌入，注意反复性神经网络和语义文本相似性的交叉字对齐

UINSUSKA-TiTech at SemEval-2017 Task 3: Exploiting Word Importance Levels for Similarity Features for CQA

摘要

著录项

相似文献

相关主题

期刊订阅