首页> 外文会议>International workshop on semantic evaluation;Annual meeting of the Association for Computational Linguistics >UINSUSKA-TiTech at SemEval-2017 Task 3: Exploiting Word Importance Levels for Similarity Features for CQA
【24h】

UINSUSKA-TiTech at SemEval-2017 Task 3: Exploiting Word Importance Levels for Similarity Features for CQA

机译:UINSUSKA-TiTech在SemEval-2017上的任务3:为CQA的相似性功能利用单词重要性级别

获取原文

摘要

The majority of core techniques to solve many problems in Community Question Answering (CQA) task rely on similarity computation. This work focuses on similarity between two sentences (or questions in subtask B) based on word embeddings. We exploit words importance levels in sentences or questions for similarity features, for classification and ranking with machine learning. Using only 2 types of similarity metric, our proposed method has shown comparable results with other complex systems. This method on subtask B 2017 dataset is ranked on position 7 out of 13 participants. Evaluation on 2016 dataset is on position 8 of 12, outperforms some complex systems. Further, this finding is explorable and potential to be used as baseline and extensible for many tasks in CQA and other textual similarity based system.
机译:解决社区问答(CQA)任务中许多问题的大多数核心技术都依赖于相似度计算。这项工作着重于基于单词嵌入的两个句子(或子任务B中的问题)之间的相似性。我们利用句子或问题中单词的重要性级别来寻找相似性特征,以便通过机器学习进行分类和排名。仅使用两种类型的相似性度量,我们提出的方法已显示出与其他复杂系统可比的结果。子任务B 2017数据集上的此方法在13位参与者中排名第7。对2016年数据集的评估位于12的第8位,优于某些复杂的系统。此外,该发现是可探索的,并且有可能被用作基线,并且可以扩展用于CQA和其他基于文本相似性的系统中的许多任务。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号