首页> 外文会议>International Conference on Applications of Natural Language to Information Systems >Combining Vector Space Model and Multi Word Term Extraction for Semantic Query Expansion
【24h】

Combining Vector Space Model and Multi Word Term Extraction for Semantic Query Expansion

机译:组合矢量空间模型和多字词提取对语义查询扩展

获取原文

摘要

In this paper, we target document ranking in a highly technical field with the aim to approximate a ranking that is obtained through an existing ontology (knowledge structure). We test and combine symbolic and vector space models (VSM). Our symbolic approach relies on shallow NLP and on internal linguistic relations between Multi-Word Terms (MWTs). Documents are ranked based on different semantic relations they share with the query terms, either directly or indirectly after clustering the MWTs using the identified lexico-semantic relations. The VSM approach consisted in ranking documents with different functions ranging from the classical tf.idf to more elaborate similarity functions. Results shows that the ranking obtained by the symbolic approach performs better on most queries than the vector space model. However, the ranking obtained by combining both approaches outperforms by a wide margin the results obtained by methods from each approach.
机译:在本文中,我们针对一个高技术领域的文件排名,目的是近似通过现有本体(知识结构)获得的排名。我们测试并结合符号和矢量空间模型(VSM)。我们的象征方法依赖于浅NLP和多字词之间的内部语言关系(MWTS)。根据使用识别的词典语义关系在聚类MWTS之后直接或间接地与查询术语共享的不同语义关系,根据不同的语义关系排列。 VSM方法组成,具有不同函数的排名文档,范围从古典tf.idf到更详细的相似性功能。结果表明,符号方法获得的排名在大多数查询中比矢量空间模型更好地执行。然而,通过组合两种方法获得的排名通过宽的边距来优于来自每种方法的方法获得的结果。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号