首页> 外文会议>Conference of the International Federation of Classification Societies;IFCS-96 >How to find the nearest by evaluating only few? Clustering techniques used to improve the efficiency of an Information Retrieval sytem based on Distributional Semantics
【24h】

How to find the nearest by evaluating only few? Clustering techniques used to improve the efficiency of an Information Retrieval sytem based on Distributional Semantics

机译:如何通过评估少量评估最接近? 聚类技术用于提高基于分布语义的信息检索系统的效率

获取原文
获取外文期刊封面目录资料

摘要

The first objective of this contribution is to give a description of our textual information retrieval system based on distributional semantics. The central idea of the approach is to represent the retrievable units and the user queries in a unified way as projections in a vectory space of pertinent terms. The projections are derived from a co-occurrence matrix computed on large reference (textual) corpora collecting the distributional semantic information. A similarity computation based on the cosine measure is then used to characterize the semantic proximity between queries and documents.
机译:本贡献的第一个目标是根据分布语义来说描述我们的文本信息检索系统。 该方法的核心思想是以统一的方式表示可检索单元和用户查询作为相关术语的检测空间中的预测。 投影来自于在大参考(文本)语料库上计算的共发生矩阵,收集分配语义信息。 然后,基于余弦度量的相似性计算来表征查询和文档之间的语义接近度。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号