Latent word context model for information retrieval

Bernard Brosseau-Villeneuve; Jian-Yun Nie; Noriko Kando

首页> 外文期刊>Information retrieval >Latent word context model for information retrieval

【24h】

Latent word context model for information retrieval

机译：信息检索的潜在词上下文模型

获取原文

获取原文并翻译 | 示例

掌桥外文数据库（机构版） >>

开具论文收录证明 >>

文献代查 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

The application of word sense disambiguation (WSD) techniques to information retrieval (IR) has yet to provide convincing retrieval results. Major obstacles to effective WSD in IR include coverage and granularity problems of word sense inventories, sparsity of document context, and limited information provided by short queries. In this paper, to alleviate these issues, we propose the construction of latent context models for terms using latent Dirichlet allocation. We propose building one latent context per word, using a well principled representation of local context based on word features. In particular, context words are weighted using a decaying function according to their distance to the target word, which is learnt from data in an unsupervised manner. The resulting latent features are used to discriminate word contexts, so as to constrict query's semantic scope. Consistent and substantial improvements, including on difficult queries, are observed on TREC test collections, and the techniques combines well with blind relevance feedback. Compared to traditional topic modeling, WSD and positional indexing techniques, the proposed retrieval model is more effective and scales well on large-scale collections.

机译：词义消歧（WSD）技术在信息检索（IR）中的应用尚未提供令人信服的检索结果。 IR中有效的WSD的主要障碍包括词义清单的覆盖范围和粒度问题，文档上下文的稀疏性以及简短查询提供的有限信息。在本文中，为了缓解这些问题，我们提出了使用潜在狄利克雷分配为术语建立潜在上下文模型的方法。我们建议使用一个基于单词特征的局部上下文的原则，来为每个单词构建一个潜在上下文。特别地，根据上下文单词到目标单词的距离，使用衰减函数对上下文单词进行加权，以无监督的方式从数据中获知。产生的潜在特征用于区分单词上下文，从而限制查询的语义范围。在TREC测试集合中观察到了一致且实质性的改进，包括对困难查询的改进，并且该技术与盲目的相关性反馈很好地结合在一起。与传统主题建模，WSD和位置索引技术相比，所提出的检索模型更加有效，并且可以在大规模馆藏中很好地扩展。

著录项

来源
《Information retrieval》 |2014年第1期|21-51|共31页
作者
Bernard Brosseau-Villeneuve; Jian-Yun Nie; Noriko Kando;
展开▼
作者单位

University of Montreal, CP. 6128 succ. Centre-ville, Montreal, QC H3C 3J7, Canada;

University of Montreal, CP. 6128 succ. Centre-ville, Montreal, QC H3C 3J7, Canada;

National Institute of Informatics, 2-1-2 Hitotsubashi, Chiyoda-ku, Tokyo 101-8430, Japan;

展开▼
收录信息
原文格式 PDF
正文语种 eng
中图分类
关键词
Retrieval models; Word context discrimination (WCD); Word context; Topic models; Word sense disambiguation (WSD);

机译：检索模型;词语境歧视（WCD）;词语境;主题模型;词义消歧（WSD）;

相似文献

外文文献
中文文献
专利

1. Disambiguating context-dependent polarity of words: An information retrieval approach [J] . Olga Vechtomova Information Processing & Management . 2017,第5期

机译：消除上下文依赖的单词极性：一种信息检索方法
2. 3D object retrieval via range image queries in a bag-of-visual-words context [J] . Konstantinos Sfikas, Theoharis Theoharis, 3oannis Pratikakis The Visual Computer . 2013,第12期

机译：在视觉词袋环境中通过范围图像查询进行3D对象检索
3. Orientation to learning context modulates retrieval processing for unrecognized words [J] . GUO ChunYan, CHEN WenJun, TIAN Tian, 中国科学通报：英文版 . 2010,第026期

机译：面向学习上下文的方向调制了无法识别单词的检索处理
4. Latent topic modelling of word co-occurence information for spoken document retrieval [C] . Berlin Chen IEEE International Conference on Acoustics, Speech and Signal Processing;ICASSP 2009 . 2009

机译：语音文档检索中单词共现信息的潜在主题建模
5. A machine-aided approach to intelligent index generation: Using natural language processing and latent semantic analysis to determine the contexts and relationships among words in a corpus. [D] . Lukon, Shelly Candita. 2006

机译：一种机器辅助的智能索引生成方法：使用自然语言处理和潜在语义分析来确定语料库中单词之间的上下文和关系。
6. Grounding Statistical Learning in Context: The Effects of Learning and Retrieval Contexts on Cross-situational Word Learning [O] . Chi-hsin Chen, Chen Yu -1

机译：在语境中扎根统计学习：学习和检索语境对跨情境单词学习的影响
7. Retrieval Contexts and the Concreteness Effect: Dissociations in Memory of Concrete and Abstract Words [O] . ter Doest, L., Semin, G.R. 2005

机译：检索语境与具体作用：具体词与抽象词记忆的分离
8. Context as the Building Blocks of Meaning: A Retrieval Model for the Semantic Representation of Words. [R] . Kwantes, P. J. 2003

机译：作为意义构建块的语境：词语语义表征的检索模型。

Latent word context model for information retrieval

摘要

著录项

相似文献

相关主题

期刊订阅