首页> 外文会议>Human language technology >DOCUMENT REPRESENTATION IN NATURAL LANGUAGE TEXT RETRIEVAL

【24h】

DOCUMENT REPRESENTATION IN NATURAL LANGUAGE TEXT RETRIEVAL

机译：自然语言文本检索中的文档表示

获取原文

获取原文并翻译 | 示例

页面导航

摘要
著录项
相似文献
相关主题

摘要

In information retrieval, the content of a document may be represented as a collection of terms: words, stems, phrases, or other units derived or inferred from the text of the document. These terms are usually weighted to indicate their importance within the document which can then be viewed as a vector in a N-dimensional space. In this paper we demonstrate that a proper term weighting is at least as important as their selection, and that different types of terms (e.g., words, phrases, names), and terms derived by different means (e.g., statistical, linguistic) must be treated differently for a maximum benefit in retrieval. We report some observations made during and after the second Text REtrieval Conference (TREC-2).

机译：在信息检索中，文档的内容可以表示为术语的集合：单词，词干，短语或其他从文档文本中得出或推断出的单位。通常对这些术语进行加权，以表明它们在文档中的重要性，然后可以将其视为N维空间中的向量。在本文中，我们证明了适当的术语权重至少与其选择一样重要，并且必须使用不同类型的术语（例如，单词，短语，名称）以及通过不同方式衍生的术语（例如，统计，语言学）区别对待以最大程度地提高检索效率。我们报告第二次文本检索会议（TREC-2）期间和之后的一些观察。

著录项

来源
《Human language technology》|1994年|364-369|共6页
会议地点 Plainsboro NJ(US)
作者
Tomek Strzalkowski;
展开▼
作者单位

Courant Institute of Mathematical Sciences New York University 715 Broadway, rm. 704 New York, NY 10003;

展开▼
会议组织
原文格式 PDF
正文语种 eng
中图分类计算机软件;
关键词

相似文献

外文文献
中文文献
专利

1. Enhanced Information Retrieval from Narrative German-language Clinical Text Documents using Automated Document Classification [J] . Stephan SPAT, Bruno CADONNA, Ivo RAKOVAC, Studies in Health Technology and Informatics . 2008,第期

机译：使用自动文档分类从叙事德语临床文本文档中增强信息检索
2. The Mechanism Analysis of Natural Language Texts in Order to Construct A Model of the Full-text Document [J] . A.S. Lebedev Science and Technology . 2013,第2A期

机译：自然语言文本的机理分析以构建全文本模型
3. An Event Graph Based Document Representation for Information Retrieval and Summarazing the Text Based on Events [J] . P. Janarthanan, V. Ramachandran Asian Journal of Information Technology . 2016,第18期

机译：基于事件图的文档表示，用于信息检索和基于事件的文本摘要
4. DOCUMENT REPRESENTATION IN NATURAL LANGUAGE TEXT RETRIEVAL [C] . Human language technology workshop . 1994

机译：自然语言文本检索中的文档表示
5. Computer-assisted transformation of design documents from a natural language description to structured modeling languages. [D] . Chen, Lei. 2008

机译：计算机辅助设计文档从自然语言描述到结构化建模语言的转换。
6. Terminology spectrum analysis of natural-language chemical documents: term-like phrases retrieval routine [O] . Boris L. Alperin, Andrey O. Kuzmin, Ludmila Yu. Ilina, 2016

机译：天然语言化学文献的术语谱分析：类词短语检索例程
7. Document Representation in Natural Language Text Retrieval [O] . Tomek Strzalkowski 1994

机译：自然语言文本检索中的文档表示
8. Natural Language Text Retrieval Using a Large Semantic Network [R] . Nelson, P. 1993

机译：利用大型语义网络进行自然语言文本检索

DOCUMENT REPRESENTATION IN NATURAL LANGUAGE TEXT RETRIEVAL

摘要

著录项

相似文献

相关主题

期刊订阅