首页> 外文学位 >A corpus-based approach for cross-lingual information retrieval.
【24h】

A corpus-based approach for cross-lingual information retrieval.

机译:基于语料库的跨语言信息检索方法。

获取原文
获取原文并翻译 | 示例

摘要

With the increasing amount of online information, information retrieval systems must manage large volumes of data and information, often written in different languages and stored in different locations. This provides a challenge for crosslingual semantic interoperability since much of this information may be seemingly unconnected. How to generate an overview of this disparate data and information so that it can be analyzed, searched, summarized and visualized is problematic.; Currently, many approaches to retrieve textual materials from databases depend on a lexical match between words in users' requests and those in or assigned to documents in a database. Because of the tremendous diversity in the words people use to describe the same document, simple lexical matching methods are incomplete and imprecise. How to allow users to retrieve information on the basis of a conceptual topic or meaning of a document is an important issue. The creation of conceptual relationships would allow the system to deduce users' information needs correctly and retrieve relevant documents, even though the documents contain different terms from the queries. In addition, the queries may be written in languages other than English. The translation ambiguity significantly exacerbates the retrieval problem.
机译:随着在线信息量的增加,信息检索系统必须管理大量的数据和信息,通常以不同的语言编写并存储在不同的位置。这为跨语言语义互操作性提出了挑战,因为许多此类信息看似无关紧要。如何生成这些不同的数据和信息的概述,以便可以对其进行分析,搜索,汇总和可视化是一个问题。当前,从数据库检索文本资料的许多方法取决于用户请求中的单词与数据库文档中的单词或分配给文档中的单词之间的词汇匹配。由于人们用来描述同一文档的单词种类繁多,因此简单的词汇匹配方法是不完整且不精确的。如何允许用户根据概念性主题或文档的含义来检索信息是一个重要的问题。概念关系的创建将允许系统正确推断用户的信息需求并检索相关文档,即使这些文档包含与查询不同的术语也是如此。此外,查询可以用英语以外的其他语言编写。翻译的歧义性大大加剧了检索问题。

著录项

  • 作者

    Li, Kar Wing.;

  • 作者单位

    The Chinese University of Hong Kong (People's Republic of China).;

  • 授予单位 The Chinese University of Hong Kong (People's Republic of China).;
  • 学科 Computer Science.; Engineering System Science.
  • 学位 Ph.D.
  • 年度 2004
  • 页码 139 p.
  • 总页数 139
  • 原文格式 PDF
  • 正文语种 eng
  • 中图分类 自动化技术、计算机技术;系统科学;
  • 关键词

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号