Multilingual Information Retrieval Based on Document Alignment Techniques

机译：基于文档对齐技术的多语种信息检索

获取原文

获取原文并翻译 | 示例

页面导航

摘要
著录项
引文网络
相似文献
相关主题

摘要

A multilingual information retrieval method is presented where the user formulates the query in his/her preferred language to retrieve relevant information from a multilingual document collection. This multilingual retrieval method involves mono-language searches as well as merging their resutls. We adopt a corpus based approach where documents of differnet languages are associated if they cover a similar story. The resulting comparable corpus enables two novel techniques we have developed. First, it enables Cross-Language Information Retrieval (CLIR) which does not lack vocabulary coverage as we observed in the case of approaches that are based on automatic Machine Translation (MT). Second, aligned documents of this corpus facilitate to merge the resutls of mono- and cross-language searches. Using hte TREC CLIR data, excellent resuts are obtained. In addition, our evaluation of the document alignments gives us new insights about the usefulness of comparable copora.

机译：提出了一种多语言信息检索方法，其中用户以他/她的首选语言来制定查询，以从多语言文档集中检索相关信息。这种多语言检索方法涉及单语言搜索以及合并其结果。我们采用基于语料库的方法，其中如果不同的网络语言的文档涵盖了相似的故事，则它们将被关联。由此产生的可比语料库实现了我们开发的两种新颖技术。首先，它启用了跨语言信息检索（CLIR），该语言不会缺少词汇覆盖率，正如我们在基于自动机器翻译（MT）的方法中观察到的那样。其次，该语料库的对齐文档有助于合并单语言和跨语言搜索的结果。使用TREC CLIR数据可获得出色的结果。此外，我们对文档对齐方式的评估为我们提供了可比拟的copora实用性的新见解。

著录项

来源
《Research and advanced technology for digital libraries》|1998年|p.183-197|共15页
会议地点 Heraklion(GR);Heraklion(GR)
作者
Martin Braschler; Peter Schaeuble;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种 eng
中图分类自动化技术、计算机技术;
关键词

相似文献

外文文献
中文文献
专利

1. Content Based Image Retrieval System for Kannada Query Image from Multilingual Document Image Collection [J] . Thanuja C, Shreedevi G R International Journal of Engineering Research and Applications . 2013,第4期

机译：基于内容的多语种文档图像查询卡纳达语查询图像检索系统
2. Document retrieval using knowledge-based fuzzy information retrieval techniques [J] . Shyi-Ming Chen, Jeng-Yih Wang IEEE Transactions on Systems, Man, and Cybernetics . 1995,第5期

机译：使用基于知识的模糊信息检索技术进行文档检索
3. Term disambiguation techniques based on target document collection for cross-language information retrieval: An empirical comparison of performance between techniques [J] . Kishida K Information Processing & Management . 2007,第1期

机译：基于目标文档集合的术语歧义消除技术用于跨语言信息检索：技术之间性能的实证比较
4. Multilingual Information Retrieval Based on Document Alignment Techniques [C] . Martin Braschler, Peter Schaeuble European conference on research and advanced technology for digital libraries . 1998

机译：基于文档对齐技术的多语言信息检索
5. Robust spoken document retrieval in multilingual and noisy acoustic environments. [D] . Akbacak, Murat. 2009

机译：在多语言和嘈杂的声学环境中进行可靠的语音文档检索。
6. Kinome-wide interaction modelling using alignment-based and alignment-independent approaches for kinase description and linear and non-linear data analysis techniques [O] . Maris Lapins, Jarl ES Wikberg 2010

机译：使用基于比对和比对独立的方法进行激酶描述以及线性和非线性数据分析技术的全基因组相互作用建模
7. Using Content Based Image Retrieval Techniques for the Indexing and Retrieval of Thai Handwritten Documents [O] . Seksan Sangsawad, Chun Che Fung 2015

机译：利用基于内容的图像检索技术进行泰文手写文档的索引和检索

Multilingual Information Retrieval Based on Document Alignment Techniques

摘要

著录项

引文网络

相似文献

相关主题

期刊订阅