Cross-Language High Similarity Search Using a Conceptual Thesaurus

机译：使用概念词库的跨语言高相似度搜索

获取原文

页面导航

摘要
著录项
引文网络
相似文献
相关主题

摘要

This work addresses the issue of cross-language high similarity and near-duplicates search, where, for the given document, a highly similar one is to be identified from a large cross-language collection of documents. We propose a concept-based similarity model for the problem which is very light in computation and memory. We evaluate the model on three corpora of different nature and two language pairs English-German and English-Spanish using the Eurovoc conceptual thesaurus. Our model is compared with two state-of-the-art models and we find, though the proposed model is very generic, it produces competitive results and is significantly stable and consistent across the corpora.

机译：这项工作解决了跨语言高度相似和近重复搜索的问题，对于给定的文档，要从大量的跨语言文档集中识别出高度相似的搜索。针对该问题，我们提出了一个基于概念的相似性模型，该模型在计算和存储方面非常轻便。我们使用Eurovoc概念词库评估了三个不同性质的语料库和两个语言对的英语-德语和英语-西班牙语对模型的评估。我们的模型与两个最新模型进行了比较，我们发现，尽管所提出的模型非常通用，但它产生了竞争性结果，并且在整个语料库中具有显着的稳定性和一致性。

著录项

来源
《International conference of the CLEF initiative》|2012年|67-75|共9页
会议地点
作者
Parth Gupta; Alberto Barron-Cedeno; Paolo Rosso;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类
关键词

相似文献

外文文献
中文文献
专利

1. Using Web resources to construct multilingual medical thesaurus for cross-language medical information retrieval [J] . Wen-Hsiang Lu, Ray S. Lin, Yi-Che Chan, Decision support systems . 2008,第3期

机译：使用Web资源构建跨语言医学信息检索的多语言医学词库
2. Automatic processing of multilingual medical terminology: applications to thesaurus enrichment and cross-language information retrieval [J] . H. Dejean, E. Gaussier, J.-M. Renders, Artificial intelligence in medicine . 2005,第2期

机译：多语种医学术语的自动处理：在词库丰富和跨语言信息检索中的应用
3. Searching on Health Information Databases: A Search Interface Including Thesaurus Term and Tree Browsers is More Effective than a Simple Search Interface [J] . Joanne L. Jordan Evidence Based Library and Information Practice . 2014,第4期

机译：在健康信息数据库上进行搜索：包括同义词库词和树浏览器的搜索界面比简单的搜索界面更有效
4. Cross-Language High Similarity Search Using a Conceptual Thesaurus [C] . Parth Gupta, Alberto Barron-Cedeno, Paolo Rosso International conference of the CLEF initiative . 2012

机译：使用概念性词库进行跨语言高相似性搜索
5. Resolving quasi-synonym relationships in automatic thesaurus construction using fuzzy rough sets and an inverse term frequency similarity function. [D] . Davault, Julius M., III. 2009

机译：使用模糊粗糙集和逆项频率相似性函数解决自动同义词库构建中的准同义词关系。
6. The Influence of Cross-Language Similarity on within- and between-Language Stroop Effects in Trilinguals [O] . Walter J. B. van Heuven, Kathy Conklin, Emily L. Coderre, 2011

机译：跨语言相似性对三语者语言内和语言间Stroop效应的影响
7. Experiments in Classification Clustering and Thesaurus Expansion for Domain Specific Cross-Language Retrieval [O] . Ray R. Larson 2008

机译：领域特定跨语言检索的分类聚类和同义词库扩展实验
8. English-German Cross-Language Retrieval for the GIRT Collection - Exploiting a Multilingual Thesaurus [R] . Gey, F. C. , Jiang, H. 2006

机译：GIRT集合的英语 - 德语跨语言检索 - 利用多语言词库

Cross-Language High Similarity Search Using a Conceptual Thesaurus

摘要

著录项

引文网络

相似文献

相关主题

期刊订阅