【24h】

Semi-Automatic Indexing of Documents with a Multilingual Thesaurus

机译:多语言词库的半自动索引文档

获取原文

摘要

With the growing significance of digital libraries and the Internet, more and more electronic texts become accessible to a wide and geographically disperse public. This requires adequate tools to facilitate indexing, storage, and retrieval of documents written in different languages. We present a method for semiautomatic indexing of electronic documents and construction of a multilingual thesaurus, which can be used for query formulation and information retrieval. We use special dictionaries and user interaction in order to solve ambiguities and find adequate canonical terms in the language and an adequate abstract language-independent term. The abstract thesaurus is updated incrementally by new indexed documents and is used to search for documents using adequate terms.
机译:随着数字图书馆和互联网的重要性,越来越多的电子文本可供广泛和地理位置的公众访问。这需要适当的工具来促进以不同语言编写的文件的索引,存储和检索。我们提出了一种用于微自意索引的电子文件和构建多语言词库的方法,可用于查询配方和信息检索。我们使用特殊的词典和用户互动,以解决模糊性,并在语言中找到足够的规范术语和足够的抽象语言无关。摘要词库由新索引文件逐步更新,用于使用足够的术语来搜索文档。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号