首页> 外文会议>International Conference on Computer Aided Systems Theory(EUROCAST 2005) >Information Retrieval and Large Text Structured Corpora
【24h】

Information Retrieval and Large Text Structured Corpora

机译:信息检索和大文本结构

获取原文

摘要

First, it is necessary to emphasise that it is mandatory to transform documents of the corpora into a common format when managing large amounts of information. This will allow us to query all documents using a unique query and to improve the performance of the system. By doing so we will avoid problems with performance and result management. Furthermore, nowadays, the technologies used to build IRSs are not prepared to satisfy corpora users' requirements. So, in the near future the development of new add-ons which take them into account is needed. There are some timid attempts to include basic linguistic operations (sensitivity to accents, umlauts, etc., theme searches, etc.) based on localization, but it is time to incorporate Syntactic techniques into commercial systems to enable the building of more versatile IRSs based on corpora.
机译:首先,有必要强调在管理大量信息时,必须在管理大量信息时将Corpor的文档转换为共同格式。这将允许我们使用唯一查询查询所有文档,并提高系统性能。通过这样做,我们将避免性能和结果管理问题。此外,如今,用于构建IRS的技术不准备满足Corpora用户的要求。因此,在不久的将来,需要开发将它们考虑在内的新附加组件。基于本地化,有一些胆小的尝试包括基本语言操作(对重音,重音,OF,主题搜索等)的敏感性,但是是时候将句法技术合并到商业系统中,以便基于更多功能的IRS构建在Corpora。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号