Information Retrieval and Large Text Structured Corpora

机译：信息检索和大文本结构

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

First, it is necessary to emphasise that it is mandatory to transform documents of the corpora into a common format when managing large amounts of information. This will allow us to query all documents using a unique query and to improve the performance of the system. By doing so we will avoid problems with performance and result management. Furthermore, nowadays, the technologies used to build IRSs are not prepared to satisfy corpora users' requirements. So, in the near future the development of new add-ons which take them into account is needed. There are some timid attempts to include basic linguistic operations (sensitivity to accents, umlauts, etc., theme searches, etc.) based on localization, but it is time to incorporate Syntactic techniques into commercial systems to enable the building of more versatile IRSs based on corpora.

机译：首先，有必要强调在管理大量信息时，必须在管理大量信息时将Corpor的文档转换为共同格式。这将允许我们使用唯一查询查询所有文档，并提高系统性能。通过这样做，我们将避免性能和结果管理问题。此外，如今，用于构建IRS的技术不准备满足Corpora用户的要求。因此，在不久的将来，需要开发将它们考虑在内的新附加组件。基于本地化，有一些胆小的尝试包括基本语言操作（对重音，重音，OF，主题搜索等）的敏感性，但是是时候将句法技术合并到商业系统中，以便基于更多功能的IRS构建在Corpora。

著录项

来源
《International Conference on Computer Aided Systems Theory(EUROCAST 2005)》|2005年||共10页
会议地点
作者
Fco. Mario Barcala; Miguel A. Molinero; Eva Dominguez;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类机器辅助技术;
关键词

相似文献

外文文献
中文文献
专利

1. Models and Algorithms of Information Retrieval in a Multilingual Environment on the Basis of Thematic and Dynamic Text Corpora [J] . Aleksey A. Mamchich Cybernetics and information technologies: CIT . 2016,第1期

机译：基于主题和动态文本语料库的多语言环境中信息检索的模型和算法
2. Text-Mining, Structured Queries, and Knowledge Management on Web Document Corpora [J] . Hamid Mousavi, Maurizio Atzori, Shi Gao, SIGMOD record . 2014,第3期

机译：Web文档语料库上的文本挖掘，结构化查询和知识管理
3. Text mining, a race against time? An attempt to quantify possible variations in text corpora of medical publications throughout the years [J] . Wagner Mathias, Vicinus Benjamin, Muthra Sherieda T., Computers in Biology and Medicine . 2016,第Null期

机译：文本挖掘，与时间赛跑？试图量化多年来医学出版物文本语料库的可能变化
4. Information Retrieval and Large Text Structured Corpora [C] . Fco. Mario Barcala, Miguel A. Molinero, Eva Dominguez International Conference on Computer Aided Systems Theory(EUROCAST 2005); 20050207-11; Las Palmas de Gran Canaria(ES) . 2005

机译：信息检索和大文本结构语料库
5. Intelligent agent development using unstructured text corpora and multiple choice questions [D] . Johnson, Joseph. 2016

机译：使用非结构化文本语料库和多项选择题进行智能代理开发
6. Leveraging word embeddings and medical entity extraction for biomedical dataset retrieval using unstructured texts [O] . Yanshan Wang, Majid Rastegar-Mojarad, Ravikumar Komandur-Elayavilli, 2017

机译：利用单词嵌入和医学实体提取来使用非结构化文本检索生物医学数据集
7. Information Retrieval and Large Text Structured Corpora ⋆ [O] . Fco Mario Barcala, Miguel A. Molinero, Eva Domínguez 2011

机译：信息检索和大文本结构语料库⋆

Information Retrieval and Large Text Structured Corpora

摘要

著录项

相似文献

相关主题

期刊订阅