An End-to-End Efficient Lucene-Based Framework of Document/Information Retrieval

Alaidine Ben Ayed; Ismail Biskri; Jean-Guy Meunier

首页> 外文期刊>International journal of information retrieval research >An End-to-End Efficient Lucene-Based Framework of Document/Information Retrieval

【24h】

An End-to-End Efficient Lucene-Based Framework of Document/Information Retrieval

机译：An End-to-End Efficient Lucene-Based Framework of Document/Information Retrieval

获取原文

获取原文并翻译 | 示例

掌桥外文数据库（机构版） >>

开具论文收录证明 >>

页面导航

摘要
著录项
相关主题

摘要

In the context of big data and the Industrial Revolution 4.0 era, enhancing document/information retrieval framework efficiency to handle the ever-growing volume of text data in an ever more digital world is a must. This article describes a double-stage system of document/information retrieval. First, a Lucene-based document retrieval tool is implemented, and a couple of query expansion techniques using a comparable corpus (Wikipedia) and word embeddings are proposed and tested. Second, a retention-fidelity summarization protocol is performed on top of the retrieved documents to create a short, accurate, and fluent extract of a longer retrieved single document (or a set of top retrieved documents). Obtained results show that using word embeddings is an excellent way to achieve higher precision rates and retrieve more accurate documents. Also, obtained summaries satisfy the retention and fidelity criteria of relevant summaries.

著录项

来源
《International journal of information retrieval research》 |2022年第2期|636-649|共14页
作者
Alaidine Ben Ayed; Ismail Biskri; Jean-Guy Meunier;
展开▼
作者单位

Universite du Quebec a Montreal, Canada;

Universite du Quebec a Trois-Rivieres, Canada;

展开▼
收录信息
原文格式 PDF
正文语种英语
中图分类
关键词
Data and Knowledge Representation; Document Retrieval; Internet and Web Applications; Mono/Multi-Document Summarization;
入库时间 2024-01-25 19:23:40

An End-to-End Efficient Lucene-Based Framework of Document/Information Retrieval

摘要

著录项

相关主题

期刊订阅