首页> 外文会议>International Conference on Information Retrieval and Knowledge Management >A Malay Hadith translated document retrieval using parallel Latent Semantic Indexing (LSI)
【24h】

A Malay Hadith translated document retrieval using parallel Latent Semantic Indexing (LSI)

机译:使用并行潜在语义索引(LSI)的马来圣训译本文档检索

获取原文

摘要

Latent Semantic Indexing (LSI) is one of the well-known searching techniques where documents are retrieved based on the content similarity or meaning of the documents. LSI is an effective method to improve the retrieval performance, however, as the size of documents gets larger; a better technique is needed to process the documents faster. In this paper, a new parallel LSI algorithm which runs on standard multi-core personal computer (PC) is proposed to improve the performance of retrieving relevant documents. The parallel LSI algorithm uses parallel threads to automatically perform the matrix computations using the Fork-Join approach. 2028 text documents extracted from four volumes of the Malay-translated book of Hadith known as Shahih Bukhari were used as the test collections. We compare the time to process LSI space between both sequential and parallel systems. The percentage of recall, precision and effectiveness for retrieving relevant document are also measured for both systems using the Information Retrieval (IR) metrics which are recall, precision, and effectiveness. The results show that the time taken to create LSI space for parallel system is faster than sequential system. Based on recall, precision and effectiveness measures, our proposed parallel LSI system is comparable to sequential LSI system.
机译:潜在语义索引(Latent Semantic Indexing,LSI)是一种众所周知的搜索技术,其中,基于文档的内容相似性或含义来检索文档。 LSI是提高检索性能的有效方法,但是,随着文档的大小变大,LSI变得越来越有用。需要一种更好的技术来更快地处理文档。本文提出了一种在标准多核个人计算机(PC)上运行的新并行LSI算法,以提高检索相关文档的性能。并行LSI算法使用Fork-Join方法使用并行线程自动执行矩阵计算。从四本马来语译本的《圣训》(Shahih Bukhari)中提取的2028篇文本文档被用作测试集。我们比较了顺序系统和并行系统之间处理LSI空间的时间。还使用信息检索(IR)指标(即召回率,准确性和有效性)对两个系统都测量了检索相关文档的召回率,准确性和有效性百分比。结果表明,为并行系统创建LSI空间所需的时间比顺序系统要快。基于召回率,精度和有效性指标,我们提出的并行LSI系统可与顺序LSI系统媲美。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号