首页> 外文会议>Annual meeting of the Association for Computational Linguistics >SARAL: A Low-Resource Cross-Lingual Domain-Focused Information Retrieval System for Effective Rapid Document Triage
【24h】

SARAL: A Low-Resource Cross-Lingual Domain-Focused Information Retrieval System for Effective Rapid Document Triage

机译:SARAL:一种有效的快速文档分诊的低资源跨语言域重点信息检索系统

获取原文

摘要

With the increasing democratization of electronic media, vast information resources are available in less-frequently-taught languages such as Swahili or Somali. That information, which may be crucially important and not available elsewhere, can be difficult for monolingual English speakers to effectively access. In this paper we present SARAL, an end-to-end cross-lingual information retrieval (CL1R) and summarization system for low-resource languages that 1) enables English speakers to search foreign language repositories of text and audio using English queries, 2) summarizes the retrieved documents in English with respect to a particular information need, and 3) provides complete transcriptions and translations as needed. The SARAL system achieved the top end-to-end performance in the most recent IARPA MATERIAL CLIR+summarization evaluations.
机译:随着电子媒体的日益民主化,大量的信息资源以斯瓦希里语或索马里语等较不流行的语言提供。对于说英语的说英语的人来说,可能至关重要的信息可能是至关重要的,而其他地方无法获得。在本文中,我们介绍了SARAL,这是一种针对低资源语言的端到端跨语言信息检索(CL1R)和汇总系统,该系统1)使英语使用者能够使用英语查询来搜索文本和音频的外语存储库,2)概述了针对特定信息需求的英文检索文档,并且3)根据需要提供完整的转录和翻译。 SARAL系统在最新的IARPA MATERIAL CLIR +摘要评估中获得了最高的端到端性能。

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号