首页> 外文会议>International conference on intelligent text processing and computational linguistics >Improving Bilingual Search Performance Using Compact Full-Text Indices
【24h】

Improving Bilingual Search Performance Using Compact Full-Text Indices

机译:使用紧凑的全文索引提高双语搜索性能

获取原文

摘要

Machine Translation tasks must tackle the ever-increasing sizes of parallel corpora, requiring space and time efficient solutions to support them. Several approaches were developed based on full-text indices, such as suffix arrays, with important time and space achievements. However, for supporting bilingual tasks, the search time efficiency of such indices can be improved using an extra layer for the text alignment. Additionally, their space requirements can be significantly reduced using more compact indices. We propose a search procedure on top of a compact bilingual framework that improves bilingual search response time, while having a space efficient representation of aligned parallel corpora.
机译:机器翻译任务必须解决并行语料库不断增长的规模,需要空间和时间高效的解决方案来支持它们。基于全文索引(例如后缀数组)开发了几种方法,它们具有重要的时空成就。但是,为了支持双语任务,可以使用额外的文本对齐层来提高此类索引的搜索时间效率。此外,使用更紧凑的索引可以显着减少其空间需求。我们提出了一种紧凑的双语框架之上的搜索程序,该框架可提高双语搜索的响应时间,同时具有对齐的并行语料库的空间高效表示。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号