Boosting Bitext Compression

机译：提升BITEXT压缩

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

Bilingual parallel corpora, also know as bitexts, convey the same information in two different languages. This implies that when modelling bi-texts one can take advantage of the fact that there exists a relation between both texts; the text alignment task allow to establish such relationship. In this paper we propose different approaches that use words and biwords (pairs made of two words, each one from a different text) as representation sym-bolic units. The properties of these approaches are analysed from a statis-tical point of view and tested as a preprocessing step to general purpose compressors. The results obtained suggest interesting conclusions concerning the use of both words and biwords. When encoded models are used as com-pression boosters we achieve compression ratios improving state-of-the-art compressors up to 6.5 percentage points, being up to 40% faster.

机译：双语平行的Corpora，也知道为Bitexts，以两种不同的语言传达相同的信息。这意味着当建模双文本时，可以利用两个文本之间存在关系的事实;文本对齐任务允许建立这种关系。在本文中，我们提出了使用单词和吉语（对由两个单词组成的对，每个来自不同文本的对）的不同方法作为表示对齐禁止单元。这些方法的性质从Statis-TiCE的观点分析并作为预处理步骤测试到通用压缩机。得到的结果表明，有关使用单词和吉语的有趣结论。当编码模型用作Com-Coundion Boosters时，我们实现了最高可达6.5个百分点的最新压缩机的压缩比率，更快高达40％。

著录项

来源
《International Conference on Practical Applications of Agents and Multiagent Systems 》|2010年||共8页
会议地点
作者
Joaquin Adiego; Miguel A. Martinez-Prieto; Javier E. Hoyos-Torio; Felipe Sanchez-Martinez;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类人工智能理论 ;
关键词
Compression Boosting; Bitext Compression.;

机译：压缩升压;BITEXT压缩。;

相似文献

外文文献
中文文献
专利

1. Generalized Biwords for Bitext Compression and Translation Spotting [J] . Adiego J., Carrasco R. C., Mart#237, The Journal of Artificial Intelligence Research . 2012 ,第4期

机译：用于双文本压缩和翻译发现的广义双字
2. Generalized Biwords for Bitext Compression and Translation Spotting [J] . Felipe Sanchez-Martinez, Rafael C. Carrasco, Miguel A. Martinez-Prieto, The Journal of Artificial Intelligence Research . 2012 ,第Null期

机译：用于双文本压缩和翻译发现的广义双字
3. Novel hybrid framework for image compression for supportive hardware design of boosting compression [J] . Premachand D. R., U. Eranna International Journal of Electrical and Computer Engineering . 2021 ,第3期

机译：用于升压压缩的支持硬件设计的新型混合框架
4. Boosting Bitext Compression [C] . Joaquin Adiego, Miguel A. Martinez-Prieto, Javier E. Hoyos-Torio, Trends in practical applications of agents and multiagent systems . 2011

机译：促进双文本压缩
5. Experimental and Computational Investigation of Spark Assisted Compression Ignition Combustion under Boosted, Ultra EGR-Dilute Conditions [D] . Triantopoulos, Vasileios. 2018

机译：升压下火花辅助压缩点火燃烧的实验和计算研究，超EGR-稀释条件
6. Boosting Throughput and Efficiency of Hardware Spiking Neural Accelerators Using Time Compression Supporting Multiple Spike Codes [O] . Changqing Xu, Wenrui Zhang, Yu Liu, 2020

机译：使用时间压缩支撑多穗码的硬件尖峰神经加速器的吞吐量和效率
7. Boosting bitext compression [O] . Adiego Rodríguez, Joaquín, Martínez Prieto, Miguel Ángel, Hoyos Torío, Javier E., 2011

机译：增强bitext压缩

Boosting Bitext Compression

摘要

著录项

相似文献

相关主题

期刊订阅