首页> 外文会议>International Conference on Electronics, Computer and Computation >Analyzing the performance differences between pattern matching and compressed pattern matching on texts
【24h】

Analyzing the performance differences between pattern matching and compressed pattern matching on texts

机译:分析文本上的模式匹配和压缩模式匹配之间的性能差异

获取原文

摘要

In this study the statistics of pattern matching on text data and the statistics of compressed pattern matching on compressed form of the same text data are compared. A new application has been developed to count the character matching numbers in compressed and uncompressed texts individually. Also a new text compression algorithm that allows compressed pattern matching by using classical pattern matching algorithms without any change is presented in this paper. In this paper while the presented compression algorithm based on digram and trigram substitution has been giving about 30–35% compression factor, the duration of compressed pattern matching on compressed text is calculated less than the duration of pattern matching on uncompressed text. Also it is confirmed that the number of character comparison on compressed texts while doing a compressed pattern matching is less than the number of character comparison on uncompressed texts. Thus the aim of the developed compression algorithm is to point out the difference in text processing between compressed and uncompressed text and to form opinions for another applications.
机译:在本研究中,比较了文本数据上的模式匹配统计数据和相同文本数据上的压缩形式的压缩模式匹配统计数据。开发了一个新的应用程序来分别计算压缩和未压缩文本中的字符匹配数。本文还提出了一种新的文本压缩算法,该文本压缩算法允许通过使用经典模式匹配算法进行压缩模式匹配,而无需进行任何更改。在本文中,虽然所提出的基于二元词和三元词替换的压缩算法给出了约30–35%的压缩因子,但计算的压缩文本上的压缩模式匹配持续时间小于未压缩文本上的模式匹配持续时间。还可以确认,在进行压缩模式匹配时,在压缩文本上的字符比较次数小于未压缩文本上的字符比较次数。因此,开发的压缩算法的目的是指出压缩和未压缩文本之间的文本处理差异,并为其他应用程序形成意见。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号