首页> 外文期刊>Computer Engineering and Intelligent Systems >Advanced Searching Algorithms and its Behavior on Text Structures
【24h】

Advanced Searching Algorithms and its Behavior on Text Structures

机译:高级搜索算法及其在文本结构上的行为

获取原文
       

摘要

This research investigates the behavior of the Boyer-Moore-Horspool (BMH) and the Boyer-Moore-Raita (BMR) string-matching algorithms using multilingual texts. The performance is computed based on searching for patterns in master strings. Experiments are conducted using a number of pattern lengths with many experiments repetition. The experimental results show that on average the number of comparisons per character passed in the case of the BMR is less than the number encountered by the BMH variant. The improvement is due to properties of the text structures. These experiments may lead to more theoretical and practical studies to develop new variants of algorithms. Using multilingual text structures provide more insight into the theory and structure of algorithms as multilingual text structures have different set of characters and dependencies, and the character properties have different type of structures. Since many applications of today depend on searching algorithms, therefore researchers need to explore every possibility that lead to improving the efficiency of searching and matching mechanisms. The time performance of exact string pattern matching can be greatly improved if an efficient algorithm is used. Considering, for example, the growing amount of text handled in the electronic patient records, it is worth and essential, in these cases and others, to searching for an efficient algorithm to deal with such huge items of information.
机译:这项研究调查了使用多语言文本的Boyer-Moore-Horspool(BMH)和Boyer-Moore-Raita(BMR)字符串匹配算法的行为。性能是根据在主字符串中搜索模式来计算的。使用许多图案长度进行实验,并重复许多实验。实验结果表明,在使用BMR的情况下,每个字符传递的比较次数平均要少于BMH变体遇到的次数。改进归因于文本结构的属性。这些实验可能会导致更多的理论和实践研究,以开发新的算法变体。由于多语言文本结构具有不同的字符集和依存关系,并且字符属性具有不同的结构类型,因此使用多语言文本结构可提供对算法理论和结构的更多了解。由于当今的许多应用都依赖于搜索算法,因此研究人员需要探索导致提高搜索和匹配机制效率的各种可能性。如果使用有效的算法,则可以大大提高精确字符串模式匹配的时间性能。例如,考虑到电子病历中处理的文本数量的增加,在这些情况下以及其他情况下,寻找一种有效的算法来处理如此庞大的信息项是有价值且必不可少的。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号