首页> 外文会议>Learning and Technology Conference >From imagining to the making of a novel and fast search methodology: Thabit's algorithm
【24h】

From imagining to the making of a novel and fast search methodology: Thabit's algorithm

机译:从想象到制作新颖和快速搜索方法:Thabit的算法

获取原文

摘要

Search algorithms or methodologies play an important role in any database, data mining, or any professional application. Given their importance, no new search algorithm was developed in the past decade. For a search methodology to be applicable, it has to be novel and faster than existing algorithms. This paper will present a new search methodology that has been awarded a patent for novelty (United States Patent no. 9009200). This search method will be called Thabit's algorithm in this paper. Moreover, the paper will show some test results to show the superiority of this methodology to several search algorithms like binary search, hashing, AVL trees and other search algorithms. For example, when searching a list of 50 million airline PNR numbers, the search time for Microsoft Hash was 18649ms, but when the list became one billion elements the search time increased to 2070318ms with a collision of 57%. Binary search was fairly stable with a search time of 116897ms in a list of 50 million and 2771639ms in the list of one billion elements. On the other hand, searching with Thabit's algorithm took 8517ms for a list of 50 million elements and 194739ms for the list of one billion elements. The AVL tree had 249269ms and 11995298 ms respectively. The comparison was done for space requirements, where binary search took 44703.6 MB for the billion elements. Thabit's algorithm, Microsoft Hash and AVL tree required 60883.3 MB, 115578.4 MB, and 104000.3 MB of space, respectively. To achieve these results, a graduate student was employed and a makerplace lab was created using a server and two laptops. In the makerplace lab the Thabit's algorithm was implemented. The implementation involved processing each item one character at a time or two, three, four, ... etc characters at a time and checking which data structures led to better performance. In the Thabit's algorithm, three dimensional arrays ware replaced with one dimensional arrays. Furthermore, much study went into data size; checking the effect of increasing the size of the data and finding when thrashing starts. Finally, trying to run the programs in parallel, and checking speed gain and how this gain compares to Amdahl's law. Also, tests were done on searching through a large number of 16 digit debit card with great results using Thabit's algorithm.
机译:搜索算法或方法在任何数据库,数据挖掘或任何专业应用中都发挥着重要作用。鉴于他们的重要性,过去十年没有开发了新的搜索算法。对于要适用的搜索方法,它必须是新颖且比现有算法更快。本文将提出一项新的搜索方法,已被授予新奇的专利(美国专利No.9009200)。此搜索方法将在本文中称为Thabit的算法。此外,本文将显示一些测试结果,以显示这种方法的优越性,与二进制搜索,散列,AVL树和其他搜索算法等几个搜索算法。例如,当搜索5000万航空公司PNR号码的列表时,Microsoft Hash的搜索时间为18649ms,但当列表变为10亿个元素时,搜索时间增加到2070318ms,碰撞为57 %。二进制搜索相当稳定,搜索时间为116897ms,列出5000万和2771639ms,列出10亿个要素。另一方面,使用Thabit的算法搜索8517毫秒,列出了5000万个元素和194739ms,列出了10亿个要素。 AVL树分别有249269ms和11995298毫秒。对空间要求进行了比较,其中二进制搜索占亿元的44703.6 MB。 Thabit的算法,Microsoft哈希和AVL树分别需要60883.3 MB,115578.4 MB和104000.3 MB的空间。为实现这些结果,就业研究生,使用服务器和两台笔记本电脑创建了一个Makerplace实验室。在MakerPlace Lab中,实现了Thabit的算法。该实施涉及处理每个项目的每个项目一次,三个,四个,...等特征一次和检查哪些数据结构导致更好的性能。在Thabit的算法中,三维阵列件用一维阵列替换。此外,许多研究进入了数据尺寸;检查增加数据大小并在捶打时查找的效果。最后,尝试并行运行程序,并检查速度增益以及该增益如何与Amdahl的定律进行比较。此外,使用Thabit算法在很大程度上通过大量16位借记卡进行了测试。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号