首页> 外文会议>44th Annual Midwest Instruction and Computing Symposium 2011. >Item Retention Improvements to Dropsort, a Lossy Sorting Algorithm
【24h】

Item Retention Improvements to Dropsort, a Lossy Sorting Algorithm

机译:改进了对有损排序算法Dropsort的项目保留

获取原文
获取原文并翻译 | 示例

摘要

This paper presents the Dropsort algorithm and suggests improvements. Dropsort is a simple comparison sorting algorithm that performs in θ(n) time by simply removing those elements that are not already in the correct order. Because of this, it is not a true sorting algorithm as the output does not contain all the elements of the input. In other words it is a lossy "sorting algorithm" that performs far more quickly than true sorting algorithms. Much of the original data is lost when using this algorithm unless the numbers to sort are in a mostly ordered format already. If more than a few elements are far from their correct position nearly all of the elements are removed. Two major improvements are proposed that greatly broaden the applicability of the Dropsort algorithm. In the first improvement, a concept of recent memory is added to the algorithm. For example, if several elements are consecutively dropped, the preceding element is assumed to be far from ordered and is removed instead. This is shown to increase the usefulness of Dropsort by increasing the number of retained elements. With a specific type of introduced error, 10,000% more of the original elements are retained over standard Dropsort. In the second improvement, each element is compared to not only determine if it is in sorted order, but if the next element is in order as well. Only if both elements are in order is the item kept. It is important to note that neither of these suggested improvements greatly increase the complexity or execution time of the Dropsort algorithm. Finally, the applications and limitations of these improvements and Dropsort in general are explored. This improved algorithm has many applications to very quickly sort large data sets when specific elements are not as important as the assurance that the list is sorted. This could be the case when a sorted list is critical but the source providing the pre-sorted input is not trustworthy. Also, applications in fields with large data sets and where speed is of utmost importance are discussed.
机译:本文介绍了Dropsort算法并提出了改进建议。 Dropsort是一种简单的比较排序算法,只需删除尚未按正确顺序排列的元素,即可在θ(n)时间内执行。因此,它不是真正的排序算法,因为输出未包含输入的所有元素。换句话说,它是一种有损的“排序算法”,其执行速度远比真正的排序算法快。使用此算法时,许多原始数据都会丢失,除非要排序的数字已经采用了大多数有序格式。如果有多个元素偏离其正确位置,则几乎所有元素都将被删除。提出了两个主要的改进,它们极大地扩展了Dropsort算法的适用性。在第一个改进中,最近存储的概念被添加到算法中。例如,如果连续删除几个元素,则假定前一个元素与有序元素相距甚远,而是将其删除。这表明通过增加保留元素的数量可以提高Dropsort的有用性。对于特定类型的引入错误,与标准Dropsort相比,保留了10,000%的原始元素。在第二个改进中,比较每个元素不仅可以确定它是否处于排序顺序,还可以确定下一个元素是否也处于排序顺序。仅当两个元素都按顺序排列时,才保留该项目。重要的是要注意,这些建议的改进都不会大大增加Dropsort算法的复杂度或执行时间。最后,探讨了这些改进和Dropsort的应用和局限性。当特定元素不如对列表进行排序的保证那么重要时,这种改进的算法具有许多应用程序可以非常快速地对大型数据集进行排序。当排序列表很关键,但是提供预排序输入的来源不可靠时,可能会出现这种情况。此外,还讨论了在数据量大且速度至关重要的领域中的应用。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号