Item Retention Improvements to Dropsort, a Lossy Sorting Algorithm

机译：改进了对有损排序算法Dropsort的项目保留

获取原文

获取原文并翻译 | 示例

页面导航

摘要
著录项
相似文献
相关主题

摘要

This paper presents the Dropsort algorithm and suggests improvements. Dropsort is a simple comparison sorting algorithm that performs in θ(n) time by simply removing those elements that are not already in the correct order. Because of this, it is not a true sorting algorithm as the output does not contain all the elements of the input. In other words it is a lossy "sorting algorithm" that performs far more quickly than true sorting algorithms. Much of the original data is lost when using this algorithm unless the numbers to sort are in a mostly ordered format already. If more than a few elements are far from their correct position nearly all of the elements are removed. Two major improvements are proposed that greatly broaden the applicability of the Dropsort algorithm. In the first improvement, a concept of recent memory is added to the algorithm. For example, if several elements are consecutively dropped, the preceding element is assumed to be far from ordered and is removed instead. This is shown to increase the usefulness of Dropsort by increasing the number of retained elements. With a specific type of introduced error, 10,000% more of the original elements are retained over standard Dropsort. In the second improvement, each element is compared to not only determine if it is in sorted order, but if the next element is in order as well. Only if both elements are in order is the item kept. It is important to note that neither of these suggested improvements greatly increase the complexity or execution time of the Dropsort algorithm. Finally, the applications and limitations of these improvements and Dropsort in general are explored. This improved algorithm has many applications to very quickly sort large data sets when specific elements are not as important as the assurance that the list is sorted. This could be the case when a sorted list is critical but the source providing the pre-sorted input is not trustworthy. Also, applications in fields with large data sets and where speed is of utmost importance are discussed.

机译：本文介绍了Dropsort算法并提出了改进建议。 Dropsort是一种简单的比较排序算法，只需删除尚未按正确顺序排列的元素，即可在θ（n）时间内执行。因此，它不是真正的排序算法，因为输出未包含输入的所有元素。换句话说，它是一种有损的“排序算法”，其执行速度远比真正的排序算法快。使用此算法时，许多原始数据都会丢失，除非要排序的数字已经采用了大多数有序格式。如果有多个元素偏离其正确位置，则几乎所有元素都将被删除。提出了两个主要的改进，它们极大地扩展了Dropsort算法的适用性。在第一个改进中，最近存储的概念被添加到算法中。例如，如果连续删除几个元素，则假定前一个元素与有序元素相距甚远，而是将其删除。这表明通过增加保留元素的数量可以提高Dropsort的有用性。对于特定类型的引入错误，与标准Dropsort相比，保留了10,000％的原始元素。在第二个改进中，比较每个元素不仅可以确定它是否处于排序顺序，还可以确定下一个元素是否也处于排序顺序。仅当两个元素都按顺序排列时，才保留该项目。重要的是要注意，这些建议的改进都不会大大增加Dropsort算法的复杂度或执行时间。最后，探讨了这些改进和Dropsort的应用和局限性。当特定元素不如对列表进行排序的保证那么重要时，这种改进的算法具有许多应用程序可以非常快速地对大型数据集进行排序。当排序列表很关键，但是提供预排序输入的来源不可靠时，可能会出现这种情况。此外，还讨论了在数据量大且速度至关重要的领域中的应用。

著录项

来源
《44th Annual Midwest Instruction and Computing Symposium 2011.》|2011年|p.217-226|共10页
会议地点 Duluth MN(US);Duluth MN(US)
作者
Abram Jackson; Ryan McCulloch;
展开▼
作者单位

Computer Science North Dakota State University Fargo, ND 58102;

Computer Science North Dakota State University Fargo, ND 58102;

展开▼
会议组织
原文格式 PDF
正文语种 eng
中图分类计算技术、计算机技术;计算技术、计算机技术;
关键词

相似文献

外文文献
中文文献
专利

1. Optimal Budget Assignment for Service Quality Improvement in Distribution Systems using Non-Dominated Sorting Genetic Algorithm II (NSGAII) and Memetic Algorithm [J] . Rojas-Ríos Luis F, López-Lezama Jesús M, Mu?oz-Galeano Nicolás Informacion Tecnologica . 2016,第1期

机译：使用非支配排序遗传算法II（NSGAII）和Memetic算法优化配电系统服务质量的最佳预算分配
2. Improvement and Evaluation of Sorting Algorithm with FeRAM-Based Functional Memory [J] . HIROAKI WATANABE, NAOKI FUJITA, HIROSHI NOZAWA Integrated Ferroelectrics . 2012,第Null期

机译：基于FeRAM的功能存储器排序算法的改进与评估
3. Improvement and Evaluation of Sorting Algorithm with FeRAM-Based Functional Memory [J] . Hiroaki Watanabea Naoki Fujitab, Hiroshi Nozawaa Integrated Ferroelectrics: An International Journal . 2012,第1期

机译：基于FeRAM的功能存储器排序算法的改进与评估
4. Item Retention Improvements to Dropsort, a Lossy Sorting Algorithm [C] . Abram Jackson, Ryan McCulloch Annual Midwest Instruction and Computing Symposium . 2011

机译：物品保留改进液滴，损失分选算法
5. Optimizing clustering algorithms on heterogeneous parallel architectures for lossy checkpointing mechanisms [D] . Jadhav, Ritesh. 2016

机译：损耗检查点机制的异构并行架构中优化聚类算法
6. Performance evaluation of lossy quality compression algorithms for RNA-seq data [O] . Rongshan Yu, Wenxian Yang, Shun Wang 2020

机译：RNA-SEQ数据有损质量压缩算法的性能评估
7. The improvement research on multi-objective optimization algorithm based on non-dominated sorting [O] . 颜曙林 2016

机译：基于非支配排序的多目标优化算法的改进研究
8. PRIME and PDQ Sorts -Efficient Minimal Storage Sorting Algorithms. [R] . hilbrand,roy r. 1979

机译：pRImE和pDQ排序 - 高效的最小存储排序算法。

Item Retention Improvements to Dropsort, a Lossy Sorting Algorithm

摘要

著录项

相似文献

相关主题

期刊订阅