Algorithms for Discovery of Frequent Superset, Rather than Frequent Subset

机译：发现频繁超集的算法，而不是频繁子集

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

In this paper, we propose a novel mining task: mining frequent superset from the database of itemsets that is useful in bioinformatics, e-learning systems, jobshop scheduling, and so on. A frequent superset means that it contains more transactions than minimum support threshold. Intuitively, according to the Apriori algorithm, the level-wise discovering starts from 1-itemset, 2-itemset, and so forth. However, such steps cannot utilize the property of Apriori to reduce search space, because if an itemset is not frequent, its superset maybe frequent. In order to solve this problem, we propose three methods. The first is the Apriori-based approach, called Apriori-C. The second is Eclat-based approach, called Eclat-C, which is depth-first approach. The last is the proposed data complement technique (DCT) that we utilize original frequent itemset mining approach to mine frequent superset. The experiment study compares the performance of the proposed three methods by considering the effect of the number of transactions, the average length of transactions, the number of different items, and minimum support.

机译：在本文中，我们提出了一种小说挖掘任务：从项目集的数据库中挖掘频繁的超集，这是在生物信息学，电子学习系统，jobshop调度等中。频繁的超集意味着它包含更多的事务而非最小支持阈值。直观地，根据APRIORI算法，级别的发现从1项开始，2项集等开始。但是，这些步骤不能利用APRiori的属性来减少搜索空间，因为如果项目集不频繁，则其超级仪可能频繁。为了解决这个问题，我们提出了三种方法。首先是基于Apriori的方法，称为Apriori-C。第二种是基于Eclat的方法，称为Eclat-C，这是深度第一的方法。最后的是我们利用原始频繁的项目集挖掘方法来实现频繁超级赛的建议的数据补充技术（DCT）。实验研究通过考虑交易数量，交易的平均交易长度，不同项目数和最低支持的效果，比较了提出的三种方法的性能。

著录项

来源
《International Conference on Data Warehousing and Knowledge Discovery》|2004年||共10页
会议地点
作者
Zhung-Xun Liao; Man-Kwan Shan; Lecture Notes in Computer Science 3181;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类 TP311.13-532;
关键词

相似文献

外文文献
中文文献
专利

1. EFFICIENT SUBSET-LATTICE ALGORITHMS FOR MINING CLOSED FREQUENT ITEMSETS AND MAXIMAL FREQUENT ITEMSETS IN DATA STREAMS [J] . Ye-In Chang, Chia-En Li, Wei-Hau Peng, International Journal of Electrical Engineering: Transactions of the Chinese Institute of Engineers, Series E . 2013,第2期

机译：高效的子格算法，用于挖掘数据流中的封闭频率项和最大频率项
2. Using Frequent Substring Mining Techniques for Indexing Genome Sequences: A Comparison of Frequent Substring and Frequent Max Substring Algorithms [J] . Todsanai Chumwatana Journal of Advances in Information Technology . 2016,第4期

机译：使用频繁子串挖掘技术为基因组序列建立索引：频繁子串算法和最大最大子串算法的比较
3. Discovery of Maximal Frequent Item Sets using Subset Creation [J] . Jnanamurthy HK, Vishesh HV, Vishruth Jain, International Journal of Data Mining & Knowledge Management Process . 2013,第1期

机译：使用子集创建发现最大的频繁项目集
4. Algorithms for Discovery of Frequent Superset, Rather than Frequent Subset [C] . Zhung-Xun Liao, Man-Kwan Shan Data Warehousing and Knowledge Discovery . 2004

机译：发现频繁超集而不是频繁子集的算法
5. Efficient main memory algorithms for significant interval and frequent episode discovery. [D] . Savla, Sagar Hasmukh. 2006

机译：高效的主内存算法，可有效间隔和频繁发现情节。
6. Clinical significance of T lymphocyte subsets immunoglobulin and complement expression in peripheral blood of children with steroid-dependent nephrotic syndrome/frequently relapsing nephrotic syndrome [O] . Shulian Chen, Jianxin Wang, Shishan Liang 2021

机译：患有类固醇依赖性肾病综合征/经常复杂性肾病综合征的儿童外周血外周血中的临床意义免疫球蛋白及其互补表达
7. Algorithms for Discovery of Frequent Superset, Rather than Frequent Subset [O] . Zhung-xun Liao, Man-kwan Shan 2013

机译：发现频繁超集而不是频繁子集的算法
8. GREWA Scalable Frequent Subgraph Discovery Algorithm. [R] . Kuramochi, M., Karypis, G. 2004

机译：GREWa可扩展频繁子图发现算法。

Algorithms for Discovery of Frequent Superset, Rather than Frequent Subset

摘要

著录项

相似文献

相关主题

期刊订阅