首页> 外文会议> >Parallel mining of association rules from text databases on a cluster of workstations

【24h】

Parallel mining of association rules from text databases on a cluster of workstations

机译：从工作站集群上的文本数据库中并行挖掘关联规则

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

Summary form only given. We propose a new algorithm named Parallel Multipass with Inverted Hashing and Pruning (PMIHP) for mining association rules between words in text databases. The characteristics of text databases are quite different from those of retail transaction databases, and existing mining algorithms cannot handle text databases efficiently because of the large number of itemsets (i.e., sets of words) that need to be counted. The new PMIHP algorithm is a parallel version of our multipass with inverted hashing and pruning (MIHP) algorithm, which was shown to be quite efficient than other existing algorithms in the context of mining text databases. The PMIHP algorithm reduces the overhead of communication between miners running on different processors because they are mining local databases asynchronously and prune the global candidates by using the inverted hashing and pruning technique.

机译：仅提供摘要表格。我们提出了一种新算法，称为并行多遍历与反向哈希和修剪（PMIHP），用于挖掘文本数据库中单词之间的关联规则。文本数据库的特征与零售交易数据库的特征完全不同，并且由于需要计算大量的项目集（即单词集），因此现有的挖掘算法无法有效地处理文本数据库。新的PMIHP算法是我们的带有反向哈希和修剪（MIHP）算法的多遍算法的并行版本，在挖掘文本数据库的情况下，该算法比其他现有算法具有更高的效率。 PMIHP算法减少了运行在不同处理器上的矿工之间的通信开销，因为它们异步地挖掘本地数据库并通过使用反向哈希和修剪技术来修剪全局候选对象。

著录项

来源
《》|2004年|p.86|共1页
会议地点
作者
Holt; J.D.; Chung; S.M.;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类无线电电子学、电信技术 ;
关键词
data mining; workstation clusters; parallel algorithms; file organisation; distributed databases; text analysis; Parallel Multipass with Inverted Hashing and Pruning algorithm; association rules; text databases; retail transaction databases; parallel mining; workstation clusters;

机译：数据挖掘;工作站集群;并行算法;文件组织;分布式数据库;文本分析;具有反向哈希和修剪算法的并行多遍;关联规则;文本数据库;零售交易数据库;并行挖掘;工作站集群;

相似文献

外文文献
中文文献
专利

1. Parallel mining of association rules from text databases [J] . John D. Holt, Soon M. Chung Journal of supercomputing . 2007 ,第3期

机译：从文本数据库并行挖掘关联规则
2. Multi-objective Genetic Algorithm for Association Rule Mining Using a Homogeneous Dedicated Cluster of Workstations | Science Publications [J] . A. Ghosh, A. K. Jagadev, R. Mall, American journal of applied sciences . 2006 ,第11期

机译：均质专用工作站集群的多目标遗传算法关联规则挖掘科学出版物
3. Parallel Semi-supervised enhanced fuzzy Co-Clustering (PSEFC) and Rapid Association Rule Mining (RARM) based frequent route mining algorithm for travel sequence recommendation on big socialmedia [J] . N. Suresh Kumar, M. Thangamani CONCURRENCY PRACTICE & EXPERIENCE . 2019 ,第14期

机译：大型社交媒体上基于并行半监督增强型模糊联合聚类（PSEFC）和快速关联规则挖掘（RARM）的频繁路线挖掘算法推荐行程
4. Parallel mining of association rules from text databases on a cluster of workstations [C] . Holt J.D., Chung S.M. International Parallel and Distributed Processing Symposium . 2004

机译：从工作站群集的文本数据库中并行挖掘关联规则
5. Efficient sequential and parallel algorithms for mining association rules in text databases [D] . Holt, John D. 2003

机译：用于挖掘文本数据库中关联规则的高效顺序和并行算法
6. Reducing Free-Text Communication Orders Placed by Providers Using Association Rule Mining [O] . Zahra Hajihashemi Master, Paul Pancoast 2012

机译：使用关联规则挖掘减少提供者下达的自由文本通信顺序
7. Multi-objective Genetic Algorithm for Association Rule Mining Using a Homogeneous Dedicated Cluster of Workstations [O] . 2008

机译：基于齐次专用工作站的关联规则挖掘的多目标遗传算法
8. Science and Technology Text Mining: Origins of Database Tomography and Multi-Word Phrase Clustering. [R] . Kostoff, R. N. 2003

机译：科技文本挖掘：数据库层析成像和多词短语聚类的起源。

Parallel mining of association rules from text databases on a cluster of workstations

摘要

著录项

相似文献

相关主题

期刊订阅