Mining association rules in text databases using multipass with inverted hashing and pruning

机译：使用倒置散列和修剪的MultiPass挖掘文本数据库中的挖掘关联规则

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

In this paper, we propose a new algorithm named Multipass with Inverted Hashing and Pruning (MIHP) for mining association rules between words in text databases. The characteristics of text databases are quite different from those of retail transaction databases, and existing mining algorithms cannot handle text databases efficiently because of the large number of itemsets (i.e., words) that need to be counted. Two well-known mining algorithms, the Apriori algorithm [1] and the Direct Hashing and Pruning (DHP) algorithm [8], are evaluated in the context of mining text databases, and are compared with the proposed MIHP algorithm. It has been shown that the MIHP algorithm has better performance for large text databases.

机译：在本文中，我们提出了一种名为MultiPass的新算法，其中具有反相散列和修剪（MIHP），用于文本数据库中的单词之间的挖掘关联规则。文本数据库的特征与零售事务数据库的特征完全不同，并且由于需要计算的大量项目集（即单词），现有的挖掘算法无法有效处理文本数据库。在采矿文本数据库的上下文中，评估了两个众所周知的挖掘算法，APRiori算法[1]和直接散列和修剪（DHP）算法[8]，并与所提出的MIHP算法进行比较。已经表明，MIHP算法对大型文本数据库具有更好的性能。

著录项

来源
《International Conference on Tools with Artifical Intelligence》|2002年||共8页
会议地点
作者
John D. Holt; Soon M. Chung;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类 TP18-53;
关键词
Association rules; Text database; Inverted hashing; Performance analysis;

机译：关联规则;文本数据库;倒散;绩效分析;

相似文献

外文文献
中文文献
专利

1. Mining association rules using inverted hashing and pruning [J] . John D. Holt, Soon M. Chung Information Processing Letters . 2002,第4期

机译：使用反向哈希和修剪挖掘关联规则
2. Multipass Algorithms for Mining Association Rules in Text Databases [J] . John D. Holt, Soon M. Chung Knowledge and Information Systems . 2001,第2期

机译：文本数据库中关联规则的多遍算法
3. A New Perfect Hashing and Pruning Algorithm for Mining Association Rule [J] . Hassan Najadat, Amani Shatnawi, Ghadeer Obiedat IBIMA Communications . 2011,第7期

机译：一种新的完善的关联规则散列和修剪算法
4. Mining association rules in text databases using multipass with inverted hashing and pruning [C] . Holt, J.D., Chung, . 2002

机译：使用带倒置哈希和修剪的多遍来挖掘文本数据库中的关联规则
5. Efficient sequential and parallel algorithms for mining association rules in text databases [D] . Holt, John D. 2003

机译：用于挖掘文本数据库中关联规则的高效顺序和并行算法
6. Reducing Free-Text Communication Orders Placed by Providers Using Association Rule Mining [O] . Zahra Hajihashemi Master, Paul Pancoast 2012

机译：使用关联规则挖掘减少提供者下达的自由文本通信顺序
7. Enhancing association rules algorithms for mining distributed databases. Integration of fast BitTable and multi-agent association rules mining in distributed medical databases for decision support. [O] . Abdo Walid Adly Atteya 2012

机译：增强用于挖掘分布式数据库的关联规则算法。快速BitTable和多代理关联规则挖掘在分布式医疗数据库中的集成，以提供决策支持。
8. Constraint Satisfaction Neural Network Approach for Data Mining Classification and Association Rules in Breast Cancer Databases [R] . Tourassi, G. D. 2003

机译：基于约束满足神经网络的乳腺癌数据挖掘分类与关联规则

Mining association rules in text databases using multipass with inverted hashing and pruning

摘要

著录项

相似文献

相关主题

期刊订阅