【24h】

Using Patterns Co-occurrence Matrix for Cleaning Closed Sequential Patterns for Text Mining

机译：使用模式共现矩阵来清理封闭的顺序模式以进行文本挖掘

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

With the overwhelming increase in the amount of texts on the web, it is almost impossible for people to keep abreast of up-to-date information. Text mining is a process by which interesting information is derived from text through the discovery of patterns and trends. Text mining algorithms are used to guarantee the quality of extracted knowledge. However, the extracted patterns using text or data mining algorithms or methods leads to noisy patterns and inconsistency. Thus, different challenges arise, such as the question of how to understand these patterns, whether the model that has been used is suitable, and if all the patterns that have been extracted are relevant. Furthermore, the research raises the question of how to give a correct weight to the extracted knowledge. To address these issues, this paper presents a text post-processing method, which uses a pattern co-occurrence matrix to find the relation between extracted patterns in order to reduce noisy patterns. The main objective of this paper is not only reducing the number of closed sequential patterns, but also improving the performance of pattern mining as well. The experimental results on Reuters Corpus Volume 1 data collection and TREC filtering topics show that the proposed method is promising.

机译：随着网络上文本数量的飞速增长，人们几乎无法跟上最新信息。文本挖掘是通过发现样式和趋势从文本中获取有趣信息的过程。文本挖掘算法用于保证所提取知识的质量。但是，使用文本或数据挖掘算法或方法提取的模式会导致噪声模式和不一致。因此，出现了不同的挑战，例如如何理解这些模式，已使用的模型是否合适以及是否已提取的所有模式都相关的问题。此外，研究提出了一个问题，即如何对所提取的知识给予正确的权重。为了解决这些问题，本文提出了一种文本后处理方法，该方法使用模式共现矩阵来查找提取的模式之间的关系，以减少噪声模式。本文的主要目的不仅是减少闭合顺序模式的数量，而且还提高了模式挖掘的性能。 Reuters Corpus第1卷数据收集和TREC过滤主题的实验结果表明，该方法很有希望。

著录项

来源
《IEEE/WIC/ACM International Conference on Web Intelligence;WI 2012;IAT 2012;IEEE/WIC/ACM International Conference on Intelligent Agent Technology;ODMWI 2012;International Workshop on Optimization-based Data Mining and Web Intelligence;BI 2012;International Workshop on Behavior Informatics;IWI-2012;TF'12;International Workshop on Intelligent Web Interaction;NLPOE 2012;International Workshop on Tourism Facilities;NiCaM-WI 2012;WPRSM 2012;International Workshop on Natural Language Processing and Ontology Engineering;WIRSS;International Workshop on Nature-Inspired Computing and Metaheuristics for Web Intelligence;WISS 2012;International Workshop on Web Personalization, Recommender Systems and Social Media;International Workshop on Web Information Retrieval Support Systems;International Symposium on Web Intelligent Systems Services;Combined Workshop on Cross-Cultural and Cross-Linguistic Semantic Web and Software Agent Teamwork for the Semantic Web;International Workshop on Social Networks and Data Processing;SNDP 2012;International Symposium on the Intelligent Campus;IC'12;International Workshop on Green Computing and Sustainable Society;GCSS》|2012年|p.201-205|共5页
会议地点
作者
Albathan Mubarak; Li Yuefeng; Algarni Abdulmohsen;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类人工智能理论;
关键词
Closed Sequential pattern; Information retrieval; Pattern co-occurrence matrix; Text mining;

机译：封闭序列模式;信息检索;模式共现矩阵;文本挖掘;

相似文献

外文文献
中文文献
专利

1. Top-κ closed co-occurrence patterns mining with differential privacy over multiple streams [J] . Jinyan Wang, Shijian Fang, Chen Liu, Future generation computer systems . 2020,第Octa期

机译：Top-κ封闭的共同发生模式采用多个流的差异隐私
2. A novel mapreduce algorithm for distributed mining of sequential patterns using co-occurrence information [J] . Saleti Sumalatha, Subramanyam R. B. V. Applied Intelligence: The International Journal of Artificial Intelligence, Neural Networks, and Complex Problem-Solving Technologies . 2019,第1期

机译：一种使用共发生信息的分布式挖掘的新型MapReduce算法
3. C3Ro: An efficient mining algorithm of extended-closed contiguous robust sequential patterns in noisy data [J] . Abboud Y., Brun A., Boyer A. Expert Systems with Application . 2019,第OCTa期

机译：C3Ro：噪声数据中扩展-闭合的连续鲁棒顺序模式的有效挖掘算法
4. Using Patterns Co-occurrence Matrix for Cleaning Closed Sequential Patterns for Text Mining [C] . Albathan Mubarak, Li Yuefeng, Algarni Abdulmohsen IEEE/WIC/ACM International Conference on Web Intelligence;IEEE/WIC/ACM International Conference on Intelligent Agent Technology;International Workshop on Optimization-Based Data Mining and Web Intelligence;International Workshop on Behavior Informatics;International Workshop on Intelligent Web Interaction;International Workshop on Tourism Facilities;International Workshop on Natural Language Processing and Ontology Engineering;International Workshop on Web Personalization, Recommender Systems and Social Media;International Workshop on Nature-Inspired Computing and Metaheuristics for Web Intelligence;International Workshop on Web Information Retrieval Support Systems;International Symposium on Web Intelligent Systems Services;Combined Workshop on Cross-Cultural and Cross-Linguistic Semantic Web and Software Agent Teamwork for the Semantic Web;International Workshop on Social Networks and Data Processing;International Symposium on the Intelligent Campus;International Workshop on Green Computing and Sustainable Society . 2012

机译：使用模式共发生矩阵清洁闭合拼接的闭合连续图案
5. Sequential patterns and temporal patterns for text mining. [D] . Hoonlor, Apirak. 2011

机译：文本挖掘的顺序模式和时间模式。
6. NetNCSP: Nonoverlapping closed sequential pattern mining [O] . Youxi Wu, Changrui Zhu, Yan Li, -1

机译：NetNCSP：不重叠的封闭顺序模式挖掘
7. Using patterns co-occurrence matrix for cleaning closed sequential patterns for text mining [O] . Albathan Mubarak, Li Yuefeng, Algarni Abdulmohsen 2012

机译：使用模式共现矩阵来清理封闭的顺序模式以进行文本挖掘

Using Patterns Co-occurrence Matrix for Cleaning Closed Sequential Patterns for Text Mining

摘要

著录项

相似文献

相关主题

期刊订阅