Event Mining Through Clustering

E. Umamaheswari; T. V. Geetha

首页> 外文期刊>Journal of Intelligent Systems >Event Mining Through Clustering

【24h】

Event Mining Through Clustering

机译：通过聚类进行事件挖掘

获取原文

获取原文并翻译 | 示例

掌桥外文数据库（机构版） >>

开具论文收录证明 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

Traditional document clustering algorithms consider text-based features such as unique word count, concept count, etc. to cluster documents. Meanwhile, event mining is the extraction of specific events, their related sub-events, and the associated semantic relations from documents. This work discusses an approach to event mining through clustering. The Universal Networking Language (UNL)-based subgraph, a semantic representation of the document, is used as the input for clustering. Our research focuses on exploring the use of three different feature sets for event clustering and comparing the approaches used for specific event mining. In our previous work, the clustering algorithm used UNL-based event semantics to represent event context for clustering. However, this approach resulted in different events with similar semantics being clustered together. Hence, instead of considering only UNL event semantics, we considered assigning additional weights to similarity between event contexts with event-related attributes such as time, place, and persons. Although we get specific events in a single cluster, sub-events related to the specific events are not necessarily in a single cluster. Therefore, to improve our cluster efficiency, connective terms between two sentences and their representation as UNL subgraphs were also considered for similarity determination. By combining UNL semantics, event-specific arguments similarity, and connective term concepts between sentences, we were able to obtain clusters for specific events and their subevents. We have used 112 000 Tamil documents from the Forum for Information Retrieval Evaluation data corpus and achieved good results. We have also compared our approach with the previous state-of-the-art approach for Router-RCV1 corpus and achieved 30% improvements in precision.

机译：传统的文档聚类算法会考虑基于文本的功能（例如唯一字数，概念数等）来对文档进行聚类。同时，事件挖掘是从文档中提取特定事件，它们的相关子事件以及相关的语义关系。这项工作讨论了通过群集进行事件挖掘的方法。基于通用网络语言（UNL）的子图（文档的语义表示）用作聚类的输入。我们的研究重点是探索使用三种不同的功能集进行事件聚类，并比较用于特定事件挖掘的方法。在我们以前的工作中，聚类算法使用基于UNL的事件语义来表示聚类的事件上下文。但是，这种方法导致具有相似语义的不同事件被聚集在一起。因此，我们考虑到为事件上下文之间具有相似性的属性（例如时间，地点和人员）分配相似性，而不是仅考虑UNL事件语义。尽管我们在单个集群中获得特定事件，但是与特定事件相关的子事件并不一定在单个集群中。因此，为了提高聚类效率，还考虑了两个句子之间的连接词以及它们作为UNL子图的表示形式，以确定相似性。通过结合UNL语义，特定于事件的参数相似性以及句子之间的连接术语概念，我们能够获取特定事件及其子事件的聚类。我们已经使用了来自信息检索评估论坛数据集的112,000个泰米尔文文件，并取得了良好的效果。我们还将我们的方法与以前针对Router-RCV1语料库的最新方法进行了比较，并且将精度提高了30％。

著录项

来源
《Journal of Intelligent Systems》 |2014年第1期|共15页
作者
E. Umamaheswari; T. V. Geetha;
展开▼
作者单位

展开▼
收录信息
原文格式 PDF
正文语种 eng
中图分类自动化系统;
关键词
Event mining; Event-context clustering; Graph-based clustering; Universal Networking Language (UNL); Semantics;

机译：事件挖掘;事件上下文聚类;基于图的聚类;通用网络语言（UNL）;语义;
入库时间 2022-08-19 01:49:15

相似文献

外文文献
中文文献
专利

1. Clustering of mining-induced seismic events in equivalent dimension spaces [J] . Grzegorz Lizurek, Stanis?aw Lasocki Journal of seismology . 2014,第3期

机译：当量空间中矿山诱发地震事件的聚类
2. Formation of Higher Stress Zones and Clusters of Seismic Events in Deep Mining in Tashtagol [J] . V. A. Eremenko, L. N. Gakhova, E. N. Semenyakin Journal of Mining Scinece . 2012,第2期

机译：塔什塔戈尔深部开采中高应力带的形成和地震事件的簇聚。
3. Mining event logs for knowledge discovery based on adaptive efficient fuzzy Kohonen clustering network [J] . Pan Yue, Zhang Limao, Li Zhiwu Knowledge-Based Systems . 2020,第Deca17期

机译：基于自适应高效模糊kohonen聚类网络的挖掘事件日志为知识发现
4. Clustering of Windows Security Events by Means of Frequent Pattern Mining [C] . Rosa Basagoiti, Urko Zurutuza, Asier Aztiria, Computational intelligence in security for information systems . 2009

机译：通过频繁模式挖掘对Windows安全事件进行群集
5. Relationship-based clustering and cluster ensembles for high-dimensional data mining. [D] . Strehl, Alexander. 2002

机译：用于高维数据挖掘的基于关系的聚类和聚类集成。
6. HIV preventive behavior and associated factors among mining workers in Sali traditional gold mining site bench maji zone Southwest Ethiopia: a cross sectional study [O] . Hordofa Gutema Abdissa, Yohannes Kebede Lemu, Dejene Tilahun Nigussie 2014

机译：埃塞俄比亚西南部萨利（Sali）传统金矿开采台架马吉区的采矿工人中的HIV预防行为及相关因素：一项横断面研究
7. LogCluster - A data clustering and pattern mining algorithm for event logs [O] . Risto Vaarandi, Mauno Pihelgas 2015

机译：LogCluster-事件日志的数据聚类和模式挖掘算法
8. Cluster Analysis of Closely Spaced Mining Blasts as a Method of Event Location [R] . Riviere-Barbier, F., Grant, L. T. 1992

机译：作者：张莹莹，王建华，煤矿安

Event Mining Through Clustering

摘要

著录项

相似文献

相关主题

期刊订阅