A Query Language and Its Processing for Time-Series Document Clusters

机译：时间序列文档簇的查询语言及其处理

获取原文

获取原文并翻译 | 示例

页面导航

摘要
著录项
相似文献
相关主题

摘要

Document clustering methods for time-series documents produce a sequence of snapshots of clustering results over time. Analyzing the contents (topics) and trends in a long sequence of clustering snapshots is hard and requires efforts since there are too many number of clusters; a user may need to access every cluster or read every document contained in each cluster. In this paper, we propose a framework to find clusters of user interest and change patterns called transition patterns involving the clusters. A cluster in a clustering result may persist in another cluster, branch into more than one cluster, merge with other clusters to form one cluster, or disappear in the adjacent clustering result. This research aims at providing users facilities to retrieve specific transition patterns in the clustering results. For this purpose, we propose a query language for time-series document clustering results and an approach to query processing. The first experimental results on TDT2 corpus clustering results are presented.

机译：时间序列文档的文档聚类方法会随时间生成一系列聚类结果的快照。由于群集数量过多，因此很难分析长时间群集快照中的内容（主题）和趋势。用户可能需要访问每个群集或阅读每个群集中包含的每个文档。在本文中，我们提出了一个框架来查找用户兴趣和变化模式的集群，这些变化模式称为涉及集群的过渡模式。聚类结果中的一个聚类可以保留在另一个聚类中，分支成多个聚类，与其他聚类合并形成一个聚类，或者在相邻聚类结果中消失。这项研究旨在为用户提供在聚类结果中检索特定过渡模式的便利。为此，我们提出了一种用于时序文档聚类结果的查询语言和一种查询处理方法。提出了关于TDT2语料库聚类结果的第一个实验结果。

著录项

来源
《Digital Libraries: Universal and Ubiquitous Access to Information》|2008年|82-92|共11页
会议地点 Bali(ID);Bali(ID)
作者
Sophoin Khy; Yoshiharu Ishikawa; Hiroyuki Kitagawa;
展开▼
作者单位

Graduate School of Systems and Information Engineering, University of Tsukuba;

Information Technology Center, Nagoya University;

Graduate School of Systems and Information Engineering, University of Tsukuba Center for Computation Sciences, University of Tsukuba;

展开▼
会议组织
原文格式 PDF
正文语种 eng
中图分类计算机网络;
关键词
cluster graph; cluster transition; clustering result; graph query; query language; query processing; transition pattern;

机译：聚类图集群过渡；聚类结果；图查询查询语言；查询处理；过渡模式;

相似文献

外文文献
中文文献
专利

1. Natural language processing methods for knowledge management-Applying document clustering for fast search and grouping of engineering documents [J] . Ivar Örn Arnarsson, Otto Frost, Emil Gustavsson, Concurrent engineering: research and applications . 2021,第2期

机译：用于知识管理的自然语言处理方法，用于快速搜索和分组工程文档的文档集群
2. The Opposite of Smoothing: A Language Model Approach to Ranking Query-Specific Document Clusters [J] . Krikon E., Kurland O. The Journal of Artificial Intelligence Research . 2011,第8期

机译：平滑的对立面：一种用于对查询特定的文档簇进行排名的语言模型方法
3. The Opposite of Smoothing: A Language Model Approach to Ranking Query-Specific Document Clusters [J] . Oren Kurland, Eyal Krikon The Journal of Artificial Intelligence Research . 2011,第Null期

机译：平滑的对立面：一种用于对查询特定的文档簇进行排名的语言模型方法
4. A Query Language and Its Processing for Time-Series Document Clusters [C] . Sophoin Khy, Yoshiharu Ishikawa, Hiroyuki Kitagawa International Conference on Asian Digital Libraries . 2008

机译：时序文档集群的查询语言及其处理
5. Multimedia query languages and query processing techniques [D] . Lee, Taekyong 1998

机译：多媒体查询语言和查询处理技术
6. Efficient Queries of Stand-off Annotations for Natural Language Processing on Electronic Medical Records [O] . Yuan Luo, Peter Szolovits 2016

机译：电子病历上自然语言处理的有效注释注释的有效查询
7. Natural language processing methods for knowledge management—Applying document clustering for fast search and grouping of engineering documents [O] . Ivar Örn Arnarsson, Otto Frost, Emil Gustavsson, 2021

机译：用于知识管理的自然语言处理方法，用于应用文档聚类，以便快速搜索和分组工程文档

A Query Language and Its Processing for Time-Series Document Clusters

摘要

著录项

相似文献

相关主题

期刊订阅