首页> 外文会议>Digital Libraries: Universal and Ubiquitous Access to Information >A Query Language and Its Processing for Time-Series Document Clusters
【24h】

A Query Language and Its Processing for Time-Series Document Clusters

机译:时间序列文档簇的查询语言及其处理

获取原文
获取原文并翻译 | 示例

摘要

Document clustering methods for time-series documents produce a sequence of snapshots of clustering results over time. Analyzing the contents (topics) and trends in a long sequence of clustering snapshots is hard and requires efforts since there are too many number of clusters; a user may need to access every cluster or read every document contained in each cluster. In this paper, we propose a framework to find clusters of user interest and change patterns called transition patterns involving the clusters. A cluster in a clustering result may persist in another cluster, branch into more than one cluster, merge with other clusters to form one cluster, or disappear in the adjacent clustering result. This research aims at providing users facilities to retrieve specific transition patterns in the clustering results. For this purpose, we propose a query language for time-series document clustering results and an approach to query processing. The first experimental results on TDT2 corpus clustering results are presented.
机译:时间序列文档的文档聚类方法会随时间生成一系列聚类结果的快照。由于群集数量过多,因此很难分析长时间群集快照中的内容(主题)和趋势。用户可能需要访问每个群集或阅读每个群集中包含的每个文档。在本文中,我们提出了一个框架来查找用户兴趣和变化模式的集群,这些变化模式称为涉及集群的过渡模式。聚类结果中的一个聚类可以保留在另一个聚类中,分支成多个聚类,与其他聚类合并形成一个聚类,或者在相邻聚类结果中消失。这项研究旨在为用户提供在聚类结果中检索特定过渡模式的便利。为此,我们提出了一种用于时序文档聚类结果的查询语言和一种查询处理方法。提出了关于TDT2语料库聚类结果的第一个实验结果。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号