...
首页> 外文期刊>Journal of management information systems >Preserving User Preferences in Automated Document-Category Management: An Evolution-Based Approach
【24h】

Preserving User Preferences in Automated Document-Category Management: An Evolution-Based Approach

机译:在自动文档类别管理中保留用户首选项:一种基于演化的方法

获取原文
获取原文并翻译 | 示例
           

摘要

Analysis of prevalent document management practices shows the popular use of categories (e.g., folders) to organize documents for subsequent searches and retrievals. The coherence and distinction of an existing document category can diminish considerably as influxes of new documents arrive over time. The complexity of and effort requirements for document-category management favor an automated approach that can be supported by appropriate document-clustering techniques. A review of the extant literature shows a predominant focus on document content analysis in automated document-category management, which cannot preserve the user's document-grouping preferences. This research develops two advanced evolution-based techniques for preserving user preferences in their management of document categories. The first technique (CE2), which supports the automated evolution of a set of flat (i.e., nonhierar-chical) document categories, extends a promising evolution-based technique (category evolution, CE) by addressing its fundamental limitations inherent to the use of holistic measures. The second technique, category hierarchy evolution (CHE), is developed on the basis of CE2 to support scenarios where document categories are organized with a hierarchical structure. Empirical evaluations of the effectiveness of each technique in various category evolution scenarios created using two different document corpora (i.e., news documents from Reuters and research articles from the ACM digital library), as compared with those of associated salient techniques for benchmark purposes, show that CE2 and CHE outperform their respective benchmark techniques. Their performance is reasonably robust and appears more effective when the quality (coherence) of the previously created categories does not deteriorate excessively. According to our results, the evolution-based approach is viable, appealing, and capable of preserving user preferences in automatic reorganizations of document categories.
机译:对流行的文档管理实践的分析表明,使用类别(例如文件夹)来组织文档以进行后续搜索和检索是很普遍的。随着大量新文档的涌入,现有文档类别的连贯性和区别可能会大大减少。文档类别管理的复杂性和工作量要求支持一种自动化方法,该方法可以通过适当的文档聚类技术来支持。对现有文献的回顾表明,自动化文档类别管理中的文档内容分析主要集中在文档分析上,这无法保留用户的文档分组首选项。这项研究开发了两种基于进化的高级技术,可以在用户管理文档类别时保留他们的偏好。第一种技术(CE2)支持一组扁平(即非分层的)文档类别的自动演化,它通过解决其使用中固有的基本局限性,扩展了一种有前途的基于演化的技术(类别演化,CE)。整体措施。第二种技术是类别层次结构演化(CHE),它是在CE2的基础上开发的,以支持用层次结构组织文档类别的方案。与使用基准测试的相关显着技术相比,使用两种不同的文档语料库(即来自路透社的新闻文档和来自ACM数字图书馆的研究文章)创建的各种技术在各种类别演变场景中的有效性的实证评估表明, CE2和CHE的性能优于各自的基准测试技术。当先前创建的类别的质量(连贯性)没有过度降低时,它们的性能相当强大,并且看起来更加有效。根据我们的结果,基于进化的方法是可行的,有吸引力的,并且能够在文档类别的自动重组中保留用户的偏好。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号