首页> 外文会议>International Conference on Engineering Technologies and Computer Science >Automatic Creation Technologies of Declarative Tools for Clustering Media Documents
【24h】

Automatic Creation Technologies of Declarative Tools for Clustering Media Documents

机译:用于群集媒体文档的声明性工具的自动创建技术

获取原文

摘要

The article describes the methods of identifying the conceptual content structure of the dataset of documents for the clustering. It was found that in the automatic extraction of key text concepts it is necessary to use the criteria of semantic significance of words and phrases obtained on the basis of syntactic, statistical and semantic methods. The syntactic criteria are based on the definition of the syntactic role of words and phrases in the text dataset. We accent on those elements of sentences that forms its semantic (predicate-actant) structure. In this research four methods of automatic identification of key text concepts have been elaborated, their comparative analysis is carried out and the technology of automatic creation of declarative means for text clustering of media is developed. The precision assessment of document clustering with and without declarative methods is conducted on test dataset.
机译:本文介绍了识别用于聚类的文档数据集的概念性内容结构的方法。结果发现,在关键文本概念的自动提取中,有必要使用基于句法,统计和语义方法获得的单词和短语的语义重要性标准。句法标准基于文本数据集中单词和短语的句法作用的定义。我们着重强调构成其语义(谓语-actant)结构的句子的那些元素。本研究阐述了四种关键文本概念的自动识别方法,进行了比较分析,并开发了自动创建用于媒体文本聚类的声明性手段的技术。在测试数据集上使用或不使用声明方法对文档聚类进行精度评估。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号