首页> 外文会议>International conference on computational collective intelligence >Improving Information-Carrying Data Capacity in Text Mining
【24h】

Improving Information-Carrying Data Capacity in Text Mining

机译:在文本挖掘中提高信息承载数据的容量

获取原文

摘要

In this article the relation between the selection of textual data representation and text mining quality has been shown. Due to this, the information-carrying capacity of data has been formalized. Then the procedure of comparing information-carrying data capacity with different structures has been described. Moreover, the method of preparing the y -gram representation of a text involving machine learning methods and ontology created by the domain expert, has been presented. This method integrates expert knowledge and automatic methods to develop the traditional text-mining technology, which cannot understand text semantics. Representation built in this way can improve the quality of text mining, what was shown in the test research.
机译:本文显示了文本数据表示的选择与文本挖掘质量之间的关系。因此,数据的信息承载能力已经正规化。然后描述了比较具有不同结构的信息承载数据容量的过程。此外,已经提出了准备文本的y-gram表示的方法,该方法涉及由领域专家创建的机器学习方法和本体。该方法结合了专家知识和自动方法,开发了无法理解文本语义的传统文本挖掘技术。测试研究表明,以这种方式构建的表示形式可以提高文本挖掘的质量。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号