Improving Information-Carrying Data Capacity in Text Mining

机译：在文本挖掘中提高信息承载数据的容量

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

In this article the relation between the selection of textual data representation and text mining quality has been shown. Due to this, the information-carrying capacity of data has been formalized. Then the procedure of comparing information-carrying data capacity with different structures has been described. Moreover, the method of preparing the y -gram representation of a text involving machine learning methods and ontology created by the domain expert, has been presented. This method integrates expert knowledge and automatic methods to develop the traditional text-mining technology, which cannot understand text semantics. Representation built in this way can improve the quality of text mining, what was shown in the test research.

机译：本文显示了文本数据表示的选择与文本挖掘质量之间的关系。因此，数据的信息承载能力已经正规化。然后描述了比较具有不同结构的信息承载数据容量的过程。此外，已经提出了准备文本的y-gram表示的方法，该方法涉及由领域专家创建的机器学习方法和本体。该方法结合了专家知识和自动方法，开发了无法理解文本语义的传统文本挖掘技术。测试研究表明，以这种方式构建的表示形式可以提高文本挖掘的质量。

著录项

来源
《International conference on computational collective intelligence》|2015年|648-657|共10页
会议地点
作者
Marcin Gibert;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类
关键词
Text mining; Information-carrying data capacity; Vector space model; Text documents representation;

机译：文本挖掘;信息承载数据的能力;向量空间模型;文字文件表示;

相似文献

外文文献
中文文献
专利

1. Multidimensional Small Medium Enterprises Achievement Rating : Improving to Data Warehousing & Data Mining Methods Full Text [J] . JameelAlsarayrah, AlaaTawfiq AL-Zyadat International Journal of Information Technology Convergence and Services (IJITCS) . 2018,第1a2期

机译：多维中小型企业的成就等级：数据仓库和数据挖掘方法的改进全文
2. N-gram Based Text Categorization Method for Improved Data Mining [J] . Kennedy Ogada, Waweru Mwangi, Wilson Cheruiyot Journal of Information Engineering and Applications . 2015,第8期

机译：基于N元语法的文本分类方法
3. Improving links between literature and biological data with text mining: a case study with GEO, PDB and MEDLINE [J] . Aurélie Névéol, W. John Wilbur, Zhiyong Lu Database . 2012,第40期

机译：通过文本挖掘改善文献与生物学数据之间的联系：GEO，PDB和MEDLINE的案例研究
4. Improving Information-Carrying Data Capacity in Text Mining [C] . Marcin Gibert International Conference on Computational Collective Intelligence . 2015

机译：提高文本挖掘中的信息携带数据容量
5. An Artificial Intelligence approach to financial forecasting using improved data representation, multi-objective optimization, and text mining. [D] . Butler, Matthew. 2009

机译：一种使用改进的数据表示，多目标优化和文本挖掘进行财务预测的人工智能方法。
6. Improving links between literature and biological data with text mining: a case study with GEO PDB and MEDLINE [O] . Aurélie Névéol, W. John Wilbur, Zhiyong Lu 2012

机译：通过文本挖掘改善文献与生物学数据之间的联系：GEOPDB和MEDLINE的案例研究
7. Text a data mining šedé literatury pro vědecké účely: Text and Data Mining of Grey Literature for the Purpose of Scientific Research [O] . 2016

机译：通过文本进行数据挖掘，以科学研究为目的：灰色文献的文本和数据挖掘

Improving Information-Carrying Data Capacity in Text Mining

摘要

著录项

相似文献

相关主题

期刊订阅