The Reliable Knowledge Discovery in Textual Database using R Infrastructure

Anu Yadav

首页> 外文期刊>Advances in Computer Science and Information Technology: ACSIT >The Reliable Knowledge Discovery in Textual Database using R Infrastructure

【24h】

The Reliable Knowledge Discovery in Textual Database using R Infrastructure

机译：使用R基础架构的文本数据库中可靠的知识发现

获取原文

获取外文期刊封面封底 >>

开具论文收录证明 >>

文献代查 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

ZJn today’s world, the internet and computer technology enormously increased the amount of stored information and unprecedented expansion in the amount of unstructured data in the textual formats, we cannot use the data for any processing to extract useful information, due to the rapid growth of digital data, and Information explosion and availability has changed the nature of information centers. Hence, knowledge discovery and text data mining have attracted an empirical attention with an imminent need for turning such data into useful information, patterns and knowledge. Text mining has become an interesting area in business intelligence application, healthcare, media and research. Text Mining can be defined as a technique which is a process used to analyze text to extract interesting and meaningful information from new or previously unknown information, non-trivial patterns or knowledge of the unstructured text documents or from different resources for particular purposes. The text mining is an interdisciplinary research held utilizing techniques from computer science, computational linguistics, information retrieval, data mining and statistics. Existing toolkits for text mining have low extensibility, lack of availability of application programming interfaces and provide less support for interacting with computing environments. Hence, in this paper, we propose a text mining in R infrastructure or computing environment, it provides intelligent methods for Meta data management and operations on documents, such as preprocessing, data cloud formation, frequency graphs, text clustering and text classification. This paper presents how text mining techniques can be applied in R infrastructure and better utilizing infrastructure features than other text mining products such as dtSearch, SPSS, SAS Text Miner, RapidMiner, weka, etc.

机译：ZJN今天的世界，互联网和计算机技术在文本格式中大大增加了存储信息的数量和非结构化数据量的展望，由于数字的快速增长，我们不能使用任何处理来提取有用信息的数据数据，信息爆炸和可用性改变了信息中心的性质。因此，知识发现和文本数据挖掘引起了实证的关注，即将需要将这些数据转化为有用的信息，模式和知识。文本挖掘已成为商业智能应用，医疗保健，媒体和研究中的一个有趣区域。文本挖掘可以被定义为一种技术，该技术是用于分析文本以从新的或先前未知的信息，非琐事模式或非结构化文本文档的知识或非结构化文本文档的知识或特定资源的不同资源中提取有趣和有意义的信息。文本挖掘是利用计算机科学，计算语言学，信息检索，数据挖掘和统计数据的技术持有跨学科研究。用于文本挖掘的现有工具包具有低的可扩展性，缺乏应用程序编程接口的可用性，并提供对与计算环境进行交互的较少支持。因此，在本文中，我们提出了R基础设施或计算环境中的文本挖掘，它为元数据管理和文档的操作提供了智能方法，例如预处理，数据云形成，频率图，文本群集和文本分类。本文介绍了文本挖掘技术如何应用于R基础设施，而且更好地利用基础设施特征，而不是其他文本挖掘产品，如DTSearch，SPSS，SAS文本矿工，Rapidminer，Weka等。

著录项

来源
《Advances in Computer Science and Information Technology: ACSIT》 |2016年第4期|共7页
作者
Anu Yadav;
展开▼
作者单位

展开▼
收录信息
原文格式 PDF
正文语种
中图分类计算技术、计算机技术;
关键词

相似文献

外文文献
中文文献
专利

1. A New Approach for Knowledge Discovery in Distributed Databases Using Fragmented Data Storage Model [J] . Masoud Pesaran Behbahani, Islam Choudhury, Souheil Khaddaj 中国经济评论：英文版 . 2013,第012期
2. Intelligent　Information　Management　and　Knowledge　Discovery　in　Large　Numeric　and　Scientific　Databases [J] . 系统工程与电子技术：英文版 . 1996,第002期
3. Data integration and knowledge discovery in biomedical databases. Reliable information from unreliable sources [J] . A Mitnitski, A Mogilner, C MacKnight, Data science journal . 2003,第2003期

机译：生物医学数据库中的数据集成和知识发现。来自不可靠来源的可靠信息
4. Discovery of textual knowledge flow based on the management of knowledge maps [J] . Xiangfeng Luo, Qingliang Hu, Weimin Xu, Concurrency and Computation . 2008,第15期

机译：基于知识图谱管理的文本知识流发现
5. Automatic discovery of abnormal values in large textual databases [J] . Goekhan Kul Computing reviews . 2016,第12期

机译：在大型文本数据库中自动发现异常值
6. Mining concept associations for knowledge discovery in large textual databases [C] . Xiaowei Xu, Mutlu Mete, Nurcan Yuruk ACM symposium on Applied computing . 2005

机译：大型文本数据库中用于知识发现的挖掘概念关联
7. Knowledge Discovery for Health Informatics from Structured and Textual Data [D] . Al-Bahrani, Reda. 2018

机译：从结构化和文本数据中发现健康信息学知识
8. Biospecimen Repositories and Integrated Databases as Critical Infrastructure for Pathogen Discovery and Pathobiology Research [O] . Jonathan L. Dunnum, Richard Yanagihara, Karl M. Johnson, 2017

机译：生物样本库和集成数据库是病原体发现和病理生物学研究的关键基础设施
9. Knowledge Discovery in Textual Databases: A Concept-Association Mining Approach [O] . Mutlu Mete, Nurcan Yuruk, Xiaowei Xu, 2014

机译：文本数据库中的知识发现：概念 - 关联挖掘方法
10. Computation Infrastructure for Knowledge-Based Development of Reliable Software Systems [R] . Constable, R. , Kreitz, C. 2006

机译：基于知识的可靠软件系统开发的计算基础设施

The Reliable Knowledge Discovery in Textual Database using R Infrastructure

摘要

著录项

相似文献

相关主题

期刊订阅