TEXT INTERPRETATION USING A MODIFIED PROCESS OF THE ONTOLOGY AND SPARSE CLUSTERING

IONIA VERITAWATI; ITO WASITO; T. BASARUDDIN

首页> 外文期刊>Journal of Theoretical and Applied Information Technology >TEXT INTERPRETATION USING A MODIFIED PROCESS OF THE ONTOLOGY AND SPARSE CLUSTERING

【24h】

TEXT INTERPRETATION USING A MODIFIED PROCESS OF THE ONTOLOGY AND SPARSE CLUSTERING

机译：使用本体和稀疏聚类的改进过程进行文本解释

获取原文

获取外文期刊封面封底 >>

开具论文收录证明 >>

文献代查 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

Many texts in online media consist of various information that need an appropriate way to extract and interpret them clearly. For better understanding of the content in the text collected from any online media, a proper methodology for the interpretation of useful information must be developed. This study offers a modified process of the text interpretation consisting of four stages with a preliminary stage of the text preprocessing and key phrase extraction using the annotated suffix tree (AST) technique and secondary stage of developing sparse clustering method named as iterative scaling of fuzzy additive spectral clustering (is-FADDIS) combined with a sharpening technique for grouping key phrases from the text. An ontology as the ?knowledge base? was developed combining with is-FADDIS method as the third stage. Interpretation from the input text was carried out as the final stage of the text interpretation. The performances of is-FADDIS clustering combined with sharpening technique as high as 96 and 78% were verified for some modeled sparse data and two specific real sparse data from two corpus, respectively, and could be better when comparing with Nonnegative Matrices Factorization (NMF) and K-means. The text interpretation of using the ontology gives a clear graph visualization on the relationship among key phrases even though it has a low correlation with content of the text. The result findings of this study potentially help us in ensuring an automatic process to be used for the interpretation of any topic information collected from online media.

机译：在线媒体中的许多文本包含各种信息，这些信息需要适当的方式来清楚地提取和解释它们。为了更好地理解从任何在线媒体收集的文本中的内容，必须开发一种解释有用信息的适当方法。这项研究提供了一个文本解释的修改过程，包括四个阶段，包括文本预处理和使用带注释后缀树（AST）技术的关键短语提取的初步阶段，以及开发稀疏聚类方法的第二阶段，称为模糊加法的迭代缩放。频谱聚类（is-FADDIS）与锐化技术相结合，可对文本中的关键短语进行分组。本体作为“知识基础”？第三阶段是结合is-FADDIS方法开发的。从输入文本进行的解释是文本解释的最后阶段。分别对来自两个语料库的一些模型化稀疏数据和两个特定的真实稀疏数据，验证了is-FADDIS聚类与锐化技术相结合的性能分别达到96％和78％，与非负矩阵因子分解（NMF）相比可能会更好和K-均值使用本体的文本解释可以清晰显示关键短语之间的关系，即使它与文本内容的相关性较低。这项研究的结果可能有助于我们确保使用自动过程来解释从在线媒体收集的任何主题信息。

著录项

来源
《Journal of Theoretical and Applied Information Technology》 |2017年第5期|共页
作者
IONIA VERITAWATI; ITO WASITO; T. BASARUDDIN;
展开▼
作者单位

展开▼
收录信息
原文格式 PDF
正文语种
中图分类计算技术、计算机技术;
关键词

相似文献

外文文献
中文文献
专利

1. An Ontology-Based Representation of Financial Criminology Domain Using Text Analytics Processing [J] . Zulazeze Sahri, Shuhaida Mohammed Shuhidan, Zuraidah Mohd Sanusi International journal of computer science and network security . 2018,第2期

机译：基于文本分析处理的金融犯罪领域的基于本体的表示
2. Ontology population as algebraic information system processing based on multi-agent natural language text analysis algorithms [J] . Garanina N. O., Sidorova E. A. Programming and Computer Software . 2015,第3期

机译：基于多主体自然语言文本分析算法的本体人口作为代数信息系统处理
3. Text Mining Approach for Prediction of Tumor Using Ontology Based Particle Swarm Optimization with Clustering Techniques [J] . Jyotsna, Govindarajulu International journal of computer science and network security . 2018,第5期

机译：基于本体的粒子群优化聚类技术的文本预测文本挖掘方法
4. Text Document Clustering with Ontology Applying Modify Concept Weighting [C] . Hmway Hmway Tar, Myint Myint Khaing International Conference on Genetic and Evolutionary Computing . 2016

机译：文本文档群集与本体应用修改概念加权
5. A comparative study on ontology generation and text clustering using VSM, LSI, and document ontology models. [D] . Taylor, William P., II. 2007

机译：使用VSM，LSI和文档本体模型进行本体生成和文本聚类的比较研究。
6. Natural language processing algorithms for mapping clinical text fragments onto ontology concepts: a systematic review and recommendations for future studies [O] . Martijn G. Kersloot, Florentien J. P. van Putten, Ameen Abu-Hanna, 2020

机译：用于将临床文本碎片映射到本体概念的自然语言处理算法：未来研究的系统审查和建议
7. Ontology Learning Process as a Bottom-up Strategy for Building Domain-specific Ontology from Legal Texts [O] . Mirna El Ghosh, Hala Naja, Habib Abdulrab, 2017

机译：本体学习过程作为从法律文本构建特定域的本体的自下而上策略

TEXT INTERPRETATION USING A MODIFIED PROCESS OF THE ONTOLOGY AND SPARSE CLUSTERING

摘要

著录项

相似文献

相关主题

期刊订阅