Learning Ontology Resolution for Document Representation and its Applications in Text Mining

机译：学习文档代表的本体决议及其在文本挖掘中的应用

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

It is well known that synonymous and polysemous terms often bring in some noises when calculating the similarity between documents. Existing ontology-based document representation methods are static, hence, the chosen semantic concept set for representing a document has a fixed resolution and it is not adaptable to the characteristics of a document collection and the text mining problem in hand. We propose an Adaptive Concept Resolution (ACR) model to overcome this issue. ACR can learn a concept border from an ontology taking into consideration of the characteristics of a particular document collection. Then this border can provide a tailor-made semantic concept representation for a document coming from the same domain. Another advantage of ACR is that it is applicable in both classification task where the groups are given in the training document set, and clustering task where no group information is available. Furthermore, the result of this model is not sensitive to the model parameter. The experimental results show that ACR outperforms an existing static method significantly.

机译：众所周知，同义和多殖民术语通常会在计算文件之间的相似性时带来一些噪音。现有的基于本体的文档表示的方法是静态的，因此，用于表示一个文件选择的语义概念集具有一个固定的分辨率，它是不适合于一个文档集合在手，文本挖掘问题的特性。我们提出了一个自适应概念分辨率（ACR）模型来克服这个问题。考虑到特定文件集合的特征，ACR可以从本体学中学到一个概念边界。然后，此边框可以为来自同一域的文档提供量身定制的语义概念表示。 ACR的另一个优点是它适用于两个分类任务，其中组在训练文件集中给出，并且没有可用组信息的聚类任务。此外，该模型的结果对模型参数不敏感。实验结果表明，ACR显着优于现有的静态方法。

著录项

来源
《ACM conference on information and knowledge management》|2010年||共4页
会议地点
作者
Lidong Bing; Bai Sun; Shan Jiang; Yan Zhang; Wai Lam;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类信息处理（信息加工）;
关键词
ontology; adaptive concept resolution;

机译：本体;自适应概念分辨率;

相似文献

外文文献
中文文献
专利

1. Adaptive Concept Resolution for document representation and its applications in text mining [J] . Lidong Bing, Shan Jiang, Wai Lam, Knowledge-Based Systems . 2015,第jana期

机译：用于文档表示的自适应概念解析及其在文本挖掘中的应用
2. A NOVEL MODEL FOR TEXT DOCUMENT REPRESENTATION: APPLICATION ON OPINION MINING DATASETS [J] . ASMAA MOUNTASSIR, HOUDA BENBRAHIM, ILHAM BERRADA Journal of computer science engineering and information technology research . 2014,第2期

机译：文本文档表示的新模型：在意见挖掘数据集上的应用
3. A NOVEL MODEL FOR TEXT DOCUMENT REPRESENTATION: APPLICATION ON OPINION MINING DATASETS [J] . ASMAA MOUNTASSIR, HOUDA BENBRAHIM, ILHAM BERRADA Journal of computer science engineering and information technology research . 2014,第2期

机译：文本文档表示的新模型：在意见挖掘数据集上的应用
4. Learning Ontology Resolution for Document Representation and its Applications in Text Mining [C] . Lidong Bing, Bai Sun, Shan Jiang, CIKM 10;ACM conference on information and knowledge management . 2011

机译：用于文档表示的学习本体解析及其在文本挖掘中的应用
5. Discovering latent topical phrases in document collections and networks with text components: Leveraging text mining and information network analysis for human oriented applications. [D] . Danilevsky, Marina Grigoryevna. 2014

机译：在文档集合和带有文本组件的网络中发现潜在的主题短语：利用面向人类的应用程序的文本挖掘和信息网络分析。
6. Ontology based text mining of gene-phenotype associations: application to candidate gene prediction [O] . Şenay Kafkas, Robert Hoehndorf 2019

机译：基于本体的基因表型关联文本挖掘：在候选基因预测中的应用
7. TM-SGTD: Text Mining Based Semantic Graph for Text Document Approach for Text Representation [O] . Ashish Pacharne, Pramod S Nair, Srinivasa Rao D 2017

机译：TM-SGTD：文本文档方法的文本挖掘语义图文本表示

Learning Ontology Resolution for Document Representation and its Applications in Text Mining

摘要

著录项

相似文献

相关主题

期刊订阅