Learning Ontology Resolution for Document Representation and its Applications in Text Mining

机译：用于文档表示的学习本体解析及其在文本挖掘中的应用

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

It is well known that synonymous and polysemous terms often bring in some noises when calculating the similarity between documents. Existing ontology-based document representation methods are static, hence, the chosen semantic concept set for representing a document has a fixed resolution and it is not adaptable to the characteristics of a document collection and the text mining problem in hand. We propose an Adaptive Concept Resolution (ACR) model to overcome this issue. ACR can learn a concept border from an ontology taking into consideration of the characteristics of a particular document collection. Then this border can provide a tailor-made semantic concept representation for a document coming from the same domain. Another advantage of ACR is that it is applicable in both classification task where the groups are given in the training document set, and clustering task where no group information is available. Furthermore, the result of this model is not sensitive to the model parameter. The experimental results show that ACR outperforms an existing static method significantly.

机译：众所周知，当计算文档之间的相似度时，同义词和多义词经常会带来一些干扰。现有的基于本体的文档表示方法是静态的，因此，所选择的用于表示文档的语义概念集具有固定的分辨率，并且不适合于文档集合的特征和现有的文本挖掘问题。我们提出了一种自适应概念解决方案（ACR）模型来克服此问题。考虑到特定文档集合的特征，ACR可以从本体学习概念边界。然后，该边界可以为来自相同域的文档提供量身定制的语义概念表示。 ACR的另一个优点是，它既适用于在培训文档集中指定了组的分类任务，又适用于没有可用组信息的聚类任务。此外，该模型的结果对模型参数不敏感。实验结果表明，ACR明显优于现有的静态方法。

著录项

来源
《CIKM 10;ACM conference on information and knowledge management》|2011年|p.1713-1716|共4页
会议地点
作者
Lidong Bing; Bai Sun; Shan Jiang; Yan Zhang; Wai Lam;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类信息处理（信息加工）;
关键词
ontology; adaptive concept resolution;

机译：本体自适应概念解析;

相似文献

外文文献
中文文献
专利

1. Adaptive Concept Resolution for document representation and its applications in text mining [J] . Lidong Bing, Shan Jiang, Wai Lam, Knowledge-Based Systems . 2015,第jana期

机译：用于文档表示的自适应概念解析及其在文本挖掘中的应用
2. A NOVEL MODEL FOR TEXT DOCUMENT REPRESENTATION: APPLICATION ON OPINION MINING DATASETS [J] . ASMAA MOUNTASSIR, HOUDA BENBRAHIM, ILHAM BERRADA Journal of computer science engineering and information technology research . 2014,第2期

机译：文本文档表示的新模型：在意见挖掘数据集上的应用
3. A NOVEL MODEL FOR TEXT DOCUMENT REPRESENTATION: APPLICATION ON OPINION MINING DATASETS [J] . ASMAA MOUNTASSIR, HOUDA BENBRAHIM, ILHAM BERRADA Journal of computer science engineering and information technology research . 2014,第2期

机译：文本文档表示的新模型：在意见挖掘数据集上的应用
4. Learning Ontology Resolution for Document Representation and its Applications in Text Mining [C] . Lidong Bing, Bai Sun, Shan Jiang, ACM conference on information and knowledge management . 2010

机译：学习文档代表的本体决议及其在文本挖掘中的应用
5. Discovering latent topical phrases in document collections and networks with text components: Leveraging text mining and information network analysis for human oriented applications. [D] . Danilevsky, Marina Grigoryevna. 2014

机译：在文档集合和带有文本组件的网络中发现潜在的主题短语：利用面向人类的应用程序的文本挖掘和信息网络分析。
6. Ontology based text mining of gene-phenotype associations: application to candidate gene prediction [O] . Şenay Kafkas, Robert Hoehndorf 2019

机译：基于本体的基因表型关联文本挖掘：在候选基因预测中的应用
7. TM-SGTD: Text Mining Based Semantic Graph for Text Document Approach for Text Representation [O] . Ashish Pacharne, Pramod S Nair, Srinivasa Rao D 2017

机译：TM-SGTD：文本文档方法的文本挖掘语义图文本表示

Learning Ontology Resolution for Document Representation and its Applications in Text Mining

摘要

著录项

相似文献

相关主题

期刊订阅