On Efficient Content Based Information Retrieval Using SVM and Higher Order Correlation Analysis

机译：基于支持向量机和高阶相关分析的基于内容的高效信息检索

获取原文

获取外文期刊封面目录资料

页面导航

摘要
著录项
引文网络
相似文献
相关主题

摘要

Efficient retrieval of information with regards to its meaning and content is an important problem in data mining systems for the creation, management and querying of very large information databases existing in the World Wide Web. In this paper we deal with the main aspect of the problem of content based retrieval, namely, with the problem of document classification, outlining a novel improved and systematic approach to it's solution. We present a document classification system for non-domain specific content based on the learning and generalization capabilities mainly of SVM neural networks. The main contribution of this paper lies on the feature extraction methodology which, first, involves word semantic categories and not raw words as other rival approaches. As a consequence of coping with the problem of dimensionality reduction, the proposed approach introduces a novel higher order approach for document categorization feature extraction by considering word semantic categories higher order correlation analysis, both two and three dimensional, based on cooccurrence analysis. The suggested methodology compares favourably to widely accepted, raw word frequency based techniques in a collection of documents concerning the Dewey Decimal Classification (DDC) system. In these comparisons different Multilayer Perceptrons (MLP) algorithms as well as the Support Vector Machine (SVM), the LVQ and the conventional k-NN technique are involved. SVM models seem to outperform all other rival methods in this study.

机译：就其含义和内容而言，有效地检索信息是数据挖掘系统中创建，管理和查询存在于万维网中的超大型信息数据库的重要问题。在本文中，我们处理基于内容的检索问题的主要方面，即文档分类问题，概述了一种新颖的，系统的解决方案。我们主要基于SVM神经网络的学习和归纳能力，提出了针对非特定领域内容的文档分类系统。本文的主要贡献在于特征提取方法，该方法首先涉及单词语义类别，而不是其他竞争方法所涉及的原始单词。由于解决了降维问题，该方法引入了一种新的高阶方法，该方法通过考虑基于共现分析的词语义类别二维和三维高阶相关性分析，来进行文档分类特征提取。在涉及杜威十进制分类（DDC）系统的文档集中，建议的方法与广泛接受的基于原始单词频率的技术相比具有优势。在这些比较中，涉及了不同的多层感知器（MLP）算法以及支持向量机（SVM），LVQ和常规的k-NN技术。在这项研究中，SVM模型似乎胜过所有其他竞争方法。

著录项

来源
《ICONIP 2008;International conference on advances in neuro-information processing》|2009年|112-119|共8页
会议地点
作者
Dimitrios Alexios Karras;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类信息处理（信息加工）;
关键词

相似文献

外文文献
中文文献
专利

1. SVM Based Navigation Patterns for Efficient Relevance Feedback in Content Based Image Retrieval [J] . Prof. Nishchol Mishra, Prof. Sanjeev Sharma, Vaishali Meshram International Journal of Computer Technology and Applications . 2012,第04期

机译：基于SVM的导航模式，用于基于内容的图像检索中的有效相关反馈
2. Efficient Approach for Content Based Image Retrieval Using Multiple SVM in YACBIR [J] . Lakhdar LAIB, Samy Ait-Aoudia Computer Science & Information Technology . 2016,第8期

机译：在YACBIR中使用多个SVM进行基于内容的图像检索的有效方法
3. A Multi-class SVM Based Content Based Image Retrieval System Using Hybrid Optimization Techniques [J] . Kishore Dannina, Rao Chanamallu Srinivasa Traitement du Signal . 2020,第2期

机译：基于多级SVM基于SVM的基于IMITE REVERIVAL系统，使用混合优化技术
4. On Efficient Content Based Information Retrieval Using SVM and Higher Order Correlation Analysis [C] . Dimitrios Alexios Karras International Conference on Neural Information Processing . 2009

机译：基于高效的基于内容的信息检索，使用SVM和高阶相关性分析
5. Efficient content-based retrieval of images using triangle-inequality-based algorithms [D] . Berman, Andrew P. 1999

机译：使用基于三角形不等式的算法基于内容的有效图像检索
6. Content based medical image retrieval based on new efficient local neighborhood wavelet feature descriptor [O] . Amita Shinde, Amol Rahulkar, Chetankumar Patil 2019

机译：基于新型高效局部邻域小波特征描述符的基于内容的医学图像检索
7. Efficient Approach for Content Based Image Retrieval Using Multiple SVM in YACBIR [O] . Lakhdar LAIB, Samy Ait-Aoudia 2016

机译：YaCBIR中基于内容的多sVm图像检索的有效方法

On Efficient Content Based Information Retrieval Using SVM and Higher Order Correlation Analysis

摘要

著录项

引文网络

相似文献

相关主题

期刊订阅