KDX: an indexer for support vector machines

Navneet Panda; Chang E.Y.

首页> 外文期刊>IEEE Transactions on Knowledge and Data Engineering >KDX: an indexer for support vector machines

【24h】

KDX: an indexer for support vector machines

机译：KDX：支持向量机的索引器

获取原文

获取原文并翻译 | 示例

掌桥外文数据库（机构版） >>

开具论文收录证明 >>

文献代查 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

Support vector machines (SVMs) have been adopted by many data mining and information-retrieval applications for learning a mining or query concept, and then retrieving the "top-k" best matches to the concept. However, when the data set is large, naively scanning the entire data set to find the top matches is not scalable. In this work, we propose a kernel indexing strategy to substantially prune the search space and, thus, improve the performance of top-k queries. Our kernel indexer (KDX) takes advantage of the underlying geometric properties and quickly converges on an approximate set of top-k instances of interest. More importantly, once the kernel (e.g., Gaussian kernel) has been selected and the indexer has been constructed, the indexer can work with different kernel-parameter settings (e.g., /spl gamma/ and /spl sigma/) without performance compromise. Through theoretical analysis and empirical studies on a wide variety of data sets, we demonstrate KDX to be very effective. An earlier version of this paper appeared in the 2005 SIAM International Conference on Data Mining. This version differs from the previous submission in providing a detailed cost analysis under different scenarios, specifically designed to meet the varying needs of accuracy, speed, and space requirements, developing an approach for insertion and deletion of instances, presenting the specific computations as well as the geometric properties used in performing the same, and providing detailed algorithms for each of the operations necessary to create and use the index structure.

机译：支持向量机（SVM）已被许多数据挖掘和信息检索应用程序采用，以学习挖掘或查询概念，然后检索与该概念最匹配的“ top-k”。但是，当数据集很大时，天真地扫描整个数据集以找到最匹配的项是不可伸缩的。在这项工作中，我们提出了一种内核索引策略，以大幅减少搜索空间，从而提高top-k查询的性能。我们的内核索引器（KDX）充分利用了基础的几何属性，并迅速收敛于感兴趣的前k个实例的近似集合。更重要的是，一旦选择了内核（例如高斯内核）并构建了索引器，索引器便可以使用不同的内核参数设置（例如/ spl gamma /和/ spl sigma /）工作而不会影响性能。通过对各种数据集的理论分析和实证研究，我们证明了KDX是非常有效的。本文的早期版本出现在2005年SIAM国际数据挖掘会议上。该版本与以前的版本不同，它提供了在不同情况下的详细成本分析，专门用于满足准确性，速度和空间要求的变化需求，开发了一种插入和删除实例的方法，并提供了具体的计算方法以及用于执行相同操作的几何属性，并为创建和使用索引结构所需的每个操作提供详细的算法。

著录项

来源
《IEEE Transactions on Knowledge and Data Engineering》 |2006年第6期|p.748-763|共16页
作者
Navneet Panda; Chang E.Y.;
展开▼
作者单位

展开▼
收录信息
原文格式 PDF
正文语种 eng
中图分类计算技术、计算机技术;
关键词
data mining; database indexing; query formulation; support vector machines; KDX; SVM; data mining; geometric properties; information retrieval; kernel indexing strategy; kernel-parameter settings; support vector machines; Support vector machine; indexing; {rm{top}}{hb;

机译：数据挖掘;数据库索引;查询表述;支持向量机;KDX;SVM;数据挖掘;几何特性;信息检索;内核索引策略;内核参数设置;支持向量机;支持向量机;索引;{rm {top} } {hb;

相似文献

外文文献
中文文献
专利

1. Narrator’s Name Recognition with Support Vector Machine for Indexing Indonesian Hadith translations [J] . Fajar Achmad Yusup, Moch Arif Bijaksana, Arief Fatchul Huda Procedia Computer Science . 2019,第11期

机译：支持向量机的旁白叙事者姓名识别，用于对印度尼西亚的圣训进行索引
2. WATERBODIES EXTRACTION FROM LANDSAT8-OLI IMAGERY USING AWATER INDEXS-GUIED STOCHASTIC FULLY-CONNECTED CONDITIONAL RANDOM FIELD MODEL AND THE SUPPORT VECTOR MACHINE [J] . Wang X., Xu L. International Archives of the Photogrammetry, Remote Sensing and Spatial Information Sciences . 2018,第1期

机译：基于水指标随机全连接条件随机场模型和支持向量机的LANDSAT 8-OLI影像中的生物提取
3. Challenges in Content-Based Image Indexing of Cultural Heritage Collections: Support vector machine active learning with applications to text classification [J] . Picard David, Gosselin Philippe-Henri, Gaspard Marie-Claude Signal Processing Magazine, IEEE . 2015,第4期

机译：基于文化遗产收藏的基于内容的图像索引中的挑战：支持向量机主动学习及其在文本分类中的应用
4. Indexing and Classifying Breathing Pattern for Male Runners Using Support Vector Machine [C] . Jessie R. Balbin, Julius T. Sese, John Zeus B. Baje, International Conference on Humanoid, Nanotechnology, Information Technology, Communication and Control, Environment, and Management . 2019

机译：基于支持向量机的男性运动员呼吸模式索引与分类。
5. Search for the Vector Boson Fusion Production of the Higgs Boson in the H →WW* → lν lν Channel using Support Vector Machines [D] . Wetter, Jeffrey Berwyn 2015

机译：使用支持向量机在H→WW *→lνlν通道中搜索希格斯玻色子的矢量玻色子融合生产
6. Relevance Vector Machine and Support Vector Machine Classifier Analysis of Scanning Laser Polarimetry Retinal Nerve Fiber Layer Measurements [O] . Christopher Bowd, Felipe A. Medeiros, Zuohua Zhang, -1

机译：关联向量机和支持向量机分类器对扫描激光偏振法测定视网膜神经纤维层的分析
7. Pengklasifikasian Topik Hadits Terjemahan Bahasa Indonesia Menggunakan Latent Semantic Indexing dan Support Vector Machine [O] . Hafizh Fauzan, Adiwijaya Adiwijaya, Said Al-Faraby 2018

机译：使用潜在语义索引和支持向量机的主题Padith翻译印度尼西亚分类

KDX: an indexer for support vector machines

摘要

著录项

相似文献

相关主题

期刊订阅