首页> 外文会议>2015 IEEE International Congress on Big Data >Online Nonlinear Classification for High-Dimensional Data
【24h】

Online Nonlinear Classification for High-Dimensional Data

机译:高维数据的在线非线性分类

获取原文
获取原文并翻译 | 示例

摘要

We study online binary classification problem under the empirical zero-one loss function. We introduce a novel randomized classification algorithm based on highly dynamic hierarchical models that partition the feature space. Our approach jointly and sequentially learns the partitioning of the feature space, the optimal classifier among all doubly exponential number of classifiers defined by the tree, and the individual region classifiers in order to directly minimize the cumulative loss. Although we adapt the entire hierarchical model to minimize a global loss function, the computational complexity of the introduced algorithm scales linearly with the dimensionality of the feature space and the depth of the tree. Furthermore, our algorithm can be applied to any streaming data without requiring a training phase or prior information, hence processes data on-the-fly and then discards it, which makes the introduced algorithm significantly appealing for applications involving "big data". We evaluate the performance of the introduced algorithm over different real data sets.
机译:我们在经验零一损失函数下研究在线二进制分类问题。我们介绍了一种基于高度动态分层模型的新型随机分类算法,该模型对特征空间进行了划分。我们的方法联合并顺序地学习特征空间的划分,由树定义的所有双指数分类器中的最佳分类器以及各个区域分类器,以便直接最小化累积损失。尽管我们调整了整个层次模型以最小化全局损失函数,但引入的算法的计算复杂度随特征空间的维数和树的深度线性增长。此外,我们的算法可以应用于任何流数据,而无需训练阶段或先验信息,因此可以实时处理数据然后丢弃它,这使得引入的算法对于涉及“大数据”的应用非常有吸引力。我们评估在不同的实际数据集上引入算法的性能。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号