A Bayesian Classifier for Learning from Tensorial Data

机译：贝叶斯分类器用于张量数据学习

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

Traditional machine learning methods characterize data observations by feature vectors, where an entry of a vector denotes a scalar feature value of a data instance. While this data representation facilitates the application of conventional machine learning algorithms, in many cases it is not the best way of extracting all useful information from the data observations. In this paper we relax the (often unstated) assumption of vectorizing features of data instances, and allow a more natural representation of the data in a tensor format. Tensors are multi-mode (aka multi-way) arrays, of whom vectors (i.e., one-mode tensors) and matrices (i.e., two-mode tensors) are special cases. We show that the tensor representation captures useful information that is difficult to provide in the conventional vector format. More importantly, to effectively utilize the rich information contained in tensors, we propose a novel semi-naive Bayesian tensor classification method (which we call Bat) that builds predictive models directly on data in tensor form (instead of on their vectorizations). We apply Bat to supervised learning problems, and perform comprehensive experiments on classifying text documents and graphs, which demonstrate (1) the advantage of the tensor representation over conventional feature-vectorization approaches, and (2) the superiority of the proposed Bat tensor classifier over other existing learners.

机译：传统的机器学习方法通过特征向量来表征数据观测，其中向量的条目表示数据实例的标量特征值。尽管此数据表示有助于常规机器学习算法的应用，但在许多情况下，这并不是从数据观察中提取所有有用信息的最佳方法。在本文中，我们放宽了对数据实例的特征进行矢量化的（通常是未声明的）假设，并允许以张量格式更自然地表示数据。张量是多模（aka多向）数组，其中向量（即一模张量）和矩阵（即二模张量）是特例。我们表明，张量表示法捕获了有用的信息，而这些信息很难以传统的矢量格式提供。更重要的是，为了有效利用张量中包含的丰富信息，我们提出了一种新颖的半朴素贝叶斯张量分类方法（我们称为Bat），该方法直接基于张量形式的数据（而不是其矢量化）建立预测模型。我们将Bat应用于有监督的学习问题，并对文本文档和图形进行分类的综合实验，证明了（1）张量表示相对于传统特征向量化方法的优势，以及（2）拟议的Bat张量分类器优于其他现有的学习者。

著录项

来源
《European conference on machine learning and knowledge discovery in databases》|2013年|483-498|共16页
会议地点
作者
Wei Liu; Jeffrey Chan; James Bailey; Christopher Leckie; Fang Chen; Kotagiri Ramamohanarao;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类
关键词

相似文献

外文文献
中文文献
专利

1. Galaxy Merger Rates up to z?～?3 Using a Bayesian Deep Learning Model: A Major-merger Classifier Using IllustrisTNG Simulation Data [J] . Leonardo Ferreira, Christopher J. Conselice, Kenneth Duncan, The Astrophysical journal . 2020,第2期

机译：Galaxy合并率最多Z？〜？3使用贝叶斯深度学习模型：使用IllustrySng模拟数据的主要合并分类器
2. A classifier for multi-dimensional datasets based on Bayesian multiple kernel grouping learning [J] . Dong Fangli, Wang Xiaozhou Journal of statistical computation and simulation . 2019,第10a12期

机译：基于贝叶斯多核分组学习的多维数据集分类器
3. A classifier for multi-dimensional datasets based on Bayesian multiple kernel grouping learning [J] . Dong Fangli, Wang Xiaozhou Journal of statistical computation and simulation . 2019,第10a12期

机译：基于贝叶斯多核分组学习的多维数据集的分类器
4. A Bayesian Classifier for Learning from Tensorial Data [C] . Wei Liu, Jeffrey Chan, James Bailey, European Conference on Machine Learning and Knowledge Discovery in Databases . 2013

机译：贝叶斯分类器，用于学习统治数据
5. Inductive classifier learning from data: An extended Bayesian belief function approach. [D] . Ma, Yong. 1995

机译：从数据中归纳分类器学习：扩展的贝叶斯信念函数方法。
6. Discriminative Structure Learning of Bayesian Network Classifiers from Training Dataset and Testing Instance [O] . Limin Wang, Yang Liu, Musa Mammadov, 2019

机译：培训数据集和测试实例贝叶斯网络分类器的鉴别结构学习
7. Discriminative Structure Learning of Bayesian Network Classifiers from Training Dataset and Testing Instance [O] . Limin Wang, Yang Liu, Musa Mammadov, 2019

机译：培训数据集和测试实例贝叶斯网络分类器的鉴别结构学习

A Bayesian Classifier for Learning from Tensorial Data

摘要

著录项

相似文献

相关主题

期刊订阅