...
首页> 外文期刊>Soft computing: A fusion of foundations, methodologies and applications >Multi-view heterogeneous fusion and embedding for categorical attributes on mixed data
【24h】

Multi-view heterogeneous fusion and embedding for categorical attributes on mixed data

机译:多视图异构融合和混合数据上的分类属性嵌入

获取原文
获取原文并翻译 | 示例
   

获取外文期刊封面封底 >>

       

摘要

Categorical attributes are ubiquitous in real-world collected data. However, such attributes lack a well-defined distance metric and cannot be directly manipulated per algebraic operations, so many data mining algorithms are unable to work directly on them. Learning an appropriate metric or an effective numerical embedding is very vital yet challenging, for categorical attributes with multi-view heterogeneous data characteristics. This paper proposes a novel multi-view heterogeneous fusion model (MVHF), which first captures basic coupling information for each view and then fuses these heterogeneous information from different views by multi-kernel metric learning, to measure the intrinsic distances between this type of categorical attributes; based on these measured distances, further, we use the manifold learning method to learn a high-quality numerical embedding for each categorical value. Experiments on 33 mixed data sets demonstrate that MVHF-enabled classification significantly enhances the performance, compared with state-of-the-art distance metrics or embedding competitors.
机译:分类属性在现实世界收集的数据中普遍存在。但是,这种属性缺乏明确定义的距离度量,并且不能直接操纵每个代数操作,因此许多数据挖掘算法无法直接工作。学习适当的公制或有效的数值嵌入非常重要但对于具有多视图异构数据特性的分类属性非常有挑战性。本文提出了一种新的多视图异构融合模型(MVHF),其首先捕获每个视图的基本耦合信息,然后通过多核度量学习来融合来自不同视图的这些异构信息,以测量这种类型的基本距离属性;基于这些测量的距离,我们使用歧管学习方法来学习每个分类值的高质量数值嵌入。 33个混合数据集的实验表明,与最先进的距离指标或嵌入竞争对手相比,启用MVHF的分类显着提高了性能。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号