首页> 外文期刊>NanoBioscience, IEEE Transactions on >MfeCNN: Mixture Feature Embedding Convolutional Neural Network for Data Mapping
【24h】

MfeCNN: Mixture Feature Embedding Convolutional Neural Network for Data Mapping

机译:MfeCNN:用于数据映射的混合特征嵌入卷积神经网络

获取原文
获取原文并翻译 | 示例
获取外文期刊封面目录资料

摘要

Data mapping plays an important role in data integration and exchanges among institutions and organizations with different data standards. However, traditional rule-based approaches and machine learning methods fail to achieve satisfactory results for the data mapping problem. In this paper, we propose a novel and sophisticated deep learning framework for data mapping called mixture feature embedding convolutional neural network (MfeCNN). The MfeCNN model converts the data mapping task to a multiple classification problem. In the model, we incorporated multimodal learning and multiview embedding into a CNN for mixture feature tensor generation and classification prediction. Multimodal features were extracted from various linguistic spaces with a medical natural language processing package. Then, powerful feature embeddings were learned by using the CNN. As many as 10 classes could be simultaneously classified by a softmax prediction layer based on multiview embedding. MfeCNN achieved the best results on unbalanced data (average F1 score, 82.4%) among the traditional state-of-the-art machine learning models and CNN without mixture feature embedding. Our model also outperformed a very deep CNN with 29 layers, which took free texts as inputs. The combination of mixture feature embedding and a deep neural network can achieve high accuracy for data mapping and multiple classification.
机译:数据映射在具有不同数据标准的机构和组织之间的数据集成和交换中起着重要作用。但是,传统的基于规则的方法和机器学习方法无法针对数据映射问题获得令人满意的结果。在本文中,我们提出了一种新颖且复杂的数据映射深度学习框架,称为混合特征嵌入卷积神经网络(MfeCNN)。 MfeCNN模型将数据映射任务转换为多重分类问题。在模型中,我们将多峰学习和多视图嵌入到CNN中,以进行混合特征张量生成和分类预测。使用医学自然语言处理软件包从各种语言空间中提取了多峰特征。然后,通过使用CNN学习了强大的功能嵌入。基于多视图嵌入的softmax预测层可以同时分类多达10个类别。在传统的最新机器学习模型和无混合特征嵌入的CNN中,MfeCNN在不平衡数据(平均F1分数,82.4%)上取得了最佳结果。我们的模型还优于具有29层的非常深的CNN,该CNN将自由文本作为输入。混合特征嵌入和深度神经网络的结合可以实现高精度的数据映射和多重分类。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号