首页> 外文期刊>工程(英文) >Wasserstein GAN-Based Small-Sample Augmentation for New-Generation Artificial Intelligence: A Case Study of Cancer-Staging Data in Biology
【24h】

Wasserstein GAN-Based Small-Sample Augmentation for New-Generation Artificial Intelligence: A Case Study of Cancer-Staging Data in Biology

机译:基于Wasserstein GAN的新一代人工智能小样本增强:生物学中癌症分期数据的案例研究

获取原文
获取原文并翻译 | 示例
       

摘要

It is essential to utilize deep-learning algorithms based on big data for the implementation of the new generation of artificial intelligence.Effective utilization of deep learning relies considerably on the number of labeled samples,which restricts the application of deep learning in an environment with a small sample size.In this paper,we propose an approach based on a generative adversarial network (GAN)combined with a deep neural network (DNN).First,the original samples were divided into a training set and a test set.The GAN was trained with the training set to generate synthetic sample data,which enlarged the training set.Next,the DNN classifier was trained with the synthetic samples.Finally,the classifier was tested with the test set,and the effectiveness of the approach for multi-classification with a small sample size was validated by the indicators.As an empirical case,the approach was then applied to identify the stages of cancers with a small labeled sample size.The experimental results verified that the proposed approach achieved a greater accuracy than traditional methods.This research was an attempt to transform the classical statistical machine-learning classification method based on original samples into a deep-learning classification method based on data augmentation.The use of this approach will contribute to an expansion of application scenarios for the new generation of artificial intelligence based on deep learning,and to an increase in application effectiveness.This research is also expected to contribute to the comprehensive promotion of new-generation artificial intelligence.
机译:为了实现新一代人工智能,必须利用基于大数据的深度学习算法。深度学习的有效利用在很大程度上依赖于标记样本的数量,这限制了深度学习在环境中的应用。本文提出了一种基于生成对抗网络(GAN)结合深度神经网络(DNN)的方法。首先,将原始样本分为训练集和测试集。使用训练集进行训练以生成合成样本数据,从而扩大训练集。接下来,使用合成样本对DNN分类器进行训练。最后,使用测试集对分类器进行测试,以及该方法进行多分类的有效性作为一个经验案例,该方法随后被用于识别标签样本量较小的癌症阶段。总体结果验证了该方法比传统方法具有更高的准确性。本研究试图将基于原始样本的经典统计机器学习分类方法转变为基于数据扩充的深度学习分类方法。该方法将有助于扩展基于深度学习的新一代人工智能的应用场景,并提高应用程序的有效性。该研究也有望为全面推广新一代人工智能做出贡献。

著录项

  • 来源
    《工程(英文)》 |2019年第001期|156-163|共8页
  • 作者单位

    College of Life Science and Technology, Huazhong University of Science and Technology, Wuhan 430074, China;

    School of Public Policy and Management, Tsinghua University, Beijing 100084, China;

    Center for Strategic Studies, Chinese Academy of Engineering, Beijing 100088, China;

    School of Public Policy and Management, Tsinghua University, Beijing 100084, China;

    College of Life Science and Technology, Huazhong University of Science and Technology, Wuhan 430074, China;

    School of Public Policy and Management, Tsinghua University, Beijing 100084, China;

    College of Life Science and Technology, Huazhong University of Science and Technology, Wuhan 430074, China;

    School of Mechanical Science and Engineering, Huazhong University of Science and Technology, Wuhan 430074, China;

  • 收录信息
  • 原文格式 PDF
  • 正文语种 eng
  • 中图分类
  • 关键词

  • 入库时间 2022-08-19 04:25:11
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号