首页> 中文期刊> 《大数据挖掘与分析(英文)》 >Survey on Encoding Schemes for Genomic Data Representation and Feature Learning——From Signal Processing to Machine Learning

Survey on Encoding Schemes for Genomic Data Representation and Feature Learning——From Signal Processing to Machine Learning

         

摘要

Data-driven machine learning, especially deep learning technology, is becoming an important tool for handling big data issues in bioinformatics. In machine learning, DNA sequences are often converted to numerical values for data representation and feature learning in various applications. Similar conversion occurs in Genomic Signal Processing(GSP), where genome sequences are transformed into numerical sequences for signal extraction and recognition. This kind of conversion is also called encoding scheme. The diverse encoding schemes can greatly affect the performance of GSP applications and machine learning models. This paper aims to collect,analyze, discuss, and summarize the existing encoding schemes of genome sequence particularly in GSP as well as other genome analysis applications to provide a comprehensive reference for the genomic data representation and feature learning in machine learning.

著录项

  • 来源
    《大数据挖掘与分析(英文)》 |2018年第3期|191-210|共20页
  • 作者

    Ning Yu; Zhihua Li; Zeng Yu;

  • 作者单位

    1. Department of Computing Sciences;

    College at Brockport;

    State University of New York 2. Department of Computer Science and Technology at Jiangnan University 3. School of Information Science and Technology;

    Southwest Jiaotong University;

  • 原文格式 PDF
  • 正文语种 chi
  • 中图分类 人工智能理论;
  • 关键词

    机译:编码方案;数据表示;特征学习;深度学习;基因组信号处理;机器学习;基因组分析;
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号