首页> 外文期刊>BioSystems >Sequence classification with side effect machines evolved via ring optimization
【24h】

Sequence classification with side effect machines evolved via ring optimization

机译:通过环优化进化出具有副作用机器的序列分类

获取原文
获取原文并翻译 | 示例
获取外文期刊封面目录资料

摘要

The explosion of available sequence data necessitates the development of sophisticated machine learning tools with which to analyze them. This study introduces a sequence-learning technology called side effect machines. It also applies a model of evolution which simulates the evolution of a ring species to the training of the side effect machines. A comparison is done between side effect machines evolved in the ring structure and side effect machines evolved using a standard evolutionary algorithm based on tournament selection. At the core of the training of side effect machines is a nearest neighbor classifier. A parameter study was performed to investigate the impact of the division of training data into examples for nearest neighbor assessment and training cases. The parameter study demonstrates that parameter setting is important in the baseline runs but had little impact in the ring-optimization runs. The ring optimization technique was also found to exhibit improved and also more reliable training performance. Side effect machines are tested on two types of synthetic data, one based on GC-content and the other checking for the ability of side effect machines to recognize an embedded motif. Three types of biological data are used, a data set with different types of immune-system genes, a data set with normal and retro-virally derived human genomic sequence, and standard and nonstandard initiation regions from the cytochrome-oxidase subunit one in the mitochondrial genome.
机译:可用序列数据的爆炸式增长要求开发用于分析它们的复杂机器学习工具。本研究介绍了一种称为副作用机的序列学习技术。它还将模拟环物种进化的进化模型应用于副作用机器的训练。比较了在环形结构中进化的副作用机和使用基于比赛选择的标准进化算法进化的副作用机。副作用机器训练的核心是最近的邻居分类器。进行了参数研究,以调查将训练数据划分为最近邻居评估和训练案例的示例的影响。参数研究表明,参数设置在基线运行中很重要,但对环优化运行影响很小。还发现环优化技术表现出改进的并且也更可靠的训练性能。对副作用机器进行了两种类型的合成数据测试,一种基于GC含量,另一种检查副作用机器识别嵌入式主题的能力。使用三种类型的生物学数据,具有不同类型免疫系统基因的数据集,具有正常和逆转录病毒衍生的人类基因组序列的数据集,以及线粒体中细胞色素氧化酶亚基的标准和非标准起始区域基因组。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号