首页> 外文期刊>BioSystems >Prediction of replication sites in Saccharomyces cerevisiae genome using DNA segment properties: Multi-view ensemble learning (MEL) approach
【24h】

Prediction of replication sites in Saccharomyces cerevisiae genome using DNA segment properties: Multi-view ensemble learning (MEL) approach

机译:使用DNA段特性预测酿酒酵母酿酒酵母基因组的复制位点:多视图集合学习(MEL)方法

获取原文
获取原文并翻译 | 示例
           

摘要

Autonomous replication sequences (ARS) are essential for the replication of Saccharomyces cerevisiae genome. The content and context of ARS sites are distinct from other segments of the genome and these factors influence the conformation and thermodynamic profile of DNA that favor binding of the origin recognition complex proteins. Identification of ARS sites in the genome is a challenging task because of their organizational complexity and degeneracy present across the intergenic regions. We considered a few properties of DNA segments and divided them into multiple subsets (views) for computational prediction of ARS sequences. Our approach utilized these views for learning classification models in an ensemble manner and accordingly predictions were made. This approach maximized the prediction accuracy over the traditional way where all features are selected at once. Our study also revealed that major groove width and major groove depth are the most prominent properties that distinguished ARS from other segments of the genome. Our investigation also provides clue about the most suitable classifier for a given feature set, and this strategy may be useful for finding ARS in other closely related species. (C) 2017 Elsevier B.V. All rights reserved.
机译:自主复制序列(ARS)对于复制酿酒酵母基因组是必不可少的。 ARS位点的内容和背景不同于基因组的其他区段,这些因素影响了有利于起源识别复合蛋白结合的DNA的构象和热力学谱。由于其组织复杂性和在非基间地区存在的组织复杂性和退化性,鉴定基因组中的ARS位点是一个具有挑战性的任务。我们认为DNA段的几种性质,并将它们划分为多个子集(视图),以计算ARS序列的计算预测。我们的方法利用这些视图以集成的方式学习分类模型,并因此进行预测。这种方法通过传统方式最大化预测准确性,传统方式一次选择所有功能。我们的研究还透露,主要的沟槽宽度和主要沟槽深度是从基因组的其他区段区分ARS的最突出的特性。我们的调查还提供关于给定特征集最合适的分类器的线索,并且该策略对于在其他密切相关的物种中寻找ARS可能是有用的。 (c)2017 Elsevier B.v.保留所有权利。

著录项

  • 来源
    《BioSystems》 |2018年第2018期|共11页
  • 作者单位

    Jawaharlal Nehru Univ Sch Computat &

    Integrat Sci New Delhi 110067 India;

    Mahatma Gandhi Cent Univ Dept Comp Sci &

    Informat Technol Motihari 845401 Bihar India;

    Jawaharlal Nehru Univ Sch Computat &

    Integrat Sci New Delhi 110067 India;

  • 收录信息
  • 原文格式 PDF
  • 正文语种 eng
  • 中图分类 生物科学;
  • 关键词

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号