...
首页> 外文期刊>Biophysical Chemistry: An International Journal Devoted to the Physical Chemistry of Biological Phenomena >Identifying DNase I hypersensitive sites using multi-features fusion and F-score features selection via Chou's 5-steps rule
【24h】

Identifying DNase I hypersensitive sites using multi-features fusion and F-score features selection via Chou's 5-steps rule

机译:使用多特色融合和F分数选择通过Chou的5步规则来识别DNASE I过敏站点

获取原文
获取原文并翻译 | 示例
           

摘要

DNase I hypersensitive sites (DHSS) are regarded as those regions of chromatin that are sensitive to cleavage by the DNase I enzyme. Identification of DNase I hypersensitive sites will provide useful insights for discovering DNA's functional elements from the non-coding sequences in the biomedical research. Because of the significance for DNase I hypersensitive sites, it is indispensable to develop an accurate, fast, robust, and high-throughput automated computational model. In this paper, we develop a model named iDHSs-MFF by combining multiple fusion features and F-score features selection approach. The multiple fusion features include three auto-correlation descriptors based on the dinucleotide property matrix and the trinucleotide property matrix (TPM), Pseudo-DPM and Pseudo-TPM. Evaluation by the jackknife cross-validation indicates that the selected features by F-score are effective in the identification of DNase I hypersensitive sites. Experimental results on two benchmark datasets demonstrate that the proposed model outperforms some highly related models. Systematic application of this computational approach will greatly facilitate the analysis of transcriptional regulatory elements. The datasets and Matlab source codes are freely available at: https://github.com/shengli0201/Datasets.
机译:DNase I过敏位点(DHSS)被认为是染色质的那些对DNase I酶裂解敏感的区域。 DNase I过敏位点的鉴定将提供从生物医学研究中的非编码序列发现DNA的功能元素的有用见解。由于DNASE I过敏位点的重要性,开发精确,快速,坚固,高吞吐量的自动化计算模型是必不可少的。在本文中,我们通过组合多个融合功能和F分具有选择方法,开发名为IDHSS-MFF的模型。多种融合特征包括基于二核苷酸性能基质和三核苷酸性能基质(TPM),假型DPM和假TPM的三个自相关描述符。通过杰克交叉验证的评估表明F-Scress的所选特征在识别DNase I度超敏位点方面是有效的。两个基准数据集上的实验结果表明,所提出的模型优于一些高度相关的模型。系统应用这种计算方法将极大地促进转录调节因素的分析。数据集和MATLAB源代码可自由获取:https://github.com/shengli0201/datasets。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号