首页> 外文会议>IEEE International Conference on Bioinformatics and Biomedicine Workshops >An integrated approach of sequence and text mining technology for the identification of transcription factor binding sites
【24h】

An integrated approach of sequence and text mining technology for the identification of transcription factor binding sites

机译:序列和文本挖掘技术的综合方法,用于鉴定转录因子结合位点

获取原文

摘要

The study of the complex mechanisms that regulated gene expression on the level of transcription is an important and challenging issue in post-genomic era. A crucial step is to identify transcription factor binding sites(TFBSs). However, the number of the known TFBSs is limited, and the accuracy of the state-of-the-art identification methods is still far from satisfactory. In this paper, a novel integrated method for mining transcription factor binding sites is presented, which combines the sequence data mining method with the text mining method. Therefore, the method can not only obtain the putative TFBSs from the sequence data sets, but also acquire the experimentally verified TFBSs from the literatures. To evaluate the performance of our method, several experiments have been tested on real data sets. The results show that our integrated method outperforms each of the algorithms alone, furthermore, exhibits superior accuracy than existing algorithms.
机译:调节基因表达对转录水平的复杂机制的研究是基因组时代的一个重要和挑战性问题。关键步骤是鉴定转录因子结合位点(TFBS)。然而,已知的TFBS的数量是有限的,并且最先进的识别方法的准确性仍然远非令人满意。本文介绍了一种新的挖掘转录因子结合位点的综合方法,其结合了序列数据挖掘方法与文本挖掘方法。因此,该方法不仅可以从序列数据集获得推定的TFBS,而且还获取来自文献的实验验证的TFBS。为了评估我们方法的性能,在真实数据集上测试了几个实验。结果表明,我们的集成方法越优于每个算法,而且还表现出比现有算法更高的精度。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号