首页> 外文期刊>Bioinformatics >A software program combining sequence motif searches with keywords for finding repeats containing DNA sequences
【24h】

A software program combining sequence motif searches with keywords for finding repeats containing DNA sequences

机译:结合了序列基序搜索和关键字的软件程序,用于查找包含DNA序列的重复序列

获取原文
获取原文并翻译 | 示例
           

摘要

Motivation: One of the most interesting features of genomes (both coding and non-coding regions) is the presence of relatively short tandemly repeated DNA sequences known as tandem repeats (TRs). We developed a new PC-based standalone software analysis program, combining sequence motif searches with keywords such as organs, tissues, cell lines or development stages for finding exact, inexact and compound, TRs. Tandem Repeats Analyzer 1.5 (TRA) has several advanced repeat search parameters/options over other repeat finder programs as it does not only accept GenBank, FASTA and expressed sequence tag (EST) sequence files but also does analysis of multifiles with multisequences. Advanced user-defined parameters/options let the researchers use different motif lengths search criteria for varying motif lengths simultaneously. The outputs show statistical results to be evaluated by the user. The discovery of TRs in ESTs could be useful for both gene mapping and association studies and discovering TRs located in coding regions of important genes that are expressed under various conditions of environment, stress, organ, tissue and development stage. Results: In this paper, we demonstrated applications of TRA using 175899 ESTs sequences for three Arabidopsis spp. downloaded from GenBank. The EST-SSRs/ESTs ratios were found 43.1%, 15.3% and 2.34% in A.Iyrata, A.thaliana and A.halleri, respectively. Analysis revealed that organs, tissues and development stages possessed different amounts of repeats and repeat compositions. This indicated that the distribution of TRs among the tissues or organs may not be random differing from the untranscribed repeats found in genomes.
机译:动机:基因组(编码区和非编码区)最有趣的特征之一是存在相对较短的串联重复DNA序列,称为串联重复(TRs)。我们开发了一种新的基于PC的独立软件分析程序,该程序将序列基序搜索与诸如器官,组织,细胞系或发育阶段之类的关键字相结合,以找到精确,不精确和复合的TR。与其他重复查找程序相比,串联重复分析器1.5(TRA)具有多个高级重复搜索参数/选项,因为它不仅接受GenBank,FASTA和表达的序列标签(EST)序列文件,而且还可以分析具有多序列的多文件。先进的用户定义参数/选项使研究人员可以同时使用不同的图案长度搜索条件来改变图案长度。输出显示要由用户评估的统计结果。在EST中发现TRs可用于基因作图和关联研究,以及发现位于重要基因编码区的TRs,这些重要基因在环境,压力,器官,组织和发育阶段的各种条件下表达。结果:在本文中,我们证明了使用175899 ESTs序列对三种拟南芥属TRA的应用。从GenBank下载。 EST.SSRs / ESTs的比率分别在伊拉塔草,拟南芥和哈雷利酵母中分别为43.1%,15.3%和2.34%。分析表明,器官,组织和发育阶段具有不同数量的重复序列和重复序列组成。这表明TRs在组织或器官之间的分布可能与基因组中未转录的重复序列没有随机差异。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号