首页> 美国卫生研究院文献>BMC Bioinformatics >Refined repetitive sequence searches utilizing a fast hash function and cross species information retrievals
【2h】

Refined repetitive sequence searches utilizing a fast hash function and cross species information retrievals

机译:利用快速哈希函数和跨物种信息检索进行精确的重复序列搜索

代理获取
本网站仅为用户提供外文OA文献查询和代理获取服务,本网站没有原文。下单后我们将采用程序或人工为您竭诚获取高质量的原文,但由于OA文献来源多样且变更频繁,仍可能出现获取不到、文献不完整或与标题不符等情况,如果获取不到我们将提供退款服务。请知悉。

摘要

BackgroundSearching for small tandem/disperse repetitive DNA sequences streamlines many biomedical research processes. For instance, whole genomic array analysis in yeast has revealed 22 PHO-regulated genes. The promoter regions of all but one of them contain at least one of the two core Pho4p binding sites, CACGTG and CACGTT. In humans, microsatellites play a role in a number of rare neurodegenerative diseases such as spinocerebellar ataxia type 1 (SCA1). SCA1 is a hereditary neurodegenerative disease caused by an expanded CAG repeat in the coding sequence of the gene. In bacterial pathogens, microsatellites are proposed to regulate expression of some virulence factors. For example, bacteria commonly generate intra-strain diversity through phase variation which is strongly associated with virulence determinants. A recent analysis of the complete sequences of the Helicobacter pylori strains 26695 and J99 has identified 46 putative phase-variable genes among the two genomes through their association with homopolymeric tracts and dinucleotide repeats. Life scientists are increasingly interested in studying the function of small sequences of DNA. However, current search algorithms often generate thousands of matches – most of which are irrelevant to the researcher.
机译:背景技术搜索小的串联/分散重复DNA序列可简化许多生物医学研究过程。例如,酵母中的全基因组阵列分析揭示了22个PHO调控的基因。除一个以外,所有启动子区域均包含两个核心Pho4p结合位点CACGTG和CACGTT中的至少一个。在人类中,微卫星在许多罕见的神经退行性疾病中起作用,例如1型脊髓小脑共济失调(SCA1)。 SCA1是一种遗传性神经退行性疾病,由基因编码序列中的CAG重复序列扩增引起。在细菌病原体中,提出了微卫星来调节某些毒力因子的表达。例如,细菌通常通过与毒力决定因素强烈相关的相变来产生菌株内多样性。最近对幽门螺杆菌菌株26695和J99的完整序列进行了分析,发现这两个基因组中有46个推定的相变基因,原因是它们与均聚物链和二核苷酸重复序列相关。生命科学家对研究DNA小序列的功能越来越感兴趣。但是,当前的搜索算法通常会生成数千个匹配项-其中大多数与研究人员无关。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
代理获取

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号