首页> 外国专利> Profile searching in nucleic acid sequences using the fast fourier transformation

Profile searching in nucleic acid sequences using the fast fourier transformation

机译:使用快速傅里叶变换在核酸序列中进行谱图搜索

摘要

One embodiment of the present invention provides methods for detecting known blocks of functionally aligned protein sequences in a test nucleic acid sequence, e.g., in an uncharacterized EST. The method can include the following steps. A) Reverse translate the set of protein sequences to a set of functionally aligned nucleic acid sequences using codon-usage tables and create a profile from the set of functionally aligned nucleic acid sequences. B) Construct a first indicator function for the profile. The first indicator function corresponds to adenine. The first indicator function allows the value at a given position to be continuous between 0 and 1 as a function of the percentage presence of adenine at a particular position. C) Construct a second indicator function for the test nucleic acid sequence. The second indicator function also corresponds to adenine. D) Compute the Fourier transform of each of the indicator functions. E) Complex conjugate the Fourier transform of the second indicator function. F) Multiply the Fourier transform of the first indicator function and the complex conjugated Fourier transform of the second indicator function to obtain a Fourier transform of the number of matches of adenine bases. G) Repeat steps B-F above for guanine, thymine, and cytosine. H) Sum the Fourier transforms of the number of matches for each base, respectively, to obtain the total Fourier transform. I) Compute the inverse Fourier transform of the total Fourier transform to obtain a complex series. J) Take the real part of the series to determine the total number of base matches for the variety of possible lags of the profile relative to the test sequence. The method can then detect the presence of known blocks of functionally aligned protein sequences in a test nucleic acid sequence based on the total number of base matches for the variety of possible lags.
机译:本发明的一个实施方案提供了用于在测试核酸序列例如在未表征的EST中检测功能上比对的蛋白质序列的已知嵌段的方法。该方法可以包括以下步骤。 A)使用密码子使用表将蛋白质序列的集合反向翻译为功能上对齐的核酸序列的集合,并从该功能上对齐的核酸序列的集合创建概况。 B)为配置文件构造第一个指标函数。第一指示剂功能对应于腺嘌呤。第一指示器功能允许给定位置的值在0和1之间连续,这取决于特定位置腺嘌呤的存在百分比。 C)为测试核酸序列构建第二指示剂功能。第二指示剂功能也对应于腺嘌呤。 D)计算每个指标函数的傅立叶变换。 E)复共轭第二指标函数的傅立叶变换。 F)将第一指示符函数的傅立叶变换与第二指示符函数的复共轭傅立叶变换相乘,以获得腺嘌呤碱基的匹配数目的傅立叶变换。 G)对鸟嘌呤,胸腺嘧啶和胞嘧啶重复上述步骤B-F。 H)分别求和每个基数的匹配数的傅立叶变换,以获得总傅立叶变换。 I)计算总傅立叶变换的傅立叶逆变换以获得复杂的级数。 J)取序列的实数部分,以确定谱图相对于测试序列的各种可能滞后的碱基匹配总数。然后,该方法可以基于针对各种可能的滞后的碱基匹配的总数,检测测试核酸序列中功能性比对的蛋白质序列的已知区块的存在。

著录项

  • 公开/公告号AU783716B2

    专利类型

  • 公开/公告日2005-12-01

    原文格式PDF

  • 申请/专利权人 AVENTIS PHARMACEUTICALS INC.;

    申请/专利号AU20000051450

  • 发明设计人 WILLIAM NEWELL;

    申请日2000-05-19

  • 分类号G06F19/00;C12Q1/68;

  • 国家 AU

  • 入库时间 2022-08-21 21:34:06

相似文献

  • 专利
  • 外文文献
  • 中文文献
获取专利

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号