首页> 外国专利> Using binary array representations of sequences to eliminate redundant patterns in discovered patterns of symbols

Using binary array representations of sequences to eliminate redundant patterns in discovered patterns of symbols

机译:使用序列的二进制数组表示形式来消除发现的符号模式中的冗余模式

摘要

The present invention relates to computer-implemented methods for finding patterns in patterns in a set of k-sequences of symbols (where k≧2) and to a computer readable medium having instructions for controlling a computer system to perform the methods. Patterns of symbols common to each 2-tuple of sequences are identified. Each identified pattern of symbols is represented by a position index binary array (PIBA) which is a set of binary digits. The binary digit in each place in the array that corresponds to a location in the selected reference sequence of a symbol in the identified pattern has a first predetermined binary value. All of the other binary digits in the array have a second predetermined binary value. The position index binary array (PIBA) representations of patterns of each tuple at any order “n” may be combined with the PIBA pattern representations of all other tuples at that same order “n” or with the pattern representations in any selected m-tuple, where m may have any integer value from 2 to (n−1). The representations of the patterns in an n-tuple are only combined with pattern representations of another tuple that includes in its tuple identifier at least one sequence index greater than the sequence indices included in the tuple identifier of the n-tuple. To avoid redundancies involving pair-wise combinations of representations of patterns all of the sequence indices of the other tuple (other than the reference sequence index) must be different from those of the n-tuple.
机译:本发明涉及用于在一组k个符号序列(其中k≥2)中的模式中找到模式的计算机实现的方法,并且涉及一种具有用于控制计算机系统以执行该方法的指令的计算机可读介质。识别每个2元组序列共有的符号模式。每个标识的符号模式都由位置索引二进制数组(PIBA)表示,PIBA是一组二进制数字。阵列中每个位置中与所识别的图案中的符号的所选参考序列中的位置相对应的二进制数字具有第一预定二进制值。阵列中的所有其他二进制数字均具有第二预定二进制值。每个元组的任何顺序“ n”的模式的位置索引二进制数组(PIBA)表示可以与相同顺序“ n”的所有其他元组的PIBA模式表示或任何选定的m-tuple的模式表示结合,其中m可以具有2到(n-1)之间的任何整数。 n元组中的模式表示仅与另一个元组的模式表示组合,该另一个元组在其元组标识符中包括至少一个比n元组的元组标识符中包含的序列索引更大的序列索引。为了避免涉及模式表示的成对组合的冗余,其他元组的所有序列索引(参考序列索引除外)必须与n元组的序列索引不同。

著录项

  • 公开/公告号US2006253517A1

    专利类型

  • 公开/公告日2006-11-09

    原文格式PDF

  • 申请/专利权人 DAVID RUBEN ARGENTAR;

    申请/专利号US20060402664

  • 发明设计人 DAVID RUBEN ARGENTAR;

    申请日2006-04-12

  • 分类号G06F7/38;

  • 国家 US

  • 入库时间 2022-08-21 21:44:11

相似文献

  • 专利
  • 外文文献
  • 中文文献
获取专利

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号