首页> 外国专利> Identifying patterns of symbols in sequences of symbols using a binary array representation of the sequence

Identifying patterns of symbols in sequences of symbols using a binary array representation of the sequence

机译:使用序列的二进制数组表示来识别符号序列中的符号模式

摘要

The present invention relates to computer-implemented methods for finding patterns in patterns in a set of k-sequences of symbols (where k≧2) and to a computer readable medium having instructions for controlling a computer system to perform the methods. Patterns of symbols common to each 2-tuple of sequences are identified. Each identified pattern of symbols is represented by a position index binary array (PIBA) which is a set of binary digits. The binary digit in each place in the array that corresponds to a location in a selected reference sequence of a symbol in the identified pattern has a first predetermined binary value. All of the other binary digits in the array have a second predetermined binary value. The position index binary array (PIBA) representations of patterns of each tuple at any order “n” may be combined with the PIBA pattern representations of all other tuples at that same order “n” or with the pattern representations in any selected m-tuple, where m may have any integer value from 2 to (n−1). The patterns of the resulting tuple are identified from the position index binary arrays (PIBAs) produced by the intersection of the set of binary digits in each position index binary array (PIBA) in the n-tuple with the set of binary digits in each position index binary array (PIBA) in the other tuple. The intersections are accomplished logically, as by performing a logical AND operation in a bit-by-bit manner on the binary arrays. Using the places in the position index binary array (PIBA) produced by the intersections having the first predetermined binary value as a guide, the symbols in corresponding locations in the reference sequence are identified. These symbols comprise the symbols in the identified pattern in the resulting tuple.
机译:本发明涉及用于在一组k个符号序列(其中k≥2)中的模式中找到模式的计算机实现的方法,并且涉及一种具有用于控制计算机系统以执行该方法的指令的计算机可读介质。识别每个2元组序列共有的符号模式。每个标识的符号模式都由位置索引二进制数组(PIBA)表示,PIBA是一组二进制数字。阵列中每个位置中与所识别的图案中的符号的选定参考序列中的位置相对应的二进制数字具有第一预定二进制值。阵列中的所有其他二进制数字均具有第二预定二进制值。每个元组的任何顺序“ n”的模式的位置索引二进制数组(PIBA)表示可以与相同顺序“ n”的所有其他元组的PIBA模式表示或任何选定的m-tuple的模式表示结合,其中m可以具有2到(n-1)之间的任何整数。从n元组中每个位置索引二进制数组(PIBA)中的二进制数集与每个位置中的二进制数字集的相交产生的位置索引二进制数组(PIBA)识别结果元组的模式其他元组中的索引二进制数组(PIBA)。相交是通过逻辑方式完成的,例如通过对二进制数组逐位执行逻辑“与”运算。使用由具有第一预定二进制值的相交产生的位置索引二进制数组(PIBA)中的位置作为基准,识别参考序列中相应位置的符号。这些符号包括结果元组中已识别模式中的符号。

著录项

  • 公开/公告号US2006235845A1

    专利类型

  • 公开/公告日2006-10-19

    原文格式PDF

  • 申请/专利权人 DAVID RUBEN ARGENTAR;

    申请/专利号US20060402716

  • 发明设计人 DAVID RUBEN ARGENTAR;

    申请日2006-04-12

  • 分类号G06F17/30;

  • 国家 US

  • 入库时间 2022-08-21 21:47:35

相似文献

  • 专利
  • 外文文献
  • 中文文献
获取专利

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号