首页> 外国专利> PARALLEL-PROCESSING SYSTEMS AND METHODS FOR HIGHLY SCALABLE ANALYSIS OF BIOLOGICAL SEQUENCE DATA

PARALLEL-PROCESSING SYSTEMS AND METHODS FOR HIGHLY SCALABLE ANALYSIS OF BIOLOGICAL SEQUENCE DATA

机译:用于生物序列数据的高度可伸缩分析的并行处理系统和方法

摘要

An apparatus includes a memory configured to store a sequence. The sequence includes an estimation of a biological sequence. The sequence includes a set of elements. The apparatus also includes a set of hardware processors. Each hardware processor is configured to implement a segment processing module. The apparatus also includes an assignment module implemented in a hardware processor. The assignment module is configured to receive the sequence from the memory, and assign each element to at least one segment from a set of segments, including, when an element maps to at least a first segment and a second segment, assigning the element to both the first segment and the second segment. The segment processing module is configured to, for each segment from a set of segments specific to that hardware processor, and substantially simultaneous with the remaining hardware processors, remove at least a portion of duplicate elements in that segment to generate a deduplicated segment. The segment processing module is further configured to reorder the elements in the deduplicated segment to generate a realigned segment that has a reduced likelihood for alignment errors.
机译:一种设备,包括配置为存储序列的存储器。该序列包括生物学序列的估计。该序列包括一组元素。该装置还包括一组硬件处理器。每个硬件处理器被配置为实现段处理模块。该设备还包括在硬件处理器中实现的分配模块。分配模块被配置为从存储器接收序列,并且将每个元素分配给一组片段中的至少一个片段,包括,当元素映射到至少第一片段和第二片段时,将元素分配给两者。第一段和第二段。分段处理模块被配置为针对该硬件处理器专用的分段集合中的每个分段,并且基本上与其余硬件处理器同时,去除该分段中的重复元素的至少一部分以生成去重复的分段。片段处理模块还被配置为对去重片段中的元素进行重新排序以生成具有降低的对齐错误可能性的重新对齐的片段。

著录项

相似文献

  • 专利
  • 外文文献
  • 中文文献
获取专利

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号