...
首页> 外文期刊>Journal of computational biology: A journal of computational molecular cell biology >Combined dynamic arrays for storing and searching semi-ordered tandem mass spectrometry data
【24h】

Combined dynamic arrays for storing and searching semi-ordered tandem mass spectrometry data

机译:组合动态数组,用于存储和搜索半序串联质谱数据

获取原文
获取原文并翻译 | 示例

摘要

When performing bioinformatics analysis on tandem mass spectrometry data, there is a computational need to efficiently store and sort these semi-ordered datasets. To solve this problem, a new data structure based on dynamic arrays was designed and implemented in an algorithm that parses semi-ordered data made by Mascot, a separate software program that matches peptide tandem mass spectra to protein sequences in a database. By accommodating the special features of these large datasets, the combined dynamic array (CDA) provides efficient searching and insertion operations. The operations on real datasets using this new data structure are hundreds times faster than operations using binary tree and red-black tree structures. The difference becomes more significant when the dataset size grows. This data structure may be useful for improving the speed of other related types of protein assembling software or other types of software that operate on datasets with similar semi-ordered features.
机译:对串联质谱数据进行生物信息学分析时,需要有效地存储和排序这些半有序数据集的计算需求。为解决此问题,设计并实现了一种基于动态数组的新数据结构,该算法可解析Mascot生成的半有序数据,该数据是将肽串联质谱与数据库中的蛋白质序列进行匹配的独立软件程序。通过适应这些大型数据集的特殊功能,组合动态数组(CDA)提供了有效的搜索和插入操作。使用这种新数据结构的实际数据集上的操作比使用二叉树和红黑树结构的操作快数百倍。当数据集大小增加时,差异变得更加明显。该数据结构对于提高其他相关类型的蛋白质组装软件或在具有类似半序特征的数据集上运行的其他类型的软件的速度可能有用。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号