EXMA: A Genomics Accelerator for Exact-Matching

机译：EXMA：一个关于精确匹配的基因组学加速器

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

Genomics is the foundation of precision medicine, global food security and virus surveillance. Exact-match is one of the most essential operations widely used in almost every step of genomics such as alignment, assembly, annotation, and compression. Modern genomics adopts Ferragina-Manzini Index (FMIndex) augmenting space-efficient Burrows-Wheeler transform (BWT) with additional data structures to permit ultra-fast exact-match operations. However, FM-Index is notorious for its poor spatial locality and random memory access pattern. Prior works create GPU-, FPGA-, ASIC- and even process-in-memory (PIM)based accelerators to boost FM-Index search throughput. Though they achieve the state-of-the-art FM-Index search throughput, the same as all prior conventional accelerators, FM-Index PIMs process only one DNA symbol after each DRAM row activation, thereby suffering from poor memory bandwidth utilization. In this paper, we propose a hardware accelerator, EXMA, to enhance FM-Index search throughput. We first create a novel EXMA table with a multi-task-learning (MTL)-based index to process multiple DNA symbols with each DRAM row activation. We then build an accelerator to search over an EXMA table. We propose 2-stage scheduling to increase the cache hit rate of our accelerator. We introduce dynamic page policy to improve the row buffer hit rate of DRAM main memory. We also present CHAIN compression to reduce the data structure size of EXMA tables. Compared to state-of-the-art FM-Index PIMs, EXMA improves search throughput by $4.9 imes$, and enhances search throughput per Watt by $4.8 imes$.

机译：基因组学是精密医学，全球粮食安全和病毒监测的基础。精确匹配是几乎所有基因组学中使用的最重要的操作之一，例如对齐，装配，注释和压缩。现代基因组学采用FerraGina-Manzini指数（FMIndex）增强空间挖掘机轮车变换（BWT），具有额外的数据结构，以允许超快速精确匹配的操作。但是，FM-Index对于其糟糕的空间局部地点和随机内存访问模式是臭名昭着的。先前作品创建基于GPU，FPGA，ASIC - 甚至进程内存（PIM）的加速器，以提高FM-Index搜索吞吐量。虽然它们实现了最先进的FM-Index搜索吞吐量，但与所有先前的传统加速器相同，但每个DRAM行激活后，FM-Index PIM在每个DRAM行激活之后仅处理一个DNA符号，从而遭受差的内存带宽利用率。在本文中，我们提出了一个硬件加速器EXMA，以增强FM-Index搜索吞吐量。我们首先创建一个具有多任务学习（MTL）的新的EXMA表，基于多任务学习（MTL）索引，以处理每个DRAM行激活的多个DNA符号。然后，我们建立一个加速器来搜索Exma表。我们提出了2阶段调度，以提高我们加速器的缓存命中率。我们介绍了动态页面策略以改善DRAM主内存的行缓冲区命中率。我们还呈现链压缩以减少EXMA表的数据结构大小。与最先进的FM-Index PIM相比，EXMA将搜索吞吐量提高了4.9美元，并增强了每瓦的搜索吞吐量$ 4.8 times $。

著录项

来源
《IEEE International Symposium on High Performance Computer Architecture》|2021年|399-411|共13页
会议地点
作者
Lei Jiang; Farzaneh Zokaee;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类
关键词
Genomics; Random access memory; DNA; Transforms; Throughput; Data structures; Hardware;

机译：基因组学;随机存取记忆;DNA;变换;吞吐量;数据结构;硬件;

相似文献

外文文献
中文文献
专利

1. Illumina Accelerator Selects Genomics Startups for Third Funding Cycle - Clinical Lab Products [J] . Clinical Lab Products . 2015,第2015期

机译：Illumina Accelerator选择基因组学初创公司进行第三轮融资-临床实验室产品
2. Illumina Accelerator Selects Genomics Startups for Third Funding Cycle - Clinical Lab Products [J] . Clinical Lab Products . 2015,第2015期

机译：Illumina Accelerator选择基因组学初创公司进行第三轮融资-临床实验室产品
3. Genomic selection as a possible accelerator of traditional selection [J] . Smaragdov M. G. Russian journal of genetics . 2009,第6期

机译：基因组选择可能是传统选择的加速器
4. k-Core: Hardware Accelerator for k-Mer Generation and Counting used in Computational Genomics [C] . Simmi M Bose, Varsha S Lalapura, S Saravanan, International Conference on VLSI Design;International Conference on Embedded Systems . 2019

机译：k-Core：计算基因组学中使用的k-Mer生成和计数的硬件加速器
5. Neutron exposure from electrom linear accelerators and a proton accelerator: Measurements and simulations. [D] . Chen, Kuan Ling. 2011

机译：电线性加速器和质子加速器的中子暴露：测量和模拟。
6. Commissioning measurements for photon beam data on three TrueBeam linear accelerators and comparison with Trilogy and Clinac 2100 linear accelerators [O] . Gloria P. Beyer 2013

机译：调试三个TrueBeam线性加速器上的光子束数据并与Trilogy和Clinac 2100线性加速器进行比较
7. EXMA: A Genomics Accelerator for Exact-Matching [O] . Lei Jiang, Farzaneh Zokaee 2021

机译：EXMA：一个关于精确匹配的基因组学加速器

EXMA: A Genomics Accelerator for Exact-Matching

摘要

著录项

相似文献

相关主题

期刊订阅