CSE: Parallel Finite State Machines with Convergence Set Enumeration

机译：CSE：具有汇聚集枚举的平行有限状态机

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

Finite State Machine (FSM) is known to be “embarrassingly sequential” because the next state depends on the current state and input symbol. Enumerative FSM breaks the data dependencies by cutting the input symbols into segments and processing all segments in parallel. With unknown starting state (except the first segment), each segment needs to calculate the state transitions, i.e., state state, for all states, each one is called an enumeration path. The current software and hardware implementations suffer from two drawbacks: 1) large amount of state state computation overhead for the enumeration paths; and 2) the optimizations are restricted by the need to correctly performing state state and only achieve limited improvements. This paper proposes CSE, a Convergence Set based Enumeration based parallel FSM. Unlike prior approaches, CSE is based on a novel computation primitive set(N) set(M), which maps N states to M states without giving the specific state state mappings (which state is mapped to which). The set(N) set(M) has two key properties: 1) if M is equal to 1, i.e., all N states are mapped to the same state, the state state for all the N states are computed; 2) using one-hot encoding, the hardware implementation cost of state state is the same as set(N) set(M). The convergence property ensures that M is always less than N. The key idea of CSE is to partition the original all S states into n state sets CS₁,CS₂,...,CS_n, i.e., convergence sets. Using set(N) set(M) to process each CS, if the states converge to a single state, then we have successfully computed the enumeration path for each state in CS; otherwise, we may need to re-execute the stage when the outcome of the previous stage falls in CS. CSE is realized by two techniques: convergence set prediction, which generates the convergence sets with random input based profiling that maximizes the probability of each CS z converging to one state; global re-execution algorithm, which ensures the correctness by re-executing the non-converging stages with known input state. Essentially, CSE reformulates the enumeration paths as setbased rather than singleton-based. We evaluate CSE with 13 benchmarks. It achieved on average 2.0x/2.4x and maximum 8.6x/2.7x speedup compared to Lookback Enumeration (LBE) and Parallel Automata Processor (PAP), respectively.

机译：已知有限状态机（FSM）是“令人尴尬的顺序”，因为下一个状态取决于当前状态和输入符号。枚举FSM通过将输入符号切割成段并并行处理所有段来打破数据依赖性。对于未知的起始状态（第一个段除外），每个段需要计算所有状态的状态转换，即状态状态，每个段都称为枚举路径。目前的软件和硬件实现遭受了两个缺点：1）枚举路径的大量状态计算开销; 2）优化受到正确执行州状态的必要性的限制，只能实现有限的改进。本文提出了基于Condring集的基于枚举的并行FSM的CSE。与现有方法不同，CSE基于新颖的计算原始集（n）集（n）设置（m），其将n个状态映射到m状态而不给出特定状态映射（映射到哪个状态）。 SET（N）集（M）具有两个关键属性：1）如果m等于1，则即，所有n个状态都被映射到相同的状态，计算所有n个状态的状态状态; 2）使用单热编码，状态状态的硬件实现成本与SET（N）集（M）相同。汇聚属性可确保M总是小于N. CSE的关键思想是将原始的所有状态分区为N状态集CS_{1 ，CS_{2 ，...，CS_{n ，即收敛组。使用set（n）设置（m）来处理每个cs，如果状态会聚到单个状态，则我们已成功计算CS中每个状态的枚举路径;否则，我们可能需要重新执行前一级的结果在CS中落入CS时。 CSE通过两种技术实现：收敛集预测，它产生具有基于随机输入的分析的收敛组，可以最大化每个CS Z会聚到一个状态的概率;全局重新执行算法，通过以已知输入状态重新执行非融合阶段来确保正确性。基本上，CSE将枚举路径重新重新格式化，而不是基于单例。我们评估CSE，有13个基准。与Lookbaces枚举（LBE）和并行自动机处理器（PAP）相比，平均为2.0倍/ 2.4x和最大8.6倍/ 2.7倍的加速。}}}

著录项

来源
《International Symposium on Microarchitecture》|2018年|xxiv 493 p. :|共13页
会议地点
作者
Youwei Zhuo; Jinglei Cheng; Qinyi Luo; Jidong Zhai; Yanzhi Wang; Zhongzhi Luan; Xuehai Qian;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类 TP302-532;
关键词
Convergence; Automata; Hardware; Optimization; Software; Encoding; Prediction algorithms;

机译：融合;自动机;硬件;优化;软件;编码;预测算法;

相似文献

外文文献
中文文献
专利

1. Combining SIMD and Many/Multi-core Parallelism for Finite-state Machines with Enumerative Speculation [J] . PENG JIANG, YANG XIA, GAGAN AGRAWAL ACM Transactions on Parallel Computing . 2020,第3期

机译：将SIMD和许多/多核并行性与枚举炒作结合起来的有限状态机
2. Alternating direction implicit time integrations for finite difference acoustic wave propagation: Parallelization and convergence [J] . Computers & Fluids . 2020,第期

机译：有限差分声波传播的交替方向隐式时间集成：并行化和收敛
3. Fast Parallel All-Subgraph Enumeration Using Multicore Machines [J] . SaeedShahrivari, SaeedJalili Scientific programming . 2015,第4期

机译：使用多核计算机的快速并行全子枚举
4. CSE: Parallel Finite State Machines with Convergence Set Enumeration [C] . Youwei Zhuo, Jinglei Cheng, Qinyi Luo, Annual IEEE/ACM International Symposium on Microarchitecture . 2018

机译：CSE：具有收敛集枚举的并行有限状态机
5. Two cyclic arrangement problems in finite projective geometry: Parallelisms and two -intersection sets [D] . White, Clinton Thomas. 2002

机译：有限射影几何中的两个循环布置问题：平行度和两个交集
6. Parallelization of enumerating tree-like chemical compounds by breadth-first search order [O] . Morihiro Hayashida, Jira Jindalertudomdee, Yang Zhao, 2015

机译：通过广度优先搜索顺序对树状化合物进行枚举
7. No Recursively Enumerable Set is the Union of Finitely Many Immune Retraceable Sets [O] . K. I. Appel 1967

机译：没有递归令人令人令人难堪的集合是有限的许多免疫可回物套的联盟
8. On the Enumeration of Finite State Synchronous Sequential Machines [R] . Bottlik, I. P. 1970

机译：关于有限状态同步序列机的计数

CSE: Parallel Finite State Machines with Convergence Set Enumeration

摘要

著录项

相似文献

相关主题

期刊订阅