...
首页> 外文期刊>Algorithmica >Speeding Up HMM Decoding and Training by Exploiting Sequence Repetitions
【24h】

Speeding Up HMM Decoding and Training by Exploiting Sequence Repetitions

机译:通过利用序列重复来加快HMM解码和训练

获取原文
获取原文并翻译 | 示例
           

摘要

We present a method to speed up the dynamic program algorithms used for solving the HMM decoding and training problems for discrete time-independent HMMs. We discuss the application of our method to Viterbi’s decoding and training algorithms (IEEE Trans. Inform. Theory IT-13:260–269, 1967), as well as to the forward-backward and Baum-Welch (Inequalities 3:1–8, 1972) algorithms. Our approach is based on identifying repeated substrings in the observed input sequence. Initially, we show how to exploit repetitions of all sufficiently small substrings (this is similar to the Four Russians method). Then, we describe four algorithms based alternatively on run length encoding (RLE), Lempel-Ziv (LZ78) parsing, grammar-based compression (SLP), and byte pair encoding (BPE). Compared to Viterbi’s algorithm, we achieve speedups of Θ(log n) using the Four Russians method, W(fracrlogr)Omega(frac{r}{log r}) using RLE, W(fraclognk)Omega(frac{log n}{k}) using LZ78, W(fracrk)Omega(frac{r}{k}) using SLP, and Ω(r) using BPE, where k is the number of hidden states, n is the length of the observed sequence and r is its compression ratio (under each compression scheme). Our experimental results demonstrate that our new algorithms are indeed faster in practice. We also discuss a parallel implementation of our algorithms.
机译:我们提出了一种方法,用于加快用于解决HMM解码和离散时间独立HMM训练问题的动态程序算法。我们讨论了该方法在Viterbi的解码和训练算法(IEEE Trans。Inform。Theory IT-13:260–269,1967)以及向前和向后和Baum-Welch(不等式3:1–8)中的应用。 (1972年)算法。我们的方法基于识别观察到的输入序列中的重复子字符串。最初,我们展示了如何利用所有足够小的子字符串的重复(这类似于“四位俄罗斯人”方法)。然后,我们基于行程编码(RLE),Lempel-Ziv(LZ78)解析,基于语法的压缩(SLP)和字节对编码(BPE)来描述四种算法。与维特比算法相比,我们使用四俄语方法实现了Θ(log n)的加速,使用RLE,W(fraclognk)Omega(frac {log n} { k})使用LZ78,W(fracrk)Omega(frac {r} {k})使用SLP,Ω(r)使用BPE,其中k是隐藏状态的数量,n是观察到的序列的长度,r是其压缩率(在每种压缩方案下)。我们的实验结果表明,我们的新算法实际上在实践中速度更快。我们还将讨论算法的并行实现。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号