首页> 外文期刊>IEEE Transactions on Pattern Analysis and Machine Intelligence >Aggregated Wasserstein Distance and State Registration for Hidden Markov Models
【24h】

Aggregated Wasserstein Distance and State Registration for Hidden Markov Models

机译:聚合的Wasserstein距离和国家注册隐藏马尔可夫模型

获取原文
获取原文并翻译 | 示例
           

摘要

We propose a framework, named Aggregated Wasserstein, for computing a dissimilarity measure or distance between two Hidden Markov Models with state conditional distributions being Gaussian. For such HMMs, the marginal distribution at any time position follows a Gaussian mixture distribution, a fact exploited to softly match, aka register, the states in two HMMs. We refer to such HMMs as HMM. The registration of states is inspired by the intrinsic relationship of optimal transport and the Wasserstein metric between distributions. Specifically, the components of the marginal GMMs are matched by solving an optimal transport problem where the cost between components is the Wasserstein metric for Gaussian distributions. The solution of the optimization problem is a fast approximation to the Wasserstein metric between two GMMs. The new Aggregated Wasserstein distance is a semi-metric and can be computed without generating Monte Carlo samples. It is invariant to relabeling or permutation of states. The distance is defined meaningfully even for two HMMs that are estimated from data of different dimensionality, a situation that can arise due to missing variables. This distance quantifies the dissimilarity of HMMs by measuring both the difference between the two marginal GMMs and that between the two transition matrices. Our new distance is tested on tasks of retrieval, classification, and t-SNE visualization of time series. Experiments on both synthetic and real data have demonstrated its advantages in terms of accuracy as well as efficiency in comparison with existing distances based on the Kullback-Leibler divergence.
机译:我们提出了一个名为聚合的Wasserstein的框架,用于计算两个隐藏马尔可夫模型之间的不同性度量或距离,其具有高斯的状态条件分布。对于这种HMMS,任何时间位置的边缘分布都遵循高斯混合分布,这是一个易于匹配的AKA寄存器,两种HMMS中的州的事实。我们将这些HMMS称为嗯。各国的登记受到最佳运输和分布之间的韦斯斯坦度量的内在关系的启发。具体地,通过求解组件之间的成本是高斯分布的最佳运输问题,匹配边缘GMM的组件。优化问题的解决方案是两个GMM之间的Wassersein度量的快速近似。新的聚合Wasserstein距离是半标率,可以在不生成蒙特卡罗样本的情况下计算。它不变于抢购或排列状态。即使对于从不同维度的数据估计的两个HMM,距离也有意义地定义,这是由于缺失变量导致的情况。该距离通过测量两个边缘GMM之间的差异和两个转换矩阵之间的差异来量化HMM的不相似性。我们在时间序列的检索,分类和T-SNE可视化的任务上测试了我们的新距离。合成和实际数据的实验已经在准确性和基于Kullback-Leibler发散的现有距离比较​​的效率方面证明了其优点。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号