Reinterpreting CTC training as iterative fitting

Li Hongzhu; Wang Weiqiang

首页> 外文期刊>Pattern Recognition: The Journal of the Pattern Recognition Society >Reinterpreting CTC training as iterative fitting

【24h】

Reinterpreting CTC training as iterative fitting

机译：重新解释CTC培训作为迭代配件

获取原文

获取原文并翻译 | 示例

获取外文期刊封面封底 >>

开具论文收录证明 >>

文献代查 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

The connectionist temporal classification (CTC) enables end-to-end sequence learning by maximizing the probability of correctly recognizing sequences during training. The outputs of a CTC-trained model tend to form a series of spikes separated by strongly predicted blanks, know as the spiky problem. To figure out the reason for it, we reinterpret the CTC training process as an iterative fitting task that is based on frame-wise cross-entropy loss. It offers us an intuitive way to compare target probabilities with model outputs for each iteration, and explain how the model outputs gradually turns spiky. Inspired by it, we put forward two ways to modify the CTC training. The experiments demonstrate that our method can well solve the spiky problem and moreover, lead to faster convergence over various training settings. Beside this, the reinterpretation of CTC, as a brand new perspective, may be potentially useful in other situations. The code is publicly available at https://github.com/hzli-ucas/caffe/tree/ctc. (C) 2020 Elsevier Ltd. All rights reserved.

机译：通过在训练期间最大化正确识别序列的概率来最大限度地，连接员时间分类（CTC）能够实现端到端序列学习。 CTC培训的模型的输出倾向于形成一系列由强烈预测的空白分开的尖峰，知道是尖刺的问题。要弄清楚它的原因，我们将CTC培训过程重新诠释为基于帧展跨熵损失的迭代拟合任务。它为我们提供了一种直观的方式来比较每个迭代的模型输出的目标概率，并解释模型输出如何逐渐变为尖峰。灵感来自于，我们提出了两种方法来修改CTC培训。实验表明，我们的方法可以很好地解决尖峰问题，而且，导致各种训练环境更快地收敛。除此之外，CTC作为全新角色的重新解释可能在其他情况下可能有用。该代码在https://github.com/hzli-ucas/caffe/tree/ctc上公开使用。（c）2020 elestvier有限公司保留所有权利。

著录项

来源
《Pattern Recognition: The Journal of the Pattern Recognition Society》 |2020年第2020期|共9页
作者
Li Hongzhu; Wang Weiqiang;
展开▼
作者单位

Univ Chinese Acad Sci Beijing Peoples R China;

Univ Chinese Acad Sci Beijing Peoples R China;

展开▼
收录信息
原文格式 PDF
正文语种 eng
中图分类计算技术、计算机技术;
关键词
Connectionist temporal classification (CTC);

机译：连接人数分类（CTC）;

相似文献

外文文献
中文文献
专利

1. Reinterpreting CTC training as iterative fitting [J] . Li Hongzhu, Wang Weiqiang Pattern Recognition: The Journal of the Pattern Recognition Society . 2020,第期

机译：重新解释CTC培训作为迭代配件
2. Iterative Curve Fitting of the Bioheat Transfer Equation for Thermocouple-Based Temperature Estimation In Vitro and In Vivo [J] . IEEE Transactions on Ultrasonics, Ferroelectrics, and Frequency Control . 2020,第1期

机译：基于热电偶的体内和体外温度估算中生物传热方程的迭代曲线拟合
3. An iterative end point fitting based trend segmentation representation of time series and its distance measure [J] . Haiyan Chen, Jinghan Du, Weining Zhang, Multimedia Tools and Applications . 2020,第19a20期

机译：基于迭代终点拟合时间序列趋势分割表示及其距离测量
4. Effectiveness of LP Derived Features and DCTC in Twins Identification - Iterative Speaker Clustering Approach [C] . Arivazhagan, S., ArulFlora, Computational Intelligence and Multimedia Applications (ICCIMA), 2007 International Conference on . 2007

机译：LP衍生特征和DCTC在双胞胎识别中的有效性-迭代说话人聚类方法。
5. Iterative two-stage procedures for fitting mixed effects models [D] . Al-Zaid, Munther Ali. 1996

机译：拟合混合效应模型的迭代两阶段过程
6. Hierarchical iterative linear-fitting algorithm (HILA) for phase correction in fat quantification by bipolar multi-echo sequence [O] . Chao Zou, Chuanli Cheng, Yangzi Qiao, 2019

机译：分层迭代线性拟合算法（HILA）用于双极性多回波序列在脂肪定量中的相位校正
7. Longitudinally Extended Molecular Wires Based upon PtCtCCtCCtCCtC Repeat Units: Iterative Syntheses of Functionalized Linear PtC8Pt, PtC8PtC8Pt, and PtC8PtC8PtC8Pt Assemblies [O] . -1

机译：基于PTCTCCTCCTCCTC重复单元的纵向扩展的分子线：官能化线性PTC8PT，PTC8PTC8PT和PTC8PTC8PT组件的迭代合成

Reinterpreting CTC training as iterative fitting

摘要

著录项

相似文献

相关主题

期刊订阅