A Discriminative Model of Stochastic Edit Distance in the Form of a Conditional Transducer

机译：条件换能器形式的随机编辑距离的辨别模型

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

Many real-world applications such as spell-checking or DNA analysis use the Levenshtein edit-distance to compute similarities between strings. In practice, the costs of the primitive edit operations (insertion, deletion and substitution of symbols) are generally hand-tuned. In this paper, we propose an algorithm to learn these costs. The underlying model is a probabilitic transducer, computed by using grammatical inference techniques, that allows us to learn both the structure and the probabilities of the model. Beyond the fact that the learned transducers are neither deterministic nor stochastic in the standard terminology, they are conditional, thus independent from the distributions of the input strings. Finally, we show through experiments that our method allows us to design cost functions that depend on the string context where the edit operations are used. In other words, we get kinds of context-sensitive edit distances.

机译：许多现实世界的应用，如法术检查或DNA分析使用Levenshtein编辑距离来计算字符串之间的相似之处。在实践中，通常可以手动调整原始编辑操作（插入，删除和符号）的成本。在本文中，我们提出了一种学习这些成本的算法。底层模型是概率换能器，通过使用语法推理技术计算，允许我们学习模型的结构和概率。除了在标准术语中，学习的传感器既不确定性也不是统计的，它们是有条件的，因此独立于输入字符串的分布。最后，我们通过实验显示我们的方法允许我们设计成本函数，这取决于使用编辑操作的字符串上下文。换句话说，我们获得各种上下文敏感的编辑距离。

著录项

来源
《International Colloquium on Grammatical Inference》|2006年||共13页
会议地点
作者
Marc Bernard; Jean-Christophe Janodet; Marc Sebban;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类人工智能理论;
关键词
edit distance; stochastic transducers; discriminative models; grammatical inference;

机译：编辑距离;随机传感器;歧视模型;语法推理;

相似文献

外文文献
中文文献
专利

1. Eulerian acceleration statistics as a discriminator between Lagrangian stochastic models in uniform shear flow [J] . Sawford BL., Yeung PK. Physics of fluids . 2000,第8期

机译：欧拉加速度统计量作为均匀拉流中拉格朗日随机模型的判别器
2. On a Transform Method for the Efficient Computation of Conditional V@R (and V@R) with Application to Loss Models with Jumps and Stochastic Volatility [J] . Ramponi Alessandro Methodology and computing in applied probability . 2016,第2期

机译：有效计算条件V @ R（和V @ R）的变换方法及其在具有跳跃和随机波动率的损失模型中的应用
3. INFERENCE FOR ADAPTIVE TIME SERIES MODELS: STOCHASTIC VOLATILITY AND CONDITIONALLY GAUSSIAN STATE SPACE FORM [J] . Charles S. Bos, Neil Shephard Econometric Reviews . 2006,第2a3期

机译：自适应时间序列模型的推论：随机波动率和条件高斯状态空间形式
4. A Discriminative Model of Stochastic Edit Distance in the Form of a Conditional Transducer [C] . Marc Bernard, Jean-Christophe Janodet, Marc Sebban Grammatical Inference: Algorithms and Applications; Lecture Notes in Artificial Intelligence; 4201 . 2006

机译：有条件换能器形式的随机编辑距离判别模型
5. Effects of Prompts Requiring Simple and Conditional Discriminative Control in the Acquisition of Conditional Discriminations [D] . Braga-Kenyon, Paula 2012

机译：提示要求简单和有条件的区分控制在有条件的歧视取得中的作用
6. Efficiency of Health Care Production in Low-Resource Settings: A Monte-Carlo Simulation to Compare the Performance of Data Envelopment Analysis, Stochastic Distance Functions, and an Ensemble Model [O] . Laura Di Giorgio, Abraham D. Flaxman, Mark W. Moses, 2011

机译：资源匮乏地区的医疗保健生产效率：蒙特卡洛模拟，用于比较数据包络分析，随机距离函数和集成模型的性能
7. A Discriminative Model of Stochastic Edit Distance in the form of a Conditional Transducer [O] . Bernard, Marc, Janodet, Jean-Christophe, Sebban, Marc 2006

机译：有条件换能器形式的随机编辑距离判别模型
8. Conditional Random Field for Discriminatively-Trained Finite-State String Edit Distance [R] . McCallum, A. , Bellare, K. , Pereira, F. 2005

机译：判别训练有限状态字符串编辑距离的条件随机场

A Discriminative Model of Stochastic Edit Distance in the Form of a Conditional Transducer

摘要

著录项

相似文献

相关主题

期刊订阅