首页> 外文会议>Conference on Computational Natural Language Learning >An Expectation Maximisation Algorithm for Automated Cognate Detection
【24h】

An Expectation Maximisation Algorithm for Automated Cognate Detection

机译:一种预期最大化算法的自动化证实检测

获取原文

摘要

In historical linguistics, cognate detection is the task of determining whether sets of words have common etymological roots. Inspired by the comparative method used by human linguists, we develop a system for automated cognate detection that frames the task as an inference problem for a general statistical model consisting of observed data (potentially cognate pairs of words), latent variables (the cog-nacy status of pairs) and unknown global parameters (which sounds correspond between languages). We then give a specific instance of such a model along with an expectation-maximisation algorithm to infer its parameters. We evaluate our system on a dataset of 8140 cognate sets, finding its performance of our method to be comparable to the state of the art. We additionally carry out qualitative analysis demonstrating various advantages it has over existing systems. We also suggest several ways our work could be extended within the general theoretical framework we propose.
机译:在历史语言学中,同源检测是确定一组词是否具有常见的导出根系的任务。灵感来自人类语言学家使用的比较方法,我们开发了一种自动证实检测系统,该系统将任务绘制为由观察到的数据(潜在同源对单词),潜在变量(Cog-Nacy)组成的一般统计模型的推理问题。对状态和未知的全局参数(语言之间的声音)。然后,我们提供这种模型的特定实例以及期望最大化算法来推断其参数。我们在8140个同源集的数据集中评估我们的系统,找到了我们对艺术技术的方法的性能。我们还开展了定性分析,证明了它拥有现有系统的各种优势。我们还建议我们的工作可以在我们提出的普通理论框架内扩展我们的工作。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号