...
首页> 外文期刊>Natural language engineering >MaltOptimizer: Fast and effective parser optimization
【24h】

MaltOptimizer: Fast and effective parser optimization

机译:MaltOptimizer:快速有效的解析器优化

获取原文
获取原文并翻译 | 示例
   

获取外文期刊封面封底 >>

       

摘要

Statistical parsers often require careful parameter tuning and feature selection. This is a nontrivial task for application developers who are not interested in parsing for its own sake, and it can be time-consuming even for experienced researchers. In this paper we present MaltOptimizer, a tool developed to automatically explore parameters and features for MaltParser, a transition-based dependency parsing system that can be used to train parser's given treebank data. MaltParser provides a wide range of parameters for optimization, including nine different parsing algorithms, an expressive feature specification language that can be used to define arbitrarily rich feature models, and two machine learning libraries, each with their own parameters. MaltOptimizer is an interactive system that performs parser optimization in three stages. First, it performs an analysis of the training set in order to select a suitable starting point for optimization. Second, it selects the best parsing algorithm and tunes the parameters of this algorithm. Finally, it performs feature selection and tunes machine learning parameters. Experiments on a wide range of data sets show that MaltOptimizer quickly produces models that consistently outperform default settings and often approach the accuracy achieved through careful manual optimization.
机译:统计解析器通常需要仔细的参数调整和功能选择。对于不希望自己解析的应用程序开发人员而言,这是一项艰巨的任务,即使对于有经验的研究人员而言,这也可能是耗时的。在本文中,我们介绍了MaltOptimizer,该工具开发用于自动探索MaltParser的参数和功能,MaltParser是一种基于过渡的依存关系分析系统,可用于训练解析器的给定树库数据。 MaltParser提供了广泛的优化参数,包括九种不同的解析算法,一种可用于定义任意丰富特征模型的表达性特征说明语言,以及两个各自具有自己参数的机器学习库。 MaltOptimizer是一个交互式系统,它分三个阶段执行解析器优化。首先,它对训练集进行分析,以选择合适的起点进行优化。其次,它选择最佳的解析算法并调整该算法的参数。最后,它执行特征选择并调整机器学习参数。在各种数据集上进行的实验表明,MaltOptimizer可以快速生成始终优于默认设置的模型,并且通常接近通过仔细的手动优化获得的精度。

著录项

  • 来源
    《Natural language engineering》 |2016年第2期|187-213|共27页
  • 作者单位

    Natural Language Processing Group, Pompeu Fabra University, Tanger 122-140, 08018 Barcelona, Spain;

    Department of Linguistics and Philology, Uppsala University, Box 635, 75126 Uppsala, Sweden;

  • 收录信息
  • 原文格式 PDF
  • 正文语种 eng
  • 中图分类
  • 关键词

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号