首页> 外文会议>International conference on computational linguistics;COLING 2010 >Very High Accuracy and Fast Dependency Parsing is not a Contradiction
【24h】

Very High Accuracy and Fast Dependency Parsing is not a Contradiction

机译:极高的准确性和快速的依赖关系解析不是矛盾

获取原文

摘要

In addition to a high accuracy, short parsing and training times are the most important properties of a parser. However, parsing and training times are still relatively long. To determine why, we analyzed the time usage of a dependency parser. We illustrate that the mapping of the features onto their weights in the support vector machine is the major factor in time complexity. To resolve this problem, we implemented the passive-aggressive perceptron algorithm as a Hash Kernel. The Hash Kernel substantially improves the parsing times and takes into account the features of negative examples built during the training. This has lead to a higher accuracy. We could further increase the parsing and training speed with a parallel feature extraction and a parallel parsing algorithm. We are convinced that the Hash Kernel and the parallelization can be applied successful to other NLP applications as well such as transition based dependency parsers, phrase structrue parsers, and machine translation.
机译:除了高精度外,较短的解析和训练时间也是解析器的最重要属性。但是,解析和训练时间仍然相对较长。为了确定原因,我们分析了依赖解析器的时间使用情况。我们说明了在支持向量机中将特征映射到其权重是时间复杂度的主要因素。为了解决此问题,我们将被动攻击型感知器算法实现为哈希内核。哈希内核大大提高了解析时间,并考虑了训练过程中建立的否定示例的功能。这导致更高的精度。我们可以通过并行特征提取和并行解析算法来进一步提高解析和训练速度。我们相信,哈希内核和并行化可以成功应用于其他NLP应用程序,例如基于过渡的依赖项解析器,短语结构解析器和机器翻译。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号