【24h】

Exploiting Chunk-level Features to Improve Phrase Chunking

机译:利用块级功能来改善短语分块

获取原文

摘要

Most existing systems solved the phrase chunking task with the sequence labeling approaches, in which the chunk candidates cannot be treated as a whole during parsing process so that the chunk-level features cannot be exploited in a natural way. In this paper, we formulate phrase chunking as a joint segmentation and labeling task. We propose an efficient dynamic programming algorithm with pruning for decoding, which allows the direct use of the features describing the internal characteristics of chunk and the features capturing the correlations between adjacent chunks. A relaxed, online maximum margin training algorithm is used for learning. Within this framework, we explored a variety of effective feature representations for Chinese phrase chunking. The experimental results show that the use of chunk-level features can lead to significant performance improvement, and that our approach achieves state-of-the-art performance. In particular, our approach is much better at recognizing long and complicated phrases.
机译:大多数现有系统通过序列标记方法解决了短语任务,其中在解析过程中不能将块候选群体视为整体,使得块级别特征不能以自然的方式利用。在本文中,我们将短语分组为联合分段和标签任务。我们提出了一种高效的动态编程算法,该算法进行解码,这允许直接使用描述块的内部特性的特征和捕获相邻块之间的相关性。轻松的在线最大边距训练算法用于学习。在此框架内,我们探索了各种有效的中文章节特征表示。实验结果表明,使用块级功能会导致显着的性能改进,我们的方法实现了最先进的性能。特别是,我们的方法在识别漫长而复杂的短语方面更好。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号