首页> 外文会议>European Conference on Machine Learning and Knowledge Discovery in Databases;ECML PKDD 2008 >A Joint Segmenting and Labeling Approach for Chinese Lexical Analysis
【24h】

A Joint Segmenting and Labeling Approach for Chinese Lexical Analysis

机译:汉语词法分析的联合分割与标注方法

获取原文

摘要

This paper introduces an approach which jointly performs a cascade of segmentation and labeling subtasks for Chinese lexical analysis, including word segmentation, named entity recognition and part-of-speech tagging. Unlike the traditional pipeline manner, the cascaded subtasks are conducted in a single step simultaneously, therefore error propagation could be avoided and the information could be shared among multi-level subtasks. In this approach, Weighted Finite State Transducers (WFSTs) are adopted. Within the unified framework of WFSTs, the models for each subtask are represented and then combined into a single one. Thereby, through one-pass decoding the joint optimal outputs for multi-level processes will be reached. The experimental results show the effectiveness of the presented joint processing approach, which significantly outperforms the traditional method in pipeline style.
机译:本文介绍了一种方法,该方法可以共同执行级联的分段和标签子任务以进行中文词汇分析,包括分词,命名实体识别和词性标注。与传统的流水线方式不同,级联的子任务是同时执行的,因此可以避免错误传播,并且可以在多级子任务之间共享信息。在这种方法中,采用了加权有限状态传感器(WFST)。在WFST的统一框架内,表示每个子任务的模型,然后将其组合为一个模型。从而,通过单遍解码,将达到用于多级处理的联合最优输出。实验结果表明了所提出的联合处理方法的有效性,该方法在管道样式上显着优于传统方法。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号