Iterative Annotation Transformation with Predict-Self Reestimation for Chinese Word Segmentation

机译：预测-自估计的迭代注释变换在中文分词中的应用

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

In this paper we first describe the technology of automatic annotation transformation, which is based on the annotation adaptation algorithm (Jiang et al., 2009). It can automatically transform a human-annotated corpus from one annotation guideline to another. We then propose two optimization strategies, iterative training and predict-self reestimation, to further improve the accuracy of annotation guideline transformation. Experiments on Chinese word segmentation show that, the iterative training strategy together with predict-self reestimation brings significant improvement over the simple annotation transformation baseline, and leads to classifiers with significantly higher accuracy and several times faster processing than annotation adaptation does. On the Penn Chinese Treebank 5.0, it achieves an F-measure of 98.43%, significantly outperforms previous works although using a single classifier with only local features.

机译：在本文中，我们首先描述了自动注释转换技术，基于注释适应算法（江等，2009）。它可以自动将人类注释的语料库从一个注释指南转换为另一个注释指南。然后，我们提出了两种优化策略，迭代培训和预测 - 自我保证，以进一步提高注释指南转型的准确性。汉字分割实验表明，与预测自我评估的迭代培训策略与简单的注释转换基线带来了显着的改进，并导致分类器具有明显更高的准确性和比注释适应更快的处理更快的处理。在Penn Chinese TreeBank 5.0上，它达到了98.43％的F-Measure，显着优于以前的作品，尽管使用单个分类器，仅具有本地特征。

著录项

来源
《Conference on empirical methods in natural language processing;Conference on computational natural language learning》|2012年|412-420|共9页
会议地点
作者
Wenbin Jiang; Fandong Meng; Qun Liu; Yajuan Lue;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类
关键词

相似文献

外文文献
中文文献
专利

1. Image Annotation by a Hierarchical and Iterative Combination of Recognition and Segmentation [J] . Morales-Gonzalez Annette, Garcia-Reyes Edel, Enrique Sucar Luis International Journal of Pattern Recognition and Artificial Intelligence . 2018,第1期

机译：通过识别和分割的层次和迭代组合进行图像注释
2. A Semi-Automatic Annotation Method of Effect Clue Words for Chinese Patents Based on Co-Training [J] . Na Deng, Chunzhi Wang, Mingwu Zhang, International Journal of Data Warehousing and Mining . 2018,第4期

机译：基于共同训练的中国专利效应CLUE词的半自动注释方法
3. Annotation and Classification of Three-Character Chinese Synthetic Words [J] . Jia Lu, Masayuki Asahara, Yuji Matsumoto International journal of computer processing of languages . 2008,第2期

机译：三字符汉语合成词的注释与分类
4. Iterative Annotation Transformation with Predict-Self Reestimation for Chinese Word Segmentation [C] . Wenbin Jiang, Fandong Meng, Qun Liu, Conference on empirical methods in natural language processing . 2012

机译：迭代注释转换与中文字分割的预测自我重新定位
5. The Prosody and Morphology of Elastic Words in Chinese: Annotations and Analyses [D] . Dong, Yan. 2015

机译：中文弹性词的韵律与形貌：注释与分析
6. Iterative Mesh Transformation for 3D Segmentation of Livers with Cancers in CT Images [O] . Difei Lu, Yin Wu, Gordon Harris, -1

机译：迭代网格变换用于CT图像中癌症的3D肝分割
7. Coupling Distant Annotation and Adversarial Training for Cross-Domain Chinese Word Segmentation [O] . Ning Ding, Dingkun Long, Guangwei Xu, 2020

机译：耦合跨域跨域词分割的远程注释和对抗训练

Iterative Annotation Transformation with Predict-Self Reestimation for Chinese Word Segmentation

摘要

著录项

相似文献

相关主题

期刊订阅