CORRECTING WORD SEGMENTATION AND PART-OF-SPEECH TAGGING ERRORS FOR CHINESE NAMED ENTITY RECOGNITION

机译：纠正中文命名实体识别中的单词分词和词性标记错误

获取原文

获取原文并翻译 | 示例

页面导航

摘要
著录项
相似文献
相关主题

摘要

In the exploration of Chinese named entity recognition for a specific domain, the authors found that the errors caused during word segmentation and part-of-speech (POS) tagging have obstructed the improvement of the recognition performance. In order to further enhance recognition recall and precision, the authors propose an error correction approach for Chinese named entity recognition. In the error correction component, transformation-based machine learning is adopted because it is suitable to fix Chinese word segmentation and POS tagging errors and produce effective correcting rules automatically. The Chinese named entity recognition component utilizes Finite-State Cascades which are automatically constructed by POS rules with semantic constraints. A prototype system, CNERS (Chinese Named Entity Recognition System), has been implemented. The experimental result shows that the recognition performance of most named entities have significantly been improved. On the other hand, the system is also fast and reliable.

机译：在对特定领域的中文命名实体识别的探索中，作者发现，在分词和词性（POS）标记过程中引起的错误阻碍了识别性能的提高。为了进一步提高识别的查全率和准确性，作者提出了一种用于中文命名实体识别的纠错方法。在纠错组件中，采用基于变换的机器学习，因为它适合修复中文分词和POS标签错误并自动生成有效的纠正规则。中文命名实体识别组件利用有限状态级联，该级联由具有语义约束的POS规则自动构建。已经实现了原型系统CNERS（中文命名实体识别系统）。实验结果表明，大多数命名实体的识别性能已得到显着提高。另一方面，该系统也是快速而可靠的。

著录项

来源
《5th International Workshop on the Internet Challenge: Technology and Applications Oct 8-9, 2002 Berlin, Germany》|2002年|p.29-36|共8页
会议地点 Berlin(DE)
作者
Tianfang Yao; Wei Ding; Gregor Erbach;
展开▼
作者单位

Computational Linguistics Department, Saarland University D-66041 Saarbruecken, Germany;

展开▼
会议组织
原文格式 PDF
正文语种 eng
中图分类自动化技术、计算机技术;
关键词
information extraction; named entity recognition; machine learning; finite-state cascades;

机译：信息提取;命名实体识别;机器学习;有限状态级联;

相似文献

外文文献
中文文献
专利

1. Chinese word segmentation and named entity recognition: A pragmatic approach [J] . Gao JF, Li M, Wu A, Computational linguistics . 2005,第4期

机译：中文分词与命名实体识别：一种务实的方法
2. Chinese word segmentation and named entity recognition: A pragmatic approach [J] . Gao JF, Li M, Wu A, Computational linguistics . 2005,第4期

机译：中文分词与命名实体识别：一种务实的方法
3. A fine-grained Chinese word segmentation and part-of-speech tagging corpus for clinical text [J] . Ying Xiong, Zhongmin Wang, Dehuan Jiang, BMC Medical Informatics and Decision Making . 2019,第2期

机译：用于临床文本的细粒度中文分词和词性标注语料库
4. CORRECTING WORD SEGMENTATION AND PART-OF-SPEECH TAGGING ERRORS FOR CHINESE NAMED ENTITY RECOGNITION [C] . Tianfang Yao, Wei Ding, Gregor Erbach International workshop on the internet challenge: Technology and applications . 2002

机译：纠正中文命名实体识别的单词分段和语音标记错误
5. An Application of Natural Language Processing: Named Entity Recognition with BLSTM in Chinese Corpora [D] . Mao, Lihui 2019

机译：自然语言处理的应用：BLSTM在中文语料库中的命名实体识别
6. A fine-grained Chinese word segmentation and part-of-speech tagging corpus for clinical text [O] . Ying Xiong, Zhongmin Wang, Dehuan Jiang, 2019

机译：用于临床文本的细粒度中文分词和词性标注语料库
7. South China Sea Conflicts Classification Using Named Entity Recognition (NER) and Part-of-Speech (POS) Tagging [O] . Nur Rafeeqkha Sulaiman, Maheyzah Md Siraj 2020

机译：使用命名实体识别（NER）和演讲（POS）标记的南海冲突分类

CORRECTING WORD SEGMENTATION AND PART-OF-SPEECH TAGGING ERRORS FOR CHINESE NAMED ENTITY RECOGNITION

摘要

著录项

相似文献

相关主题

期刊订阅