首页> 外文会议>5th International Workshop on the Internet Challenge: Technology and Applications Oct 8-9, 2002 Berlin, Germany >CORRECTING WORD SEGMENTATION AND PART-OF-SPEECH TAGGING ERRORS FOR CHINESE NAMED ENTITY RECOGNITION
【24h】

CORRECTING WORD SEGMENTATION AND PART-OF-SPEECH TAGGING ERRORS FOR CHINESE NAMED ENTITY RECOGNITION

机译:纠正中文命名实体识别中的单词分词和词性标记错误

获取原文
获取原文并翻译 | 示例

摘要

In the exploration of Chinese named entity recognition for a specific domain, the authors found that the errors caused during word segmentation and part-of-speech (POS) tagging have obstructed the improvement of the recognition performance. In order to further enhance recognition recall and precision, the authors propose an error correction approach for Chinese named entity recognition. In the error correction component, transformation-based machine learning is adopted because it is suitable to fix Chinese word segmentation and POS tagging errors and produce effective correcting rules automatically. The Chinese named entity recognition component utilizes Finite-State Cascades which are automatically constructed by POS rules with semantic constraints. A prototype system, CNERS (Chinese Named Entity Recognition System), has been implemented. The experimental result shows that the recognition performance of most named entities have significantly been improved. On the other hand, the system is also fast and reliable.
机译:在对特定领域的中文命名实体识别的探索中,作者发现,在分词和词性(POS)标记过程中引起的错误阻碍了识别性能的提高。为了进一步提高识别的查全率和准确性,作者提出了一种用于中文命名实体识别的纠错方法。在纠错组件中,采用基于变换的机器学习,因为它适合修复中文分词和POS标签错误并自动生成有效的纠正规则。中文命名实体识别组件利用有限状态级联,该级联由具有语义约束的POS规则自动构建。已经实现了原型系统CNERS(中文命名实体识别系统)。实验结果表明,大多数命名实体的识别性能已得到显着提高。另一方面,该系统也是快速而可靠的。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号