首页> 外文会议>International Conference on Asian Language Processing >Quality Assurance for Segmentation and Tagging of Chinese Novels in the Ming and Qing Dynasties
【24h】

Quality Assurance for Segmentation and Tagging of Chinese Novels in the Ming and Qing Dynasties

机译:明清中国小说的切分与标注的质量保证。

获取原文

摘要

This paper presents a word segmentation and named entity tagging project which annotates Chinese novels in the Ming and Qing dynasties. Computer-aided tools are used to assist the annotation. The focus of this paper will be on the quality assurance measures to ensure precision and consistency. The specification for word segmentation and named entity tagging is formulated based on the standards for modern Chinese segmentation commonly used in Mainland China and in Taiwan as well as the analysis of differences between Chinese classics and modern Chinese. The specification is established through iterative refinements. This refinement process can offer valuable insights into the quality control of computer-aided processing performed on Chinese literature works in the Ming and Qing dynasties and can be applied to those in even earlier periods. The finalized corpus, built in a computer-aided, manually-reviewed method in accordance with the specification, can be used for researches in literature, linguistics, information technology, and teaching of Chinese.
机译:本文提出了一个分词和命名实体标签项目,以注释明清时期的中国小说。使用计算机辅助工具来辅助注释。本文的重点将放在质量保证措施上,以确保精度和一致性。根据中国大陆和台湾地区常用的现代汉语分词标准以及对中国经典词和现代汉语之间差异的分析,制定了分词和命名实体标签规范。该规范是通过迭代改进而建立的。这种完善的过程可以为明清时期中国文学作品所进行的计算机辅助处理的质量控制提供有价值的见解,甚至可以应用于更早的时期。最终的语料库以符合规范的计算机辅助人工审查方法构建,可用于文学,语言学,信息技术和汉语教学方面的研究。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号