首页> 外文会议>Chinese lexical semantics workshop >The Construction Scheme of a Graded Spoken Interaction Corpus for Mandarin Chinese
【24h】

The Construction Scheme of a Graded Spoken Interaction Corpus for Mandarin Chinese

机译:汉语分级口语互动语料库的建设方案

获取原文

摘要

This paper introduces the construction scheme of a graded spoken interaction corpus for Mandarin Chinese. Material selection and collection principals, corpus annotation and assistant software development are explained. This paper also points out the important and difficult issues in the construction process. The corpus proposed in this paper consists of 1 million words (transcribed from 1.5 TB data), and it is graded and tagged with interaction annotation. This corpus can provide naturally occurring interactions with transcriptions and annotations for researchers, by which the quantitative analysis of SI can be realized. In addition, exemplars grading according to Conversation Analysis (CA) is also provided in the corpus for the reference of other researchers.
机译:本文介绍了普通话分级口语互动语料库的构建方案。解释了材料选择和收集原理,语料库注释和辅助软件开发。本文还指出了施工过程中的重要难题。本文提出的语料库由100万个单词组成(从1.5 TB数据中转录而来),并对其进行了分级和标记,并带有交互注释。该语料库可以为研究人员提供与转录和注解自然发生的相互作用,从而可以实现对SI的定量分析。此外,语料库中还提供了根据会话分析(CA)进行的示例评分,供其他研究人员参考。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号