首页> 外文会议>European Conference on Speech Communication and Technology >Construction of an Advanced In-Car Spoken Dialogue Corpus and its Characteristic Analysis
【24h】

Construction of an Advanced In-Car Spoken Dialogue Corpus and its Characteristic Analysis

机译:建设先进的车载口语对话语料库及其特征分析

获取原文

摘要

This paper describes an advanced spoken language corpus which has been constructed by enhancing an in-car speech database. The corpus has the following characteristic features: (1) Advanced tag: Not only linguistic phenomena tags but also advanced discourse tags such as sentential structures, and utterance intentions, have been provided for the transcribed texts. (2) Large-scale: The sentential structures and the intentions are currently provided for 45,053 phrases and 35,421 utterance units, respectively. (3) Multi-layer: The corpus consists of different levels of spoken language data such as speech signals, transcribed texts, sentential structures, intentional markers and dialogue structures, moreover, they are related with each other. It allows a very wide variety of analysis of spontaneous spoken dialogue to utilize the multi-layered corpus. This paper also reports the result of investigation of the corpus, especially, for-cusing on the relations between the syntactic style and the intentional style of spoken utterances.
机译:本文介绍了一种通过增强车载语音数据库构建的先进口语语言语言。语料库具有以下特征特征:(1)高级标签:不仅是语言现象标签,而且还为转录文本提供了先进的话语标签,如句子结构和话语意图。 (2)大规模:目前分别为45,053短语和35,421个话语单位提供了句子结构和意图。 (3)多层:语料库包括不同级别的口语语言数据,如语音信号,转录文本,句子结构,故意标记和对话结构,而且它们彼此相关。它允许对自发口头对话进行各种各样的分析来利用多层语料库。本文还报告了对语料库调查的结果,特别是对句法风格与口语话语的故意风格之间的关系。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号