首页> 外文会议>9th International conference on language resources and evaluation >Towards Automatic Transformation between Different Transcription Conventions: Prediction of Intonation Markers from Linguistic and Acoustic Features
【24h】

Towards Automatic Transformation between Different Transcription Conventions: Prediction of Intonation Markers from Linguistic and Acoustic Features

机译:在不同转录约会之间的自动转换:语言和声学特征的语调标记预测

获取原文

摘要

Because of the tremendous effort required for recording and transcription, large-scale spoken language corpora have been hardly developed in Japanese, with a notable exception of the Corpus of Spontaneous Japanese (CSJ). Various research groups have individually developed conversation corpora in Japanese, but these corpora are transcribed by different conventions and have few annotations in common, and some of them lack fundamental annotations, which are prerequisites for conversation research. To solve this situation by sharing existing conversation corpora that cover diverse styles and settings, we have tried to automatically transform a transcription made by one convention into that made by another convention. Using a conversation corpus transcribed in both the Conversation-Analysis-style (CA-style) and CSJ-style, we analyzed the correspondence between CA's 'intonation markers' and CSJ's 'tone labels,' and constructed a statistical model that converts tone labels into intonation markers with reference to linguistic and acoustic features of the speech. The result showed that there is considerable variance in intonation marking even between trained transcribers. The model predicted with 85% accuracy the presence of the intonation markers, and classified the types of the markers with 72% accuracy.
机译:由于录制和转录所需的巨大努力,大规模的语言语言在日语中几乎没有开发,具有显着的例外的自发日本语料库(CSJ)。各种研究小组在日语中单独开发对话集团,但这些Corpora被不同的公约转录,很少有共同的注释,其中一些缺乏基本的诠释,这是对话研究的先决条件。为了通过分享涵盖不同风格和设置的现有对话语料来解决这种情况,我们试图自动转换一份公约的转录,以其他公约作出的。使用在对话 - 分析风格(CA-STYLE)和CSJ式中转录的对话语言,我们分析了CA的“语调标记”和CSJ的“音调标签”之间的对应关系,并构建了一种将音标标签转换为的统计模型语调标记参考语言语言和声学特征。结果表明,即使在训练的转录器之间也存在狭窄标记的相当差异。该模型预测了85%的准确性,语调标记的存在,并分类了72%的准确度的标记类型。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号