首页> 外文会议>2012 IEEE Region 10 Conference: sustainable development through humanitarian technology. >Developing a children's Filipino speech corpus for application in automatic detection of reading miscues and disfluencies
【24h】

Developing a children's Filipino speech corpus for application in automatic detection of reading miscues and disfluencies

机译:开发儿童菲律宾语语料库,以应用于自动检测阅读错误和不满情况

获取原文
获取原文并翻译 | 示例

摘要

Recognizing the potential benefit that the current speech processing technology offers to improve children's literacy, researchers in the past few years have devoted their efforts in developing reading miscue detectors (RMDs) and automated reading tutors (ARTs). A primary challenge however in developing speech technologies for children may be the unavailability of a dedicated children's speech corpus that can be used for system design and test. In the past few years, children's speech corpora have been developed for languages such as English, Dutch, Chinese Mandarin, Italian, German and Swedish. But since Filipino has features and orthography that are distinct from other languages, the focus of this study is the development of a children's Filipino speech corpus (CFSC). In this paper, we present the CFSC design, reading text, data collection procedure and speech transcription method. We also performed initial analysis of the reading miscues and disfluencies found in the CFSC. The results of the miscue analysis suggest possible ways for modeling the reading miscues and possible methods for detecting them. Among these methods are acoustic model likelihood calculation and analysis of duration-based prosodic features. The CFSC presented in this study will be used for the development of an RMD and an ART for Filipino.
机译:认识到当前语音处理技术可提高儿童识字率的潜在好处,研究人员在过去几年中致力于开发阅读错误检测器(RMD)和自动阅读辅导员(ART)。然而,在为儿童开发语音技术方面的主要挑战可能是无法获得可用于系统设计和测试的专用儿童语音语料库。在过去的几年中,已经为英语,荷兰语,中文普通话,意大利语,德语和瑞典语等语言开发了儿童语音语料库。但是,由于菲律宾语具有与其他语言不同的特征和拼字法,因此本研究的重点是儿童菲律宾言语语料库(CFSC)的发展。在本文中,我们介绍了CFSC的设计,阅读文本,数据收集程序和语音转录方法。我们还对CFSC中发现的阅读错误和错位进行了初步分析。错误提示分析的结果提出了对阅读错误进行建模的可能方法以及检测它们的可能方法。这些方法包括声学模型似然计算和基于持续时间的韵律特征分析。这项研究中提出的CFSC将用于开发RMD和菲律宾的ART。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号