首页> 外文会议>2016 Conference of The Oriental Chapter of International Committee for Coordination and Standardization of Speech Databases and Assessment Technique >Structural analysis of Hindi phonetics and a method for extraction of phonetically rich sentences from a very large Hindi text corpus
【24h】

Structural analysis of Hindi phonetics and a method for extraction of phonetically rich sentences from a very large Hindi text corpus

机译:印地语语音的结构分析和从非常大的印地语文本语料库中提取语音丰富的句子的方法

获取原文
获取原文并翻译 | 示例

摘要

Automatic speech recognition (ASR) and Text to speech (TTS) are two prominent area of research in human computer interaction nowadays. A set of phonetically rich sentences is in a matter of importance in order to develop these two interactive modules of HCI. Essentially, the set of phonetically rich sentences has to cover all possible phone units distributed uniformly. Selecting such a set from a big corpus with maintaining phonetic characteristic based similarity is still a challenging problem. The major objective of this paper is to devise a criteria in order to select a set of sentences encompassing all phonetic aspects of a corpus with size as minimum as possible. First, this paper presents a statistical analysis of Hindi phonetics by observing the structural characteristics. Further a two stage algorithm is proposed to extract phonetically rich sentences with a high variety of triphones from the EMILLE Hindi corpus. The algorithm consists of a distance measuring criteria to select a sentence in order to improve the triphone distribution. Moreover, a special preprocessing method is proposed to score each triphone in terms of inverse probability in order to fasten the algorithm. The results show that the approach efficiently build uniformly distributed phonetically-rich corpus with optimum number of sentences.
机译:自动语音识别(ASR)和文本语音转换(TTS)是当今人机交互研究的两个重要领域。为了开发HCI的这两个交互模块,一组语音丰富的句子非常重要。本质上,这组语音丰富的句子必须覆盖所有可能均匀分布的电话单元。从大型语料库中选择这样的集合并保持基于语音特征的相似性仍然是一个难题。本文的主要目的是设计一种标准,以选择一组涵盖语料库所有语音方面的句子,并且其大小应尽可能小。首先,本文通过观察结构特征来对印地语语音进行统计分析。此外,提出了一种两阶段算法,用于从EMILLE印地语语料库中提取具有多种三音素的丰富语音的句子。该算法包括一个测距标准以选择一个句子,以改善三音素的分布。此外,为了固定算法,提出了一种特殊的预处理方法来对每个三音手机进行逆概率评分。结果表明,该方法有效地构建了具有最佳句子数的,均匀分布的语音丰富的语料库。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号