【24h】

Data mining Mandarin tone contour shapes

机译:数据挖掘普通话轮廓形状

获取原文

摘要

In spontaneous speech, Mandarin tones that belong to the same tone category may exhibit many different contour shapes. We explore the use of data mining and NLP techniques for understanding the variability of tones in a large corpus of Mandarin newscast speech. First, we adapt a graph-based approach to characterize the clusters (fuzzy types) of tone contour shapes observed in each tone n-gram category. Second, we show correlations between these realized contour shape types and a bag of automatically extracted linguistic features. We discuss the implications of the current study within the context of phonological and information theory.
机译:在自发语音中,属于相同音调类别的普通话音调可能会表现出许多不同的轮廓形状。我们探索使用数据挖掘和NLP技术来了解大型普通话新闻广播语音中音调的可变性。首先,我们采用基于图的方法来表征在每个音调n-gram类别中观察到的音调轮廓形状的聚类(模糊类型)。其次,我们显示了这些已实现的轮廓形状类型与一袋自动提取的语言特征之间的相关性。我们在语音和信息理论的背景下讨论当前研究的意义。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号