Discovering salient prosodic cues and their interactions for automatic story segmentation in Mandarin broadcast news

Lei Xie

首页> 外文期刊>Multimedia systems >Discovering salient prosodic cues and their interactions for automatic story segmentation in Mandarin broadcast news

【24h】

Discovering salient prosodic cues and their interactions for automatic story segmentation in Mandarin broadcast news

机译：在中文广播新闻中发现显着的韵律线索及其相互作用以实现自动故事分割

获取原文

获取原文并翻译 | 示例

掌桥外文数据库（机构版） >>

开具论文收录证明 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

This paper investigates speech prosody for automatic story segmentation in Mandarin broadcast news. Prosodic cues effectively used in English story segmentation deserve a re-investigation since the lexical tones of Mandarin may complicate the expressions of pitch declination and reset. Our data-oriented study shows that story boundaries cannot be clearly discriminated from utterance boundaries by speaker normalized pitch features due to their large variations across different Mandarin syllable tones. We thus propose to use speaker- and tone-normalized pitch features that can provide clear separations between utterance and story boundaries. Our study also shows that speaker-normalized pause duration is quite effective to separate between story and utterance boundaries, while speaker-normalized speech energy and syllable duration are not effective. Experiments using decision trees for story boundary detection reinforce the difference between English and Chinese, i.e., speaker- and tone-normalized pitch features should be favorably adopted in Mandarin story segmentation. We show that the combination of different prosodic cues can achieve a very high F-measure of 93.04% due to the complementarity between pause, pitch and energy. Analysis of the decision tree uncovered five major heuristics that show how speakers jointly utilize pause duration and pitch to separate speech into stories.

机译：本文研究了普通话广播新闻中用于自动故事分割的语音韵律。由于普通话的词汇语调可能会使音高偏斜和复位的表达变得复杂，因此在英语故事分割中有效使用的韵律提示值得重新研究。我们的面向数据的研究表明，由于说话者归一化的音高特征在不同的普通话音节音调中存在较大差异，因此无法将其与说话者的音色界限区分开。因此，我们建议使用扬声器和音调归一化的音高特征，这些特征可以在发声和故事边界之间提供清晰的分隔。我们的研究还表明，说话者归一化的停顿持续时间对于区分故事和话语边界非常有效，而说话者归一化的语音能量和音节持续时间则无效。使用决策树进行故事边界检测的实验增强了英语和汉语之间的差异，即在汉语故事分割中应优先采用说话人和音调标准化的音高特征。我们证明，由于暂停，音调和能量之间的互补性，不同韵律提示的组合可以实现93.04％的很高的F值。通过对决策树的分析，发现了五种主要的启发式方法，它们显示了说话者如何共同利用停顿持续时间和音调将语音分为故事。

著录项

来源
《Multimedia systems》 |2008年第4期|237-253|共17页
作者
Lei Xie;
展开▼
作者单位

展开▼
收录信息美国《科学引文索引》(SCI);美国《工程索引》(EI);
原文格式 PDF
正文语种 eng
中图分类
关键词
multimedia retrieval; spoken document retrieval; speech prosody; story segmentation; topic segmentation;

机译：多媒体检索;口头文件检索;言语韵律故事分割;主题细分;
入库时间 2022-08-18 02:06:54

相似文献

外文文献
中文文献
专利

1. Integrating Prosodic and Lexical Cues for Automatic Topic Segmentation [J] . Gokhan Tur, Dilek Hakkani-Tur, Andreas Stolcke, Computational linguistics . 2001,第1期

机译：集成韵律和词汇提示以自动进行主题细分
2. Automatic salient object segmentation using saliency map and color segmentation [J] . HAN Sung-ho, JUNG Gye-dong, LEE Sangh-yuk, 中南大学学报（英文版） . 2013,第009期

机译：使用显着图和颜色分割的自动显着对象分割
3. Classification Program and Story Boundaries Segmentation in TV News Broadcast Videos via Deep Convolutional Neural Network [J] . Mounira Hmayda, Ridha Ejbali, Mourad Zaied Journal of computer sciences . 2020,第5期

机译：通过深度卷积神经网络，电视新闻广播视频中的分类计划和故事边界分割
4. A Subword Normalized Cut Approach to Automatic Story Segmentation of Chinese Broadcast News [C] . Jin Zhang, Lei Xie, Wei Feng, Information Retrieval Technology . 2009

机译：中文广播新闻自动分词的子词归一化剪切方法
5. Saliency Cut: an Automatic Approach for Video Object Segmentation Based on Saliency Energy Minimization [D] . Wang, Yilin 2013

机译：显着削减：一种基于显着能量最小化的视频对象自动分割方法
6. The Influence of Different Prosodic Cues on Word Segmentation [O] . Theresa Matzinger, Nikolaus Ritt, W. Tecumseh Fitch 2021

机译：不同韵律提示对词分割的影响
7. Combined Use of Speaker- and Tone-Normalized Pitch Reset with Pause Duration for Automatic Story Segmentation in Mandarin Broadcast News [O] . Lei Xie, Chuan Liu, Helen Meng 2009

机译：扬声器和音调归一化音高重置与暂停持续时间的结合使用，可自动进行中文广播新闻中的故事分割
8. Integrating Prosodic and Lexical Cues for Automatic Topic Segmentation. [R] . Tur, G., Stolcke, A., Hakkani-Tur, D., 2001

机译：整合韵律和词汇提示自动主题分割。

Discovering salient prosodic cues and their interactions for automatic story segmentation in Mandarin broadcast news

摘要

著录项

相似文献

相关主题

期刊订阅