A PRELIMINARY STUDY OF PROSODY-BASED DETECTION OF QUESTIONS IN ARABIC SPEECH MONOLOGUES

Omair Khan; Wasfi G. Al-Khatib; Lahouari Cheded

首页> 外文期刊>The Arabian journal for science and engineering >A PRELIMINARY STUDY OF PROSODY-BASED DETECTION OF QUESTIONS IN ARABIC SPEECH MONOLOGUES

【24h】

A PRELIMINARY STUDY OF PROSODY-BASED DETECTION OF QUESTIONS IN ARABIC SPEECH MONOLOGUES

机译：基于语音的阿拉伯语语音单语检测问题的初步研究

获取原文

获取原文并翻译 | 示例

掌桥外文数据库（机构版） >>

开具论文收录证明 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

Prosody features have been widely used in many speech-related applications, including speaker and word recognition, emotion and accent identification, topic and sentence segmentation, and text-to-speech applications. Languages other than Arabic have received a lot of attention in this regard. An important application of prosodic features which is investigated here is that of identifying question sentences in Arabic monologue lectures. To our best knowledge, this is the first attempt at addressing question detection from spoken lectures in any language. To this end, we developed a small corpus made of 1028 utterances that were extracted from 15 Arabic spoken lectures. We approach this problem by first segmenting the continuous speech (recorded lectures) into sentences using both intensity and duration features. Prosodic features are, then, extracted from each sentence. These features are used as input to four different classifiers to classify each sentence into either a question or a non-question sentence.Our results suggest that questions are cued by more than one type of prosodic features in spontaneous Arabic speech. We classified questions with an accuracy of 77.43%. A feature-specific analysis further reveals that energy and fundamental frequency (F0) features are mainly responsible for discriminating between question and non-question sentences. In terms of classification, we found that a Bayes Network performs better than support vector machines, multi-layer perceptron neural networks, or decision trees on our dataset. Removal of correlated features through Correlation-based Feature Selection produced more efficient and accurate results than the complete feature set.

机译：韵律功能已广泛用于许多与语音相关的应用程序中，包括说话者和单词识别，情感和口音识别，主题和句子分段以及文本到语音的应用程序。在这方面，除阿拉伯语以外的其他语言受到了广泛关注。本文研究的韵律特征的重要应用是在阿拉伯独白讲座中识别疑问句。据我们所知，这是首次尝试解决任何语言的口语演讲中的问题。为此，我们开发了一个小型语料库，该语料库由1528种口语组成，并从15种阿拉伯语口语课中提取。我们通过首先使用强度和持续时间特征将连续语音（录制的演讲）分割成句子来解决这个问题。然后，从每个句子中提取韵律特征。这些特征被用作四个不同分类器的输入，以将每个句子分为一个疑问句或一个非疑问句。我们的结果表明，问题是由自发阿拉伯语语音中的一种以上韵律特征所暗示的。我们对问题进行分类的准确性为77.43％。特定于特征的分析进一步表明，能量和基频（F0）特征主要负责区分疑问句和非疑问句。在分类方面，我们发现贝叶斯网络在数据集上的性能优于支持向量机，多层感知器神经网络或决策树。通过基于相关的特征选择删除相关的特征所产生的结果比完整的特征集更为有效和准确。

著录项

来源
《The Arabian journal for science and engineering》 |2010年第2c期|p.167-181|共15页
作者
Omair Khan; Wasfi G. Al-Khatib; Lahouari Cheded;
展开▼
作者单位

King Fahd University of Petroleum & Minerals, Dhahran, Saudi Arabia 31261;

King Fahd University of Petroleum & Minerals, Dhahran, Saudi Arabia 31261;

King Fahd University of Petroleum & Minerals, Dhahran, Saudi Arabia 31261;

展开▼
收录信息美国《科学引文索引》(SCI);
原文格式 PDF
正文语种 eng
中图分类
关键词
question detection; prosodic analysis; audio monologues; arabic lectures; learning algorithms;

机译：问题检测;韵律分析;音频独白;阿拉伯语讲座;学习算法;
入库时间 2022-08-17 23:07:18

相似文献

外文文献
中文文献
专利

1. A Preliminary Study for Building an Arabic Corpus of Pair Questions-texts from the Web: AQA-WebCorp [J] . Wided Bakari, Patrice Bellot, Mahmoud Neji International Journal of Recent Contributions from Engineering, Science & IT . 2016,第2期

机译：从网络构建配对问题阿拉伯语语料库的初步研究：AQA-WebCorp
2. A Preliminary Study of the Effects of Attentive Music Listening on Cochlear Implant Users’ Speech Perception, Quality of Life, and Behavioral and Objective Measures of Frequency Change Detection [J] . Gabrielle M. Firestone, Kelli McGuire, Chun Liang, Frontiers in Human Neuroscience . 2020,第4期

机译：细心音乐聆听对耳蜗植入用户的语音感知，生活质量的初步研究，以及频率变化检测的行为和客观措施
3. Automatic Detection of Articulations Disorders from Children's Speech Preliminary Study [J] . N. Ramou, M. Guerti NTT R&D . 2014,第11期

机译：从儿童言语初步研究中自动检测发音障碍
4. Detection of Questions in Arabic Audio Monologues Using Prosodic Features [C] . Khan Omair, Al-Khatib Wasfi G., Lahouari Cheded, IEEE International Symposium on Multimedia . 2007

机译：使用韵律特征检测阿拉伯音频独白的问题
5. Effects of Prosody-Based Instruction and Self-Assessment in L2 Speech Development [D] . Saito, Yukie. 2019

机译：韵律教学与自我评估在L2语音开发中的影响
6. A Preliminary Study of the Effects of Attentive Music Listening on Cochlear Implant Users’ Speech Perception Quality of Life and Behavioral and Objective Measures of Frequency Change Detection [O] . Gabrielle M. Firestone, Kelli McGuire, Chun Liang, 2020

机译：专注听音乐对人工耳蜗使用者的语音感知生活质量以及频率变化检测的行为和客观测量的影响的初步研究
7. A Preliminary Study for Building an Arabic Corpus of Pair Questions-Texts from the Web: AQA-Webcorp [O] . Bakari, Wided, Bellot, Patrice, Neji, Mahmoud 2017

机译：构建阿拉伯语双语语料库的初步研究来自网络的问题文本：aQa-Webcorp

A PRELIMINARY STUDY OF PROSODY-BASED DETECTION OF QUESTIONS IN ARABIC SPEECH MONOLOGUES

摘要

著录项

相似文献

相关主题

期刊订阅