VIDEO AND AUDIO BASED DETECTION OF FILLED HESITATION PAUSES IN CLASSROOM LECTURES

机译：基于视频和音频的课堂演讲中犹豫不决暂停的检测

获取原文

页面导航

摘要
著录项
引文网络
相似文献
相关主题

摘要

In this paper we study the detection of hesitation filled pauses in oral presentations of university lectures taught in the Greek language and recorded using a tablet PC via a specialized software. We suggest a hierarchical approach fusing video data with audio data for increasing the precision rate in our detection system. The detection method works at frame level rather than the usual segmental level for more accurate synchronization of audio and video data after removing the detected hesitations. Audio characteristics are modeled using Gaussian Mixture Models while the stationarity of the recorded video is taken into account. This efficient video and audio combination yields higher precision and recall rates comparing with other works in the literature. On a dataset of approximately 7 hours the precision rate is 99.6% while the recall rate is 84.7% when audio and video data are taken into account.

机译：在本文中，我们研究了在希腊语授课的大学演讲的口头演示中检测到的犹豫填充停顿的情况，并使用平板电脑通过专用软件进行记录。我们建议采用一种分层方法，将视频数据与音频数据融合在一起，以提高检测系统的准确率。该检测方法在帧级别而不是通常的分段级别上工作，以便在消除检测到的犹豫之后更准确地同步音频和视频数据。使用高斯混合模型对音频特性进行建模，同时考虑录制视频的平稳性。与文献中的其他作品相比，这种有效的视频和音频组合产生了更高的精度和召回率。在大约7个小时的数据集上，考虑到音频和视频数据，查准率是99.6％，召回率是84.7％。

著录项

来源
《European signal processing conference;EUSIPCO 2009》|2010年|p.834-838|共5页
会议地点
作者
Vassilis Tsiaras; Costas Panagiotakis; Yannis Stylianou;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类信息处理（信息加工）;
关键词

相似文献

外文文献
中文文献
专利

1. Audio-video based character recognition for handwritten mathematical content in classroom videos [J] . Smita Vemulapalli, Monson Hayes Integrated Computer-Aided Engineering . 2014,第3期

机译：基于音频视频的字符识别，用于教室视频中的手写数学内容
2. The Flipped Classroom: A Comparison of Student Performance Using Instructional Videos and Podcasts versus the Lecture-Based Model of Instruction [J] . Retta Guy, Gerald Marquis Journal of issues in informing science & information technology . 2016,第期

机译：翻转课堂：使用教学视频和播客的学生表现与基于授课的教学模型的比较
3. Flipped Classroom: A Comparison Of Student Performance Using Instructional Videos And Podcasts Versus The Lecture-Based Model Of Instruction [J] . Retta Guy, Gerald Marquis Issues in Informing Science and Information Technology . 2016,第Suppa2期

机译：翻转课堂：使用教学视频和播客的学生表现与基于讲座的教学模型的比较
4. Video and audio based detection of filled hesitation pauses in classroom lectures [C] . Tsiaras Vassilis, Panagiotakis Costas, Stylianou Yannis European Signal Processing Conference . 2009

机译：基于视频和音频的教室讲座中的犹豫暂停检测
5. The efficient classroom: How team-based learning and lecture video acceleration affect the learning efficiency and effectiveness of a first-year engineering course. [D] . Jacobson, Benjamin Paul. 2015

机译：高效的课堂：基于团队的学习和演讲视频加速如何影响一年级工程课程的学习效率和有效性。
6. Filled Pause Refinement Based on the Pronunciation Probability for Lecture Speech [O] . Yan-Hua Long, Hong Ye -1

机译：基于语音讲话概率的填充式暂停细化
7. Video And Audio Based Detection of Filled Hesitation Pauses in Classroom Lectures [O] . Panagiotakis Costas, Stylianou Yiannis, Tsiaras Vassilis 2009

机译：基于视频和音频的课堂演讲中充满犹豫的暂停的检测

VIDEO AND AUDIO BASED DETECTION OF FILLED HESITATION PAUSES IN CLASSROOM LECTURES

摘要

著录项

引文网络

相似文献

相关主题

期刊订阅