首页> 外文期刊>Audio, Speech, and Language Processing, IEEE Transactions on >Low-Latency Real-Time Meeting Recognition and Understanding Using Distant Microphones and Omni-Directional Camera
【24h】

Low-Latency Real-Time Meeting Recognition and Understanding Using Distant Microphones and Omni-Directional Camera

机译:使用远程麦克风和全向摄像头的低延迟实时会议识别和理解

获取原文
获取原文并翻译 | 示例

摘要

This paper presents our real-time meeting analyzer for monitoring conversations in an ongoing group meeting. The goal of the system is to recognize automatically “who is speaking what” in an online manner for meeting assistance. Our system continuously captures the utterances and face poses of each speaker using a microphone array and an omni-directional camera positioned at the center of the meeting table. Through a series of advanced audio processing operations, an overlapping speech signal is enhanced and the components are separated into individual speaker''s channels. Then the utterances are sequentially transcribed by our speech recognizer with low latency. In parallel with speech recognition, the activity of each participant (e.g., speaking, laughing, watching someone) and the circumstances of the meeting (e.g., topic, activeness, casualness) are detected and displayed on a browser together with the transcripts. In this paper, we describe our techniques and our attempt to achieve the low-latency monitoring of meetings, and we show our experimental results for real-time meeting transcription.
机译:本文介绍了我们的实时会议分析器,用于监视正在进行的小组会议中的对话。该系统的目标是以在线方式自动识别“谁在说什么”,以寻求帮助。我们的系统使用麦克风阵列和位于会议桌中央的全向摄像头,连续捕获每个扬声器的讲话和面部姿势。通过一系列高级音频处理操作,增强了重叠的语音信号,并将分量分离为单个扬声器的通道。然后,语音被语音识别器以低延迟顺序转录。与语音识别并行地,检测每个参与者的活动(例如,说话,笑,看着某人)和会议的情况(例如,主题,活跃度,休闲度),并将其与成绩单一起显示在浏览器上。在本文中,我们描述了我们的技术以及实现会议低延迟监视的尝试,并展示了实时会议转录的实验结果。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号