首页> 外文会议>International Conference on Intelligent Sustainable Systems >Deep Learning based Identification of Primary Speaker in Voice-Controlled Devices
【24h】

Deep Learning based Identification of Primary Speaker in Voice-Controlled Devices

机译:语音控制设备中初级扬声器的深度学习识别

获取原文

摘要

At present, so many low budget recording devices available, recording lectures, meetings, conferences and events have become a very easy option for everyone but also at the same time these devices lead to unclear speeches. An advanced methodology of identifying the primary speaker is introduced to record noisy audio file given in any regional language. The speaker turns and speaker lengths are examples of features which provide greater insight in the detection of primary speakers. This also provides the transcript of the primary speaker audio chunk to the end-user. Speaker diarization is the process of identifying various chunks in given audio belonging to different homogenous speakers where the count of speakers is unknown. This process is a mixture of segmentation and clustering. Speech segmentation detects the speaker change points followed by grouping them based on the speaker. Thus, Speaker Diarization is the most important step in the Identification of the primary speaker.
机译:目前,可提供许多低预算记录设备,录制讲座,会议,会议和事件已成为每个人的一个非常简单的选择,而且还同时这些设备导致语音不明确。介绍了识别初级扬声器的先进方法,以记录以任何区域语言给出的噪声音频文件。扬声器转弯和扬声器长度是在初级扬声器的检测方面提供更大的洞察力的特征示例。这也为最终用户提供了主扬声器音频块的转录程序。扬声器日期是识别给定音频的各种块的过程,属于不同的同质扬声器,其中扬声器的计数未知。该过程是分段和聚类的混合物。语音分割检测扬声器改变点,然后基于扬声器对它们进行分组。因此,扬声器日期是初级扬声器识别中最重要的一步。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号