Deep Learning based Identification of Primary Speaker in Voice-Controlled Devices

机译：语音控制设备中初级扬声器的深度学习识别

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

At present, so many low budget recording devices available, recording lectures, meetings, conferences and events have become a very easy option for everyone but also at the same time these devices lead to unclear speeches. An advanced methodology of identifying the primary speaker is introduced to record noisy audio file given in any regional language. The speaker turns and speaker lengths are examples of features which provide greater insight in the detection of primary speakers. This also provides the transcript of the primary speaker audio chunk to the end-user. Speaker diarization is the process of identifying various chunks in given audio belonging to different homogenous speakers where the count of speakers is unknown. This process is a mixture of segmentation and clustering. Speech segmentation detects the speaker change points followed by grouping them based on the speaker. Thus, Speaker Diarization is the most important step in the Identification of the primary speaker.

机译：目前，可提供许多低预算记录设备，录制讲座，会议，会议和事件已成为每个人的一个非常简单的选择，而且还同时这些设备导致语音不明确。介绍了识别初级扬声器的先进方法，以记录以任何区域语言给出的噪声音频文件。扬声器转弯和扬声器长度是在初级扬声器的检测方面提供更大的洞察力的特征示例。这也为最终用户提供了主扬声器音频块的转录程序。扬声器日期是识别给定音频的各种块的过程，属于不同的同质扬声器，其中扬声器的计数未知。该过程是分段和聚类的混合物。语音分割检测扬声器改变点，然后基于扬声器对它们进行分组。因此，扬声器日期是初级扬声器识别中最重要的一步。

著录项

来源
《International Conference on Intelligent Sustainable Systems》|2020年|297-301|共5页
会议地点
作者
Kavya Khatter; Daksha Singhal; Jayashree. R;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类
关键词
Speech processing; Signal processing; Feature extraction; Conferences; Acoustics; Mathematical model; Artificial intelligence;

机译：语音处理;信号处理;特征提取;会议;声学;数学模型;人工智能;

相似文献

外文文献
中文文献
专利

1. Deep Learning Based NLOS Identification With Commodity WLAN Devices [J] . Choi Jeong-Sik, Lee Woong-Hee, Lee Jae-Hyun, Fortschritte der Physik . 2018,第4期

机译：基于深入学习的NLO识别商品WLAN设备
2. SpeakerBeam: A New Deep Learning Technology for Extracting Speech of a Target Speaker Based on the Speaker’s Voice Characteristics [J] . Marc Delcroix, Katerina Zmolikova, Keisuke Kinoshita, NTT Technical Review . 2018,第11期

机译：SpeakerBeam：一种新的深度学习技术，用于根据说话者的语音特征提取目标说话者的语音
3. Electromagnetic radiation-based IC device identification and verification using deep learning [J] . Hong-xin Zhang, Jia Liu, Jun Xu, Eurasip Journal on Wireless Communications and Networking . 2020,第1期

机译：基于电磁辐射的IC器件使用深度学习识别和验证
4. Evaluation of the deep nonlinear metric learning based speaker identification on the large scale of voiceprint corpus [C] . Feng Yong, Cai Xinyuan, Ji Ruifang International Symposium on Chinese Spoken Language Processing . 2016

机译：基于大规模声纹语料库的基于深度非线性度量学习的说话人识别评估
5. Speaker Identification: Time-Frequency Analysis With Deep Learning [D] . Chen, Hui 2018

机译：说话人识别：深度学习的时频分析
6. Deep learning-based smart speaker to confirm surgical sites for cataract surgeries: A pilot study [O] . Tae Keun Yoo, Ein Oh, Hong Kyu Kim, 2020

机译：基于深度学习的智能扬声器以确认白内障手术的外科遗址：试点研究
7. Deep Learning Based NLOS Identification with Commodity WLAN Devices [O] . Choi, Jeong-Sik, Lee, Woong-Hee, Lee, Jae-Hyun, 2017

机译：基于深度学习的商品WLaN设备NLOs识别

Deep Learning based Identification of Primary Speaker in Voice-Controlled Devices

摘要

著录项

相似文献

相关主题

期刊订阅