首页> 外文会议>IEEE International Conference on Internet of Things and Intelligence System >Speaker Recognition For Digital Forensic Audio Analysis Using Learning Vector Quantization Method

【24h】

Speaker Recognition For Digital Forensic Audio Analysis Using Learning Vector Quantization Method

机译：学习向量量化方法的数字法医音频分析中的说话人识别

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

Presently, Biometric features are often used to identify suspects in law enforcement processes. One of these biometric features is Speaker Recognition. Speaker recognition is used to discriminate people by their voice. In this study, the problem that can be solved is how to classify audio sample that exist on the evidence with the voice of the suspect.In this final project is made a application's prototype that can be used to classify and in that case will be done speaker recognition technique (Speaker Recognition) to be able to classify the speaker's voice in the evidence and the voice of the suspect. The stages used to compare the sound is by extracting the sound features using the Mel-frequency Cepstral Coefficients (MFCC) method and using the Learning Vector Quantization Neural Network (JST-LVQ) method as the classification method of the voice extraction result.By using LVQ, the accuracy in recognition the speaker's voice is pretty good. The use of LVQ method produces best accuracy at 73,33% to recognize the speaker that with the same sentence, and 46,67% for different sentence. So the results obtained in accordance with the expected.

机译：当前，生物特征识别功能通常用于识别执法过程中的嫌疑人。这些生物特征之一是说话者识别。说话者识别用于通过语音区分人。在这项研究中，可以解决的问题是如何使用犯罪嫌疑人的声音对证据中存在的音频样本进行分类。在此最终项目中，将制作一个可用于分类的应用程序原型，在这种情况下将完成分类说话人识别技术（Speaker Recognition），能够将说话人的声音分为证据和嫌疑人的声音。用于比较声音的阶段是通过使用梅尔频率倒谱系数（MFCC）方法和使用学习矢量量化神经网络（JST-LVQ）方法作为声音提取结果的分类方法来提取声音特征。 LVQ，识别说话人声音的准确性非常好。 LVQ方法的使用产生的最佳准确度为73.33％，可以识别出具有相同句子的说话者，而对于不同句子，则可以达到46.67％。因此，结果符合预期。

著录项

来源
《IEEE International Conference on Internet of Things and Intelligence System 》|2018年|221-226|共6页
会议地点 Bali(ID)
作者
Danny Bastian Manurung; Burhanuddin Dirgantoro; Casi Setianingsih;
展开▼
作者单位

Telkom University Indonesia;

展开▼
会议组织
原文格式 PDF
正文语种
中图分类
关键词
Speaker recognition; Mel frequency cepstral coefficient; Feature extraction; Testing; Vector quantization; Filter banks;

机译：说话人识别；梅尔频率倒谱系数；特征提取;测试；矢量量化；筛选银行;

相似文献

外文文献
中文文献
专利

1. SPEAKER RECOGNITION USING AUDIO SPECTRUM PROJECTION AND VECTOR QUANTIZATION [J] . Bikram Kar, Avishek Dey International journal of simulation: systems, science and technology . 2018 ,第4aaPagea1期

机译：使用音频频谱投影和矢量量化扬声器识别
2. Comparison of performance of five common classifiers represented as boundary methods: Euclidean Distance to Centroids, Linear Discriminant Analysis, Quadratic Discriminant Analysis, Learning Vector Quantization and Support Vector Machines, ... [J] . Sarah J. Dixon, Richard G. Brereton Chemometrics and Intelligent Laboratory Systems . 2009 ,第1期

机译：比较以边界法表示的五个常见分类器的性能：欧氏距质心的距离，线性判别分析，二次判别分析，学习向量量化和支持向量机，...
3. Time and spectral analysis methods with machine learning for the authentication of digital audio recordings [J] . KoryckiR. Forensic science international . 2013 ,第1a3期

机译：机器学习的时间和频谱分析方法，用于数字录音的认证
4. Speaker Recognition For Digital Forensic Audio Analysis Using Learning Vector Quantization Method [C] . Danny Bastian Manurung, Burhanuddin Dirgantoro, Casi Setianingsih International Conference on Internet of Things and Intelligence System . 2018

机译：使用学习矢量量化方法进行数字法医学分析的扬声器识别
5. Multimodal Sensing and Data Processing for Speaker and Emotion Recognition Using Deep Learning Models with Audio, Video and Biomedical Sensors [D] . Abtahi, Farnaz. 2018

机译：使用具有音频，视频和生物医学传感器的深度学习模型，对说话人和情感识别进行多模式传感和数据处理
6. Using an Optimized Learning Vector Quantization- (LVQ-) Based Neural Network in Accounting Fraud Recognition [O] . Yuan Zheng, Xiaolan Ye, Ting Wu 2021

机译：在会计欺诈识别中使用基于优化的学习矢量量化 - （LVQ-）神经网络
7. Analysis of Robust Soft Learning Vector Quantization and an application to Facial Expression Recognition [O] . de Vries Gert-Jan, Biehl Michael 2009

机译：鲁棒软学习矢量量化分析及其在面部表情识别中的应用
8. Noise Robust I-Vector Extractor Using Vector Taylor Series For Speaker Recognition. [R] . Lei, Y., Burget, L., Scheffer, N. 2013

机译：使用矢量泰勒级数进行说话人识别的噪声鲁棒I-向量提取器。

Speaker Recognition For Digital Forensic Audio Analysis Using Learning Vector Quantization Method

摘要

著录项

相似文献

相关主题

期刊订阅