首页> 外文会议>IEEE International Conference on Internet of Things and Intelligence System >Speaker Recognition For Digital Forensic Audio Analysis Using Learning Vector Quantization Method
【24h】

Speaker Recognition For Digital Forensic Audio Analysis Using Learning Vector Quantization Method

机译:学习向量量化方法的数字法医音频分析中的说话人识别

获取原文

摘要

Presently, Biometric features are often used to identify suspects in law enforcement processes. One of these biometric features is Speaker Recognition. Speaker recognition is used to discriminate people by their voice. In this study, the problem that can be solved is how to classify audio sample that exist on the evidence with the voice of the suspect.In this final project is made a application's prototype that can be used to classify and in that case will be done speaker recognition technique (Speaker Recognition) to be able to classify the speaker's voice in the evidence and the voice of the suspect. The stages used to compare the sound is by extracting the sound features using the Mel-frequency Cepstral Coefficients (MFCC) method and using the Learning Vector Quantization Neural Network (JST-LVQ) method as the classification method of the voice extraction result.By using LVQ, the accuracy in recognition the speaker's voice is pretty good. The use of LVQ method produces best accuracy at 73,33% to recognize the speaker that with the same sentence, and 46,67% for different sentence. So the results obtained in accordance with the expected.
机译:当前,生物特征识别功能通常用于识别执法过程中的嫌疑人。这些生物特征之一是说话者识别。说话者识别用于通过语音区分人。在这项研究中,可以解决的问题是如何使用犯罪嫌疑人的声音对证据中存在的音频样本进行分类。在此最终项目中,将制作一个可用于分类的应用程序原型,在这种情况下将完成分类说话人识别技术(Speaker Recognition),能够将说话人的声音分为证据和嫌疑人的声音。用于比较声音的阶段是通过使用梅尔频率倒谱系数(MFCC)方法和使用学习矢量量化神经网络(JST-LVQ)方法作为声音提取结果的分类方法来提取声音特征。 LVQ,识别说话人声音的准确性非常好。 LVQ方法的使用产生的最佳准确度为73.33%,可以识别出具有相同句子的说话者,而对于不同句子,则可以达到46.67%。因此,结果符合预期。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号