首页> 外文会议>International Conference on Problems of Infocommunications. Science and Technology >Investigation of Informativeness and Stability of Mel-Frequency Cepstral Coefficients Estimates Based on Voice Signal Phase Data of Authentication System User
【24h】

Investigation of Informativeness and Stability of Mel-Frequency Cepstral Coefficients Estimates Based on Voice Signal Phase Data of Authentication System User

机译:基于认证系统用户的语音信号相位数据的母频谱系统系数估计的信息性和稳定性研究

获取原文

摘要

The problems of increasing the reliability of using various resources, access to which is carried out by means of infocommunication networks, are considered. It is known that the first barrier in ensuring high reliability of access is a high-quality user authentication system. Currently preference is given to access systems based on biometric characteristics of a user. Initially, the priority was given to static biometric characteristics of a user (face image, finger papillary picture and iris). These biometric features did not meet the expectations of developers and users due to the simplicity of their forgery. At present, the main attention of developers is focused on implementation of dynamic (behavioral) biometric features of users and, first of all, voice authentication systems. Voice authentication systems have a number of significant advantages: simplicity, compactness, low cost, and a number of others. It is also important that the user's passphrase can be quickly changed and extended in the process of voice authentication. The quality indicators of voice authentication systems, like all biometric access systems, do not meet the increasing requirements. In the process of voice authentication, the amplitude-frequency spectrum of the recording materials is analyzed. The main efforts of researchers are focused on using estimates of the pitch frequency and formant frequencies associated with it, cepstral coefficients, mel-frequency cepstral coefficients, and linear prediction coefficients as a user template. Some attention is paid to decision-making procedures based on Gaussian Mixture Model, Support Vector Machine, and Hidden Markov Models or artificial neural networks. In the presented work, it is proposed to supplement the analysis of the amplitude-frequency spectrum with studies of phase data, which is currently receiving less focus in the process of voice authentication. The results of research on estimates of pitch frequency and mel-frequency cepstral coefficients based on the amplitude and phase information of the voice signal are presented. The purpose of this work is to analyze the informativeness of phase data of a voice signal, as well as to study the stability of estimates of the user's template and, first of all, mel-frequency cepstral coefficients calculated from the phase data. The studies performed have shown high informativeness and stability of the investigated estimates, which emphasizes the importance of the phase information of the voice signal for improving quality characteristics of voice authentication systems.
机译:考虑了使用各种资源的可靠性,通过信息通信网络进行的增加的问题。众所周知,在确保获得高可靠性的第一屏障是一种高质量的用户认证系统。目前优先于基于用户的生物特征的访问系统。最初,优先考虑用户的静态生物特征(面部图像,手指乳头图像和虹膜)。由于他们的伪造简单,这些生物识别功能并不符合开发人员和用户的期望。目前,开发人员的主要关注是专注于实施用户的动态(行为)生物特征,首先是语音认证系统。语音认证系统具有许多显着的优势:简单,紧凑,低成本和其他一些。在语音认证过程中,也可以在语音认证过程中快速更改和扩展用户的密码。语音认证系统的质量指标,如所有生物识别访问系统,都不满足越来越多的要求。在语音认证过程中,分析了记录材料的幅度频谱。研究人员的主要努力将侧重于使用与其相关的俯仰频率和形成频率频率,倒谱系数,熔融频率谱系数和作为用户模板的线性预测系数的估计。一些关注是基于高斯混合模型,支持向量机和隐藏的马尔可夫模型或人工神经网络的决策程序。在本工作中,提出通过研究相位数据的研究来补充对幅度频谱的分析,当前正在接受语音认证过程中的焦点。提出了基于语音信号的幅度和相位信息的音调频率和熔体频率谱系数估计研究结果。本作作品的目的是分析语音信号的相位数据的信息,以及研究用户模板的估计的稳定性,并且首先从相位数据计算的熔体频率谱系数。所执行的研究表明了调查估计的高信息性和稳定性,这强调了语音信号的相位信息以提高语音认证系统的质量特征。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号