首页> 外文学位 >Advanced signal processing techniques for speaker recognition and communications.
【24h】

Advanced signal processing techniques for speaker recognition and communications.

机译:用于说话人识别和通信的高级信号处理技术。

获取原文
获取原文并翻译 | 示例

摘要

Advanced signal processing techniques can help us well analyze signals of interests and perform proper operations on signals of interests for many useful applications.;In this dissertation, we aim at developing signal processing techniques for speaker recognition (e.g. feature extraction, classifier design) and for communications (e.g. filtering, modulation, beamforming). In the first part, we focus on speaker recognition. For gender identification, we proposed a pitch-based system with a two-stage classifier to ensure accurate identification and low complexity. Our pitch extraction algorithm is able to produce robust pitch estimations. The proposed system is speech language/content independent, microphone independent, and robust against noisy recording conditions. For large population speaker identification under noisy conditions, we proposed a fuzzy-clustering-based decision tree approach. Our approach aims at partitioning the whole population into subgroups in a hierarchical way. We only apply mel-frequency cepstral coefficients (MFCC) + Gaussian mixture model (GMM) to the leaf node which has a very small population size and hence MFCC+GMM is effective. To achieve a low probability of classification error, we adopted fuzzy clustering in constructing the decision tree, i.e., a speaker may belong to multiple nodes at each level of the tree. We derived six features (including pitch and five vocal source characteristics) and constructed a six-level decision tree. Compared to MFCC+GMM, our proposed approach achieves much higher accuracy with much less complexity.;In the second part, we study signal processing for communications. To address limitations of orthogonal frequency-division multiplexing (OFDM), we proposed a multi-carrier transceiver based on frequency-shift filter. Compared with OFDM, the proposed transceiver is much less sensitive to carrier frequency offset and has a lower peak-to-average ratio; moreover, the proposed transceiver has the advantage of being able to mitigate strong co-channel interference and strong narrowband interference. To improve the anti-jamming capability of a space-time block coding system over fading channels, we proposed to use Capon's beamforming to extract the intended signal while suppressing jamming signals coming from directions different from the intended signal. We evaluate the anti-jamming performance and the system cost with different numbers of array elements under different types of jamming signals.
机译:先进的信号处理技术可以帮助我们很好地分析感兴趣的信号并在感兴趣的信号上执行适当的操作,以用于许多有用的应用。本文旨在开发用于说话人识别的信号处理技术(例如特征提取,分类器设计)以及用于通信(例如,滤波,调制,波束成形)。在第一部分中,我们专注于说话人识别。对于性别识别,我们提出了一种基于音高的系统,该系统具有两级分类器,以确保准确的识别和低复杂度。我们的音高提取算法能够产生可靠的音高估计。所提出的系统是与语音语言/内容无关,与麦克风无关,并且在嘈杂的录音条件下具有鲁棒性。对于嘈杂条件下的大型说话人识别,我们提出了一种基于模糊聚类的决策树方法。我们的方法旨在以分层的方式将整个人群划分为子组。我们仅将mel-frequency倒谱系数(MFCC)+高斯混合模型(GMM)应用于人口规模很小的叶节点,因此MFCC + GMM是有效的。为了降低分类错误的可能性,我们在构建决策树时采用了模糊聚类,即说话人可能在树的每个级别上属于多个节点。我们推导了六个特征(包括音高和五个声源特征),并构建了一个六级决策树。与MFCC + GMM相比,我们提出的方法以更高的精度实现了更高的准确度。第二部分,我们研究了通信的信号处理。为了解决正交频分复用(OFDM)的局限性,我们提出了一种基于频移滤波器的多载波收发器。与OFDM相比,拟议的收发器对载波频率偏移的敏感度低得多,并且具有较低的峰均比。此外,所提出的收发器具有能够减轻强烈的同信道干扰和强烈的窄带干扰的优点。为了提高空时分组编码系统在衰落信道上的抗干扰能力,我们建议使用Capon波束成形来提取目标信号,同时抑制来自与目标信号不同方向的干扰信号。我们在不同类型的干扰信号下,使用不同数量的阵列元件评估抗干扰性能和系统成本。

著录项

  • 作者

    Hu, Yakun.;

  • 作者单位

    University of Florida.;

  • 授予单位 University of Florida.;
  • 学科 Engineering Electronics and Electrical.;Engineering Computer.
  • 学位 Ph.D.
  • 年度 2012
  • 页码 208 p.
  • 总页数 208
  • 原文格式 PDF
  • 正文语种 eng
  • 中图分类
  • 关键词

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号