首页> 外文学位 >Shallow and deep learning for audio and natural language processing.
【24h】

Shallow and deep learning for audio and natural language processing.

机译:浅层和深度学习,用于音频和自然语言处理。

获取原文
获取原文并翻译 | 示例

摘要

Many machine learning algorithms can be viewed as optimization problems that seek the optimum hypothesis in a hypothesis space. To model the complex dependencies in real-world artificial intelligence tasks, machine learning algorithms are required to have high expressive power (high degrees of freedom or richness of a family of functions) and a large amount of training data. Deep learning models and kernel machines are regarded as models with high expressive power through the composition of multiple layers of nonlinearities and through nonlinearly mapping data to a high-dimensional space, respectively.;While the majority of deep learning work is focused on pure classification problems given input data, there are many other challenging Artificial Intelligence (AI) problems beyond classification tasks. In real-world applications, there are cases where we have structured relationships between and among input data and output targets, which have not been fully taken into account in deep learning models. On the other hand, though kernel machines involve convex optimization and have strong theoretical grounding in tractable optimization techniques, for large-scale applications, kernel machines often suffer from significant memory requirements and computational expense. Resolving the computational limitation and thereby enhancing the expressibility of kernel machines are important for large-scale real-world applications.;Learning models based on deep learning and kernel machines for audio and natural language processing tasks are developed in this dissertation. In particular, we address the challenges for deep learning with structured relationships among data and the computational limitations of large-scale kernel machines. A general framework is proposed to consider the relationship among output predictions and enforce constraints between a mixture input and output predictions for monaural source separation tasks. To model the structured relationships among inputs, the deep structured semantic models are introduced for an information retrieval task. Queries and documents are modeled as inputs to the deep learning models and the relevance is measured through the similarity at the output layer. A discriminative objective function is proposed to exploit the similarity and dissimilarity between queries and web documents. To address the scalability and efficiency of large-scale kernel machines, using deep architectures, ensemble models, and a scalable parallel solver are investigated to further scale-up kernel machines approximated by randomized feature maps. The proposed techniques are shown to match the expressive power of deep neural network based models in spoken language understanding and speech recognition tasks.
机译:许多机器学习算法可以看作是在假设空间中寻求最佳假设的优化问题。为了对现实世界中的人工智能任务中的复杂依赖性进行建模,要求机器学习算法具有较高的表达能力(较高的自由度或一系列功能的丰富性)和大量的训练数据。深度学习模型和内核机器被认为是具有高表达能力的模型,它们分别是由多层非线性构成以及通过将数据非线性映射到高维空间来实现的;尽管大多数深度学习工作都集中在纯分类问题上给定输入数据,除分类任务外,还有许多其他具有挑战性的人工智能(AI)问题。在实际应用中,有些情况下我们已经在输入数据和输出目标之间建立了结构化的关系,而在深度学习模型中并未充分考虑这些关系。另一方面,尽管内核计算机涉及凸优化,并且在易处理的优化技术中具有扎实的理论基础,但对于大规模应用程序,内核计算机通常会遭受大量内存需求和计算开销。解决计算上的限制,从而提高内核机器的可表达性对于大规模的现实应用具有重要意义。特别是,我们通过数据之间的结构化关系和大规模内核机器的计算限制来应对深度学习的挑战。提出了一个通用框架来考虑输出预测之间的关系,并为单声道源分离任务强制执行混合输入和输出预测之间的约束。为了对输入之间的结构化关系建模,引入了用于信息检索任务的深层结构化语义模型。将查询和文档建模为深度学习模型的输入,并通过输出层的相似性来衡量相关性。提出了一种判别目标函数来利用查询和Web文档之间的相似性和相异性。为了解决大规模内核计算机的可伸缩性和效率问题,使用深度体系结构,集成模型和可伸缩并行求解器进行了研究,以进一步扩展由随机特征图近似的内核计算机。所提出的技术显示出可与基于深度神经网络的模型在口语理解和语音识别任务中的表达能力相匹配。

著录项

  • 作者

    Huang, Po-Sen.;

  • 作者单位

    University of Illinois at Urbana-Champaign.;

  • 授予单位 University of Illinois at Urbana-Champaign.;
  • 学科 Computer science.;Artificial intelligence.;Electrical engineering.
  • 学位 Ph.D.
  • 年度 2015
  • 页码 111 p.
  • 总页数 111
  • 原文格式 PDF
  • 正文语种 eng
  • 中图分类
  • 关键词

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号