Shallow and deep learning for audio and natural language processing.

机译：浅层和深度学习，用于音频和自然语言处理。

获取原文

获取原文并翻译 | 示例

页面导航

摘要
著录项
相似文献
相关主题

摘要

Many machine learning algorithms can be viewed as optimization problems that seek the optimum hypothesis in a hypothesis space. To model the complex dependencies in real-world artificial intelligence tasks, machine learning algorithms are required to have high expressive power (high degrees of freedom or richness of a family of functions) and a large amount of training data. Deep learning models and kernel machines are regarded as models with high expressive power through the composition of multiple layers of nonlinearities and through nonlinearly mapping data to a high-dimensional space, respectively.;While the majority of deep learning work is focused on pure classification problems given input data, there are many other challenging Artificial Intelligence (AI) problems beyond classification tasks. In real-world applications, there are cases where we have structured relationships between and among input data and output targets, which have not been fully taken into account in deep learning models. On the other hand, though kernel machines involve convex optimization and have strong theoretical grounding in tractable optimization techniques, for large-scale applications, kernel machines often suffer from significant memory requirements and computational expense. Resolving the computational limitation and thereby enhancing the expressibility of kernel machines are important for large-scale real-world applications.;Learning models based on deep learning and kernel machines for audio and natural language processing tasks are developed in this dissertation. In particular, we address the challenges for deep learning with structured relationships among data and the computational limitations of large-scale kernel machines. A general framework is proposed to consider the relationship among output predictions and enforce constraints between a mixture input and output predictions for monaural source separation tasks. To model the structured relationships among inputs, the deep structured semantic models are introduced for an information retrieval task. Queries and documents are modeled as inputs to the deep learning models and the relevance is measured through the similarity at the output layer. A discriminative objective function is proposed to exploit the similarity and dissimilarity between queries and web documents. To address the scalability and efficiency of large-scale kernel machines, using deep architectures, ensemble models, and a scalable parallel solver are investigated to further scale-up kernel machines approximated by randomized feature maps. The proposed techniques are shown to match the expressive power of deep neural network based models in spoken language understanding and speech recognition tasks.

机译：许多机器学习算法可以看作是在假设空间中寻求最佳假设的优化问题。为了对现实世界中的人工智能任务中的复杂依赖性进行建模，要求机器学习算法具有较高的表达能力（较高的自由度或一系列功能的丰富性）和大量的训练数据。深度学习模型和内核机器被认为是具有高表达能力的模型，它们分别是由多层非线性构成以及通过将数据非线性映射到高维空间来实现的;尽管大多数深度学习工作都集中在纯分类问题上给定输入数据，除分类任务外，还有许多其他具有挑战性的人工智能（AI）问题。在实际应用中，有些情况下我们已经在输入数据和输出目标之间建立了结构化的关系，而在深度学习模型中并未充分考虑这些关系。另一方面，尽管内核计算机涉及凸优化，并且在易处理的优化技术中具有扎实的理论基础，但对于大规模应用程序，内核计算机通常会遭受大量内存需求和计算开销。解决计算上的限制，从而提高内核机器的可表达性对于大规模的现实应用具有重要意义。特别是，我们通过数据之间的结构化关系和大规模内核机器的计算限制来应对深度学习的挑战。提出了一个通用框架来考虑输出预测之间的关系，并为单声道源分离任务强制执行混合输入和输出预测之间的约束。为了对输入之间的结构化关系建模，引入了用于信息检索任务的深层结构化语义模型。将查询和文档建模为深度学习模型的输入，并通过输出层的相似性来衡量相关性。提出了一种判别目标函数来利用查询和Web文档之间的相似性和相异性。为了解决大规模内核计算机的可伸缩性和效率问题，使用深度体系结构，集成模型和可伸缩并行求解器进行了研究，以进一步扩展由随机特征图近似的内核计算机。所提出的技术显示出可与基于深度神经网络的模型在口语理解和语音识别任务中的表达能力相匹配。

著录项

作者
Huang, Po-Sen.;
展开▼
作者单位

University of Illinois at Urbana-Champaign.;

展开▼
授予单位 University of Illinois at Urbana-Champaign.;
学科 Computer science.;Artificial intelligence.;Electrical engineering.
学位 Ph.D.
年度 2015
页码 111 p.
总页数 111
原文格式 PDF
正文语种 eng
中图分类
关键词

相似文献

外文文献
中文文献
专利

1. Deep Learning Based Part-of-Speech Tagging for Malayalam Twitter Data (Special Issue: Deep Learning Techniques for Natural Language Processing) [J] . S.Kumar, M. AnandKumar, K.P.Soman Journal of Intelligent Systems . 2019,第3期

机译：基于深入学习的Malayalam Twitter数据的语音标记（特殊问题：自然语言处理的深度学习技巧）
2. Chu-Ren Huang, Nicoletta Calzolari, Aldo Gangemi, Alessandro Lend, Alessandro Oltramari, and Laurent Prevot (eds.): Ontology and the Lexicon: a natural language processing perspective. (Studies in Natural Language Processing.) [J] . Andrea C. Schalley Language Resources and Evaluation . 2012,第1期

机译：Huang Chu-Ren，Nicoletta Calzolari，Aldo Gangemi，Alessandro Lend，Alessandro Oltramari和Laurent Prevot（ed。）：本体论和词汇：自然语言处理的观点。（自然语言处理研究）。
3. Feature Extraction and Analysis of Natural Language Processing for Deep Learning English Language [J] . Wang Dongyang, Su Junli, Yu Hongbin Quality Control, Transactions . 2020,第期

机译：深度学习英语语言自然语言处理的特征提取与分析
4. ASHAN ('Alphabets and Scripts by Hand and Audio, Naturally') - A Tool for Learning Scripts and Languages [C] . Joseph M.T., Kulkarni M., Lohokare A., 2011 Second International Conference on Intelligent Systems, Modelling and Simulation . 2011

机译：ASHAN（“自然而然地通过手工和音频编写的字母和脚本”）-一种学习脚本和语言的工具
5. Deep learning approaches to problems in speech recognition, computational chemistry, and natural language text processing. [D] . Dahl, George Edward. 2015

机译：深度学习解决语音识别，计算化学和自然语言文本处理中的问题。
6. Controlled Vocabularies Indexing and Medical Language Processing. Medical Language Processing: Database Capture of Natural Language Echocardiographic Reports: A Unified Medical Language System Approach [O] . K. Canfield, B. Bray, S. Huff, 1989

机译：受控词汇表索引编制和医学语言处理。医学语言处理：自然语言超声心动图报告的数据库捕获：统一医学语言系统方法
7. A corpus-based induction learning approach to natural language processing. [O] . 1996

机译：基于语料库的自然语言处理归纳学习方法。

Shallow and deep learning for audio and natural language processing.

摘要

著录项

相似文献

相关主题

期刊订阅