首页> 外文期刊>Philosophy & technology >Why Can Computers Understand Natural Language?·The Structuralist Image of Language Behind Word Embeddings
【24h】

Why Can Computers Understand Natural Language?·The Structuralist Image of Language Behind Word Embeddings

机译:为什么计算机可以了解自然语言?·语言嵌入词背后的语言的形象

获取原文
           

摘要

The present paper intends to draw the conception of language implied in the technique of word embeddings that supported the recent development of deep neural network models in computational linguistics. After a preliminary presentation of the basic functioning of elementary artificial neural networks, we introduce the motivations and capabilities of word embeddings through one of its pioneering models, word2vec. To assess the remarkable results of the latter, we inspect the nature of its underlying mechanisms, which have been characterized as the implicit factorization of a word-context matrix. We then discuss the ordinary association of the “distributional hypothesis” with a “use theory of meaning,” often justifying the theoretical basis of word embeddings, and contrast them to the theory of meaning stemming from those mechanisms through the lens of matrix models (such as vector space models and distributional semantic models). Finally, we trace back the principles of their possible consistency through Harris’s original distributionalism up to the structuralist conception of language of Saussure and Hjelmslev. Other than giving access to the technical literature and state of the art in the field of natural language processing to non-specialist readers, the paper seeks to reveal the conceptual and philosophical stakes involved in the recent application of new neural network techniques to the computational treatment of language.
机译:本文打算绘制在嵌入式技术中暗示的语言的概念,支持最近在计算语言学中获得深度神经网络模型的最新发展。在初步呈现基本人工神经网络的基本运作之后,我们通过其开创式模型,Word2VEC介绍了Word Embeddings的动机和能力。为了评估后者的显着结果,我们检查其潜在机制的性质,这些机制被称为单词上下文矩阵的隐式分解。然后,我们讨论了“分布假设”的普通协会,并与“使用意义理论”,经常证明Word Embeddings的理论基础,并将它们与矩阵模型镜头源于这些机制的意义的理论对比(这样作为矢量空间模型和分布语义模型)。最后,我们通过哈里斯的原始分配主义追溯了他们可能的一致性的原则,这取决于索斯尔和Hjelmslev语言的结构主义概念。除了在对非专业读者的自然语言处理领域提供技术文学和最先进的技术,旨在揭示历史最近应用新神经网络技术对计算治疗的概念和哲学赌注语言。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号