Why Can Computers Understand Natural Language?·The Structuralist Image of Language Behind Word Embeddings

Juan Luis Gastaldi

首页> 外文期刊>Philosophy & technology >Why Can Computers Understand Natural Language?·The Structuralist Image of Language Behind Word Embeddings

【24h】

Why Can Computers Understand Natural Language?·The Structuralist Image of Language Behind Word Embeddings

机译：为什么计算机可以了解自然语言？·语言嵌入词背后的语言的形象

获取原文

掌桥外文数据库（机构版） >>

开具论文收录证明 >>

文献代查 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

The present paper intends to draw the conception of language implied in the technique of word embeddings that supported the recent development of deep neural network models in computational linguistics. After a preliminary presentation of the basic functioning of elementary artificial neural networks, we introduce the motivations and capabilities of word embeddings through one of its pioneering models, word2vec. To assess the remarkable results of the latter, we inspect the nature of its underlying mechanisms, which have been characterized as the implicit factorization of a word-context matrix. We then discuss the ordinary association of the “distributional hypothesis” with a “use theory of meaning,” often justifying the theoretical basis of word embeddings, and contrast them to the theory of meaning stemming from those mechanisms through the lens of matrix models (such as vector space models and distributional semantic models). Finally, we trace back the principles of their possible consistency through Harris’s original distributionalism up to the structuralist conception of language of Saussure and Hjelmslev. Other than giving access to the technical literature and state of the art in the field of natural language processing to non-specialist readers, the paper seeks to reveal the conceptual and philosophical stakes involved in the recent application of new neural network techniques to the computational treatment of language.

机译：本文打算绘制在嵌入式技术中暗示的语言的概念，支持最近在计算语言学中获得深度神经网络模型的最新发展。在初步呈现基本人工神经网络的基本运作之后，我们通过其开创式模型，Word2VEC介绍了Word Embeddings的动机和能力。为了评估后者的显着结果，我们检查其潜在机制的性质，这些机制被称为单词上下文矩阵的隐式分解。然后，我们讨论了“分布假设”的普通协会，并与“使用意义理论”，经常证明Word Embeddings的理论基础，并将它们与矩阵模型镜头源于这些机制的意义的理论对比（这样作为矢量空间模型和分布语义模型）。最后，我们通过哈里斯的原始分配主义追溯了他们可能的一致性的原则，这取决于索斯尔和Hjelmslev语言的结构主义概念。除了在对非专业读者的自然语言处理领域提供技术文学和最先进的技术，旨在揭示历史最近应用新神经网络技术对计算治疗的概念和哲学赌注语言。

著录项

来源
《Philosophy & technology》 |2021年第1期|149-214|共66页
作者
Juan Luis Gastaldi;
展开▼
作者单位

ETH Zürich (HPM D-GESS Zürich Switzerland;

展开▼
收录信息
原文格式 PDF
正文语种 eng
中图分类
关键词
Word embeddings; Natural language processing; word2vec; Neural networks; Philosophy of language; Matrix models; Distributional hypothesis; Structuralism;

机译：Word Embeddings;自然语言处理;Word2VEC;神经网络;语言哲学;矩阵模型;分布假设;结构主义;

相似文献

外文文献
中文文献
专利

1. Bridging Semantic Gaps between Natural Languages and APIs with Word Embedding [J] . Li Xiaochen, Jiang He, Kamei Yasutaka, IEEE Transactions on Software Engineering . 2020,第10期

机译：桥接自然语言和API之间的语义间隙，用词嵌入
2. Phraseological Computer-Aided Translation of Natural Language Texts into Other Natural Languages [J] . G. G. Belonogov, A. A. Khoroshilov, A. A. Khoroshilov Automatic Documentation and Mathematical Linguistics . 2010,第5期

机译：短语计算机辅助将自然语言文本翻译成其他自然语言
3. Convolution-deconvolution word embedding: An end-to-end multi-prototype fusion embedding method for natural language processing [J] . Shuang Kai, Zhang Zhixuan, Loo Jonathan, Information Fusion . 2020,第期

机译：卷积 - 解卷积词嵌入：用于自然语言处理的端到端多型融合嵌入方法
4. Comparing Fifty Natural Languages and Twelve Genetic Languages Using Word Embedding Language Divergence (WELD) as a Quantitative Measure of Language Distance [C] . Ehsaneddin Asgari, Mohammad R.K. Mofrad Workshop on multilingual and cross-lingual methods in NLP 2016 . 2016

机译：使用词嵌入语言差异（WELD）作为语言距离的定量度量，比较五十种自然语言和十二种遗传语言
5. Understanding Word Embedding Stability Across Languages and Applications [D] . Wendlandt Burdick, Laura Anne. 2020

机译：了解跨语言和应用程序的Word嵌入稳定性
6. A Comparison of Word Embeddings for the Biomedical Natural Language Processing [O] . Yanshan Wang, Sijia Liu, Naveed Afzal, -1

机译：生物医学自然语言处理中的词嵌入比较
7. Comparing Fifty Natural Languages and Twelve Genetic Languages Using Word Embedding Language Divergence (WELD) as a Quantitative Measure of Language Distance [O] . Asgari, Ehsaneddin, Mofrad, Mohammad R. K. 2016

机译：用50种语言比较50种自然语言和12种基因语言词语嵌入语言发散（WELD）作为一种定量测量语言距离

Why Can Computers Understand Natural Language?·The Structuralist Image of Language Behind Word Embeddings

摘要

著录项

相似文献

相关主题

期刊订阅