A Generalized Language Model in Tensor Space

机译：张量空间的广义语言模型

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

In the literature, tensors have been effectively used for capturing the context information in language models. However, the existing methods usually adopt relatively-low order tensors, which have limited expressive power in modeling language. Developing a higher-order tensor representation is challenging, in terms of deriving an effective solution and showing its generality. In this paper, we propose a language model named Tensor Space Language Model (TSLM), by utilizing tensor networks and tensor decomposition. In TSLM, we build a high-dimensional semantic space constructed by the tensor product of word vectors. Theoretically, we prove that such tensor representation is a generalization of the n-gram language model. We further show that this high-order tensor representation can be decomposed to a recursive calculation of conditional probability for language modeling. The experimental results on Penn Tree Bank (PTB) dataset and Wiki-Text benchmark demonstrate the effectiveness of TSLM.

机译：在文献中，张量已经有效地用于捕获语言模型中的上下文信息。然而，现有方法通常采用相对低阶的张量，在建模语言中具有有限的表现力。就衍生有效的解决方案并表达其一般性，开发更高阶的张量表示是具有挑战性的。在本文中，我们提出了一种名为Tensor空间语言模型（TSLM）的语言模型，利用张量网络和张量分解。在TSLM中，我们建立由字向量的张量产品构建的高维语义空间。从理论上讲，我们证明这种张量表示是n-gram语言模型的概括。我们进一步表明，这种高阶张量表示可以分解到语言建模的条件概率的递归计算。 Penn Tree Bank（PTB）数据集和维基文本基准的实验结果证明了TSLM的有效性。

著录项

来源
《AAAI Conference on Artificial Intelligence》|2019年|7063-7739p|共9页
会议地点
作者
Lipeng Zhang; Peng Zhang; Xindian Ma; Shuqin Gu; Zhan Su; Dawei Song;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类 TP18-53;
关键词

相似文献

外文文献
中文文献
专利

1. Generalized scalar-tensor theory for Bianchi spacetime models [J] . Chakraborty MC, Chakraborty S Modern Physics Letters, A . 2004,第9期

机译：Bianchi时空模型的广义标量张量理论
2. Generalized Goldberg–Sachs Theorems in Complex and Real Space-Time. I. Algebraic Classifications of the Conformal Tensor and the Energy-Momentum Tensor on Complex and Real Space-Times [J] . M. Przanowski, J.F. Plebański Acta physica Polonica, B. Particle Physics and Field Theory, Nuclear Physics, Theory of Relativity . 1979,第6期

机译：复杂时空中的广义Goldberg-Sachs定理。 I.复时和实时空上的共形张量和能量动量张量的代数分类
3. Generalized Weyl conformal curvature tensor of generalized Riemannian space [J] . Nenad O. Vesic Miskolc Mathematical Notes . 2019,第1期

机译：广义黎曼空间的广义Weyl保形曲率张量
4. A Generalized Language Model in Tensor Space [C] . Lipeng Zhang, Peng Zhang, Xindian Ma, AAAI Conference on Artificial Intelligence . 2019

机译：张量空间的广义语言模型
5. Tensor Completion and Total Variation Denoising and Deblurring in Tensor Spaces [D] . Sanogo, Fatoumata. 2021

机译：张量空间中的张量完成和总变化和脱落
6. Thumbnail Tensor—A Method for Multidimensional Data Streams Clustering with an Efficient Tensor Subspace Model in the Scale-Space [O] . Bogusław Cyganek 2019

机译：缩略图张量-一种在尺度空间中使用有效张量子空间模型进行多维数据流聚类的方法
7. A Generalized Language Model in Tensor Space [O] . Lipeng Zhang, Peng Zhang, Xindian Ma, 2019

机译：张量空间中的广义语言模型

A Generalized Language Model in Tensor Space

摘要

著录项

相似文献

相关主题

期刊订阅