首页> 外文会议>International conference on quantum interaction >Orthogonality and Orthography: Introducing Measured Distance into Semantic Space
【24h】

Orthogonality and Orthography: Introducing Measured Distance into Semantic Space

机译:正交与正字法:将测得的距离引入语义空间

获取原文

摘要

This paper explores a new technique for encoding structured information into a semantic model, for the construction of vector representations of words and sentences. As an illustrative application, we use this technique to compose robust representations of words based on sequences of letters, that are tolerant to changes such as transposition, insertion and deletion of characters. Since these vectors are generated from the written form or orthography of a word, we call them 'orthographic vectors'. The representation of discrete letters in a continuous vector space is an interesting example of a Generalized Quantum model, and the process of generating semantic vectors for letters in a word is mathematically similar to the derivation of orbital angular momentum in quantum mechanics. The importance (and sometimes, the violation) of orthogonality is discussed in both mathematical settings. This work is grounded in psychological literature on word representation and recognition, and is also motivated by potential technological applications such as genre-appropriate spelling correction. The mathematical method, examples and experiments, and the implementation and availability of the technique in the Semantic Vectors package are also discussed.
机译:本文探索了一种将结构化信息编码为语义模型的新技术,用于构建单词和句子的矢量表示。作为说明性应用程序,我们使用此技术基于字母序列来组成单词的鲁棒表示,这些字母序列可以忍受诸如字符的转置,插入和删除之类的变化。由于这些向量是根据单词的书面形式或正字法生成的,因此我们将其称为“正字向量”。连续向量空间中离散字母的表示是广义量子模型的一个有趣示例,并且单词中字母的语义矢量生成过程在数学上类似于量子力学中轨道角动量的推导。在两种数学设置中都讨论了正交性的重要性(有时甚至是违反性)。这项工作的基础是心理学文献中有关单词表示和识别的知识,并且还受到潜在技术应用(例如体裁适当的拼写校正)的推动。还讨论了语义矢量包中的数学方法,示例和实验,以及该技术的实现和可用性。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号