首页> 外文会议>Complex Adaptive Systems >Finding Semantic Equivalence of Text Using Random Index Vectors
【24h】

Finding Semantic Equivalence of Text Using Random Index Vectors

机译:使用随机索引向量查找文本的语义等价

获取原文

摘要

The challenges of machine semantic understanding have not yet been satisfactorily solved by automated methods. In our approach, the semantics and syntax of words, phrases and documents are represented by deep semantic vectors that capture both the structure and semantic meaning of the language. Our experiment reproduces the experiment done by Patwardhan and Pedersen 2006, but uses random index vectors for the words, glosses and tweets. Our model first determines random index vectors from glosses and definitions for words from WordNet. From these foundational semantic vectors, random index vectors that represent phrases, sentences or tweets are determined. Our set of algorithms relies on high-dimensional distributed representations, and their effectiveness and versatility derive from the unintuitive properties of such representations: from the mathematical properties of high-dimensional spaces. High-dimensional vector representations have been used successfully in modeling human cognition, such as memory and learning. Our semantic vectors are high-dimensional and capture the meaning of a language expression, such as a word, phrase, query, news article, story or a message. A key benefit of ou r method is that the dimensionality of the vectors remains constant as we add data; this also allows good generalization to rarely seen words, which "borrow strength" from their more frequent neighbors.
机译:机器语义理解的挑战尚未通过自动化方法令人满意地解决。在我们的方法中,单词,短语和文档的语义和语法由深度语义向量表示,捕获语言的结构和语义含义。我们的实验再现由Patardhan和Pedersen 2006完成的实验,但使用随机索引矢量来单词,光泽和推文。我们的模型首先从Wordnet中确定来自光泽和定义的随机索引向量。从这些基本语义向量中,确定代表短语,句子或推文的随机索引矢量。我们的一组算法依赖于高维分布式表示,它们的有效性和多功能性来自这种表示的不合适属性:来自高维空间的数学特性。高维向量表示已成功用于建模人类认知,例如记忆和学习。我们的语义向量是高维的,捕获语言表达的含义,例如单词,短语,查询,新闻文章,故事或消息。 OU R方法的一个关键益处是,在添加数据时,矢量的维度仍然是恒定的;这也允许良好的概括很少看到单词,从他们更频繁的邻居那里“借用力量”。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号