Random Manhattan Integer Indexing: Incremental L_1 Normed Vector Space Construction

机译：随机曼哈顿整数索引：增量L_1范数向量空间构造

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

Vector space models (VSMs) are mathematically well-defined frameworks that have been widely used in the distributional approaches to semantics. In VSMs. Ingh-dimensional vectors represent linguistic entities. In an application, the similarity of vectors-and thus the entities that they represent-is computed by a distance formula. The high dimensionality of vectors, however, is a barrier to the performance of methods that employ VSMs. Consequently, a dimensionality reduction technique is employed to alleviate this problem. This paper introduces a novel technique called Random Manhattan Indexing (RMI) for the construction of ℓ_1 normed VSMs at reduced dimensionality. RMI combines the construction of a VSM and dimension reduction into an incremental and thus scalable two-step procedure. In order to attain its goal, RMI employs the sparse Cauchy random projections. We further introduce Random Manhattan Integer Indexing (RMII): a computationally enhanced version of RMI. As shown in the reported experiments, RMI and RMH can be used reliably to estimate the ℓ_1 distances between vectors in a vector space of low dimensionality.

机译：向量空间模型（VSM）是数学上定义明确的框架，已广泛用于语义的分布方法中。在VSM中。 Ingh维向量表示语言实体。在一个应用程序中，矢量的相似性-以及它们表示的实体-的相似性是通过距离公式计算的。然而，矢量的高维性是采用VSM的方法性能的障碍。因此，采用降维技术来减轻该问题。本文介绍了一种称为“随机曼哈顿索引（RMI）”的新颖技术，用于以降维构造ℓ_1范数VSM。 RMI将VSM的构造和尺寸缩减合并为一个增量的，因此可扩展的两步过程。为了实现其目标，RMI使用了稀疏的柯西随机投影。我们进一步介绍了随机曼哈顿整数索引（RMII）：RMI的计算增强版本。如所报道的实验所示，RMI和RMH可以可靠地用于估计低维向量空间中向量之间的ℓ_1距离。

著录项

来源
《Conference on empirical methods in natural language processing》|2014年|1713-1723|共11页
会议地点
作者
Behrang Q. Zadeh; Siegfried Handschuh;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类
关键词

相似文献

外文文献
中文文献
专利

1. On two problems of choosing some subset of vectors with integer coordinates that has maximum norm of the sum of elements in Euclidean space [J] . Gimadi E.Kh., Glazkov Yu.V., Rykov I.A. Journal of applied and industrial mathematics . 2009,第3期

机译：关于选择某些具有整数坐标的向量子集的两个问题，该向量子集具有欧几里得空间中元素和的最大范数
2. On two problems of choosing some subset of vectors with integer coordinates that has maximum norm of the sum of elements in Euclidean space [J] . Gimadi E.Kh., Glazkov Yu.V., Rykov I.A. Journal of applied and industrial mathematics . 2009,第3期

机译：关于选择某些具有整数坐标的向量子集的两个问题，该向量子集具有欧几里得空间中元素和的最大范数
3. STABILITY RADIUS OF A VECTOR INTEGER LINEAR PROGRAMMING PROBLEM: CASE OF A REGULAR NORM IN THE SPACE OF CRITERIA [J] . V. A. Emelichev, K. G. Kuzmin Cybemetics and Systems Analysis . 2010,第1期

机译：向量整数线性规划问题的稳定性半径：以准则空间中的常规范数为例
4. Random Manhattan Integer Indexing: Incremental L_1 Normed Vector Space Construction [C] . Behrang Q. Zadeh, Siegfried Handschuh Conference on empirical methods in natural language processing . 2014

机译：随机曼哈顿整数索引：增量L_1规范矢量空间施工
5. Optimal upper bound for the infinity norm of eigenvectors of random matrices. [D] . Wang, Ke. 2013

机译：随机矩阵特征向量无穷范数的最佳上限。
6. The complete moment convergence for CNA random vectors in Hilbert spaces [O] . Mi-Hwa Ko -1

机译：希尔伯特空间中CNA随机向量的完整矩收敛
7. Random Manhattan Integer Indexing: Incremental L1 Normed Vector Space Construction [O] . Behrang Q. Zadeh, Siegfried Handschuh 2015

机译：随机曼哈顿整数索引：增量L1赋范矢量空间构造

Random Manhattan Integer Indexing: Incremental L_1 Normed Vector Space Construction

摘要

著录项

相似文献

相关主题

期刊订阅