首页> 外文会议>International Conference on Agents and Artificial Intelligence >Study of the Parallel Techniques for Dimensionality Reduction and Its Impact on Performance of the Text Processing Algorithms
【24h】

Study of the Parallel Techniques for Dimensionality Reduction and Its Impact on Performance of the Text Processing Algorithms

机译:对维度减少的并行技术及其对文本处理算法性能的影响研究

获取原文

摘要

The presented algorithms employ the Vector Space Model (VSM) and its enhancements such as TFIDF (Term Frequency Inverse Document Frequency). Vector space model suffers from curse of dimensionality. Therefore various dimensionality reduction algorithms are utilized. This paper deals with two of the most common ones i.e. Latent Semantic Indexing (LSI) and Random Projection (RP). It turns out that the size of a document corpus has a substantial impact on the processing time. Thus the authors introduce GPU based on acceleration of these techniques. A dedicated test set-up was created and a series of experiments were conducted which revealed important properties of the algorithms and their accuracy. They show that the random projection outperforms LSI in terms of computing speed at the expanse of results quality.
机译:呈现的算法采用矢量空间模型(VSM)及其增强功能,例如TFIDF(术语频率逆文档频率)。 矢量空间模型遭受维度诅咒。 因此,利用了各种维度减少算法。 本文涉及两个最常见的i.e.潜在语义索引(LSI)和随机投影(RP)。 事实证明,文档语料库的大小对处理时间具有大量影响。 因此,作者基于这些技术的加速来引入GPU。 创建了专用的测试设置,进行了一系列实验,揭示了算法的重要属性及其准确性。 他们表明,随机投影在曝光的结果质量的计算速度方面优于LSI。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号