首页> 外文期刊>Procedia Computer Science >Comparative study of word embedding methods in topic segmentation
【24h】

Comparative study of word embedding methods in topic segmentation

机译:主题分割词嵌入方法的比较研究

获取原文
       

摘要

The vector representations of words are very useful in different natural language processing tasks in order to capture the semantic meaning of words. In this context, the three known methods are: LSA, Word2Vec and GloVe. In this paper, these methods will be investigated in the field of topic segmentation for both languages Arabic and English. Moreover, Word2Vec is studied in depth by using different models and approximation algorithms. As results, we found out that LSA, Word2Vec and GloVe depend on the used language. However, Word2Vec presents the best word vector representation yet it depends on the choice of model.
机译:单词的矢量表示在不同的自然语言处理任务中非常有用,以捕获单词的语义含义。在这种情况下,三种已知方法是:LSA,Word2VEC和手套。在本文中,将在Arabic和英语的语言主题分段领域进行这些方法。此外,通过使用不同的模型和近似算法深入研究Word2Vec。结果,我们发现LSA,Word2Vec和手套依赖于习惯的语言。但是,Word2VEC呈现了最佳的单词矢量表示,但这取决于模型的选择。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号