首页> 外文会议>Australasian joint conference on artificial intelligence >Stability of Word Embeddings Using Word2Vec
【24h】

Stability of Word Embeddings Using Word2Vec

机译:使用Word2Vec的单词嵌入的稳定性

获取原文

摘要

The word2vec model has been previously shown to be successful in creating numerical representations of words (word embeddings) that capture the semantic and syntactic meanings of words. This study examines the issue of model stability in terms of how consistent these representations are given a specific corpus and set of model parameters. Specifically, the study considers the impact of word embedding dimension size and frequency of words on stability. Stability is measured by comparing the neighborhood of words in the word vector space model. Our results demonstrate that the dimension size of word embeddings has a significant effect on the consistency of the model. In addition, the effect of the frequency of the target words on stability is identified. An approach to mitigate the effects of word frequency on stability is proposed.
机译:先前已证明word2vec模型可成功创建单词的数字表示形式(单词嵌入),以捕获单词的语义和句法含义。这项研究从给定的特定语料和一组模型参数如何一致地检验模型稳定性的问题。具体来说,该研究考虑了单词嵌入维数大小和单词频率对稳定性的影响。通过在单词向量空间模型中比较单词的邻域来衡量稳定性。我们的结果表明,单词嵌入的维数大小对模型的一致性有重要影响。另外,确定了目标词的频率对稳定性的影响。提出了一种减轻词频对稳定性的影响的方法。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号