首页> 外文会议>International conference on intelligent text processing and computational linguistics >New Word Analogy Corpus for Exploring Embeddings of Czech Words
【24h】

New Word Analogy Corpus for Exploring Embeddings of Czech Words

机译:新词类比语料库用于探索捷克词的嵌入

获取原文

摘要

The word embedding methods have been proven to be very useful in many tasks of NLP (Natural Language Processing). Much has been investigated about word embeddings of English words and phrases, but only little attention has been dedicated to other languages. Our goal in this paper is to explore the behavior of state-of-the-art word embedding methods on Czech, the language that is characterized by very rich morphology. We introduce new corpus for word analogy task that inspects syntactic, morphosyntactic and semantic properties of Czech words and phrases. We experiment with Word2Vec and GloVe algorithms and discuss the results on this corpus. The corpus is available for the research community.
机译:单词嵌入方法已被证明在NLP(自然语言处理)的许多任务中非常有用。关于英语单词和短语的单词嵌入的研究很多,但是对其他语言的关注很少。本文的目标是探索最先进的词嵌入方法在捷克语上的行为,捷克语具有非常丰富的词法特征。我们为词类比任务引入了新的语料库,它检查了捷克语单词和短语的句法,形态句法和语义特性。我们尝试使用Word2Vec和GloVe算法,并讨论该语料库的结果。该语料库可用于研究社区。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号