Obtaining Better Word Representations via Language Transfer

机译：通过语言转移获得更好的单词表示

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

Vector space word representations have gained big success recently at improving performance across various NLP tasks. However, existing word embeddings learning methods only utilize homo-lingual corpus. Inspired by transfer learning, we propose a novel language transfer method to obtain word embeddings via language transfer. Under this method, in order to obtain word embeddings of one language (target language), we train models on corpus of another different language (source language) instead. And then we use the obtained source language word embeddings to represent target language word embeddings. We evaluate the word embeddings obtained by the proposed method on word similarity tasks across several benchmark datasets. And the results show that our method is surprisingly effective, outperforming competitive baselines by a large margin. Another benefit of our method is that the process of collecting new corpus might be skipped.

机译：传染媒介空间字表示最近在提高各种NLP任务中提高了表现的大量成功。但是，现有的Word Embeddings学习方法仅利用同性恋语料库。通过转移学习的启发，我们提出了一种新颖的语言转移方法，通过语言转移获取Word Embedings。在此方法下，为了获得一种语言（目标语言）的单词嵌入，我们培训另一种不同语言（源语言）的语料库上的模型。然后我们使用所获得的源语言Word Embeddings表示目标语言单词嵌入品。我们评估通过在多个基准数据集中的单词相似性任务上获得的嵌入单词。结果表明，我们的方法令人惊讶地有效，优于竞争力的基线。我们的方法的另一个好处是，可能会跳过收集新语料库的过程。

著录项

来源
《Conference on Intelligent Text Processing and Computational Linguistics;CICLing 2014》|2014年||共10页
会议地点
作者
Changliang Li; Bo Xu; Gaowei Wu; Xiuying Wang; Wendong Ge; Yan Li;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类 TP18-532;
关键词
target language; results; collecting new corpus;

机译：目标语言;结果;收集新语料库;

相似文献

外文文献
中文文献
专利

1. Are there qualitative differences in the representation of abstract and concrete words? Within-language and cross-language evidence from the semantic priming paradigm [J] . Ferre Pilar, Guasch Marc, Garcia-Chico Teofilo, The quarterly journal of experimental psychology: QJEP . 2015,第12期

机译：抽象词和具体词的表示形式在质量上有区别吗？语义启动范式的语言内和跨语言证据
2. EEG decoding of spoken words in bilingual listeners: from words to language invariant semantic-conceptual representations [J] . Jo?￡o M. Correia, Bernadette Jansma, Lars Hausfeld, Frontiers in Psychology . 2015,第4期

机译：双语听众中口语单词的EEG解码：从单词到语言不变的语义概念表示
3. Contrasting Similar Words Facilitates Second Language Vocabulary Learning in Children by Sharpening Lexical Representations [J] . Peta Baxter, Mienke Droop, Marianne van den Hurk, Frontiers in Psychology . 2021,第a期

机译：对比类似的词语通过锐化词汇表现来促进儿童的第二语言词汇学习
4. Obtaining Better Word Representations via Language Transfer [C] . Changliang Li, Bo Xu, Gaowei Wu, International conference on intelligent text processing and computational linguistics . 2014

机译：通过语言转移获得更好的单词表示
5. THE RELATIVE IMPORTANCE OF CONTENT WORDS AND FUNCTION WORDS AS RELATED TO SYNTACTIC COMPLEXITY, ENGLISH PROFICIENCY AND FIRST LANGUAGE TRANSFER IN THE READING COMPREHENSION OF ENGLISH AS A SECOND LANGUAGE (ESL) LEARNERS (ADULT, SPANISH, ARABIC). [D] . LAM, AGNES SHUN-LING. 1984

机译：在英语作为第二语言（ESL）学习者（成人，西班牙语，阿拉伯语）的阅读理解中，与句法复杂性，英语熟练度和第一语言迁移相关的内容单词和功能单词的相对重要性。
6. EEG decoding of spoken words in bilingual listeners: from words to language invariant semantic-conceptual representations [O] . João M. Correia, Bernadette Jansma, Lars Hausfeld, -1

机译：双语听众中口语的EEG解码：从单词到语言不变的语义概念表示
7. Language Transfer of Audio Word2Vec: Learning Audio Segment Representations without Target Language Data [O] . Shen, Chia-Hao, Sung, Janet Y., Lee, Hung-Yi 2017

机译：音频语言转换Word2Vec：学习音频片段没有目标语言数据的表示

Obtaining Better Word Representations via Language Transfer

摘要

著录项

相似文献

相关主题

期刊订阅