Learning Phrase Representations Based on Word and Character Embeddings

机译：基于单词和字符嵌入的短语学习

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

Most phrase embedding methods consider a phrase as a basic term and learn embeddings according to phrases' external contexts, ignoring the internal structures of words and characters. There are some languages such as Chinese, a phrase is usually composed of several words or characters and contains rich internal information. The semantic meaning of a phrase is also related to the meanings of its composing words or characters. Therefore, we take Chinese for example, and propose a joint words and characters embedding model for learning phrase representation. In order to disambiguate the word and character and address the issue of non-compositional phrases, we present multiple-prototype word and character embeddings and an effective phrase selection method. We evaluate the effectiveness of the proposed model on phrase similarities computation and analogical reasoning. The empirical result shows that our model outperforms other baseline methods which ignore internal word and character information.

机译：大多数短语嵌入方法都将短语作为基本术语，并根据短语的外部上下文学习嵌入，而忽略了单词和字符的内部结构。有一些语言，例如中文，一个短语通常由几个单词或字符组成，并包含丰富的内部信息。短语的语义含义还与组成单词或字符的含义有关。因此，我们以汉语为例，提出了一种联合的词和字符嵌入模型来学习短语表示。为了消除单词和字符的歧义并解决非组合短语的问题，我们提出了多原型单词和字符的嵌入以及一种有效的短语选择方法。我们评估该模型在短语相似度计算和类比推理中的有效性。实证结果表明，我们的模型优于其他忽略内部单词和字符信息的基线方法。

著录项

来源
《International conference on neural information processing》|2016年|547-554|共8页
会议地点
作者
Jiangping Huang; Donghong Ji; Shuxin Yao; Wenzhi Huang; Bo Chen;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类
关键词
Embedding; Phrase representation; Semantic composition; Analogical reasoning;

机译：嵌入;短语表示;语义组成;类比推理;

相似文献

外文文献
中文文献
专利

1. Linkit: a CALL system for learning Chinese characters, words, and phrases [J] . Chris Shei, Hsun-Ping Hsieh Computer assisted language learning . 2012,第4期

机译：Linkit：用于学习汉字，单词和短语的CALL系统
2. Story embedding: Learning distributed representations of stories based on character networks [J] . O-Joun Lee, Jason J. Jung Artificial intelligence . 2020,第Apra期

机译：故事嵌入：基于角色网络学习故事的分布式表示
3. Story embedding: Learning distributed representations of stories based on character networks [J] . Journal of Virological Methods . 2020,第期

机译：故事嵌入：基于角色网络学习分布式的故事表示
4. Learning Phrase Representations Based on Word and Character Embeddings [C] . Jiangping Huang, Donghong Ji, Shuxin Yao, International conference on neural information processing . 2016

机译：基于Word和字符嵌入的学习短语表示
5. Hierarchical character recognition and its use in handwritten word/phrase recognition [D] . Park, Jaehwa 2000

机译：分层字符识别及其在手写单词/短语识别中的应用
6. Representing Multiword Chemical Terms through Phrase-Level Preprocessingand Word Embedding [O] . Liyuan Huang, Chen Ling 2019

机译：通过短语级预处理表示多词化学术语和词嵌入
7. Beyond Word Embeddings: Learning Entity and Concept Representations from Large Scale Knowledge Bases [O] . Shalaby, Walid, Zadrozny, Wlodek, Jin, Hongxia 2017

机译：超越Word嵌入：学习实体和概念表示大规模知识库

Learning Phrase Representations Based on Word and Character Embeddings

摘要

著录项

相似文献

相关主题

期刊订阅