首页> 外文期刊>ACM transactions on Asian and low-resource language information processing >Enhanced Double-Carrier Word Embedding via Phonetics and Writing
【24h】

Enhanced Double-Carrier Word Embedding via Phonetics and Writing

机译:增强的双载波单词嵌入通过语音和写作

获取原文
获取原文并翻译 | 示例
       

摘要

Word embeddings, which map words into a unified vector space, capture rich semantic information. From a linguistic point of view, words have two carriers, speech and writing. Yet the most recent word embedding models focus on only the writing carrier and ignore the role of the speech carrier in semantic expressions. However, in the development of language, speech appears before writing and plays an important role in the development of writing. For phonetic language systems, the written forms are secondary symbols of spoken ones. Based on this idea, we carried out our work and proposed double-carrier word embedding (DCWE). We used DCWE to conduct a simulation of the generation order of speech and writing. We trained written embedding based on phonetic embedding. The final word embedding fuses writing and phonetic embedding. To illustrate that our model can be applied to most languages, we selected Chinese, English, and Spanish as examples and evaluated these models through word similarity and text classification experiments.
机译:Word Embeddings,将哪个映射到统一的矢量空间,捕获丰富的语义信息。从语言角度来看,单词有两个运营商,言语和写作。然而,最近的嵌入模型只关注写作载波并忽略语音载波在语义表达式中的角色。然而,在语言的发展中,在写作之前出现演讲并在写作的发展中发挥重要作用。对于语音语言系统,书面形式是所说的次要符号。基于这个想法,我们进行了我们的工作并提出了双载波嵌入(DCWE)。我们使用DCWe进行言语和写作的发电顺序的模拟。我们根据语音嵌入培训书面嵌入。最后一词嵌入了保险丝写作和语音嵌入。为了说明我们的模型可以应用于大多数语言,我们选择中文,英文和西班牙语作为示例,并通过字相似性和文本分类实验评估这些模型。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号