Bilingual Word Embeddings for Bilingual Terminology Extraction from Specialized Comparable Corpora

机译：从专门的可比语料库中提取双语术语的双语词嵌入

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

Bilingual lexicon extraction from comparable corpora is constrained by the small amount of available data when dealing with specialized domains. This aspect penalizes the performance of distributional-based approaches, which is closely related to the reliability of word's cooccurrence counts extracted from comparable corpora. A solution to avoid this limitation is to associate external resources with the comparable corpus. Since bilingual word embeddings have recently shown efficient models for learning bilingual distributed representation of words, we explore different word embedding models and show how a general-domain comparable corpus can enrich a specialized comparable corpus via neural networks.

机译：当处理特殊领域时，可比较语料库的双语词典提取受到少量可用数据的限制。这方面不利于基于分布的方法的性能，这与从可比语料库中提取的单词共现计数的可靠性密切相关。避免此限制的解决方案是将外部资源与可比较的语料库关联。由于双语单词嵌入最近显示了用于学习单词的双语分布式表示的有效模型，因此我们探索了不同的单词嵌入模型，并展示了通用域可比语料库如何通过神经网络丰富特定的可比语料库。

著录项

来源
《International joint conference on natural language processing》|2017年|685-693|共9页
会议地点
作者
Amir Hazem; Emmanuel Morin;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类
关键词

相似文献

外文文献
中文文献
专利

1. Exploiting unbalanced specialized comparable corpora for bilingual lexicon extraction [J] . EMMANUEL MORIN, AMIR HAZEM Natural language engineering . 2016,第pta4期

机译：利用不平衡的专业可比语料库提取双语词典
2. Low-frequency words in bilingual corpora a step towards automatic extraction of bilingual word pairs [J] . Keita Tsuji, Fuyuki Yoshikane, Kyo Kageura 電子情報通信学会技術研究報告. 言語理解とコミュニケーション. Natural Language Understanding and Models of Communication . 2000,第200期

机译：双语语料库中的低频单词迈向自动提取双语单词对的一步
3. Low-frequency words in bilingual corpora a step towards automatic extraction of bilingual word pairs [J] . Keita Tsuji, Fuyuki Yoshikane, Kyo Kageura 電子情報通信学会技術研究報告. 言語理解とコミュニケーション. Natural Language Understanding and Models of Communication . 2000,第200期

机译：双语语料库中的低频单词迈向自动提取双语单词对的一步
4. Bilingual Word Embeddings for Bilingual Terminology Extraction from Specialized Comparable Corpora [C] . Amir Hazem, Emmanuel Morin International joint conference on natural language processing . 2017

机译：双语单词嵌入专业的可比语料库的双语术语提取
5. Parallel Sentence Detection in Comparable Corpora with Bilingual Word Embeddings for Low-Resource Languages [D] . Cadigan, John. 2018

机译：与低资源语言的双语单词嵌入式的同类语料中的并行句子检测
6. Bilingual term alignment from comparable corpora in English discharge summary and Chinese discharge summary [O] . Yan Xu, Luoxin Chen, Junsheng Wei, 2015

机译：可比语料库中英语出院摘要和中文出院摘要的双语术语对齐
7. Flow Network Models for Word Alignment and Terminology Extraction from Bilingual Corpora [O] . 2008

机译：双语语料库中词对齐和术语提取的流网络模型

Bilingual Word Embeddings for Bilingual Terminology Extraction from Specialized Comparable Corpora

摘要

著录项

相似文献

相关主题

期刊订阅