Lithuanian-Latvian-Lithuanian Parallel Corpus

机译：立陶宛 - 拉脱维亚立陶宛平行语料库

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

The goal of the paper is to present different problems related to the building of Parallel Corpus for two small languages, namely, Latvian and Lithuanian. The Lithuanian-Latvian-Lithuania Parallel Corpus (LILA) will contain 8 million running words; will be bidirectional, aligned on the sentence level. The problems include identifying, acquiring, preparing, and aligning parallel texts.

机译：本文的目标是呈现与两种小语言的并行语料库相关的不同问题，即拉脱维亚和立陶宛语。立陶宛 - 拉脱维亚立陶宛并行语料库（LILA）将包含800万次跑步单词;将是双向的，对齐在句子级别。问题包括识别，获取，准备和对齐并行文本。

著录项

来源
《International Conference on Human Language Technologies》|2012年||共5页
会议地点
作者
Andrius UTKA; Kristine LEVANE-PETROVA; Agne BIELINSKIEN; Jolanta KOVALEVSKAITE; Erika RIMKUTE; Daira VEVERE;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类 TP312-53;
关键词
Lithuanian; Latvian; Parallel corpus;

机译：立陶宛语;拉脱维亚;并行语料库;

相似文献

外文文献
中文文献
专利

1. Exploring the sawa corpus: collection and deployment of a parallel corpus English-Swahili [J] . Guy De Pauw, Peter Waiganjo Wagacha, Gilles-Maurice de Schryver Language Resources and Evaluation . 2011,第3期

机译：探索锯齿语料库：英语-斯瓦希里语平行语料库的收集和部署
2. Extreme parallels: a corpus-driven analysis of ISIS and far-right discourse [J] . Louisa Buckingham, Nusiebah Alali Kotuitui: New Zealand Journal of Social Sciences Online . 2020,第2期

机译：极端旁边：ISIS和右右话语的语料库驱动分析
3. ParaMed: a parallel corpus for English–Chinese translation in the biomedical domain [J] . Liu Boxiang, Huang Liang BMC Medical Informatics and Decision Making . 2021,第1期

机译：Paramed：生物医学域中的英汉翻译并行语料库
4. Lithuanian-Latvian-Lithuanian Parallel Corpus [C] . Andrius UTKA, KristTne LEVANE-PETROVA, Agne BIELINSKIENE, Human language technologies : The baltic perspective . 2012

机译：立陶宛-拉脱维亚-立陶宛平行语料库
5. Analyse comparative de l'equivalence terminologique en corpus parallele et en corpus comparable: Application au domaine du changement climatique. [D] . Le Serrec, Annaich. 2012

机译：平行语料库和可比语料库中术语等效性的比较分析：在气候变化领域中的应用。
6. ECCParaCorp: a cross-lingual parallel corpus towards cancer education dissemination and application [O] . Hetong Ma, Feihong Yang, Jiansong Ren, 2020

机译：ECCParaCorp：针对癌症教育传播和应用的跨语言平行语料库
7. Dutch Parallel Corpus: A Balanced Copyright-Cleared Parallel Corpus [O] . Lieve Macken, Orphée De, Clercq Hans Paulussen 2016

机译：荷兰平行语料库：平衡版权清除平行语料库

Lithuanian-Latvian-Lithuanian Parallel Corpus

摘要

著录项

相似文献

相关主题

期刊订阅