Experiments in Cross-Language Morphological Annotation Transfer

机译：跨语言形态标注转移实验

获取原文

获取原文并翻译 | 示例

页面导航

摘要
著录项
引文网络
相似文献
相关主题

摘要

Annotated corpora are valuable resources for NLP which are often costly to create. We introduce a method for transferring annotation from a morphologically annotated corpus of a source language to a target language. Our approach assumes only that an unannotated text corpus exists for the target language and a simple textbook which describes the basic morphological properties of that language is available. Our paper describes experiments with Polish, Czech, and Russian. However, the method is not tied in any way to these languages. In all the experiments we use the TnT tagger, a second-order Markov model. Our approach assumes that the information acquired about one language can be used for processing a related language. We have found out that even breath-takingly naive things (such as approximating the Russian transitions by Czech and/or Polish and approximating the Russian emissions by (manually/automatically derived) Czech cognates) can lead to a significant improvement of the tagger's performance.

机译：带注释的语料库是NLP的宝贵资源，通常创建成本很高。我们介绍了一种用于将注释从源语言的形态标注语料库转移到目标语言的方法。我们的方法仅假设目标语言存在未注释的文本语料库，并且提供了描述该语言的基本形态学特性的简单教科书。本文介绍了波兰语，捷克语和俄语的实验。但是，该方法不以任何方式绑定到这些语言。在所有实验中，我们都使用二阶马尔可夫模型TnT tagger。我们的方法假设获取的有关一种语言的信息可用于处理相关语言。我们发现，即使是令人屏息的天真事物（例如，通过捷克和/或波兰近似于俄罗斯的过渡以及通过（手动/自动获得）捷克同源来近似俄罗斯的排放量）也可以显着提高标记器的性能。

著录项

来源
《International Conference on Computational Linguistics and Intelligent Text Processing(CICLing 2006); 20060219-25; Mexico City(MX)》|2006年|P.41-50|共10页
会议地点 Mexico City(MX)
作者
Anna Feldman; Jirka Hana; Chris Brew;
展开▼
作者单位

Ohio State University, Department of Linguistics, Columbus, OH 43210-1298, USA;

展开▼
会议组织
原文格式 PDF
正文语种 eng
中图分类程序语言、算法语言;
关键词

相似文献

外文文献
中文文献
专利

1. Cross-language transfer of semantic annotation via targeted crowdsourcing: task design and evaluation [J] . Stepanov Evgeny A., Chowdhury Shammur Absar, Bayer Ali Orkan, Language Resources and Evaluation . 2018,第1期

机译：通过目标众包进行语义标注的跨语言传输：任务设计和评估
2. Semantic annotation for concept-based cross-language medical information retrieval [J] . Martin Volk, Baerbel Ripplinger, Spela Vintar, International journal of medical informatics . 2002,第1a3期

机译：基于概念的跨语言医学信息检索的语义标注
3. Cross-Language Transfer of Phonological Awareness and Letter Knowledge: Causal Evidence and Nature of Transfer [J] . Wawire Brenda A., Kim Young-Suk G. Scientific studies of reading . 2018,第6期

机译：语音意识和文字知识的跨语言迁移：因果证据和迁移性质
4. Experiments in Cross-Language Morphological Annotation Transfer [C] . Anna Feldman, Jirka Hana, Chris Brew International Conference on Computational Linguistics and Intelligent Text Processing . 2006

机译：跨语言形态注释转移的实验
5. Role of morphological awareness in biliteracy development: Within- and cross-language perspectives among Korean ESL/EFL learners in grades five and six [D] . Bae, Han Suk. 2015

机译：形态意识在双人发展中的作用：韩国ESL / EFL学习者中的历代和跨语言的角度在五年级和六个
6. Cross-language activation of morphological relatives in cognates: the role of orthographic overlap and task-related processing [O] . Kimberley Mulder, Ton Dijkstra, R. Harald Baayen 2015

机译：同源词中形态亲戚的跨语言激活：正交重叠和任务相关处理的作用
7. Experiments in Cross-Language Morphological Annotation Transfer [O] . Anna Feldman, Jirka Hana, Chris Brew 2006

机译：跨语言形态学注释转移实验

Experiments in Cross-Language Morphological Annotation Transfer

摘要

著录项

引文网络

相似文献

相关主题

期刊订阅