Word-Formation Network for Czech

机译：捷克文构词网

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

In the present paper, we describe the development of the lexical network DeriNet, which captures core word-formation relations on the set of around 266 thousand Czech lexemes. The network is currently limited to derivational relations because derivation is the most frequent and most productive word-formation process in Czech. This limitation is reflected in the architecture of the network: each lexeme is allowed to be linked up with just a single base word; composition as well as combined processes (composition with derivation) are thus not included. After a brief summarization of theoretical descriptions of Czech derivation and the state of the art of NLP approaches to Czech derivation, we discuss the linguistic background of the network and introduce the formal structure of the network and the semi-automatic annotation procedure. The network was initialized with a set of lexemes whose existence was supported by corpus evidence. Derivational links were created using three sources of information: links delivered by a tool for morphological analysis, links based on an automatically discovered set of derivation rules, and on a grammar-based set of rules. Finally, we propose some research topics which could profit from the existence of such lexical network.

机译：在本文中，我们描述了词法网络DeriNet的发展，该网络捕获了大约26.6万个捷克语词素集上的核心单词形成关系。该网络当前仅限于派生关系，因为派生是捷克语中最常见，最有生产力的单词形成过程。这种局限性反映在网络的体系结构中：每个词素只允许与一个基本词链接起来;因此，不包括组成以及组合过程（带有导数的组成）。在简要总结了捷克语派生的理论描述和捷克语派生的NLP方法的现状之后，我们讨论了网络的语言背景，并介绍了网络的形式结构和半自动注释过程。网络由一组词素初始化，这些词素的存在得到语料库证据的支持。派生链接是使用三种信息源创建的：用于形态分析的工具提供的链接，基于自动发现的一组派生规则的链接以及基于语法的规则集。最后，我们提出一些可以从这种词汇网络的存在中受益的研究主题。

著录项

来源
《9th International conference on language resources and evaluation》|2014年|2561-2567|共7页
会议地点
作者
Magda Sevcikova; Zdenek Zabokrtsky;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类
关键词
word-formation; derivation; derivational morphology; lexical network;

机译：词的构成;推导衍生形态词汇网络;

相似文献

外文文献
中文文献
专利

1. Semi-automatic construction of word-formation networks [J] . Lango Mateusz, Zabokrtsky Zdenek, Sevcikova Magda Language Resources and Evaluation . 2021,第1期

机译：半自动构建字形网络
2. Negative verb clusters in Mari and Udmurt and why they require postsyntactic top-down word-formation [J] . Georgieva Ekaterina, Salzmann Martin, Weisser Philipp Natural language & linguistic theory . 2021,第2期

机译：Mari和Udmurt中的否定动词群以及为什么他们需要划分的自上而下的字形成
3. Analysis of word-formation processes in the English and Russian thematic groups “insectophones” [J] . Victoria Oschepkova, Elizaveta Razheva E3S Web of Conferences . 2020,第10期

机译：英语和俄罗斯专题群中的文字形成过程分析“昆虫”
4. Semi-supervised Induction of Morpheme Boundaries in Czech Using a Word-Formation Network [C] . Jan Bodnar, Zdenek Zabokrtsky, Magda Sevcikova International conference on text, speech, and dialogue . 2020

机译：使用构词网络在捷克语中半监督词素边界的归纳
5. The role of international networks and foreign market knowledge in the internationalization of Czech entrepreneurial ventures. [D] . Musteen, Martina. 2006

机译：国际网络和国外市场知识在捷克企业国际化中的作用。
6. 30-year trends in major cardiovascular risk factors in the Czech population Czech MONICA and Czech post-MONICA 1985 – 2016/17 [O] . Renata Cífková, Jan Bruthans, Peter Wohlfahrt, 2020

机译：捷克人口捷克莫尼卡和捷克州捷克人的主要心血管危险因素趋势为1985 - 2016/17
7. Russian word-formation in contrast with Czech and Norwegian [O] . Janda Laura Alexis 2010

机译：俄语单词形成与捷克和挪威语形成对比
8. Water pollution abatement programme. The Czech Republic. Project 4.2. Assessing critical loads of acidity to surface waters in the Czech Republic. Critical loads of acidity to surface waters, north-eastern Bohemia and northern Moravia, The Czech Republic [R] . Lien, L. , Raclavsky, K. , Raclavska, H. , 1996

机译：捷克共和国。项目4.2。评估捷克共和国地表水的酸度临界负荷。对地表水，波希米亚东北部和摩拉维亚北部，捷克共和国的酸度临界负荷

Word-Formation Network for Czech

摘要

著录项

相似文献

相关主题

期刊订阅