Morphosyntactic Analysis of the CHILDES and TalkBank Corpora

机译：童话职业分析童话博士集团

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

This paper describes the construction and usage of the MOR and GRASP programs for part of speech tagging and syntactic dependency analysis of the corpora in the CHILDES and TalkBank databases. We have written MOR grammars for 11 languages and GRASP analyses for three. For English data, the MOR tagger reaches 98% accuracy on adult corpora and 97% accuracy on child language corpora. The paper discusses the construction of MOR lexicons with an emphasis on compounds and special conversational forms. The shape of rules for controlling allomorphy and morpheme concatenation are discussed. The analysis of bilingual corpora is illustrated in the context of the Cantonese-English bilingual corpora. Methods for preparing data for MOR analysis and for developing MOR grammars are discussed. We believe that recent computational work using this system is leading to significant advances in child language acquisition theory and theories of grammar identification more generally.

机译：本文介绍了MOR和掌握程序的构建和使用，是童话标记的一部分语音标记和童话数据库中的语法依赖性分析。我们为11种语言编写了Mor语法，并掌握了三个分析。对于英语数据，Moragger对成人语料库的准确性为98％，对儿童语言集团的准确性有97％。本文讨论了Mor Lexicons的建设，重点是化合物和特殊的对话形式。讨论了控制血管和语素级联的规则的形状。在粤语 - 英语双语语料库的背景下说明了双语语料库的分析。讨论了对MOR分析和发展MOR语法的制备数据的方法。我们认为，最近使用该系统的计算工作导致儿童语言采集理论和语法识别理论的重要进展。

著录项

来源
《LREC-2012》|2012年||共6页
会议地点
作者
Brian MacWhinney;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类 41.11083;
关键词
morphosyntax; part of speech tagging; spoken language corpora; child language; morphology; dependency grammar;

机译：morphosyntax;言语标签的一部分;语言语言;儿童语言;形态;依赖语法;
入库时间 2022-08-20 22:06:07

相似文献

外文文献
中文文献
专利

1. Large Corpora and Historical Syntax: Consequences for the Study of Morphosyntactic Diffusion in the History of Spanish [J] . álvaro S. Octavio de Toledo y Huerta Frontiers in Psychology . 2019,第a期

机译：大公司和历史言论：西班牙历史上的形态学扩散研究的后果
2. Large Corpora and Historical Syntax: Consequences for the Study of Morphosyntactic Diffusion in the History of Spanish [J] . Octavio de Toledo y Huerta lvaro S. Frontiers in Psychology . 2019,第2期

机译：大公司和历史语法：西班牙历史上的形态学扩散研究的后果
3. Korpusomat – a Tool for Creating Searchable Morphosyntactically Tagged Corpora [J] . Kiera? Witold, ?ukasz Kobylinski, Maciej Ogrodniczuk Computational Methods in Science and Technologygy . 2018,第1期

机译：Korpusomat –用于创建可搜索形态标记的语料库的工具
4. Morphosyntactic Analysis of the CHILDES and TalkBank Corpora [C] . Brian MacWhinney International conference on language resources and evaluation . 2012

机译：CHILDES和TalkBank语料库的形态句法分析
5. Morphosyntactic Neural Analysis for Generalized Lexical Normalization. [D] . Leeman-Munk, Samuel Paul. 2016

机译：广义词法归一化的形态语法神经分析。
6. Morphosyntactic annotation of CHILDES transcripts [O] . KENJI SAGAE, ERIC DAVIS, ALON LAVIE, -1

机译：CHILDEs成绩单的形态句法注释
7. Enriching CHILDES for Morphosyntactic Analysis [O] . MacWhinney, Brian 2009

机译：浓缩智利进行形态学分析

Morphosyntactic Analysis of the CHILDES and TalkBank Corpora

摘要

著录项

相似文献

相关主题

期刊订阅