Building Corpus-based Frequency Lemma Lists

David Lindemann; I?aki San Vicente

首页> 外文期刊>Procedia - Social and Behavioral Sciences >Building Corpus-based Frequency Lemma Lists

【24h】

Building Corpus-based Frequency Lemma Lists

机译：建立基于语料库的频率引语列表

获取原文

获取外文期刊封面目录资料

开具论文收录证明 >>

文献代查 >>

文献数据库（团队版） >>

页面导航

摘要
著录项
引文网络
相似文献
相关主题

摘要

This paper presents a simple methodology to create corpus-based frequency lemma lists, applied to the case of the Basque language. Since the first work on the matter in 1982, the amount of text written in Basque and language resources related to this language has grown exponentially. Based on state-of-the-art Basque corpora and current NLP technology, we develop a frequency lemma list for standard Basque. Our aim is twofold: On the one hand, to propose a primary Basque lemma list for a bilingual dictionary that is currently being worked on at UPV/EHU, and on the other, to contrast existing Basque dictionary lemma lists with frequency data, in order to evaluate the adequacy of our proposal and to compare lemma lists with each other.

机译：本文提出了一种简单的方法来创建基于语料库的频率引理列表，并将其应用于巴斯克语的情况。自1982年就此问题进行首次工作以来，以巴斯克语撰写的文字和与该语言有关的语言资源的数量呈指数增长。基于最新的巴斯克语料库和当前的NLP技术，我们为标准巴斯克语制定了频率引理列表。我们的目标是双重的：一方面，为目前在UPV / EHU上工作的双语词典提出主要的巴斯克引理列表，另一方面，将现有的巴斯克词典引理列表与频率数据进行对比，以便评估我们建议的适当性，并比较引理列表。

著录项

来源
《Procedia - Social and Behavioral Sciences》 |2015年第2期|共12页
作者
David Lindemann; I?aki San Vicente;
展开▼
作者单位

展开▼
收录信息
原文格式 PDF
正文语种
中图分类社会科学现状及发展;
关键词

相似文献

外文文献
中文文献
专利

1. Zipfian interpretation of textbook vocabulary lists: comments on Xiao et al.'s Corpus-based research on English word recognition rates in primary school and word selection strategy [J] . Hu Qiong, Yue Ming Journal of Turbulence . 2017,第7期

机译：ZIPFIAN对教科书词汇表的解释：关于Xiao等人的评论。基于语料库的基于语料库的英语词汇识别率研究和单词选择策略
2. Zipfian interpretation of textbook vocabulary lists: comments on Xiao et al.'s Corpus-based research on English word recognition rates in primary school and word selection strategy [J] . Qiong HU, Ming YUE 浙江大学学报（英文版）（C辑：计算机与电子） . 2017,第007期

机译：Zipfian对教科书词汇表的解释：评评Xiao等基于语料库的小学英语单词识别率和选词策略
3. Corpus-based vocabulary lists for language learners for nine languages [J] . Adam Kilgarriff, Frieda Charalabopoulou, Maria Gavrilidou, Language Resources and Evaluation . 2014,第1期

机译：针对九种语言的语言学习者的基于语料库的词汇表
4. Using Corpus-Based Analysis to Explore the Academic Word Lists (AWL) Emerged in English Manuscript of SBMPTN Tests [C] . Windy Ardianita Sari Linguistics Conference of Universitas Airlangga . 2018

机译：使用基于语料库的分析来探索SBMPTN测试中英文手稿的学术单词列表（AWL）
5. Building High-frequency Word Lists for the Semantic Domain of ?āINA (‘land’) Using a Raw Corpus of Spoken ?ōlelo Hawai?i [D] . Brockway, Catherine Elizabeth Lee. 2021

机译：使用原始语料库的语义域构建高频词列表？lelo hawai？我
6. A Corpus-Based Evaluation of Beamforming Techniques and Phase-Based Frequency Masking [O] . Caleb Rascon 2021

机译：基于语料库的波束形成技术和基于相位频率掩蔽的评估
7. Building Corpus-based Frequency Lemma Lists [O] . Lindemann David, Vicente Iñaki San 2015

机译：建立基于语料库的频率引语列表
8. Building Energy Performance Standards. Phase IV. Code Parameter List and Code Format Listing [R] . Kroner, W. M. 1980

机译：建筑能效标准。第四阶段。代码参数列表和代码格式列表

Building Corpus-based Frequency Lemma Lists

摘要

著录项

引文网络

相似文献

相关主题

期刊订阅