Tagging a Norwegian Dialect Corpus

机译：标记挪威方言语料库

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

This paper describes an evaluation of five data-driven Part-of-Speech (PoS) taggers for spoken Norwegian. The taggers all rely on different machine learning mechanisms: decision trees, hidden Markov models (HMMs), conditional random fields (CRFs), long-short term memory networks (LSTMs), and convolutional neural networks (CNNs). We go into some of the challenges posed by the task of tagging spoken, as opposed to written, language, and in particular a wide range of dialects as is found in the recordings of the LIA (Language Infrastructure made Accessible) project. The results show that the taggers based on either conditional random fields or neural networks perform much better than the rest, with the LSTM tagger getting the highest score.

机译：本文介绍了针对挪威语的五个数据驱动的词性（PoS）标记器的评估。标记者都依赖于不同的机器学习机制：决策树，隐马尔可夫模型（HMM），条件随机字段（CRF），长期短期记忆网络（LSTM）和卷积神经网络（CNN）。我们将讨论口语（而不是书面），语言（尤其是各种方言）的标记任务所带来的一些挑战，正如LIA（可访问语言基础结构）项目的录音中所发现的那样。结果表明，基于条件随机场或神经网络的标记器的性能要优于其余的，其中LSTM标记器的得分最高。

著录项

来源
《Nordic conference of computational Linguistics》|2019年|350-355|共6页
会议地点 Turku(FI)
作者
Andre Kasen; Kristin Hagen; Anders Nøklestad; Joel Priestley;
展开▼
作者单位

Department of Informatics University of Oslo;

The Text Laboratory University of Oslo;

展开▼
会议组织
原文格式 PDF
正文语种
中图分类
关键词
入库时间 2022-08-26 14:42:12

相似文献

外文文献
中文文献
专利

1. The English Dialects App: The creation of a crowdsourced dialect corpus [J] . Adrian Leemann, Marie-José Kolly, David Britain Ampersand . 2018,第3期

机译：英语方言应用程序：创建众包方言语料库
2. Representing variation in a spoken corpus of an endangered dialect: the case of Torlak [J] . Vukovic Teodora Language Resources and Evaluation . 2021,第3期

机译：代表濒临灭绝的方言语料库的变化：Torlak的情况
3. Issues of Dialectal Saudi Twitter Corpus [J] . Alruily Meshrif The international arab journal of information technology . 2020,第3期

机译：方言沙特推特语料库的问题
4. Tagging a Norwegian Dialect Corpus [C] . Andre Kasen, Kristin Hagen, Anders N?klestad, Nordic conference of computational Linguistics . 2019

机译：标记挪威语方言语料库
5. A generational and social study of the changing urban dialect of Bergen, Norway. [D] . Brown, William Robert, Jr. 2003

机译：挪威卑尔根不断变化的城市方言的世代相传和社会研究。
6. The Nationwide Speech Project: A new corpus of American English dialects [O] . Cynthia G. Clopper, David B. Pisoni -1

机译：全国语音项目：美国英语方言的新语料库
7. Clearing the Transcription Hurdle in Dialect Corpus Building: The Corpus of Southern Dutch Dialects as Case Study [O] . Anne-Sophie Ghyselen, Anne Breitbarth, Melissa Farasyn, 2020

机译：在方言语料库中清除转录障碍：南方荷兰语方言的语料库作为案例研究
8. Construction of a Phonotactic Dialect Corpus using Semiautomatic Annotation [R] . Schwartz, R., Shen, W., Campbell, J., 2007

机译：用半自动注释构建一个语音方言语料库

Tagging a Norwegian Dialect Corpus

摘要

著录项

相似文献

相关主题

期刊订阅