Predicting Prosodic Prominence from Text with Pre-trained Contextualized Word Representations

机译：预测与预先接受训练的上下文化词表示的文本的韵律突出

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

In this paper we introduce a new natural language processing dataset and benchmark for predicting prosodic prominence from written text. To our knowledge this will be the largest publicly available dataset with prosodic labels. We describe the dataset construction and the resulting benchmark dataset in detail and train a number of different models ranging from feature-based classifiers to neural network systems for the prediction of dis-cretized prosodic prominence. We show that pre-trained contextualized word representations from BERT outperform the other models even with less than 10% of the training data. Finally we discuss the dataset in light of the results and point to future research and plans for further improving both the dataset and methods of predicting prosodic prominence from text. The dataset and the code for the models are publicly available.

机译：在本文中，我们介绍了一种新的自然语言处理数据集和基准，用于预测书面文本的韵律突出。据我们所知，这将是具有韵律标签的最大的公共数据集。我们详细描述了数据集结构和所产生的基准数据集，并培训许多不同的模型，范围从基于特征的分类器到神经网络系统，以预测Dis-Creetized韵律突出。我们表明，即使占训练数据的少于10％，伯特的预先训练的上下文化词表示越优于其他模型。最后，我们鉴于结果和指向未来的研究和计划进一步改进数据集和预测文本的韵律突出的方法，指向未来的研究和计划。数据集和模型的代码是公开可用的。

著录项

来源
《Nordic conference of computational Linguistics》|2019年|xx 410 p.|共11页
会议地点
作者
Aarne Talman; Antti Suni; Hande Celikkanat; Sofoklis Kakouros; J?rg Tiedemann; Martti Vainio;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类程序设计、软件工程;
关键词
入库时间 2022-08-20 20:19:22

相似文献

外文文献
中文文献
专利

1. Reduction in prosodic prominence predicts speakers' recall: implications for theories of prosody [J] . Scott H. Fraundorf, Duane G. Watson, Aaron S. Benjamin Language, cognition and neuroscience . 2015,第5期

机译：韵律突出的减少预示着演讲者的回忆：对韵律理论的启示
2. Automatic conversion from lexical words to prosodic words for mandarin text-to-speech system [J] . Yanqiu Shao, Jiqing Han, Ting Liu, International journal of speech technology . 2007,第1期

机译：普通话文本到语音系统的从词自动转换为韵律词
3. Transformer based contextualization of pre-trained word embeddings for irony detection in Twitter [J] . Jose Angel Gonzalez, Lluis-F. Hurtado, Ferran Pla Information Processing & Management . 2020,第4期

机译：基于变压器的预训练Word Embeddings的上下文化，在Twitter中进行讽刺检测
4. Predicting Prosodic Prominence from Text with Pre-trained Contextualized Word Representations [C] . Aarne Talman, Antti Suni, Hande Celikkanat, Nordic conference of computational Linguistics . 2019

机译：使用预训练的上下文化单词表示从文本预测韵律突出
5. The automatic prediction of prosodic prominence from text. [D] . Brenier, Jason M. 2008

机译：从文本自动预测韵律突出。
6. Reduction in Prosodic Prominence Predicts Speakers’ Recall: Implications for Theories of Prosody [O] . Scott H. Fraundorf, Duane G. Watson, Aaron S. Benjamin -1

机译：韵律突出的减少预示着演讲者的回忆：对韵律理论的启示
7. PREDICTING PROSODIC WORDS FROM LEXICAL WORDS--A FIRST STEP TOWARDS PREDICTING PROSODY FROM TEXT [O] . 2014

机译：从词汇词中预测韵律词-从文本中预测韵律的第一步

Predicting Prosodic Prominence from Text with Pre-trained Contextualized Word Representations

摘要

著录项

相似文献

相关主题

期刊订阅