首页> 外文会议>International Conference on Computational Linguistics >Using Eye-tracking Data to Predict the Readability of Brazilian Portuguese Sentences in Single-task, Multi-task and Sequential Transfer Learning Approaches

【24h】

Using Eye-tracking Data to Predict the Readability of Brazilian Portuguese Sentences in Single-task, Multi-task and Sequential Transfer Learning Approaches

机译：使用眼跟踪数据来预测在单任务，多任务和顺序转移学习方法中的巴西葡萄牙语句子的可读性

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

Sentence complexity assessment is a relatively new task in Natural Language Processing. One of its aims is to highlight in a text which sentences are more complex to support the simplification of contents for a target audience (e.g., children, cognitively impaired users, non-native speakers and low-literacy readers (Scarton and Specia, 2018)). This task is evaluated using datasets of pairs of aligned sentences including the complex and simple version of the same sentence. For Brazilian Portuguese, the task was addressed by (Leal ct al., 2018), who set up the first datasct to evaluate the task in this language, reaching 87.8% of accuracy with linguistic features. The present work advances these results, using models inspired by (Gonzalez-Gardufio and S0gaard, 2018), which hold the state-of-the-art for the English language, with multi-task learning and eye-tracking measures. First-Pass Duration, Total Regression Duration and Total Fixation Duration were used in two moments; first to select a subset of linguistic features and then as an auxiliary task in the multi-task and sequential learning models. The best model proposed here reaches the new state-of-the-art for Portuguese with 97.5% accuracy', an increase of almost 10 points compared to the best previous results, in addition to proposing improvements in the public dataset after analysing the errors of our best model.

机译：句子复杂性评估是自然语言处理中相对较新的任务。其中一个目标是在文本中突出显示哪些句子更复杂，以支持目标受众的内容（例如，儿童，认知障碍用户，非母语扬声器和低识字读者（Scarton和Specia，2018））。使用包括相同句子的复杂和简单版本的对齐句子的数据集来评估此任务。对于巴西葡萄牙语来说，这项任务由（Leal CT al。，2018）解决，他们设置了第一个DataSct来评估这种语言的任务，达到了语言特征的准确性的87.8％。目前的工作介绍了这些结果，采用了由（Gonzalez-Gardufio和S0Gaard，2018）的模型，该结果具有持有最先进的英语，具有多任务学习和追踪措施。首先持续时间，总回归持续时间和总固定持续时间在两个时刻使用;首先要选择语言特征的子集，然后作为多任务和顺序学习模型中的辅助任务。此处提出的最佳型号达到了葡萄牙语的新型，精度为97.5％'，与最佳先前的结果相比，近10分的增加，除了在分析错误之后提出公共数据集的改进我们最好的模特。

著录项

来源
《International Conference on Computational Linguistics》|2020年|5821-5831|共11页
会议地点
作者
Sidney Evaldo Leal; Joao Marcos Munguba Vieira; Erica dos Santos Rodrigues; Elisangela Nogueira Teixeira; Sandra Maria Aluisio;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类
关键词
入库时间 2022-08-26 13:58:20

相似文献

外文文献
中文文献
专利

1. Multi-Task Learning for Analyzing and Sorting Large Databases of Sequential Data [J] . Ni K., Paisley J., Carin L., IEEE Transactions on Signal Processing . 2008,第8期

机译：用于分析和排序大型序列数据数据库的多任务学习
2. Inductive Transfer of Knowledge: Application of Multi-Task Learning and Feature Net Approaches to Model Tissue-Air Partition Coefficients [J] . Varnek A, Gaudin C, Marcou G, Journal of chemical information and modeling . 2009,第1期

机译：知识的归纳传递：多任务学习和特征网方法在组织空气分配系数模型中的应用
3. Dataset-aware multi-task learning approaches for biomedical named entity recognition [J] . Zuo Mei, Zhang Yang Bioinformatics . 2020,第15期

机译：DataSet感知生物医学名为实体识别的多任务学习方法
4. A Nontrivial Sentence Corpus for the Task of Sentence Readability Assessment in Portuguese [C] . Sidney Evaldo Leal, Magali Sanches Duran, Sandra Maria Aluisio International conference on computational linguistics . 2018

机译：葡萄牙语中句子易读性评估任务的重要句子语料库
5. From Fully-Supervised, Single-Task to Scarcely-Supervised, Multi-Task Deep Learning for Medical Image Analysis [D] . ?Imran, Abdullah-Al-Zubaer 2020

机译：从完全监督的单一任务到几乎监督，多任务深度学习进行医学图像分析
6. Correction: A Predictive Framework for Integrating Disparate Genomic Data Types Using Sample-Specific Gene Set Enrichment Analysis and Multi-Task Learning [O] . Brian D. Bennett, Qing Xiong, Sayan Mukherjee, -1

机译：更正：使用特定于样本的基因集富集分析和多任务学习来整合不同基因组数据类型的预测框架
7. Transformation of discriminative single-task classification into generative multi-task classification in machine learning context [O] . Liu Han, Cocea Mihaela, Mohasseb Alaa, 2017

机译：机器学习环境中判别性单任务分类向生成多任务分类的转换

Using Eye-tracking Data to Predict the Readability of Brazilian Portuguese Sentences in Single-task, Multi-task and Sequential Transfer Learning Approaches

摘要

著录项

相似文献

相关主题

期刊订阅