Spanish Diacritic Error Detection and Restoration: A Survey

机译：西班牙音调符号错误检测和恢复：一项调查

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

In this paper we address the problem of diacritic error detection and restoration-the task of identifying and correcting missing accents in text. In particular, we evaluate the performance of a simple part-of-speech tagger-based technique comparing it to other established methods for error detection/restoration: unigram frequency, decision lists, discriminative classifiers, a machine-translation based method, and grapheme-based approaches. In languages such as Spanish (the focus here), diacritics play a key role in disambiguation and results show that a straightforward modification to an n-gram tagger can be used to achieve good performance in diacritic error identification without resorting to any specialized machinery. Our method should be applicable to any language where diacritics distribute comparably and perform similar roles of disambiguation.

机译：在本文中，我们解决了变音符号错误检测和还原的问题-识别和纠正文本中丢失的重音的任务。特别是，我们评估了一种基于简单词性标记器的技术与其他已建立的错误检测/恢复方法的性能：单字组频率，决策列表，判别式分类器，基于机器翻译的方法以及字素基于方法。在诸如西班牙语（此处为重点）之类的语言中，变音符号在消除歧义中起关键作用，结果表明，对n-gram标记器的直接修改可用于实现变音符号错误识别中的良好性能，而无需诉诸任何专用机制。我们的方法应该适用于变音符号可比地分布并且执行相似的歧义消除作用的任何语言。

著录项

来源
《Language and technology conference》|2016年|290-303|共14页
会议地点
作者
Mans Hulden; Jerid Francom;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类
关键词

相似文献

外文文献
中文文献
专利

1. A survey of diacritic restoration in abjad and alphabet writing systems [J] . FRANKLIN OLADIIPO ASAHIAH, ODETUNJI AJADI ODEJOBI, EMMANUEL ROTIMI ADAGUNODO Natural language engineering . 2018,第pta1期

机译：阿拉伯语和字母书写系统中变音符号恢复的调查
2. Automatic detection and correction of discourse marker errors made by Spanish native speakers in Portuguese academic writing [J] . Sepulveda-Torres Lianet, Duran Magali Sanches, Aluisio Sandra Maria Language Resources and Evaluation . 2019,第3期

机译：自动检测和纠正西班牙语母语的人在葡萄牙语学术论文中的话语标记错误
3. Automatic detection and correction of discourse marker errors made by Spanish native speakers in Portuguese academic writing [J] . Sepulveda-Torres Lianet, Duran Magali Sanches, Aluisio Sandra Maria Language Resources and Evaluation . 2019,第3期

机译：葡萄牙语学术写作中西班牙语母语人士的自动检测与校正语篇标志错误
4. Spanish Diacritic Error Detection and Restoration - A Survey [C] . Mans Hulden, Jerid Francom Language and Technology Conference . 2016

机译：西班牙读书错误检测与恢复 - 调查
5. Victim or Threat? Terrorists and Migrants in the USA and Spain Analysis and Comparison of the impact of the terrorist attacks of September 11, 2001, and March 11, 2004, on US and Spanish migration politics [D] . Carla, Andrea 2012

机译：受害者还是威胁？美国和西班牙的恐怖分子和移民2001年9月11日和2004年3月11日恐怖袭击对美国和西班牙移民政治的影响的分析和比较
6. Time trends in antibiotic consumption in the elderly: Ten-year follow-up of the Spanish National Health Survey and the European Health Interview Survey for Spain (2003–2014) [O] . Domingo Palacios-Ceña, Valentín Hernández-Barrera, Isabel Jiménez-Trujillo, 2011

机译：老年人抗生素消费的时间趋势：西班牙国家健康调查和西班牙欧洲健康采访调查的十年随访（2003-2014年）
7. Automatic Detection of Gender and Number Agreement Errors in Spanish Texts Written by Japanese Learners [O] . Ibanez Maria del Pilar Valverde, Otani Akira 2012

机译：自动学习日语学习者撰写的西班牙语文本中的性别和数字约定错误
8. Surveys and Investigations Projects as Required by Federal Aid in Wildlife Restoration Act, Missouri. Study No. 54: Bat Call Detection Distance and 'Myotis' Species Discrimination Using Anabat. Job No. 1 [R] . Clawson, R. L. 2002

机译：密苏里州野生动植物恢复法联邦援助所要求的调查和调查项目。第54号研究：使用anabat进行蝙蝠呼叫检测距离和'myotis'物种歧视。第1号工作

Spanish Diacritic Error Detection and Restoration: A Survey

摘要

著录项

相似文献

相关主题

期刊订阅