Spanish Diacritic Error Detection and Restoration - A Survey

机译：西班牙读书错误检测与恢复 - 调查

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

In this paper we address the problem of diacritic error detection and restoration - the task of identifying and correcting missing accents in text. In particular, we evaluate the performance of a simple part-of-speech tagger-based technique comparing it to other established methods for error detection/restoration: unigram frequency, decision lists, discriminative classifiers, a machine-translation based method, and grapheme-based approaches. In languages such as Spanish (the focus here), diacritics play a key role in disambiguation and results show that a straightforward modification to an n-gram tagger can be used to achieve good performance in diacritic error identification without resorting to any specialized machinery. Our method should be applicable to any language where diacritics distribute comparably and perform similar roles of disambiguation.

机译：在本文中，我们解决了读音器错误检测和恢复问题 - 识别和纠正文本中缺失的折叠的任务。特别是，我们评估了一种简单的语音标记的技术的性能，将其与其他既定的错误检测/恢复方法进行比较：unigram频率，决定列表，鉴别类别分类器，基于机器的方法和图形 - 基于方法。在西班牙语（这里的重点）等语言中，模糊物在消歧和结果中发挥着关键作用，结果表明，对N-GRAM标记器的直接修改可用于在无读数误差识别中实现良好的性能，而无需诉诸任何专门的机械。我们的方法应适用于任何形式的任何语言，其中复杂的分配相当并执行歧义的类似角色。

著录项

来源
《Language and Technology Conference》|2016年|422p|共14页
会议地点
作者
Mans Hulden; Jerid Francom;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类 TP18-53;
关键词

相似文献

外文文献
中文文献
专利

1. A survey of diacritic restoration in abjad and alphabet writing systems [J] . FRANKLIN OLADIIPO ASAHIAH, ODETUNJI AJADI ODEJOBI, EMMANUEL ROTIMI ADAGUNODO Natural language engineering . 2018,第pta1期

机译：阿拉伯语和字母书写系统中变音符号恢复的调查
2. Automatic detection and correction of discourse marker errors made by Spanish native speakers in Portuguese academic writing [J] . Sepulveda-Torres Lianet, Duran Magali Sanches, Aluisio Sandra Maria Language Resources and Evaluation . 2019,第3期

机译：自动检测和纠正西班牙语母语的人在葡萄牙语学术论文中的话语标记错误
3. Automatic detection and correction of discourse marker errors made by Spanish native speakers in Portuguese academic writing [J] . Sepulveda-Torres Lianet, Duran Magali Sanches, Aluisio Sandra Maria Language Resources and Evaluation . 2019,第3期

机译：葡萄牙语学术写作中西班牙语母语人士的自动检测与校正语篇标志错误
4. Spanish Diacritic Error Detection and Restoration: A Survey [C] . Mans Hulden, Jerid Francom Language and technology conference . 2016

机译：西班牙音调符号错误检测和恢复：一项调查
5. Victim or Threat? Terrorists and Migrants in the USA and Spain Analysis and Comparison of the impact of the terrorist attacks of September 11, 2001, and March 11, 2004, on US and Spanish migration politics [D] . Carla, Andrea 2012

机译：受害者还是威胁？美国和西班牙的恐怖分子和移民2001年9月11日和2004年3月11日恐怖袭击对美国和西班牙移民政治的影响的分析和比较
6. Time trends in antibiotic consumption in the elderly: Ten-year follow-up of the Spanish National Health Survey and the European Health Interview Survey for Spain (2003–2014) [O] . Domingo Palacios-Ceña, Valentín Hernández-Barrera, Isabel Jiménez-Trujillo, 2011

机译：老年人抗生素消费的时间趋势：西班牙国家健康调查和西班牙欧洲健康采访调查的十年随访（2003-2014年）
7. Automatic Detection of Gender and Number Agreement Errors in Spanish Texts Written by Japanese Learners [O] . Ibanez Maria del Pilar Valverde, Otani Akira 2012

机译：自动学习日语学习者撰写的西班牙语文本中的性别和数字约定错误
8. Surveys and Investigations Projects as Required by Federal Aid in Wildlife Restoration Act, Missouri. Study No. 54: Bat Call Detection Distance and 'Myotis' Species Discrimination Using Anabat. Job No. 1 [R] . Clawson, R. L. 2002

机译：密苏里州野生动植物恢复法联邦援助所要求的调查和调查项目。第54号研究：使用anabat进行蝙蝠呼叫检测距离和'myotis'物种歧视。第1号工作

Spanish Diacritic Error Detection and Restoration - A Survey

摘要

著录项

相似文献

相关主题

期刊订阅