首页> 外文会议>Workshop on multiword expressions: from theory to application. >Visually and Phonologically Similar Characters in Incorrect Simplified Chinese Words
【24h】

Visually and Phonologically Similar Characters in Incorrect Simplified Chinese Words

机译:错误和不正确的简体字在视觉和语音上相似的字符

获取原文
获取原文并翻译 | 示例

摘要

Visually and phonologically similar cha- racters are major contributing factors for errors in Chinese text.By defining ap- propriate similarity measures that consid- er extended Cangjie codes,we can identi- fy visually similar characters within a fraction of a second.Relying on the pro- nunciation information noted for individ- ual characters in Chinese lexicons,we can compute a list of characters that are phonologically similar to a given charac- ter.We collected 621 incorrect Chinese words reported on the Internet,and ana- lyzed the causes of these errors.83%of these errors were related to phonological similarity,and 48%of them were related to visual similarity between the involved characters.Generating the lists of phono- logically and visually similar characters, our programs were able to contain more than 90%of the incorrect characters in the reported errors.
机译:视觉和语音上相似的字符是造成中文文本错误的主要因素。通过定义适当的相似度度量(包括扩展的仓jie代码),我们可以在不到一秒钟的时间内识别出视觉相似的字符。针对汉语词典中各个字符的发音信息,我们可以计算出一个与给定字符在语音上相似的字符列表。我们收集了互联网上报告的621个错误的中文单词,并分析了造成这种情况的原因这些错误中有83%与语音相似性有关,其中48%与所涉及字符之间的视觉相似性有关。生成语音和视觉相似字符列表,我们的程序能够包含90多个报告的错误中%不正确的字符。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号