首页> 外文期刊>Literary & linguistic computing >Phonetic-based Sindhi spellchecker system using a hybrid model
【24h】

Phonetic-based Sindhi spellchecker system using a hybrid model

机译:使用混合模型的基于语音的Sindhi拼写检查器系统

获取原文
获取原文并翻译 | 示例
           

摘要

This article presents a novel architecture using a hybrid model for developing a Sindhi spellchecker system which has yet not been developed prior to this work. The compound textual forms and glyphs of Sindhi language presents a substantial challenge for developing a Sindhi spellchecker system and generating a similar suggestion list for misspelled words. In order to implement such a system, phonetic-based Sindhi language rules and patterns must be taken into account for increasing the accuracy and efficiency. In this research work, a simple and efficient combinational hybrid system is proposed, using three different algorithms, the Edit Distance algorithm to find the measure of similarity between two Sindhi strings. The phonetic-based SoundEx and ShapeEx algorithms are developed for pattern or glyph matching, generating accurate and an efficient suggestion list for incorrect or misspelled Sindhi words. The proposed system is established with a blend between Phonetic-based SoundEx algorithm and ShapeEx algorithm for pattern or glyph matching, generating accurate and efficient suggestion list for incorrect or misspelled Sindhi words. In this article, a table of phonetically similar-sounding Sindhi characters is presented which are grouped together along with another table containing similar glyph or shape-based character groups. The system has been successfully integrated into a pre-developed Sindhi word processer application. The Sindhi word segmentation methodology and algorithms required for the spellchecker has already been published and so are not discussed in detail in this article.
机译:本文介绍了一种使用混合模型开发Sindhi拼写检查器系统的新颖架构,该系统在此工作之前尚未开发。信德语语言的复合文本形式和字形为开发信德语拼写检查系统并为拼错单词生成类似建议列表提出了重大挑战。为了实现这样的系统,必须考虑基于语音的信德语语言规则和模式,以提高准确性和效率。在这项研究工作中,提出了一种简单有效的组合混合系统,该系统使用三种不同的算法,即“编辑距离”算法来查找两个信德弦之间的相似度。基于语音的SoundEx和ShapeEx算法专为模式或字形匹配而开发,可为错误或拼写错误的Sindhi单词生成准确而有效的建议列表。拟议的系统是基于语音的SoundEx算法和ShapeEx算法的混合使用而建立的,用于模式或字形匹配,从而为错误或拼写错误的Sindhi单词生成准确而有效的建议列表。本文中,提供了一个发音相似的信德语字符表,这些表与另一个包含相似字形或基于形状的字符组的表一起分组。该系统已成功集成到预先开发的Sindhi文字处理程序应用程序中。拼写检查器所需的Sindhi分词方法和算法已经发布,因此本文不对其进行详细讨论。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号