...
首页> 外文期刊>Indian Journal of Science and Technology >Effect of Statistical POS Tagger on Syntactic Analysis of Punjabi Sentences
【24h】

Effect of Statistical POS Tagger on Syntactic Analysis of Punjabi Sentences

机译:统计POS标注对旁遮普句句法分析的影响。

获取原文
   

获取外文期刊封面封底 >>

       

摘要

Objectives: In this research article, author has explored the effect of statistics based part of speech tagger on the syntactic analysis of Punjabi sentences. Methods/Statistical Analysis: To study the effect of statistical POS tagger on the syntactic analysis of Punjabi sentence, author performed two experiments; first a rule based POS tagger is used for syntactic analysis and second this rule based POS tagger is replaced with HMM based statistical POS tagger. An annotated corpus of 20,000 words has been used to train the HMM based POS tagger. Findings: The system is tested on three types of errors; first subject/object and verb agreement error second noun and modifier agreement (in attributed form) error and third modifier and noun agreement error. On using HMM based POS tagger, the system shows a precision of 80.67 for subject/object and verb agreement error whereas on using rule based POS tagger the system shows a precision of 72.81. Similarly for noun and modifier agreement (in attributed form) error, author claims a precision of 82.45 on using HMM based tagger whereas on using rule based tagger, the precision is 76.00. And in case of modifier and noun agreement error, a precision of 97.56 is claimed by the author by using HMM based tagger which was 95.45 when rule based POS tagger is used. Application/Improvements: The result indicates that the grammar checker performs better when rule based POS tagger is replaced with statistics based POS tagger.
机译:目的:在这篇研究文章中,作者探讨了基于统计的语音标记器对旁遮普句子句法分析的影响。方法/统计分析:为了研究统计POS标记器对旁遮普句子句法分析的影响,作者进行了两个实验;首先,将基于规则的POS标记用于语法分析,其次,将基于规则的POS标记替换为基于HMM的统计POS标记。已使用20,000个单词的带注释语料库来训练基于HMM的POS标记器。结果:对系统进行了三种类型的错误测试:第一个主语/宾语和动词一致错误;第二个名词和修饰语一致(属性形式)错误;第三个修饰语和名词一致错误。使用基于HMM的POS标记器时,系统显示主语/宾语和动词一致性错误的精度为80.67,而使用基于规则的POS标记器时,系统显示精度为72.81。类似地,对于名词和修饰符一致性(属性形式)错误,作者声称使用基于HMM的标记器的精度为82.45,而使用基于规则的标记器的精度为76.00。并且在出现修饰语和名词一致性错误的情况下,作者使用基于HMM的标记器声称精度为97.56,而在使用基于规则的POS标记器时,精度为95.45。应用程序/改进:结果表明,当将基于规则的POS标记器替换为基于统计信息的POS标记器时,语法检查器的性能会更好。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号