首页> 外文会议>International conference on natural language processing >Manipuri Chunking: An Incremental Model with POS and RMWE
【24h】

Manipuri Chunking: An Incremental Model with POS and RMWE

机译:Manipuri分块:使用POS和RMWE的增量模型

获取原文

摘要

This paper records the work of Manipuri Chunking by using the commonly use tool of Support Vector Machine (SVM). Manipur being a very highly agglutinative language have to be careful in selecting the features for running the SVM. An experiment is being performed with 35,000 words to check whether the POS tagged and the Reduplicated Multiword Expression (RMWE) can improve the Chunk identification. With a linguistic expert the Newspaper corpus is maintained to the Gold standard. The experimental system is designed as an incremental model with notation and identification in each stage. The Chunks are identified with a list of selected features. The chunks are very much related with the Part of Speech (POS) thus in the second stage POS tagging is done with the identified Chunks as one of the features which is again followed by the chunking. The third stage is with the RMWE. An experiment is again conducted with a list of carefully selected features for the SVM in order to find the Chunk with POS and RMWE as other features. Comparisons and evaluations are performed in each phase and the final output is drawn with completely tagged chunk Manipuri text. The experiment also identifies the POS tagging with a Recall (R) of 71.97%, Precision (P) of 87.16% and F-measure (F) of 78.84%. Apart from POS it also identifies the RMWE with a Recall (R) of 89.39%, Precision (P) of 98.33% and F-measure (F) of 93.65%. The system shows a final chunking with a Recall (R) of 70.45%, Precision (P) of 86.11% and F-measure (F) of 77.50%.
机译:本文使用支持向量机(SVM)的常用工具记录了Manipuri Chunking的工作。 Manipur是一种高度凝集的语言,在选择运行SVM的功能时必须小心。正在使用35,000个单词进行实验,以检查POS标记和Reduplicated Multiword Expression(RMWE)是否可以改善Chunk识别。有了语言专家,报纸语料库就保持了金标准。实验系统被设计为一个增量模型,在每个阶段都有符号和标识。块以选定特征的列表进行标识。这些块与语音部分(POS)密切相关,因此在第二阶段,使用识别出的块作为特征之一完成POS标记,然后再次进行块化。第三阶段是RMWE。再次对SVM精心选择的功能列表进行实验,以找到带有POS和RMWE作为其他功能的块。在每个阶段执行比较和评估,并使用完全标记的块Manipuri文本绘制最终输出。实验还确定了POS标签,其召回率(R)为71.97%,精度(P)为87.16%,F量度(F)为78.84%。除了POS以外,它还以RM.89.39%的召回率(R),98.33%的精确度(P)和93.65%的F量度(F)标识RMWE。系统显示最终分块,召回率(R)为70.45%,精度(P)为86.11%,F度量(F)为77.50%。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号