首页> 外文会议>International conference on applications of natural language to information systems >Part-of-Speech Tagger for Konkani-English Code-Mixed Social Media Text
【24h】

Part-of-Speech Tagger for Konkani-English Code-Mixed Social Media Text

机译:Konkani-英语代码混合社交媒体文本的词性标注器

获取原文

摘要

In this paper, we propose efficient and less resource-intensive strategies for Konkani-English code-mixed social media text, which witnesses several challenges as compared to tagging general normal text. Part-of-Speech Tagging is a primary and an important step for many Natural Language Processing Applications. This paper reports work on annotating code-mixed Konkani-English data collected from social media site Facebook, which consists of more than four thousands posts from Facebook and developed automatic Part-of-Speech Taggers for this corpus. Part-of-Speech tagging is considered as a classification problem and we use different classifiers such as CRFs, SVM with different combinations of features.
机译:在本文中,我们提出了一种针对Konkani-英语代码混合社交媒体文本的有效且资源占用较少的策略,与标记普通普通文本相比,该策略见证了若干挑战。词性标记是许多自然语言处理应用程序的主要且重要的步骤。本文报道了对从社交媒体网站Facebook收集的代码混合的Konkani-English数据进行注释的工作,该数据包含来自Facebook的四千多个帖子,并为此语料库开发了自动词性标注工具。词性标记被认为是分类问题,我们使用具有不同特征组合的不同分类器,例如CRF,SVM。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号