Part-of-Speech Tagger for Konkani-English Code-Mixed Social Media Text

机译：Konkani-英语代码混合社交媒体文本的词性标注器

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

In this paper, we propose efficient and less resource-intensive strategies for Konkani-English code-mixed social media text, which witnesses several challenges as compared to tagging general normal text. Part-of-Speech Tagging is a primary and an important step for many Natural Language Processing Applications. This paper reports work on annotating code-mixed Konkani-English data collected from social media site Facebook, which consists of more than four thousands posts from Facebook and developed automatic Part-of-Speech Taggers for this corpus. Part-of-Speech tagging is considered as a classification problem and we use different classifiers such as CRFs, SVM with different combinations of features.

机译：在本文中，我们提出了一种针对Konkani-英语代码混合社交媒体文本的有效且资源占用较少的策略，与标记普通普通文本相比，该策略见证了若干挑战。词性标记是许多自然语言处理应用程序的主要且重要的步骤。本文报道了对从社交媒体网站Facebook收集的代码混合的Konkani-English数据进行注释的工作，该数据包含来自Facebook的四千多个帖子，并为此语料库开发了自动词性标注工具。词性标记被认为是分类问题，我们使用具有不同特征组合的不同分类器，例如CRF，SVM。

著录项

来源
《International conference on applications of natural language to information systems》|2018年|303-307|共5页
会议地点
作者
Akshata Phadte; Radhiya Arsekar;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类
关键词
Code-mixing; Social media text; Part-of-Speech tagging;

机译：代码混合;社交媒体文字;词性标记;

相似文献

外文文献
中文文献
专利

1. Language identification framework in code-mixed social media text based on quantum LSTM - the word belongs to which language? [J] . Modern Physics Letters, B. Condensed Matter Physics, Statistical Physics, Applied Physics . 2020,第6期

机译：基于量子LSTM的代码混合社交媒体文本中语言识别框架 - 这个词属于哪种语言？
2. An effective cybernated word embedding system for analysis and language identification in code-mixed social media text [J] . Shekhar Shashi, Sharma Dilip Kumar, Sufyan Beg M.M. International journal of knowledge-based and intelligent engineering systems . 2019,第3期

机译：一个有效的电子化词嵌入系统，用于在代码混合的社交媒体文本中进行分析和语言识别
3. Crime rate detection using social media of different crime locations and Twitter part-of-speech tagger with Brown clustering [J] . Thanh Vo, Sharma Rohit, Kumar Raghvendra, Journal of intelligent & fuzzy systems: Applications in Engineering and Technology . 2020,第4Pta2期

机译：犯罪率检测不同犯罪地点的社交媒体和Twitter分段与棕色聚类的言语标签
4. Part-of-Speech Tagger for Konkani-English Code-Mixed Social Media Text [C] . Akshata Phadte, Radhiya Arsekar International Conference on Applications of Natural Language to Information Systems . 2018

机译：konkani-English-English-Code-Medio Media Medio文本的一部分语音标记器
5. IITagger: Tagging Wall Street Journal text with part-of-speech information [D] . Kim, Yeongkwun 1996

机译：IITagger：使用词性信息标记“华尔街日报”文本
6. A fine-grained Chinese word segmentation and part-of-speech tagging corpus for clinical text [O] . Ying Xiong, Zhongmin Wang, Dehuan Jiang, 2019

机译：用于临床文本的细粒度中文分词和词性标注语料库
7. Part-of-speech Tagging of Code-Mixed Social Media Text [O] . Souvick Ghosh, Satanu Ghosh, Dipankar Das 2016

机译：代码混合的社交媒体文本的词性标记

Part-of-Speech Tagger for Konkani-English Code-Mixed Social Media Text

摘要

著录项

相似文献

相关主题

期刊订阅