Part-of-Speech Tagging for Code-Mixed English-Hindi Twitter and Facebook Chat Messages

机译：混合代码的英语-印地语Twitter和Facebook聊天消息的词性标记

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

The paper reports work on collecting and annotating code-mixed English-Hindi social media text (Twitter and Facebook messages), and experiments on automatic tagging of these corpora, using both a coarse-grained and a fine-grained part-of-speech tag set. We compare the performance of a combination of language specific taggers to that of applying four machine learning algorithms to the task (Conditional Random Fields, Sequential Minimal Optimization, Naive Bayes and Random Forests), using a range of different features based on word context and word-internal information.

机译：该论文报告了收集和注释混合代码的英语-印地语社交媒体文本（Twitter和Facebook消息）的工作，以及使用粗粒度和细粒度词性标签对这些语料库进行自动标记的实验。放。我们比较了特定语言标记器的组合性能与将四种机器学习算法应用于任务（有条件的随机字段，顺序最小优化，朴素贝叶斯和随机森林）的性能，并使用了基于单词上下文和单词的一系列不同功能-内部信息。

著录项

来源
《International conference on recent advances in natural language processing》|2015年|239-248|共10页
会议地点
作者
Anupam Jamatia; Bjoern Gambaeck; Amitava Das;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类
关键词

相似文献

外文文献
中文文献
专利

1. Deep Learning Based Sentiment Analysis in a Code-Mixed English-Hindi and English-Bengali Social Media Corpus [J] . Jamatia Anupam, Swamy Steve Durairaj, Gamback Bjorn, International Journal of Artificial Intelligence Tools: Architectures, Languages, Algorithms . 2020,第5期

机译：基于码混合英语 - 印度和英语 - 孟加拉社交媒体语料库的深度学习情感分析
2. Crime rate detection using social media of different crime locations and Twitter part-of-speech tagger with Brown clustering [J] . Thanh Vo, Sharma Rohit, Kumar Raghvendra, Journal of intelligent & fuzzy systems: Applications in Engineering and Technology . 2020,第4Pta2期

机译：犯罪率检测不同犯罪地点的社交媒体和Twitter分段与棕色聚类的言语标签
3. Deep Learning Based Part-of-Speech Tagging for Malayalam Twitter Data (Special Issue: Deep Learning Techniques for Natural Language Processing) [J] . S.Kumar, M. AnandKumar, K.P.Soman Journal of Intelligent Systems . 2019,第3期

机译：基于深入学习的Malayalam Twitter数据的语音标记（特殊问题：自然语言处理的深度学习技巧）
4. Part-of-Speech Tagging for Code-Mixed English-Hindi Twitter and Facebook Chat Messages [C] . Anupam Jamatia, Bjoern Gambaeck, Amitava Das International conference on recent advances in natural language processing . 2015

机译：代码混合英语 - 印度推特和Facebook聊天消息的词性标记
5. Racial Identities on Social Media: Projecting Racial Identities on Facebook, Instagram, and Twitter [D] . Brinkman, Nolan. 2018

机译：社交媒体上的种族身份：在Facebook，Instagram和Twitter上投射种族身份
6. Correction: Design and rationale of the Cardiovascular Health and Text Messaging (CHAT) Study and the CHAT-Diabetes Mellitus (CHAT-DM) Study: two randomised controlled trials of text messaging to improve secondary prevention for coronary heart disease and diabetes [O] . 2018

机译：更正：心血管健康和短信（CHAT）研究和CHAT-糖尿病（CHAT-DM）研究的设计和原理：两项短信以改善冠心病和糖尿病的二级预防的随机对照试验
7. Part-of-Speech Tagging for Code-Mixed English-Hindi Twitter and Facebook Chat Messages [O] . Jamatia Anupam, Gambäck Björn, Das Amitava 2016

机译：混合代码的英语-印地语Twitter和Facebook聊天消息的词性标记
8. Part-of-Speech Tagging for Twitter: Annotation, Features, and Experiments [R] . Gimpel, K., Schneider, N., O'Connor, B., 2010

机译：Twitter的词性标注：注释，功能和实验

Part-of-Speech Tagging for Code-Mixed English-Hindi Twitter and Facebook Chat Messages

摘要

著录项

相似文献

相关主题

期刊订阅