首页> 外文会议>International workshop on semantic evaluation;Annual conference of the North American Chapter of the Association for Computational Linguistics: human language technologies >CENNLP at SemEval-2018 Task 2: Enhanced Distributed Representation of Text using Target Classes for Emoji Prediction Representation
【24h】

CENNLP at SemEval-2018 Task 2: Enhanced Distributed Representation of Text using Target Classes for Emoji Prediction Representation

机译:CENNLP在SemEval-2018上的任务2:使用目标类增强表情符号预测表示形式的文本的分布式表示

获取原文

摘要

Emoji is one of the "fastest growing language " in pop-culture, especially in social media and it is very unlikely for its usage to decrease. These are generally used to bring an extra level of meaning to the texts, posted on social media platforms. Providing such an added info. gives more insights to the plain text, arising to hidden interpretation within the text. This paper explains our analysis on Task 2," Multilingual Emoji Prediction" sharedtask conducted by Semeval-2018. In the task, a predicted emoji based on a piece of Twitter text are labelled under 20 different classes (most commonly used emojis) where these classes are learnt and further predicted are made for unseen Twitter text. In this work, we have experimented and analysed emojis predicted based on Twitter text, as a classification problem where the entailing emoji is considered as a label for every individual text data. We have implemented this using distributed representation of text through fastText. Also, we have made an effort to demonstrate how fastText framework can be useful in case of emoji prediction. This task is divided into two subtasks, they are based on daiaset presented in two different languages English and Spanish.
机译:表情符号是流行文化中“增长最快的语言”之一,尤其是在社交媒体中,它的使用量不太可能减少。这些通常用于为发布在社交媒体平台上的文本带来额外的含义。提供这样的附加信息。对纯文本有更多的见解,这是由于文本内隐藏的解释引起的。本文介绍了我们对由Semeval-2018执行的任务2“多语言表情符号预测”共享任务的分析。在此任务中,基于20条不同类别(最常用的表情符号)标记了基于Twitter文本的预测表情符号,在这些类别中可以学习这些类别并针对看不见的Twitter文本进行进一步预测。在这项工作中,我们已经对基于Twitter文本预测的表情符号进行了实验和分析,这是一个分类问题,其中,需要将表情符号作为每个单独文本数据的标签。我们已经通过fastText使用文本的分布式表示实现了这一点。此外,我们已努力证明在表情符号预测的情况下fastText框架如何有用。此任务分为两个子任务,它们基于以两种不同语言(英语和西班牙语)显示的daiaset。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号