首页> 外文会议>International AAAI Conference on Web and Social Media >EmojiNet: An Open Service and API for Emoji Sense Discovery
【24h】

EmojiNet: An Open Service and API for Emoji Sense Discovery

机译:Emojinet:开放服务和Emoji Sense Discovery的API

获取原文

摘要

This paper presents the release of EmojiNet, the largest machine-readable emoji sense inventory that links Unicode emoji representations to their English meanings extracted from the Web. EmojiNet is a dataset consisting of: (i) 12,904 sense labels over 2,389 emoji, which were extracted from the web and linked to machine-readable sense definitions seen in BabelNet; (ii) context words associated with each emoji sense, which are inferred through word embedding models trained over Google News corpus and a Twitter message corpus for each emoji sense definition; and (iii) recognizing discrepancies in the presentation of emoji on different platforms, specification of the most likely platform-based emoji sense for a selected set of emoji. The dataset is hosted as an open service with a REST API and is available at http://emojinet.knoesis.org/. The development of this dataset, evaluation of its quality, and its applications including emoji sense disambiguation and emoji sense similarity are discussed.
机译:本文提出了Emojinet的发布,最大的机器可读表情符号索取库存,即将Unicode Emoji表示与从Web中提取的英语意义链接。 EMOJINET是一个由以下组成的数据集:(i)12,904索取标签超过2,389 emoji,从网上提取并与Babelnet中看到的机器可读意义定义相关联; (ii)与每个表情符号感相关的上下文单词通过在Google News语料库上培训的单词嵌入模型和每个表情符号索引定义的Twitter消息语料库推断出来的单词; (iii)识别在不同平台上表达Emoji的差异,规范了所选表情符号的最可能的基于平台的表情符号。 DataSet托管为具有REST API的开放式服务,可在http://emojinet.kneoesis.org/提供。讨论了这一数据集的发展,评估其质量及其包括表情留声歧义和表情符号感知相似的应用。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号