Semi-automatic construction of a named entity dictionary for entity-based sentiment analysis in social media

Song Yeongkil; Jeong Seokwon; Kim Harksoo

首页> 外文期刊>Multimedia Tools and Applications >Semi-automatic construction of a named entity dictionary for entity-based sentiment analysis in social media

【24h】

Semi-automatic construction of a named entity dictionary for entity-based sentiment analysis in social media

机译：半自动构建命名实体字典，用于社交媒体中基于实体的情感分析

获取原文

获取原文并翻译 | 示例

掌桥外文数据库（机构版） >>

开具论文收录证明 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

To understand the user experience in social media or to facilitate the design of human-centric services by social media, users' opinions about specific entities in text messages should be captured. A fine-grained named entity recognizer (NER) is an essential module for identifying opinion targets in text messages, and a named-entity (NE) dictionary is a major resource that affects the performance of an NER. However, it is not easy to construct an NE dictionary manually, because human annotation is time-consuming and labor-intensive. To reduce construction time and labor, we propose a semi-automatic system to construct an NE dictionary from the free online resource, Wikipedia. The proposed system constructs a pseudo-document for each Wikipedia NE by using an active-learning technique. It then classifies Wikipedia entries into NE classes based on similarities between the entries and pseudo-documents located in a vector space. In experiments, the proposed system classified 92.3 % of Wikipedia entries into 29 NE classes. It showed a high performance, with a macro-averaging F1-measure of 0.872 and micro-averaging F1-measure of 0.935.

机译：为了了解社交媒体中的用户体验或通过社交媒体促进以人为本的服务设计，应捕获用户对文本消息中特定实体的意见。细粒度的命名实体识别器（NER）是用于识别文本消息中的意见目标的基本模块，而命名实体（NE）词典是影响NER性能的主要资源。但是，人工构建NE词典并不容易，因为人工注释既费时又费力。为了减少施工时间和劳力，我们提出了一种半自动系统，可以从免费的在线资源Wikipedia构建NE词典。所提出的系统通过使用主动学习技术为每个Wikipedia NE构造一个伪文档。然后，根据条目和位于矢量空间中的伪文档之间的相似性，将Wikipedia条目分类为NE类。在实验中，提出的系统将92.3％的Wikipedia条目分类为29个NE类。它表现出很高的性能，宏平均F1测度为0.872，微观平均F1测度为0.935。

著录项

来源
《Multimedia Tools and Applications》 |2017年第9期|11319-11329|共11页
作者
Song Yeongkil; Jeong Seokwon; Kim Harksoo;
展开▼
作者单位

Kangwon Natl Univ, Coll IT, Program Comp & Commun Engn, 1 Gangwondaehak Gil, Seoul 121742, Gangwon Do, South Korea;

Kangwon Natl Univ, Coll IT, Program Comp & Commun Engn, 1 Gangwondaehak Gil, Seoul 121742, Gangwon Do, South Korea;

Kangwon Natl Univ, Coll IT, Program Comp & Commun Engn, 1 Gangwondaehak Gil, Seoul 121742, Gangwon Do, South Korea;

展开▼
收录信息
原文格式 PDF
正文语种 eng
中图分类
关键词
Named entity dictionary; Active learning; Information retrieval; Vector space model;

机译：命名实体词典;主动学习;信息检索;向量空间模型;
入库时间 2022-08-17 13:04:15

相似文献

外文文献
中文文献
专利

1. "Systems, Methods and Devices for Generating an Adjective Sentiment Dictionary for Social Media Sentiment Analysis" in Patent Application Approval Process [J] . Robotics and Machine Learning . 2012,第44期

机译：专利申请批准过程中的“用于生成用于社交媒体情感分析的形容词情感词典的系统，方法和设备”
2. Sentiment analysis based on a social media customised dictionary [J] . Milene Dias Almeida, Vinicius Mothé Maia, Roberto Tommasetti, MethodsX . 2021,第a期

机译：基于社交媒体定制字典的情感分析
3. Empirical Evaluation of Leveraging Named Entities for Arabic Sentiment Analysis [J] . Mulki Hala, Haddad Hatem, Gridach Mourad, The international arab journal of information technology . 2020,第2期

机译：用阿拉伯语情绪分析利用命名实体的实证评价
4. Semi-automatic Construction of a Named Entity Dictionary Based on Active Learning [C] . Yeongkil Song, Harksoo Kim FTRA international conference on computer science and its applications . 2015

机译：基于主动学习的命名实体字典半自动构建
5. Parallelization of Entity-Based Models in Computational Social Science: A Hardware Perspective [D] . Brearcliffe, Dale K. 2017

机译：计算社会科学中基于实体的模型的并行化：硬件视角
6. Sentiment analysis based on a social media customised dictionary [O] . Milene Dias Almeida, Vinicius Mothé Maia, Roberto Tommasetti, 2021

机译：基于社交媒体定制字典的情感分析
7. Corpus Creation and Analysis for Named Entity Recognition in Telugu-English Code-Mixed Social Media Data [O] . Vamshi Krishna Srirangam, Appidi Abhinav Reddy, Vinay Singh, 2019

机译：Telugu-English-English-Mixed社交媒体数据中命名实体识别的语料库创建和分析

Semi-automatic construction of a named entity dictionary for entity-based sentiment analysis in social media

摘要

著录项

相似文献

相关主题

期刊订阅