首页> 中文期刊> 《中国文献情报(英文刊)》 >Ontology modeling of semantics in social media: Public issue knowledge base (PIKB) of the Weibo

Ontology modeling of semantics in social media: Public issue knowledge base (PIKB) of the Weibo

         

摘要

Purpose:This study aims to construct an ontology to model the semantics of social media streams,in particular,trending topics and public issues.Design/methodology/approach:Our knowledge base included 10 public events and topics from Weibo respectively,which were collected through keyword search and a crawler program.We used a semi-automatic approach to model and annotate the semantics in social media,and adapted the multi-layered ontology to refine the design based on previous researches,then we used named entity recognition (NER) to extract entities to instantiate the ontology.Relationships were extracted based on co-occurrence measures.Finally,we manually conducted post-filtering evaluation and edited the extracted entities and relationships.Findings:An initial assessment demonstrated that our multi-layered ontology supports various types of queries and analyses in the public issue knowledge base (PIKB),which can serve as an effective tool to query,understand and trace public issues.Research limitations:Manual involvement cannot meet the requirements for challenges of sustainable developments.Since the relationships extracted are fullybased on the co-occurrence of entities,rich semantic relationships,such as how much the key players have been involved,could not be fully reflected.Besides,the user evaluation is necessary for further ontology assessment.Practical implications:The PIKB can be used by regular Web users and policy makers to query,understand,and make sense of public events and topics.The methodology and reusable ontology model are useful for institutions that are interested in making use of the social media data.Originality/value:In this study,a multi-layered ontology is applied to model the evolving semantics of public events and trending topics in social media,and the semi-automatic approach could make it possible to extract entities and relationships from large amount of unstructured short texts of user generated content (UGC) from social media.

著录项

获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号