首页> 外文会议>OnTheMove International Federated Conference >A Big Linked Data Toolkit for Social Media Analysis and Visualization Based on W3C Web Components
【24h】

A Big Linked Data Toolkit for Social Media Analysis and Visualization Based on W3C Web Components

机译:基于W3C Web组件的社交媒体分析和可视化的大型链接数据工具包

获取原文

摘要

Social media generates a massive amount of data at a very fast pace. Objective information such as news, and subjective content such as opinions and emotions are intertwined and readily available. This data is very appealing from both a research and a commercial point of view, for applications such as public polling or marketing purposes. A complete understanding requires a combined view of information from different sources which are usually enriched (e.g. sentiment analysis) and visualized in a dashboard. In this work, we present a toolkit that tackles these issues on different levels: (1) to extract heterogeneous information, it provides independent data extractors and web scrapers; (2) data processing is done with independent semantic analysis services that are easily deployed; (3) a configurable Big Data orchestrator controls the execution of extraction and processing tasks; (4) the end result is presented in a sensible and interactive format with a modular visualization framework based on Web Components that connects to different sources such as SPARQL and ElasticSearch endpoints. Data workflows can be defined by connecting different extractors and analysis services. The different elements of this toolkit interoperate through a linked data principled approach and a set of common ontologies. To illustrate the usefulness of this toolkit, this work describes several use cases in which the toolkit has been successfully applied.
机译:社交媒体以非常快速的速度产生大量数据。客观信息如新闻,和主观内容,如意见和情绪等互动和容易获得。对于诸如公众投票或营销目的等应用,这种数据从研究和商业角度来看非常有吸引力。完整的理解需要来自不同来源的信息的组合视图,这些信息通常富集(例如情绪分析)并在仪表板中可视化。在这项工作中,我们提出了一个工具包,将这些问题解决了不同的级别:(1)提取异构信息,它提供独立的数据提取器和Web刮板; (2)数据处理是通过轻松部署的独立语义分析服务完成的; (3)一个可配置的大数据orchestrator控制提取和处理任务的执行; (4)最终结果以合理的和交互式格式呈现,具有基于连接到不同源的Web组件的模块化可视化框架,例如SparQL和Elasticsearch端点。数据工作流程可以通过连接不同的提取器和分析服务来定义。此工具包的不同元素互操作通过链接数据原理方法和一组常见的本体。为了说明此工具包的有用性,这项工作描述了几种使用案例,其中工具包已成功应用。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号