首页> 外文会议>International Conference on Geographical Information Systems Theory, Applications and Management >DISTL: Distributed In-Memory Spatio-Temporal Event-based Storyline Categorization Platform in Social Media
【24h】

DISTL: Distributed In-Memory Spatio-Temporal Event-based Storyline Categorization Platform in Social Media

机译:DISTL:在社交媒体中分布在内存时空临时事件的故事情节分类平台

获取原文

摘要

Event analysis in social media is challenging due to endless amount of information generated daily. While current research has put a strong focus on detecting events, there is no clear guidance on how those storylines should be processed such that they would make sense to a human analyst. In this paper, we present DISTL, an event processing platform which takes as input a set of storylines (a sequence of entities and their relation-ships) and processes them as follows: (1) uses different algorithms (LDA, SVM, information gain, rule sets) to identify events with different themes and allocates storylines to them; and (2) combines the events with location and time to narrow down to the ones that are meaningful in a specific scenario. The output comprises sets of events in different categories. DISTL uses in-memory distributed processing that scales to high data volumes and categorizes generated storylines in near real-time. It uses Big Data tools, such as Hadoop and Spark, which have shown to be highly efficient in handling millions of tweets concurrently.
机译:由于日常生成的无限信息,社交媒体中的事件分析挑战。虽然目前的研究强烈关注检测事件,但没有明确指导应该如何处理这些故事列表,以便他们对人类分析师有意义。在本文中,我们呈现DISTL,一个事件处理平台,它用作输入一组故事情节(一系列实体和它们的关系),并如下处理:(1)使用不同的算法(LDA,SVM,信息增益,规则集)以识别具有不同主题的事件并将故事列表分配给它们; (2)将具有位置和时间的事件结合到缩小到在特定场景中有意义的那些。该输出包括不同类别中的一组事件。 DISTL使用内存的分布式处理,该处理可扩展到高数据卷,并在近实时分类生成的故事情节。它使用大数据工具,例如Hadoop和Spark,这表明在同时处理数百万推文方面非常有效。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号