首页> 外文期刊>Journal of Intelligent Information Systems >Unified domain-specific language for collecting and processing data of social media
【24h】

Unified domain-specific language for collecting and processing data of social media

机译:统一的领域特定语言,用于收集和处理社交媒体数据

获取原文
获取原文并翻译 | 示例
           

摘要

Data provided by social media becomes an increasingly important analysis material for social scientists, market analysts, and other stakeholders. Diversity of interests leads to the emergence of a variety of crawling techniques and programming solutions. Nevertheless, these solutions have a lack of flexibility to satisfy requirements of different users and individual crawling scenarios, that can range from a simple query to a complex workflow containing multiple steps and requiring data from different networks to be collected. To address this problem, our paper proposes an approach based on a developed domain specific language (DSL) and architecture of distributed crawling system. The DSL has a declarative style that requires the user to define the description of needed data and based on an ontological model of social networks and the essential crawling techniques. Thus, the crawling system can be applied to collect the data from different online social networks within complex workflows along with the exploitation of various crawling methods implemented in a distributed computing environment.
机译:社交媒体提供的数据已成为社会科学家,市场分析师和其他利益相关者越来越重要的分析材料。兴趣的多样性导致出现了各种各样的爬网技术和编程解决方案。但是,这些解决方案缺乏灵活性,无法满足不同用户和各个爬网方案的需求,其范围可能从简单的查询到包含多个步骤的复杂工作流,并且需要收集来自不同网络的数据。为了解决这个问题,本文提出了一种基于已开发的领域特定语言(DSL)和分布式爬网系统体系结构的方法。 DSL具有声明性样式,要求用户根据社交网络的本体模型和基本的爬网技术定义所需数据的描述。因此,爬网系统可以应用于在复杂的工作流中从不同的在线社交网络收集数据,以及利用在分布式计算环境中实现的各种爬网方法。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号