首页> 外文会议>International Conference on Big Data, Small Data, Linked Data and Open Data >A Novel Methodology to Identify and Collect Data from Relevant Blogs Leveraging Multiple Social Media Platforms and Cyber Forensics
【24h】

A Novel Methodology to Identify and Collect Data from Relevant Blogs Leveraging Multiple Social Media Platforms and Cyber Forensics

机译:一种识别和收集利用多个社交媒体平台和网络取证的相关博客数据的新方法

获取原文

摘要

Blogs play a vital role in retrieving real time information, a place for users to gain insights into events and also find communities with similar interests. However, being able to identify blogs that contain honest, unbiased opinion of individuals as opposed to biased or agenda-driven coverage, is quite a challenge. Secondly, blogs are notorious for being dynamic in structure, where their owner is entitled to give them a makeover whenever they want. This changing structure of blogs can be computationally expensive for researchers and Web crawlers. In this paper, we propose a methodology to help identify relevant blogs for specific events. We provide data statistics of a few real-world events where our methodology successfully identified relevant blogs and helped us study the information discourse. We then discuss the strengths and weaknesses of this methodology and highlight the best approach to crawling blogs.
机译:博客在检索实时信息中发挥着至关重要的作用,为用户获得了解事件的洞察的地方,也找到了具有相似兴趣的社区。然而,能够识别包含诚实,对个人意见的博客,而不是偏见或议程驱动的覆盖率,这是一项挑战。其次,博客在结构中动态令人惊叹,他们的主人有权在他们想要的情况下给予他们改造。对于研究人员和Web爬行者来说,这种更改的博客结构可以计算得昂贵。在本文中,我们提出了一种方法来帮助识别特定事件的相关博客。我们提供一些现实世界事件的数据统计数据,其中我们的方法是成功确定了相关博客并帮助我们研究了信息话语。然后,我们讨论这种方法的优势和缺点,并突出了爬行博客的最佳方法。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号