首页> 外国专利> Method for analyzing web space data

Method for analyzing web space data

机译:网站空间数据分析方法

摘要

Method for analyzing data from the web characterized in that it comprises the steps of choosing a predetermined topic (S), said topic (S) identified by at least one keyword collecting data, or Web resources, from the Web that mention said predetermined topic (S) at successive instants t, two successive instants t being separated by an interval of time of predetermined length d counting the number W(S) of said Web resources that mention said predetermined topic (S) at each instant t generating a time-series of consecutive measures of the number of said Web resources, said time-series representing said number W(S) of Web resources as a function of time splitting said time-series into a plurality of consecutive time windows T of predetermined length z, with z≥d, in such a way that each time window T comprises at least one web resource among said web resources applying a correlation level quantifying technique to said plurality of time windows for quantifying, for at least one time window among said time windows, the level of correlations Lc existing in the Web resources W(S) of a same time window T estimating, for each time window T, the average number WM(S) of said Web resources W(S) that mention said topic (S) computing, for each time window T, a trend index by combining said average number of said Web resources WM(S) with said level of correlations Lc repeating said computing step of said trend index for all said time windows generating a sequence of trend indexes which show how opinions that the society has on a topic S changed over time.
机译:用于分析来自网络的数据的方法,其特征在于,该方法包括以下步骤:选择预定主题(S),该​​主题(S)由至少一个从Web上提及该预定主题的关键词收集数据或Web资源来标识( S)在连续的时刻t处,两个连续的时刻t被间隔一段预定长度d的时间间隔,该间隔计算在每个时刻t提及所述预定主题(S)的所述Web资源的数量W(S),从而产生一个时间序列Web资源的数量的连续度量,表示时间序列表示Web资源的数量W(S)作为时间的函数,将时间序列划分为多个预定长度z的连续时间窗口T,其中z ≥d,以使得每个时间窗口T包括所述网络资源中的至少一个网络资源,对所述多个时间窗口应用相关度量化技术以对至少一个时间窗口进行量化在所述时间窗口中,存在于相同时间窗口T的Web资源W(S)中的相关度Lc针对每个时间窗口T估计所述Web资源W(S)的平均数WM(S)。所述主题(S)通过将所述Web资源WM(S)的所述平均数量与所述相关性水平Lc组合来针对每个时间窗口T计算趋势指标,对于所有所述时间窗口重复所述趋势指标的所述计算步骤,从而生成一个趋势指数序列,显示社会对主题S的看法随时间变化。

著录项

  • 公开/公告号EP2339522A1

    专利类型

  • 公开/公告日2011-06-29

    原文格式PDF

  • 申请/专利权人 SCUOLA NORMALE SUPERIORE DI PISA;

    申请/专利号EP20100015516

  • 发明设计人 MONTANGERO SIMONE;FURINI MARCO;

    申请日2010-12-10

  • 分类号G06Q30/00;

  • 国家 EP

  • 入库时间 2022-08-21 17:54:07

相似文献

  • 专利
  • 外文文献
  • 中文文献
获取专利

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号