首页> 外文会议>ISPRS/IGU/ICA Joint Workshop on Borderlands Modelling and Understanding for Global Sustainability >USING WEB CRAWLER TECHNOLOGY FOR TEXT ANALYSIS OFGEO-EVENTS: A CASE STUDY OF THE HUANGYAN ISLAND INCIDENT
【24h】

USING WEB CRAWLER TECHNOLOGY FOR TEXT ANALYSIS OFGEO-EVENTS: A CASE STUDY OF THE HUANGYAN ISLAND INCIDENT

机译:利用Web履带技术进行文本分析,造意事件:以黄岩岛事件为例

获取原文

摘要

With the social networking and network socialisation have brought more text information and social relationships into our daily lives, the question of whether big data can be fully used to study the phenomenon and discipline of natural sciences has prompted many specialists and scholars to innovate their research. Though politics were integrally involved in the hyperlinked word issues since 1990s, automatic assembly of different geospatial web and distributed geospatial information systems utilizing service chaining have explored and built recently, the information collection and data visualisation of geo-events have always faced the bottleneck of traditional manual analysis because of the sensibility, complexity, relativity, timeliness and unexpected characteristics of political events. Based on the framework of Heritrix and the analysis of web-based text, word frequency, sentiment tendency and dissemination path of the Huangyan Island incident is studied here by combining web crawler technology and the text analysis method. The results indicate that tag cloud, frequency map, attitudes pie, individual mention ratios and dissemination flow graph based on the data collection and processing not only highlight the subject and theme vocabularies of related topics but also certain issues and problems behind it. Being able to express the time-space relationship of text information and to disseminate the information regarding geo-events, the text analysis of network information based on focused web crawler technology can be a tool for understanding the formation and diffusion of web-based public opinions in political events.
机译:随着社交网络和网络社会化将更多的文本信息和社会关系带入我们的日常生活中,大数据是否可以充分利用自然科学的现象和纪律促使许多专家和学者创新研究。虽然政治自20世纪90年代以来一体地涉及超链接的单词问题,但最近利用服务链的不同地理空间网和分布式地理空间信息系统的自动组装,并建于地质事件的信息收集和数据可视化始终面临着传统的瓶颈手动分析由于敏感性,复杂性,相对性,政治事件的性能和意外特征。基于Heritrix的框架和基于Web的文本分析,通过组合Web履带技术和文本分析方法研究了黄岩岛事件的字频,情感倾向和传播路径。结果表明,基于数据收集和处理的标签云,频率图,态度派,个人提及比率和传播流程图不仅突出了相关主题的主题和主题词汇表,而且还突出了它的某些问题和问题。能够表达文本信息的时间空间关系并传播关于地理事件的信息,基于聚焦的Web爬网技术的网络信息的文本分析可以是理解基于Web的公众意见的形成和扩散的工具在政治事件中。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号