签到位置数据的密度峰值快速搜索与聚类方法

刘萌; 邬群勇; 邱端昇; 孙梅; 张强

首页> 中文期刊>测绘学报 >签到位置数据的密度峰值快速搜索与聚类方法

签到位置数据的密度峰值快速搜索与聚类方法

开具论文收录证明 >>

期刊封面封底目录下载 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

Check-in data obtained from Location-based Social Network (LBSN) is a sort of crowd geographic data which will reveal daily activities of urban residents.Different check-in behaviors with the same check-in location will produce the phenomenon of location duplication because of location candidate function in LBSN system.The current density-based spatial clustering algorithms have the following problems: ①difficulty to find density peak point.②clustering error caused by check-in point objects with duplicate positions.In order to solve these problems, we proposed a fast search density peaks and clustering method for check-in data, based on clustering by fast search and find of density peaks (CFSFDP).Firstly, position repetition frequency was introduced and calculated to illustrate the number of the check-in position duplications data.Secondly, a new type of point feature was constructed by adding position repetition frequency of the original check-in position data, which was used as study object to search density peaks.At last, clustering algorithm based on density peak point was constructed in which density connectivity was taken into account to ensure the continuity and integrity of density clusters.Taking check-in data obtained from Sina Microblog as an example, an experiment was designed and implemented.The results demonstrates:①Clustering method can effectively avoid the problem that the outlier location object with high repeatability is chosen as the peak and clustering, and has excellent spatial adaptability as well when comparing with check-in data from other area.②Extracted density peak points can not only be used to represent the center of the hot zone, but also reflect the concentration trend of the hot zone, which can help to explore the dynamic change of the hot zone.%位置签到数据蕴含了城市居民活动变化.由于客户端位置候选问题,不同的签到行为以同一候选位置签到时会产生位置重复现象.针对现有密度聚类方法在签到数据聚类上存在的问题,以快速搜索和查找密度峰值聚类算法(CFSFDP)为基础,提出了签到位置数据的密度峰值快速搜索与聚类方法.首先,引入位置重复频率来表达签到位置重复,然后,对原始签到位置数据点统计位置重复频率并重新设计数据结构,以新的空间点要素为研究对象寻找密度峰值点;最后,构建了峰值点密度簇聚类算法,在点要素集聚类过程中考虑密度连通性来保证峰值密度簇的连续与完整.试验表明,所提出的聚类方法有效避免了重复度较高的离群位置对象选为峰值并聚类的情况,并具有良好的空间适应性.所提取的密度峰值点不仅可以用来表示热区的中心,还能够反映热区的集中趋势,进而可以帮助探索热区的动态变化情况.

著录项

来源
《测绘学报》|2017年第4期|516-525|共10页
作者
刘萌; 邬群勇; 邱端昇; 孙梅; 张强;
展开▼
作者单位

福州大学福建省空间信息工程研究中心空间数据挖掘与信息共享教育部重点实验室,福建福州 350002;

福州大学福建省空间信息工程研究中心空间数据挖掘与信息共享教育部重点实验室,福建福州 350002;

福州大学福建省空间信息工程研究中心空间数据挖掘与信息共享教育部重点实验室,福建福州 350002;

福州大学福建省空间信息工程研究中心空间数据挖掘与信息共享教育部重点实验室,福建福州 350002;

福州大学福建省空间信息工程研究中心空间数据挖掘与信息共享教育部重点实验室,福建福州 350002;

展开▼
原文格式 PDF
正文语种 chi
中图分类测绘数据库与信息系统;
关键词
签到位置数据; 活动热区; 空间聚类; 密度峰值聚类; 位置重复频率;
入库时间 2022-08-18 01:48:30

相似文献

中文文献
外文文献
专利

1. 复杂高维数据的密度峰值快速搜索聚类算法 [J] . 陈俊芬 ,张明 ,赵佳成 . 计算机科学 . 2020,第003期
2. 基于位置签到数据的城市空间分析与可视化 [J] . 陈冉 ,王海起 ,孟斌 . 地理信息世界 . 2017,第003期
3. 基于位置签到数据的城市分层地标提取 [J] . 王明 ,胡庆武 ,李清泉 . 计算机学报 . 2016,第002期
4. 一种综合位置签到数据和景点分级信息的旅游景点评价方法 [J] . 李响 ,张晶 ,江南 . 测绘科学技术学报 . 2015,第004期
5. 基于微博位置签到数据的POI更新方法 [J] . 曹劲舟 ,武红宇 . 地理空间信息 . 2013,第002期
6. 基于快速搜索密度峰值聚类的智能故障诊断 [C] . WEI Zexian ,韦泽贤 ,WANG Yanxue . 2016年全国设备监测诊断与维护学术会议、第十五届全国设备故障诊断学术会议、第十七届全国设备监测与诊断学术会议、2016年全国设备诊断工程会议 . 2016
7. 基于快速搜索和发现的密度峰值聚类算法研究与应用 [A] . 吕闯 . 2019

签到位置数据的密度峰值快速搜索与聚类方法

摘要

著录项

相似文献

相关主题

期刊订阅