首页> 外文会议>International Conference on Data and Software Engineering >Public facilities recommendation system based on structured and unstructured data extraction from multi-channel data sources
【24h】

Public facilities recommendation system based on structured and unstructured data extraction from multi-channel data sources

机译:基于从多渠道数据源中抽取结构化和非结构化数据的公共设施推荐系统

获取原文

摘要

Nowadays social media data has grown very rapidly by producing a huge amount and variety of data everyday. Those data can be analyzed and processed to deliver useful information especially for public needs. However, most of the data available in social media are unstructured. This paper proposes a recommendation system for public facilities by utilizing both structured and unstructured data gathered from multi-channel data sources. The system uses single-criteria rating, multi-criteria-rating, and text data as the inputs. The challenge is how to handle data variety such that any kind of data from any channel can be integrated. The second challenge is how to extract location-related data from the raw data. There are four data channels used in the system. Three of them are social media channels, i.e. Twitter, Instagram, and Foursquare, while the other is internal data channel built as a part of the system itself. The system deals with three categories of public facility, i.e. park, hospital, and mosque. The whole system consists of two sub systems, i.e. the extractor system including the rating input module and the recommendation system. The recommendation system is implemented as end-user mobile application such that the users are able to use it anytime and anywhere. The system successfully integrate data from different social media channels and in different format to provide users with useful information concerning public facilities in the form of recommendation (rating) and popularity of the facilities. The experiment has shown that above 90% of the data collected from the social media contains location-related information that is useful for further processing. The system has been tested using usability test, and it obtained an average users score 3.9 on a scale of 1 to 5.
机译:如今,社交媒体数据通过每天产生大量和各种各样的数据而迅速增长。可以对这些数据进行分析和处理,以提供有用的信息,尤其是满足公共需求的信息。但是,社交媒体中可用的大多数数据都是非结构化的。本文利用从多渠道数据源收集的结构化和非结构化数据,提出了一种公共设施推荐系统。系统使用单标准分级,多标准分级和文本数据作为输入。面临的挑战是如何处理各种数据,以便可以集成来自任何渠道的任何类型的数据。第二个挑战是如何从原始数据中提取与位置相关的数据。系统中使用了四个数据通道。其中三个是社交媒体渠道,即Twitter,Instagram和Foursquare,另一个是内部数据渠道,它是系统本身的一部分。该系统处理三类公共设施,即公园,医院和清真寺。整个系统由两个子系统组成,即包含评级输入模块和推荐系统的提取器系统。推荐系统被实现为最终用户移动应用程序,以便用户能够随时随地使用它。该系统成功集成了来自不同社交媒体渠道和格式的数据,以推荐(评级)和设施受欢迎程度的形式向用户提供有关公共设施的有用信息。实验表明,从社交媒体收集的数据中有90%以上包含与位置有关的信息,这些信息可用于进一步处理。该系统已使用可用性测试进行了测试,并获得了1-5分的平均用户得分3.9。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号