首页> 外文期刊>SIGKDD explorations >Unsupervised Feature Selection for Linked Social Media Data
【24h】

Unsupervised Feature Selection for Linked Social Media Data

机译:链接的社交媒体数据的非监督特征选择

获取原文
获取原文并翻译 | 示例
           

摘要

The prevalent use of social media produces mountains of un- labeled, high-dimensional data. Feature selection has been shown effective in dealing with high-dimensional data for efficient data mining. Feature selection for unlabeled data remains a challenging task due to the absence of label information by which the feature relevance can be assessed. The unique characteristics of social media data further complicate the already challenging problem of unsupervised feature selection, (e.g., part of social media data is linked, which makes invalid the independent and identically distributed assumption), bringing about new challenges to traditional unsupervised feature selection algorithms. In this paper, we study the differences between social media data and traditional attribute-value data, investigate if the relations revealed in linked data can be used to help select relevant features, and propose a novel unsupervised feature selection framework, LUFS, for linked social media data. We perform experiments with real-world social media datasets to evaluate the effectiveness of the proposed framework and probe the working of its key components.
机译:社交媒体的普遍使用会产生大量未标记的高维数据。事实表明,特征选择可有效处理高维数据以进行有效的数据挖掘。由于没有标签信息可用来评估功能相关性,因此未标签数据的功能选择仍然是一项艰巨的任务。社交媒体数据的独特特征进一步加剧了已经具有挑战性的无监督特征选择问题(例如,部分社交媒体数据被链接,这使得独立且均匀分布的假设无效),给传统的无监督特征选择算法带来了新挑战。在本文中,我们研究了社交媒体数据与传统属性值数据之间的差异,调查链接数据中揭示的关系是否可用于帮助选择相关特征,并为链接社交网络提出了一种新颖的无监督特征选择框架LUFS媒体数据。我们使用现实世界的社交媒体数据集进行实验,以评估所提出框架的有效性并探究其关键组件的工作。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号