首页> 外文会议>IEEE International Congress on Big Data >Graph-Based Data Relevance Estimation for Large Storage Systems
【24h】

Graph-Based Data Relevance Estimation for Large Storage Systems

机译:基于图的大存储系统的数据相关估计

获取原文

摘要

In storage systems, the relevance of files to users can be taken into account to determine storage control policies to reduce cost, while retaining high reliability and performance. The relevance of a file can be estimated by applying supervised learning and using the metadata as features. However, supervised learning requires many training samples to achieve an acceptable estimation accuracy. In this paper, we propose a novel graph-based learning system for the relevance estimation of files using a small training set. First, files are grouped into different file-sets based on the available metadata. Then a parameterized similarity metric among files is introduced for each file-set using the knowledge of the metadata. Finally, message passing over a bipartite graph is applied for relevance estimation. The proposed system is tested on various datasets and compared with logistic regression.
机译:在存储系统中,可以考虑文件与用户的相关性以确定要降低成本的存储控制策略,同时保留高可靠性和性能。可以通过应用监督学习和使用元数据作为特征来估计文件的相关性。然而,监督学习需要许多培训样本来实现可接受的估计准确性。在本文中,我们提出了一种基于图形的学习系统,用于使用小型训练集的文件相关性估计。首先,根据可用元数据将文件分组为不同的文件集。然后使用元数据的知识为每个文件集引入文件中的参数化相似度度量。最后,应用通过双面图的消息用于相关性估计。所提出的系统在各种数据集上进行测试,并与Logistic回归进行比较。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号