首页> 外文期刊>IEEE transactions on dependable and secure computing >Towards Private and Scalable Cross-Media Retrieval
【24h】

Towards Private and Scalable Cross-Media Retrieval

机译:走向私人和可扩展的交叉媒体检索

获取原文
获取原文并翻译 | 示例
           

摘要

Cross-media retrieval (CMR) is an attractive networked application where a server responds to queries with retrieval results of different modalities. Different from traditional information retrieval, CMR relies on a more enriched set of machine learning techniques to produce semantic models projecting multimodal data into a common space. A larger training dataset usually gives more accurate models, leading to a better retrieval result. Despite very promising with potential underpinnings in network analytics and multimedia applications, applying CMR in such contexts also faces severe privacy challenges, due to the fact that various data scattering among multiple parties may be sensitive and not allowed to be shared publicly. Studies jointly considering cross-media analytics, privacy protection, collaborative learning, and distributed networking contexts, are relatively sparse. In this work, we propose the first practical system for privacy-preserving cross-media retrieval by utilizing trusted processors. Our scheme enables secure aggregation of the data from distinct parties, and secure canonical correlation analysis (CCA) over collaborated data to obtain semantic models. Verification mechanisms are designed to defend against active attacks from a malicious adversary. Furthermore, to deal with large data sets, we provide a set of optimization methods to accomodate to limited trusted memory and improve the efficiency of training process in CMR. We consider issues such as data block splitting to manage memory overhead, ordering of operations as well as parameters reuse and release to simplify I/O, and parallel computation to speed up dual operations. Our experiments over both synthetic and real datasets show that our solution is very efficient in practice, outperforms the existing solutions, and performs comparably with the original CMR system.
机译:跨媒体检索(CMR)是一个有吸引力的联网应用程序,服务器响应具有不同模态的检索结果的查询。与传统信息检索不同,CMR依赖于更丰富的机器学习技术,以生产将多峰数据投影成公共空间的语义模型。更大的训练数据集通常提供更准确的模型,从而导致更好的检索结果。尽管网络分析和多媒体应用中的潜在支撑性非常有前途,但在这种情况下申请CMR也面临着严重的隐私挑战,因为多方之间的各种数据散射可能是敏感的,并且不允许公开共享。参与跨媒体分析,隐私保护,协作学习和分布式网络上下文的研究相对稀少。在这项工作中,我们提出了通过利用可信处理器来保护隐私保留跨媒检索的第一个实际系统。我们的方案使得能够在不同方面的数据确立聚合,并在合作数据中获得规范相关分析(CCA)以获得语义模型。验证机制旨在防止来自恶意对手的积极攻击。此外,为了处理大数据集,我们提供了一组优化方法,以便于限制受信任内存,提高CMR中的培训过程的效率。我们考虑数据块拆分等问题来管理内存开销,操作排序以及参数重用和释放以简化I / O,并并行计算以加速双重操作。我们对合成和实际数据集的实验表明,我们的解决方案在实践中非常有效,优于现有的解决方案,并与原始CMR系统相对操作。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号