Secure Sequence Similarity Search on Encrypted Genomic Data

机译：加密基因组数据的安全序列相似性搜索

获取原文

获取原文并翻译 | 示例

页面导航

摘要
著录项
相似文献
相关主题

摘要

Genomic data is being produced rapidly by both individuals and enterprises and needs to be outsourced from local machines to a cloud for better flexibility. Outsourcing also eliminates the local storage management problem for data owners. However, sensitive data must be encrypted by data owners before outsourcing to protect data privacy and security in the cloud. As genome data is huge in volume, it is challenging to execute researchers' query securely and efficiently. In this paper, we present a prefix tree based indexing algorithm for supporting similar sequence search query. We support Hamming distance as similarity measure. The proposed method adopts semi-honest adversary model for the cloud server. The security of the shared data is guaranteed through encryption while making the overall computation fast and scalable enough for real-life biomedical applications. We evaluated the efficiency of our proposed model on a database of Single-Nucleotide Polymorphism (SNP) sequences and experimental results demonstrate that a query of hamming distance k = 2 in a database of 10000 records, where each record contains 500 nucleotides, takes approximately 4 minutes.

机译：个人和企业都在快速生成基因组数据，需要将其从本地计算机外包到云中以提高灵活性。外包还消除了数据所有者的本地存储管理问题。但是，敏感数据必须在外包之前由数据所有者加密，以保护云中的数据隐私和安全性。由于基因组数据量巨大，因此安全有效地执行研究人员的查询具有挑战性。在本文中，我们提出了一种基于前缀树的索引算法，以支持类似的序列搜索查询。我们支持汉明距离作为相似性度量。所提出的方法对云服务器采用半诚实的对手模型。共享数据的安全性通过加密得到保证，同时使整个计算速度和可伸缩性足以满足现实生活中的生物医学应用。我们在单核苷酸多态性（SNP）序列数据库上评估了我们提出的模型的效率，实验结果表明，在10000条记录的数据库中查询汉明距离k = 2的查询，其中每条记录包含500个核苷酸，大约需要4个核苷酸分钟。

著录项

来源
《》|2017年|205-213|共9页
会议地点 Philadelphia(US)
作者
Md Safiur Rahman Mahdi; Mohammad Zahidul Hasan; Noman Mohammed;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种 eng
中图分类
关键词
Bioinformatics; Genomics; Cryptography; Data privacy; Servers; Hamming distance; Cloud computing;

机译：生物信息学;基因组学;密码学;数据隐私;服务器;吊索距离;云计算;

相似文献

外文文献
中文文献
专利

1. FSDS: A practical and fully secure document similarity search over encrypted data with lightweight client [J] . Tosun Tolun, Savas Erkay Journal of information security and applications . 2021,第Juna期

机译：FSDS：使用轻量级客户端，实用和完全安全的文档相似性搜索加密数据
2. Secure semantic expansion based search over encrypted cloud data supporting similarity ranking [J] . Zhihua Xia, Yanling Zhu, Xingming Sun, Journal of Cloud Computing: Advances, Systems and Applications . 2014,第1期

机译：基于安全语义扩展的搜索通过加密云数据支持相似性排名
3. Automated protein sequence database classification.I.Integration of compositional similarity search,local similarity search,and multiple sequence alignment [J] . Jerome Gracy... Bioinformatics . 1998,第2期

机译：自动化蛋白质序列数据库分类.I。组成相似性搜索，局部相似性搜索和多序列比对的整合
4. Secure Sequence Similarity Search on Encrypted Genomic Data [C] . Md Safiur Rahman Mahdi, Mohammad Zahidul Hasan, Noman Mohammed IEEE/ACM International Conference on Connected Health: Applications, Systems and Engineering Technologies . 2017

机译：安全序列相似性搜索加密的基因组数据
5. Secure Semantic Search over Encrypted Big Data in the Cloud [D] . Woodworth, Jason W. 2017

机译：云中加密的大数据的安全语义搜索
6. SCOTCH: Secure Counting Of encrypTed genomiC data using a Hybrid approach [O] . Wang Chenghong, Yichen Jiang, Noman Mohammed, 2017

机译：SCOTCH：使用混合方法对加密的基因组数据进行安全计数
7. Secure semantic expansion based search over encrypted cloud data supporting similarity ranking [O] . Zhihua Xia, Yanling Zhu, Xingming Sun, 2014

机译：在支持相似性排名的加密云数据上基于安全语义扩展的搜索

Secure Sequence Similarity Search on Encrypted Genomic Data

摘要

著录项

相似文献

相关主题

期刊订阅