不确定集值数据的高效相似查询

陈珂; 洪银杰; 陈刚

首页> 中文期刊> 《软件学报》 >不确定集值数据的高效相似查询

不确定集值数据的高效相似查询

开具论文收录证明 >>

期刊封面封底目录下载 >>

文献代查 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

Setting similarity search on possible worlds is semantically and computationally different from the traditional technique for sets of certain data. Considering the uncertainty of items of set, I.e. There is a certain probability for an item appearing in a set, the traditional technique used for processing sets is not applicable. This paper brings forward the formulas to measure the expected similarity of the sets based on possible worlds' semantics. In the expected contexts, if the expected similarity of a pair of sets (X,Y) is larger than a given threshold value re (0,1), this pair could be called as similar set pair. In the normal algorithm, the complexity of the expected similarity of uncertain sets based on possible worlds is of exponential order. This paper has provided new algorithms to calculate expected similarity by dynamic programming. The complexity of these algorithms is of polynomial order and they reduce execution time greatly. The final experiments have indicated the usability and the high performance of the new algorithms.%基于可能世界的不确定集合的相似查询,从语义上或者从计算方法的角度来看,都有别于传统的确定型集合上的技术.由于集合中的项存在不确定性,即一个项出现在集合中是有一定概率的,使得传统处理集合的技术不再适用.提出了一个基于可能世界的集合期望相似度的度量公式.在期望的度量公式中,如果一对集合(X,Y)的期望相似度大于给定的阈值τ∈(0,1),则被称为相似集合对.一般的算法,在基于可能世界的情况下计算不确定集合的期望相似度,其复杂度是指数级的.提出了利用动态规划来计算集合期望相似度的算法,该算法的复杂度是多项式级别,极大地减少了计算时间.实验结果表明了基于该算法查询的可用性和高性能.

著录项

来源
《软件学报》 |2012年第6期|1588-1601|共14页
作者
陈珂; 洪银杰; 陈刚;
展开▼
作者单位

浙江大学计算机科学与技术系;

浙江杭州310027;

浙江大学计算机科学与技术系;

浙江杭州310027;

浙江大学计算机科学与技术系;

浙江杭州310027;

展开▼
原文格式 PDF
正文语种 chi
中图分类程序设计、软件工程;
关键词
相似查询; 期望相似度; 动态规划; 不确定集值;

相似文献

中文文献
外文文献
专利

1. 面向不确定文本数据的余弦相似性查询方法 [J] . 朱命冬 ,徐立新 ,申德荣 . 计算机科学与探索 . 2018,第001期
2. ProBench:一种评估流程相似性查询算法的基准数据集 [J] . 曹斌 ,王佳星 ,安卫士 . 计算机集成制造系统 . 2017,第005期
3. 高效的连续不确定XML数据Top-k查询算法 [J] . 张晓琳 ,郑春红 ,刘立新 . 计算机工程与科学 . 2014,第006期
4. 一种高效的不确定数据流并行Skyline查询处理方法 [J] . 赵越 ,王意洁 ,王媛 . 计算机研究与发展 . 2013,第0z2期
5. 一种高效的不确定数据流Top-K查询算法 [J] . 卢印举 ,单国全 . 科学技术与工程 . 2013,第018期
6. 不确定数据集上的k-Skyline查询 [C] . . 第二十五届中国数据库学术会议(NDBC2008) . 2008
7. 不确定数据的世系管理和相似性查询 [A] . 高明 . 2011

不确定集值数据的高效相似查询

摘要

著录项

相似文献

相关主题

期刊订阅