...
【24h】

Efficient fuzzy ranking queries in uncertain databases

机译:不确定数据库中的有效模糊排名查询

获取原文
获取原文并翻译 | 示例
           

摘要

Recently, uncertain data have received dramatic attention along with technical advances on geographical tracking, sensor network and RFID etc. Also, ranking queries over uncertain data has become a research focus of uncertain data management. With dramatically growing applications of fuzzy set theory, lots of queries involving fuzzy conditions appear nowadays. These fuzzy conditions are widely applied for querying over uncertain data. For instance, in the weather monitoring system, weather data are inherent uncertainty due to some measurement errors. Weather data depicting heavy rain are desired, where "heavy" is ambiguous in the fuzzy query. However, fuzzy queries cannot ensure returning expected results from uncertain databases. In this paper, we study a novel kind of ranking queries, Fuzzy Ranking queries (FRanking queries) which extend the traditional notion of ranking queries. FRanking queries are able to handle fuzzy queries submitted by users and return k results which are the most likely to satisfy fuzzy queries in uncertain databases. Due to fuzzy query conditions, the ranks of tuples cannot be evaluated by existing ranking functions. We propose Fuzzy Ranking Function to calculate tuples' ranks in uncertain databases for both attribute-level and tuple-level uncertainty models. Our ranking function take both the uncertainty and fuzzy semantics into account. FRanking queries are formally defined based on Fuzzy Ranking Function. In the processing of answering FRanking queries, we present a pruning method which safely prunes unnecessary tuples to reduce the search space. To further improve the efficiency, we design an efficient algorithm, namely Incremental Membership Algorithm (IMA) which efficiently answers FRanking queries by evaluating the ranks of incremental tuples under each threshold for the fuzzy set. We demonstrate the effectiveness and efficiency of our methods through the theoretical analysis and experiments with synthetic and real datasets.
机译:近年来,随着地理跟踪,传感器网络和RFID等技术的进步,不确定数据受到了极大的关注。此外,对不确定数据进行排名查询已成为不确定数据管理的研究重点。随着模糊集理论的迅速增长,如今出现了许多涉及模糊条件的查询。这些模糊条件被广泛应用于不确定数据的查询。例如,在天气监视系统中,由于某些测量误差,天气数据是固有的不确定性。需要描述大雨的天气数据,其中在模糊查询中“大”是模棱两可的。但是,模糊查询不能确保从不确定的数据库返回预期结果。在本文中,我们研究了一种新颖的排名查询,即模糊排名查询(FRanking查询),该查询扩展了传统的排名查询概念。侧面查询能够处理用户提交的模糊查询,并返回k个结果,这些结果最有可能满足不确定数据库中的模糊查询。由于模糊查询条件,元组的排名无法通过现有的排名函数进行评估。对于属性级和元组级不确定性模型,我们提出了模糊排序函数来计算不确定数据库中元组的等级。我们的排名函数同时考虑了不确定性和模糊语义。框架查询是基于模糊排名函数正式定义的。在处理FRanking查询的过程中,我们提出了一种修剪方法,该方法可以安全地修剪不必要的元组以减少搜索空间。为了进一步提高效率,我们设计了一种有效的算法,即增量成员资格算法(IMA),该算法通过评估模糊集的每个阈值下的增量元组的秩来有效地回答FRanking查询。我们通过理论分析和综合和真实数据集的实验来证明我们方法的有效性和效率。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号