首页> 外文期刊>Pattern recognition letters >Data granulation by the principles of uncertainty
【24h】

Data granulation by the principles of uncertainty

机译:通过不确定性原理进行数据细化

获取原文
获取原文并翻译 | 示例
           

摘要

Researches in granular modeling produced a variety of mathematical models, such as intervals (higher-order) fuzzy sets, rough sets, and shadowed sets, which are all suitable to characterize the so-called information granules. Modeling of the input data uncertainty is recognized as a crucial aspect in information granulation. Moreover, the uncertainty is a well-studied concept in many mathematical settings, such as those of probability theory, fuzzy set theory, and possibility theory. This fact suggests that an appropriate quantification of the uncertainty expressed by the information granule model could be used to define an invariant property, to be exploited in practical situations of information granulation. In this perspective, we postulate that a procedure of information granulation is effective if the uncertainty conveyed by the synthesized information granule is in a monotonically increasing relation with the uncertainty of the input data. In this paper, we present a data granulation framework that elaborates over the principles of uncertainty introduced by Klir. Being the uncertainty a mesoscopic descriptor of systems and data, it is possible to apply such principles regardless of the input data type and the specific mathematical setting adopted for the information granules. The proposed framework is conceived (i) to offer a guideline for the synthesis of information granules and (ii) to build a groundwork to compare and quantitatively judge over different data granulation procedures. To provide a suitable case study, we introduce a new data granulation technique based on the minimum sum of distances, which is designed to generate type-2 fuzzy sets. The automatic membership function elicitation is completely based on the dissimilarity values of the input data, which makes this approach widely applicable. We analyze the procedure by performing different experiments on two distinct data types: feature vectors and labeled graphs. Results show that the uncertainty of the input data is suitably conveyed by the generated type-2 fuzzy set models. (C) 2015 Elsevier B.V. All rights reserved.
机译:颗粒建模的研究产生了各种数学模型,例如区间(高阶)模糊集,粗糙集和阴影集,它们都适合表征所谓的信息颗粒。输入数据不确定性的建模被认为是信息粒化的关键方面。此外,不确定性是许多数学环境中经过充分研究的概念,例如概率论,模糊集理论和可能性理论。这一事实表明,可以使用信息粒度模型表达的不确定性的适当量化来定义不变性,以便在信息粒度化的实际情况下加以利用。从这个角度出发,我们假设,如果合成信息颗粒传达的不确定性与输入数据的不确定性呈单调递增关系,则信息粒化程序是有效的。在本文中,我们提出了一个数据细化框架,详细阐述了Klir引入的不确定性原理。由于不确定性是系统和数据的介观描述符,因此可以应用这样的原理,而与输入数据类型和信息颗粒采用的特定数学设置无关。拟议的框架的构想是(i)为信息颗粒的合成提供指南,以及(ii)建立基础,以比较和定量判断不同的数据制粒程序。为了提供合适的案例研究,我们引入了一种基于最小距离总和的新数据细化技术,该技术旨在生成2型模糊集。自动隶属函数导出完全基于输入数据的相异性值,这使得此方法可广泛应用。我们通过对两种不同的数据类型执行不同的实验来分析程序:特征向量和标记图。结果表明,输入数据的不确定性可以通过生成的2型模糊集模型适当地传达。 (C)2015 Elsevier B.V.保留所有权利。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号