首页> 外文学位 >Data-driven identification of key variables: A fuzzy set approach.
【24h】

Data-driven identification of key variables: A fuzzy set approach.

机译:数据驱动的关键变量识别:模糊集方法。

获取原文
获取原文并翻译 | 示例

摘要

In this dissertation, we investigate a problem raised from a real-world application, surface mount manufacturing. The problem can be abstracted as a general problem: to identify key variables that contribute to a partition of a given data set. We have developed two algorithms that can be applied to dealing with this problem. Both algorithms are based on fuzzy sets, fuzzy measures, fuzzy integrals, and evolutionary strategies.;The second algorithm is based on the idea that each data point can be considered as an evaluation function of an object with respect to several features. Fuzzy measures are used to weight different features, and fuzzy integrals are used to define partitions of data points. An evolutionary strategy is again used to identify the optimal fuzzy measure under which values of fuzzy integral of data points define a partition which is as close as possible to a given partition. Both algorithms are tested on the benchmark data, the Iris data set.;A by-product of our investigation is a method for constructing fuzzy measures from a given data set by solving fuzzy relation equations. Moreover, we have also developed a theoretically justified method for approximate solutions of fuzzy relation equations.;The first algorithm is based on the idea that by employing different Mahalanobis metrics, one can weight variables differently. It is called an evolutionary fuzzy c-means algorithm. The algorithm involves a search for an optimal Mahalanobis metric under which the fuzzy c-means algorithm derives a fuzzy partition that is as close as possible to a given partition.
机译:在本文中,我们研究了实际应用中提出的问题,即表面贴装制造。该问题可以抽象为一个普遍的问题:识别有助于划分给定数据集的关键变量。我们已经开发了两种可用于处理此问题的算法。两种算法都基于模糊集,模糊测度,模糊积分和演化策略。第二种算法基于这样的思想,即每个数据点都可以被视为对象相对于多个特征的评估函数。模糊度量用于加权不同的特征,模糊积分用于定义数据点的分区。进化策略再次用于识别最佳模糊测度,在该最佳模糊测度下,数据点的模糊积分的值定义了一个分区,该分区尽可能地接近给定的分区。两种算法都在基准数据Iris数据集上进行了测试。我们研究的副产品是一种通过求解模糊关系方程从给定数据集构造模糊测度的方法。此外,我们还为模糊关系方程的近似解开发了一种理论上合理的方法。第一种算法基于以下思想:通过采用不同的Mahalanobis度量,人们可以对变量进行加权。它称为进化模糊c均值算法。该算法涉及对最佳Mahalanobis度量的搜索,在该度量下,模糊c均值算法可得出与给定分区尽可能接近的模糊分区。

著录项

  • 作者

    Yuan, Bo.;

  • 作者单位

    State University of New York at Binghamton.;

  • 授予单位 State University of New York at Binghamton.;
  • 学科 Statistics.;Artificial Intelligence.;Engineering System Science.
  • 学位 Ph.D.
  • 年度 1996
  • 页码 150 p.
  • 总页数 150
  • 原文格式 PDF
  • 正文语种 eng
  • 中图分类 水产、渔业;
  • 关键词

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号