Parallel Computation of Rough Set Approximations in Information Systems with Missing Decision Data

Thinh Cao; Koichi Yamada; Muneyuki Unehara; Izumi Suzuki; Do Van Nguyen

首页> 外文期刊>Computers >Parallel Computation of Rough Set Approximations in Information Systems with Missing Decision Data

【24h】

Parallel Computation of Rough Set Approximations in Information Systems with Missing Decision Data

机译：缺少决策数据的信息系统中粗糙集近似的并行计算

获取原文

掌桥外文数据库（机构版） >>

开具论文收录证明 >>

文献代查 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

The paper discusses the use of parallel computation to obtain rough set approximations from large-scale information systems where missing data exist in both condition and decision attributes. To date, many studies have focused on missing condition data, but very few have accounted for missing decision data, especially in enlarging datasets. One of the approaches for dealing with missing data in condition attributes is named twofold rough approximations . The paper aims to extend the approach to deal with missing data in the decision attribute. In addition, computing twofold rough approximations is very intensive, thus the approach is not suitable when input datasets are large. We propose parallel algorithms to compute twofold rough approximations in large-scale datasets. Our method is based on MapReduce, a distributed programming model for processing large-scale data. We introduce the original sequential algorithm first and then the parallel version is introduced. Comparison between the two approaches through experiments shows that our proposed parallel algorithms are suitable for and perform efficiently on large-scale datasets that have missing data in condition and decision attributes.

机译：本文讨论了使用并行计算从大型信息系统中获得粗糙集近似值的情况，条件和决策属性中都缺少数据。迄今为止，许多研究都集中在缺少条件数据上，但是很少有研究说明缺少决策数据，尤其是在扩大数据集方面。处理条件属性中的缺失数据的一种方法称为双重粗糙近似。本文旨在扩展处理决策属性中缺失数据的方法。另外，计算双重粗略近似值非常费力，因此，当输入数据集很大时，此方法不适合。我们提出了并行算法来计算大规模数据集中的两倍粗近似。我们的方法基于MapReduce，这是一种用于处理大规模数据的分布式编程模型。我们首先介绍原始的顺序算法，然后介绍并行版本。通过实验对两种方法进行比较，结果表明，我们提出的并行算法适用于条件和决策属性中缺少数据的大规模数据集，并能有效地对其执行。

著录项

来源
《Computers》 |2018年第3期|共21页
作者
Thinh Cao; Koichi Yamada; Muneyuki Unehara; Izumi Suzuki; Do Van Nguyen;
展开▼
作者单位

展开▼
收录信息
原文格式 PDF
正文语种
中图分类数学;
关键词
rough setrough approximationmapreducetwofold rough approximationmissing decision datamissing condition data;

机译：粗略设定粗略逼近映射减少两倍粗略近似缺少决策数据缺少条件数据;

相似文献

外文文献
中文文献
专利

1. Fast algorithms for computing rough approximations in set-valued decision systems while updating criteria values [J] . Luo Chuan, Li Tianrui, Chen Hongmei, Information Sciences: An International Journal . 2015,第Null期

机译：快速算法，可在更新标准值时计算集值决策系统中的近似值
2. An assessment method for the impact of missing data in the rough set-based decision fusion [J] . Han Shan, Jin Xiaoning, Li Jianxun Intelligent data analysis . 2016,第6期

机译：基于粗糙集的决策融合中缺失数据影响的评估方法
3. Decision-theoretic Rough Sets-based Three-way Approximations of Interval-valued Fuzzy Sets [J] . Lang Guangming, Yang Tian Fundamenta Informaticae . 2015,第1a4期

机译：基于决策理论粗糙集的区间值模糊集的三向逼近
4. Rough Sets Approximations in Data Tables Containing Missing Values [C] . Michinori Nakata, Hiroshi Sakai IEEE International Conference on Fuzzy Systems . 2008

机译：包含缺失值的数据表中的粗糙集近似值
5. Fuzzy Rough Set Approximations in Large Scale Information Systems. [D] . Asfoor, Hasan. 2015

机译：大规模信息系统中的模糊粗糙集近似。
6. EFFICIENT HAPLOTYPE INFERENCE FROM PEDIGREES WITH MISSING DATA USING LINEAR SYSTEMS WITH DISJOINT-SET DATA STRUCTURES [O] . Xin Li, Jing Li -1

机译：使用带有离散集数据结构的线性系统从缺少数据的谱系获得有效的单型推断
7. Rough Sets Computations to Impute Missing Data [O] . Nelwamondo, Fulufhelo Vincent, Marwala, Tshilidzi 2007

机译：用于计算缺失数据的粗糙集计算

Parallel Computation of Rough Set Approximations in Information Systems with Missing Decision Data

摘要

著录项

相似文献

相关主题

期刊订阅