首页> 外文期刊>International Journal of Computational Intelligence and Applications >Neural Skyline Filtering for Imbalance Features Classification
【24h】

Neural Skyline Filtering for Imbalance Features Classification

机译:无制的神经天际线滤波功能分类

获取原文
获取原文并翻译 | 示例
获取外文期刊封面目录资料

摘要

In the current digitalized era, large datasets play a vital role in features extractions, information processing, knowledge mining and management. Sometimes, existing mining approaches are not sufficient to handle large volume of datasets. Biological data processing also suffers for the same issue. In the present work, a classification process is carried out on large volume of exons and introns from a set of raw data. The proposed work is designed into two parts as pre-processing and mapping-based classification. For pre-processing, three filtering techniques have been used. However, these traditional filtering techniques face difficulties for large datasets due to the long required time during large data processing as well as the large required memory size. In this regard, a mapping-based neural skyline filtering approach is designed. Randomized algorithm performed the mapping for large volume of datasets based on objective function. The objective function determines the randomized size of the datasets according to the homogeneity. Around 200 million DNA base pairs have been used for experimental analysis. Experimental result shows that mapping centric filtering outperforms other filtering techniques during large data processing.
机译:在当前的数字化时代,大型数据集在特征提取,信息处理,知识挖掘和管理中发挥着重要作用。有时,现有的采矿方法不足以处理大量数据集。生物数据处理也遭受了同样的问题。在本作本作中,分类过程是在一组原料数据中大量的外显子和内含子进行。该拟议的工作被设计为两部分作为基于预处理和基于映射的分类。为了预处理,已经使用了三种过滤技术。然而,由于大数据处理期间的长期所需时间以及大所需的内存大小,这些传统的过滤技术面临大型数据集的困难。在这方面,设计了一种基于映射的神经天际线过滤方法。随机算法基于客观函数执行大量数据集的映射。目标函数根据均匀性确定数据集的随机大小。大约2亿DNA碱基对已经用于实验分析。实验结果表明,在大数据处理期间,映射中心滤波优于其他过滤技术。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号