首页> 外文期刊>Neural Networks, IEEE Transactions on >R-POPTVR: A Novel Reinforcement-Based POPTVR Fuzzy Neural Network for Pattern Classification
【24h】

R-POPTVR: A Novel Reinforcement-Based POPTVR Fuzzy Neural Network for Pattern Classification

机译:R-POPTVR:一种基于增强的新型POPTVR模糊神经网络,用于模式分类

获取原文
获取原文并翻译 | 示例
       

摘要

In general, a fuzzy neural network (FNN) is characterized by its learning algorithm and its linguistic knowledge representation. However, it does not necessarily interact with its environment when the training data is assumed to be an accurate description of the environment under consideration. In interactive problems, it would be more appropriate for an agent to learn from its own experience through interactions with the environment, i.e., reinforcement learning. In this paper, three clustering algorithms are developed based on the reinforcement learning paradigm. This allows a more accurate description of the clusters as the clustering process is influenced by the reinforcement signal. They are the REINFORCE clustering technique I (RCT-I), the REINFORCE clustering technique II (RCT-II), and the episodic REINFORCE clustering technique (ERCT). The integrations of the RCT-I, the RCT-II, and the ERCT within the pseudo-outer product truth value restriction (POPTVR), which is a fuzzy neural network integrated with the truth restriction value (TVR) inference scheme in its five layered feedforward neural network, form the RPOPTVR-I, the RPOPTVR-II, and the ERPOPTVR, respectively. The Iris, Phoneme, and Spiral data sets are used for benchmarking. For both Iris and Phoneme data, the RPOPTVR is able to yield better classification results which are higher than the original POPTVR and the modified POPTVR over the three test trials. For the Spiral data set, the RPOPTVR-II is able to outperform the others by at least a margin of 5.8% over multiple test trials. The three reinforcement-based clustering techniques applied to the POPTVR network are able to exhibit the trial-and-error search characteristic that yields higher qualitative performance.
机译:通常,模糊神经网络(FNN)的特征在于其学习算法和语言知识表示。但是,当训练数据被认为是所考虑环境的准确描述时,它不一定与环境相互作用。在交互问题中,代理人通过与环境的交互(即强化学习)从自己的经验中学习将更为合适。在本文中,基于强化学习范式开发了三种聚类算法。由于聚类过程受增强信号的影响,因此可以更准确地描述聚类。它们是REINFORCE聚类技术I(RCT-I),REINFORCE聚类技术II(RCT-II)和情节性REINFORCE聚类技术(ERCT)。 RCT-I,RCT-II和ERCT在伪外部产品真值限制(POPTVR)中的集成,这是一个模糊神经网络,在其五层中集成了真值限制(TVR)推理方案前馈神经网络分别形成RPOPTVR-I,RPOPTVR-II和ERPOPTVR。 Iris,Phoneme和Spiral数据集用于基准测试。对于Iris和Phoneme数据,在三个测试试验中,RPOPTVR能够产生更好的分类结果,高于原始的POPTVR和修改的POPTVR。对于Spiral数据集,在多次测试中,RPOPTVR-II的性能至少比其他同类产品高出5.8%。应用于POPTVR网络的三种基于增强的聚类技术能够表现出反复试验的搜索特性,从而产生更高的定性性能。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号