In order to solve the problem of controlling dissolved oxygen (DO) concentration and nitrate concentration of wasterwater treatment process (WWTP),a multi critic indices dual heuristic dynamic programming (MDHP) policy is proposed.The approximating precision can be improved through lowering the complexity between the relationship of the critic network's outputs and inputs in this scheme.Echo state networks (ESNs) are adopted to approximate the critic indices and the optimal control policy.Online learning method of the controller is investigated.Experimental results indicate that the MDHP scheme has some advantages over single critic index DHP (SDHP) and PID in control performance.%针对污水处理过程(Wastewater treatment process,WWTP)溶解氧(Dissolved oxygen,DO)及硝态氮浓度控制问题,提出了一种多评价指标的DHP (Dual heuristic dynamic programming)控制策略.该策略能够降低评价指标的复杂性,提高评价网络的逼近精度.采用回声状态网络(Echo state networks,ESNs)实现评价函数及控制策略的逼近,研究了控制器的在线学习算法.实验表明,该策略在控制性能上优于单评价指标的DHP策略及常规PID控制策略.
展开▼