首页> 外国专利> AUTOMATED OPTIMIZATION OF A MASS POLICY COLLECTIVELY PERFORMED FOR OBJECTS IN TWO OR MORE STATES AND A DIRECT POLICY PERFORMED IN EACH STATE

AUTOMATED OPTIMIZATION OF A MASS POLICY COLLECTIVELY PERFORMED FOR OBJECTS IN TWO OR MORE STATES AND A DIRECT POLICY PERFORMED IN EACH STATE

机译:对两个或多个状态下的对象和每个状态下的对象直接执行的集体策略的自动优化

摘要

An information processing apparatus that optimizes a policy in a transition model in which the number of targeted objects in each state transits according to the policy includes a cost constraint acquisition unit configured to acquire a cost constraint that constrains a total cost of the policy; a mass policy setting unit configured to set the number of objects targeted by a mass policy in each state, based on the predefined number of objects to belong to each state and a reach rate at which the mass policy reaches to an object, with respect to the mass policy collectively executed for the object in two or more states; and a processing unit configured to assume the reach rate of the mass policy as a variable of an optimization and maximize an objective function based on a total reward in a whole period while satisfying the cost constraint.
机译:在转变模型中优化策略的信息处理设备包括:成本约束获取单元,其被构造为获取约束策略的总成本的成本约束;其中,在过渡模型中,每个状态的目标对象的数量根据该策略进行转换。质量策略设置单元,其被配置为基于属于每个状态的对象的预定数量和质量策略到达对象的到达率,来设置每个状态中质量策略所针对的对象的数量。在两个或多个状态下针对该对象共同执行的群众政策;处理单元,其被设定为在满足成本约束的同时,将整体政策的到达率作为优化变量,并基于整个期间的总报酬最大化目标函数。

著录项

相似文献

  • 专利
  • 外文文献
  • 中文文献
获取专利

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号