We propose Normative Dec-POMDPs, a model of collective decision making in the presence of complex norms, with violations of norms classified according to their relative severity. We extend the PBPG algorithm in order to solve Normative Dec-POMDPs and propose a heuristic that improves its scalability without affecting the policy quality.
展开▼