Exploring the Effects of Conditioning Independent Q-Learners on the Sufficient Statistic for Dec-POMDPs

Alexander Mandersloot, Frans A. Oliehoek, and Aleksander Czechowski. Exploring the Effects of Conditioning Independent Q-Learners on the Sufficient Statistic for Dec-POMDPs. In Proceedings of the 32nd Benelux Conference on Artificial Intelligence (BNAIC) and the 29th Belgian Dutch Conference on Machine Learning (Benelearn), November 2020.


pdf [1.0MB]  



