Publications• Sorted by Date • Classified by Publication Type • Classified by Research Category • When Do Off-Policy and On-Policy Policy Gradient Methods Align?Davide Mambelli, Stephan Bongers, Onno Zoeter, Matthijs T. J. Spaan, and Frans A. Oliehoek. When Do Off-Policy and On-Policy Policy Gradient Methods Align?. arXiv e-prints, pp. arXiv:2402.12034, February 2024. DownloadAbstract(unavailable) BibTeX Entry@ARTICLE{Mambelli24arxiv,
author = {{Mambelli}, Davide and {Bongers}, Stephan and {Zoeter}, Onno and {Spaan}, Matthijs T.~J. and {Oliehoek}, Frans A.},
title = {When Do Off-Policy and On-Policy Policy Gradient Methods Align?},
journal = {arXiv e-prints},
year = 2024,
month = feb,
eid = {arXiv:2402.12034},
pages = {arXiv:2402.12034},
doi = {10.48550/arXiv.2402.12034},
archivePrefix = {arXiv},
eprint = {2402.12034},
primaryClass = {stat.ML},
keywords = {nonrefereed, arxiv},
}
Generated by
bib2html.pl
(written by Patrick Riley) on
Thu Nov 06, 2025 10:14:50 UTC |