Approximate policy iteration using regularised Bellman residuals minimisation | Publicación