Dual REPS: A Generalization of Relative Entropy Policy Search Exploiting Bad Experiences | Publicación