A scalable Deep Q-Learning approach for hot stamping process under dynamic control environment | Publicación