Regularizing transformers with deep probabilistic layers | Publicación