Published article "Neural networks made easy (Part 63): Unsupervised Pretraining for Decision Transformer (PDT)".
We continue to discuss the family of Decision Transformer methods. From previous article, we have already noticed that training the transformer underlying the architecture of these methods is a rather complex task and requires a large labeled dataset for training. In this article we will look at an algorithm for using unlabeled trajectories for preliminary model training.