Vintix II: Decision Pre-Trained Transformer is a Scalable In-Context Reinforcement Learner
Paper • 2604.05112 • Published
This is a multi-domain action model via in-context reinforcement learning capable of adaptation to unseen dynamics.
@article{polubarov2026vintixiidecisionpretrained, author={Andrei Polubarov and Lyubaykin Nikita and Alexander Derevyagin and Artyom Grishin and Igor Saprygin and Aleksandr Serkov and Mark Averchenko and Daniil Tikhonov and Maksim Zhdanov and Alexander Nikulin and Ilya Zisman and Albina Klepach and Alexey Zemtsov and Vladislav Kurenkov}, title={Vintix II: Decision Pre-Trained Transformer is a Scalable In-Context Reinforcement Learner}, journal={arXiv}, volume={2604.05112}, year={2026}, }