HAVEN: Hierarchical diffusion and value-based trajectory selection for offline safe reinforcement learning
Author:
Erlie Wang,He Diao,Xianglin Chen,Jingkui Zhang,Xiaofeng Chai,Qiang Qi,Ping Zhang
Publication:
Neurocomputing
© 2026 Elsevier B.V. All rights are reserved, including those for text and data mining, AI training, and similar technologies.