PyTorch implementation of Constrained Policy Optimization
-
Updated
Oct 19, 2021 - Python
PyTorch implementation of Constrained Policy Optimization
RARE-3D: Reinforcement Learning–based Adaptive Path Selection for Efficient Point-Cloud Restoration
Implmentation of Trust Region Policy Optimization (TRPO) from scratch. Tested on standard mujoco-based gymnasium environments. Extended to a harder task of Quadrupedal Locomotion with the help of gait priors.
A repository for easy understanding of codes in Deep Reinforcement Learning
The pytorch implemetation of trpo
Add a description, image, and links to the trpo-pytorch topic page so that developers can more easily learn about it.
To associate your repository with the trpo-pytorch topic, visit your repo's landing page and select "manage topics."