# PPO-PyTorch
This repository provides a Minimal PyTorch implementation of Proximal Policy Optimization (PPO) with clipped objective
for OpenAI gym environments. It is primarily intended for beginners in Reinforcement Learning for understanding the PPO
algorithm. It can still be used for complex environments but may require some hyperparameter-tuning or changes in the code.
Modified from https://github.com/tangyudi/Ai-Learn
## Usage
- To train a new network : run `PPO_continuous.py`
- To train a new network : run `PPO.py`
- To train a test network : run `test_continuous.py`
- To train a test network : run `test.py`
## Dependencies
Trained and tested on:
```
gym==0.19.0
pyglet==1.5.27
box2d box2d-kengz
gym[box2d]
torch==2.0.1+cu117
```
If you still have problems, you can check `requirement.txt`.
## References
- VMPO [paper](https://arxiv.org/abs/1909.12238)
- [OpenAI Spinning up](https://spinningup.openai.com/en/latest/)
没有合适的资源?快使用搜索试试~ 我知道了~
资源推荐
资源详情
资源评论
收起资源包目录
ppo-pyorch-master.zip (11个子文件)
ppo-pyorch-master
gif
PPO_BipedalWalker-v2.gif 3.37MB
PPO_LunarLander-v2.gif 1.16MB
PPO_LunarLander-v2.pth 43KB
PPO_continuous.py 8KB
PPO.py 10KB
test_continuous.py 2KB
preTrained
PPO_LunarLander-v2.pth 43KB
PPO_continuous_BipedalWalker-v2.pth 31KB
requirements.txt 780B
test.py 2KB
README.md 944B
共 11 条
- 1
资源评论
Older司机渣渣威
- 粉丝: 6
- 资源: 202
上传资源 快速赚钱
- 我的内容管理 展开
- 我的资源 快来上传第一个资源
- 我的收益 登录查看自己的收益
- 我的积分 登录查看自己的积分
- 我的C币 登录后查看C币余额
- 我的收藏
- 我的下载
- 下载帮助
安全验证
文档复制为VIP权益,开通VIP直接复制
信息提交成功