<p align="center">
<a href="https://www.youtube.com/watch?v=pieI7rOXELI&list=PLXO45tsB95cIplu-fLMpUEEZTwrDNh6Ba" target="_blank">
<img width="60%" src="https://github.com/MorvanZhou/Reinforcement-learning-with-tensorflow/blob/master/RL_cover.jpg" style="max-width:100%;">
</a>
</p>
<br>
# Reinforcement Learning Methods and Tutorials
In these tutorials for reinforcement learning, it covers from the basic RL algorithms to advanced algorithms developed recent years.
**If you speak Chinese, visit [莫烦 Python](https://morvanzhou.github.io/tutorials/) or my [Youtube channel](https://www.youtube.com/channel/UCdyjiB5H8Pu7aDTNVXTTpcg) for more.**
**As many requests about making these tutorials available in English, please find them in this playlist:** ([https://www.youtube.com/playlist?list=PLXO45tsB95cIplu-fLMpUEEZTwrDNh6Ba](https://www.youtube.com/playlist?list=PLXO45tsB95cIplu-fLMpUEEZTwrDNh6Ba))
# Table of Contents
* Tutorials
* [Simple entry example](https://github.com/MorvanZhou/Reinforcement-learning-with-tensorflow/tree/master/contents/1_command_line_reinforcement_learning)
* [Q-learning](https://github.com/MorvanZhou/Reinforcement-learning-with-tensorflow/tree/master/contents/2_Q_Learning_maze)
* [Sarsa](https://github.com/MorvanZhou/Reinforcement-learning-with-tensorflow/tree/master/contents/3_Sarsa_maze)
* [Sarsa(lambda)](https://github.com/MorvanZhou/Reinforcement-learning-with-tensorflow/tree/master/contents/4_Sarsa_lambda_maze)
* [Deep Q Network (DQN)](https://github.com/MorvanZhou/Reinforcement-learning-with-tensorflow/tree/master/contents/5_Deep_Q_Network)
* [Using OpenAI Gym](https://github.com/MorvanZhou/Reinforcement-learning-with-tensorflow/tree/master/contents/6_OpenAI_gym)
* [Double DQN](https://github.com/MorvanZhou/Reinforcement-learning-with-tensorflow/tree/master/contents/5.1_Double_DQN)
* [DQN with Prioitized Experience Replay](https://github.com/MorvanZhou/Reinforcement-learning-with-tensorflow/tree/master/contents/5.2_Prioritized_Replay_DQN)
* [Dueling DQN](https://github.com/MorvanZhou/Reinforcement-learning-with-tensorflow/tree/master/contents/5.3_Dueling_DQN)
* [Policy Gradients](https://github.com/MorvanZhou/Reinforcement-learning-with-tensorflow/tree/master/contents/7_Policy_gradient_softmax)
* [Actor-Critic](https://github.com/MorvanZhou/Reinforcement-learning-with-tensorflow/tree/master/contents/8_Actor_Critic_Advantage)
* [Deep Deterministic Policy Gradient (DDPG)](https://github.com/MorvanZhou/Reinforcement-learning-with-tensorflow/tree/master/contents/9_Deep_Deterministic_Policy_Gradient_DDPG)
* [A3C](https://github.com/MorvanZhou/Reinforcement-learning-with-tensorflow/tree/master/contents/10_A3C)
* [Dyna-Q](https://github.com/MorvanZhou/Reinforcement-learning-with-tensorflow/tree/master/contents/11_Dyna_Q)
* [Proximal Policy Optimization (PPO)](https://github.com/MorvanZhou/Reinforcement-learning-with-tensorflow/tree/master/contents/12_Proximal_Policy_Optimization)
* [Curiosity Model](/contents/Curiosity_Model), [Random Network Distillation (RND)](/contents/Curiosity_Model/Random_Network_Distillation.py)
* [Some of my experiments](https://github.com/MorvanZhou/Reinforcement-learning-with-tensorflow/tree/master/experiments)
* [2D Car](https://github.com/MorvanZhou/Reinforcement-learning-with-tensorflow/tree/master/experiments/2D_car)
* [Robot arm](https://github.com/MorvanZhou/Reinforcement-learning-with-tensorflow/tree/master/experiments/Robot_arm)
* [BipedalWalker](https://github.com/MorvanZhou/Reinforcement-learning-with-tensorflow/tree/master/experiments/Solve_BipedalWalker)
* [LunarLander](https://github.com/MorvanZhou/Reinforcement-learning-with-tensorflow/tree/master/experiments/Solve_LunarLander)
# Some RL Networks
### [Deep Q Network](https://github.com/MorvanZhou/Reinforcement-learning-with-tensorflow/tree/master/contents/5_Deep_Q_Network)
<a href="https://github.com/MorvanZhou/Reinforcement-learning-with-tensorflow/tree/master/contents/5_Deep_Q_Network">
<img class="course-image" src="https://morvanzhou.github.io/static/results/reinforcement-learning/4-3-2.png">
</a>
### [Double DQN](https://github.com/MorvanZhou/Reinforcement-learning-with-tensorflow/tree/master/contents/5.1_Double_DQN)
<a href="https://github.com/MorvanZhou/Reinforcement-learning-with-tensorflow/tree/master/contents/5.1_Double_DQN">
<img class="course-image" src="https://morvanzhou.github.io/static/results/reinforcement-learning/4-5-3.png">
</a>
### [Dueling DQN](https://github.com/MorvanZhou/Reinforcement-learning-with-tensorflow/tree/master/contents/5.3_Dueling_DQN)
<a href="https://github.com/MorvanZhou/Reinforcement-learning-with-tensorflow/tree/master/contents/5.3_Dueling_DQN">
<img class="course-image" src="https://morvanzhou.github.io/static/results/reinforcement-learning/4-7-4.png">
</a>
### [Actor Critic](https://github.com/MorvanZhou/Reinforcement-learning-with-tensorflow/tree/master/contents/8_Actor_Critic_Advantage)
<a href="https://github.com/MorvanZhou/Reinforcement-learning-with-tensorflow/tree/master/contents/8_Actor_Critic_Advantage">
<img class="course-image" src="https://morvanzhou.github.io/static/results/reinforcement-learning/6-1-1.png">
</a>
### [Deep Deterministic Policy Gradient](https://github.com/MorvanZhou/Reinforcement-learning-with-tensorflow/tree/master/contents/9_Deep_Deterministic_Policy_Gradient_DDPG)
<a href="https://github.com/MorvanZhou/Reinforcement-learning-with-tensorflow/tree/master/contents/9_Deep_Deterministic_Policy_Gradient_DDPG">
<img class="course-image" src="https://morvanzhou.github.io/static/results/reinforcement-learning/6-2-2.png">
</a>
### [A3C](https://github.com/MorvanZhou/Reinforcement-learning-with-tensorflow/tree/master/contents/10_A3C)
<a href="https://github.com/MorvanZhou/Reinforcement-learning-with-tensorflow/tree/master/contents/10_A3C">
<img class="course-image" src="https://morvanzhou.github.io/static/results/reinforcement-learning/6-3-2.png">
</a>
### [Proximal Policy Optimization (PPO)](https://github.com/MorvanZhou/Reinforcement-learning-with-tensorflow/tree/master/contents/12_Proximal_Policy_Optimization)
<a href="https://github.com/MorvanZhou/Reinforcement-learning-with-tensorflow/tree/master/contents/12_Proximal_Policy_Optimization">
<img class="course-image" src="https://morvanzhou.github.io/static/results/reinforcement-learning/6-4-3.png">
</a>
### [Curiosity Model](/contents/Curiosity_Model)
<a href="/contents/Curiosity_Model">
<img class="course-image" src="/contents/Curiosity_Model/Curiosity.png">
</a>
# Donation
*If this does help you, please consider donating to support me for better tutorials. Any contribution is greatly appreciated!*
<div >
<a href="https://www.paypal.com/cgi-bin/webscr?cmd=_donations&business=morvanzhou%40gmail%2ecom&lc=C2&item_name=MorvanPython&currency_code=AUD&bn=PP%2dDonationsBF%3abtn_donateCC_LG%2egif%3aNonHosted">
<img style="border-radius: 20px; box-shadow: 0px 0px 10px 1px #888888;"
src="https://www.paypalobjects.com/webstatic/en_US/i/btn/png/silver-pill-paypal-44px.png"
alt="Paypal"
height="auto" ></a>
</div>
<div>
<a href="https://www.patreon.com/morvan">
<img src="https://morvanzhou.github.io/static/img/support/patreon.jpg"
alt="Patreon"
height=120></a>
</div>
没有合适的资源?快使用搜索试试~ 我知道了~
资源推荐
资源详情
资源评论
收起资源包目录
最全强化学习路径规划Reinforcement-learning-with-tensorflow-master.zip (118个子文件)
DDPG.py.baiduyun.uploading.cfg 3KB
treasure_on_right.py.baiduyun.uploading.cfg 3KB
checkpoint 69B
checkpoint 69B
params.data-00000-of-00001 117KB
params.data-00000-of-00001 39KB
强化学习a-c.docx 338KB
params.index 1KB
params.index 1KB
1559560308(1).jpg 6KB
1559560336(1).jpg 4KB
launch.json 477B
LICENCE 1KB
README.md 786B
README.md 7KB
events.out.tfevents.1490801027.Morvan 1.07MB
rl.mp4 1.64MB
rl1.mp4 214KB
Reinforcement_learning_An_introduction (1).pdf 9.57MB
Curiosity.png 123KB
img.png 42KB
DDPG.py 16KB
DuelingDQNPrioritizedReplay.py 13KB
RL_brain.py 12KB
DDPG.py 10KB
A3C_rnn.py 10KB
DDPG.py 10KB
DDPG.py 10KB
A3C.py 9KB
A3C_RNN.py 9KB
A3C_distributed_tf.py 9KB
discrete_DPPO.py 9KB
A3C.py 8KB
A3C.py 8KB
arm_env.py 8KB
RL_brain.py 8KB
RL_brain.py 8KB
DPPO.py 8KB
A3C_continuous_action.py 8KB
DPPO.py 8KB
car_env.py 8KB
A3C_discrete_action.py 8KB
env.py 7KB
RL_brain.py 7KB
RL_brain.py 7KB
DQN_modified.py 6KB
simply_PPO.py 6KB
Curiosity.py 6KB
AC_continue_Pendulum.py 6KB
Random_Network_Distillation.py 6KB
env.py 6KB
AC_CartPole.py 6KB
DDPG_update.py 6KB
DDPG_update2.py 6KB
env.py 5KB
env.py 5KB
rl.py 5KB
rl.py 5KB
rl.py 5KB
RL_brain.py 4KB
maze_env.py 4KB
maze_env.py 4KB
maze_env.py 4KB
maze_env.py 4KB
maze_env.py 4KB
treasure_on_right.py 3KB
RL_brain.py 3KB
RL_brain.py 3KB
RL_brain.py 3KB
run_Pendulum.py 2KB
run_Pendulum.py 2KB
run_MountainCar.py 2KB
collision.py 2KB
run_MountainCar.py 2KB
env.py 2KB
RL_brain.py 2KB
run_CartPole.py 2KB
run_LunarLander.py 2KB
run_this.py 2KB
run_this.py 1KB
run_this.py 1KB
main.py 1KB
run_CartPole.py 1KB
run_this.py 1KB
main.py 1KB
run_MountainCar.py 1KB
run_this.py 1KB
main.py 1KB
main.py 850B
main.py 835B
main.py 801B
rl.py 226B
rl.py 226B
rl.py 226B
env.py 175B
arm_env.cpython-36.pyc 8KB
env.cpython-35.pyc 6KB
env.cpython-35.pyc 6KB
env.cpython-36.pyc 5KB
env.pyc 5KB
共 118 条
- 1
- 2
资源评论
Yunworthy
- 粉丝: 3
- 资源: 8
上传资源 快速赚钱
- 我的内容管理 展开
- 我的资源 快来上传第一个资源
- 我的收益 登录查看自己的收益
- 我的积分 登录查看自己的积分
- 我的C币 登录后查看C币余额
- 我的收藏
- 我的下载
- 下载帮助
最新资源
- mmqrcode1714153659780.png
- Screenshot_2024-04-27-06-08-58-486_com.baidu.xin.aiqicha.jpg
- 基于Javaweb+Tomcat+MySQL的大学生公寓管理系统+sql文件.zip
- 实训作业基于javaweb的订单管理系统源码+数据库+实训报告.zip
- 多机调度问题贪心算法基于最小堆和贪心算法求解多机调度问题.zip
- 基于同态加密技术的匿名电子投票系统源码.zip
- Pyqt5项目框架-PyQt项目开发实践
- 基于C通过MQTT的智能农业大棚管理系统(本科毕业设计)
- python+CNN的网络入侵检测算法源码.zip
- js 实现记住密码功能 js.cookie.min.js
资源上传下载、课程学习等过程中有任何疑问或建议,欢迎提出宝贵意见哦~我们会及时处理!
点击此处反馈
安全验证
文档复制为VIP权益,开通VIP直接复制
信息提交成功