最全强化学习路径规划Reinforcement-learning-with-tensorflow-master.zip

共118个文件

py：74个

pyc：23个

md：2个

强化学习

python

路径规划

需积分: 34 130 浏览量 2019-10-01 13:37:20 上传评论 26 收藏 11.6MB ZIP 举报

资源推荐

资源详情

资源评论

收起资源包目录

最全强化学习路径规划Reinforcement-learning-with-tensorflow-master.zip （118个子文件）

DDPG.py.baiduyun.uploading.cfg 3KB

treasure_on_right.py.baiduyun.uploading.cfg 3KB

checkpoint 69B

params.data-00000-of-00001 117KB

params.data-00000-of-00001 39KB

强化学习a-c.docx 338KB

params.index 1KB

1559560308(1).jpg 6KB

1559560336(1).jpg 4KB

launch.json 477B

LICENCE 1KB

README.md 786B

README.md 7KB

events.out.tfevents.1490801027.Morvan 1.07MB

rl.mp4 1.64MB

rl1.mp4 214KB

Reinforcement_learning_An_introduction (1).pdf 9.57MB

Curiosity.png 123KB

img.png 42KB

DDPG.py 16KB

DuelingDQNPrioritizedReplay.py 13KB

RL_brain.py 12KB

DDPG.py 10KB

A3C_rnn.py 10KB

DDPG.py 10KB

A3C.py 9KB

A3C_RNN.py 9KB

A3C_distributed_tf.py 9KB

discrete_DPPO.py 9KB

A3C.py 8KB

arm_env.py 8KB

RL_brain.py 8KB

DPPO.py 8KB

A3C_continuous_action.py 8KB

DPPO.py 8KB

car_env.py 8KB

A3C_discrete_action.py 8KB

env.py 7KB

RL_brain.py 7KB

DQN_modified.py 6KB

simply_PPO.py 6KB

Curiosity.py 6KB

AC_continue_Pendulum.py 6KB

Random_Network_Distillation.py 6KB

env.py 6KB

AC_CartPole.py 6KB

DDPG_update.py 6KB

DDPG_update2.py 6KB

env.py 5KB

rl.py 5KB

RL_brain.py 4KB

maze_env.py 4KB

treasure_on_right.py 3KB

RL_brain.py 3KB

run_Pendulum.py 2KB

run_MountainCar.py 2KB

collision.py 2KB

run_MountainCar.py 2KB

env.py 2KB

RL_brain.py 2KB

run_CartPole.py 2KB

run_LunarLander.py 2KB

run_this.py 2KB

run_this.py 1KB

main.py 1KB

run_CartPole.py 1KB

run_this.py 1KB

main.py 1KB

run_MountainCar.py 1KB

run_this.py 1KB

main.py 1KB

main.py 850B

main.py 835B

main.py 801B

rl.py 226B

env.py 175B

arm_env.cpython-36.pyc 8KB

env.cpython-35.pyc 6KB

env.cpython-36.pyc 5KB

env.pyc 5KB

共 118 条

<p align="center"> <a href="https://www.youtube.com/watch?v=pieI7rOXELI&list=PLXO45tsB95cIplu-fLMpUEEZTwrDNh6Ba" target="_blank"> <img width="60%" src="https://github.com/MorvanZhou/Reinforcement-learning-with-tensorflow/blob/master/RL_cover.jpg" style="max-width:100%;"> </a> </p> <br> # Reinforcement Learning Methods and Tutorials In these tutorials for reinforcement learning, it covers from the basic RL algorithms to advanced algorithms developed recent years. **If you speak Chinese, visit [莫烦 Python](https://morvanzhou.github.io/tutorials/) or my [Youtube channel](https://www.youtube.com/channel/UCdyjiB5H8Pu7aDTNVXTTpcg) for more.** **As many requests about making these tutorials available in English, please find them in this playlist:** ([https://www.youtube.com/playlist?list=PLXO45tsB95cIplu-fLMpUEEZTwrDNh6Ba](https://www.youtube.com/playlist?list=PLXO45tsB95cIplu-fLMpUEEZTwrDNh6Ba)) # Table of Contents * Tutorials * [Simple entry example](https://github.com/MorvanZhou/Reinforcement-learning-with-tensorflow/tree/master/contents/1_command_line_reinforcement_learning) * [Q-learning](https://github.com/MorvanZhou/Reinforcement-learning-with-tensorflow/tree/master/contents/2_Q_Learning_maze) * [Sarsa](https://github.com/MorvanZhou/Reinforcement-learning-with-tensorflow/tree/master/contents/3_Sarsa_maze) * [Sarsa(lambda)](https://github.com/MorvanZhou/Reinforcement-learning-with-tensorflow/tree/master/contents/4_Sarsa_lambda_maze) * [Deep Q Network (DQN)](https://github.com/MorvanZhou/Reinforcement-learning-with-tensorflow/tree/master/contents/5_Deep_Q_Network) * [Using OpenAI Gym](https://github.com/MorvanZhou/Reinforcement-learning-with-tensorflow/tree/master/contents/6_OpenAI_gym) * [Double DQN](https://github.com/MorvanZhou/Reinforcement-learning-with-tensorflow/tree/master/contents/5.1_Double_DQN) * [DQN with Prioitized Experience Replay](https://github.com/MorvanZhou/Reinforcement-learning-with-tensorflow/tree/master/contents/5.2_Prioritized_Replay_DQN) * [Dueling DQN](https://github.com/MorvanZhou/Reinforcement-learning-with-tensorflow/tree/master/contents/5.3_Dueling_DQN) * [Policy Gradients](https://github.com/MorvanZhou/Reinforcement-learning-with-tensorflow/tree/master/contents/7_Policy_gradient_softmax) * [Actor-Critic](https://github.com/MorvanZhou/Reinforcement-learning-with-tensorflow/tree/master/contents/8_Actor_Critic_Advantage) * [Deep Deterministic Policy Gradient (DDPG)](https://github.com/MorvanZhou/Reinforcement-learning-with-tensorflow/tree/master/contents/9_Deep_Deterministic_Policy_Gradient_DDPG) * [A3C](https://github.com/MorvanZhou/Reinforcement-learning-with-tensorflow/tree/master/contents/10_A3C) * [Dyna-Q](https://github.com/MorvanZhou/Reinforcement-learning-with-tensorflow/tree/master/contents/11_Dyna_Q) * [Proximal Policy Optimization (PPO)](https://github.com/MorvanZhou/Reinforcement-learning-with-tensorflow/tree/master/contents/12_Proximal_Policy_Optimization) * [Curiosity Model](/contents/Curiosity_Model), [Random Network Distillation (RND)](/contents/Curiosity_Model/Random_Network_Distillation.py) * [Some of my experiments](https://github.com/MorvanZhou/Reinforcement-learning-with-tensorflow/tree/master/experiments) * [2D Car](https://github.com/MorvanZhou/Reinforcement-learning-with-tensorflow/tree/master/experiments/2D_car) * [Robot arm](https://github.com/MorvanZhou/Reinforcement-learning-with-tensorflow/tree/master/experiments/Robot_arm) * [BipedalWalker](https://github.com/MorvanZhou/Reinforcement-learning-with-tensorflow/tree/master/experiments/Solve_BipedalWalker) * [LunarLander](https://github.com/MorvanZhou/Reinforcement-learning-with-tensorflow/tree/master/experiments/Solve_LunarLander) # Some RL Networks ### [Deep Q Network](https://github.com/MorvanZhou/Reinforcement-learning-with-tensorflow/tree/master/contents/5_Deep_Q_Network) <a href="https://github.com/MorvanZhou/Reinforcement-learning-with-tensorflow/tree/master/contents/5_Deep_Q_Network"> <img class="course-image" src="https://morvanzhou.github.io/static/results/reinforcement-learning/4-3-2.png"> </a> ### [Double DQN](https://github.com/MorvanZhou/Reinforcement-learning-with-tensorflow/tree/master/contents/5.1_Double_DQN) <a href="https://github.com/MorvanZhou/Reinforcement-learning-with-tensorflow/tree/master/contents/5.1_Double_DQN"> <img class="course-image" src="https://morvanzhou.github.io/static/results/reinforcement-learning/4-5-3.png"> </a> ### [Dueling DQN](https://github.com/MorvanZhou/Reinforcement-learning-with-tensorflow/tree/master/contents/5.3_Dueling_DQN) <a href="https://github.com/MorvanZhou/Reinforcement-learning-with-tensorflow/tree/master/contents/5.3_Dueling_DQN"> <img class="course-image" src="https://morvanzhou.github.io/static/results/reinforcement-learning/4-7-4.png"> </a> ### [Actor Critic](https://github.com/MorvanZhou/Reinforcement-learning-with-tensorflow/tree/master/contents/8_Actor_Critic_Advantage) <a href="https://github.com/MorvanZhou/Reinforcement-learning-with-tensorflow/tree/master/contents/8_Actor_Critic_Advantage"> <img class="course-image" src="https://morvanzhou.github.io/static/results/reinforcement-learning/6-1-1.png"> </a> ### [Deep Deterministic Policy Gradient](https://github.com/MorvanZhou/Reinforcement-learning-with-tensorflow/tree/master/contents/9_Deep_Deterministic_Policy_Gradient_DDPG) <a href="https://github.com/MorvanZhou/Reinforcement-learning-with-tensorflow/tree/master/contents/9_Deep_Deterministic_Policy_Gradient_DDPG"> <img class="course-image" src="https://morvanzhou.github.io/static/results/reinforcement-learning/6-2-2.png"> </a> ### [A3C](https://github.com/MorvanZhou/Reinforcement-learning-with-tensorflow/tree/master/contents/10_A3C) <a href="https://github.com/MorvanZhou/Reinforcement-learning-with-tensorflow/tree/master/contents/10_A3C"> <img class="course-image" src="https://morvanzhou.github.io/static/results/reinforcement-learning/6-3-2.png"> </a> ### [Proximal Policy Optimization (PPO)](https://github.com/MorvanZhou/Reinforcement-learning-with-tensorflow/tree/master/contents/12_Proximal_Policy_Optimization) <a href="https://github.com/MorvanZhou/Reinforcement-learning-with-tensorflow/tree/master/contents/12_Proximal_Policy_Optimization"> <img class="course-image" src="https://morvanzhou.github.io/static/results/reinforcement-learning/6-4-3.png"> </a> ### [Curiosity Model](/contents/Curiosity_Model) <a href="/contents/Curiosity_Model"> <img class="course-image" src="/contents/Curiosity_Model/Curiosity.png"> </a> # Donation *If this does help you, please consider donating to support me for better tutorials. Any contribution is greatly appreciated!* <div > <a href="https://www.paypal.com/cgi-bin/webscr?cmd=_donations&business=morvanzhou%40gmail%2ecom&lc=C2&item_name=MorvanPython&currency_code=AUD&bn=PP%2dDonationsBF%3abtn_donateCC_LG%2egif%3aNonHosted"> <img style="border-radius: 20px; box-shadow: 0px 0px 10px 1px #888888;" src="https://www.paypalobjects.com/webstatic/en_US/i/btn/png/silver-pill-paypal-44px.png" alt="Paypal" height="auto" ></a> </div> <div> <a href="https://www.patreon.com/morvan"> <img src="https://morvanzhou.github.io/static/img/support/patreon.jpg" alt="Patreon" height=120></a> </div>

评论收藏

内容反馈