Python-OpenAIGym自驾小车模拟环境_pythongym资源-CSDN文库

共180个文件

png：45个

py：44个

obj：25个

5星 · 超过95%的资源需积分: 8 133 浏览量 2019-08-11 04:17:59 上传评论 3 收藏 18.71MB ZIP 举报

在IT领域，Python语言因其简洁明了的语法和丰富的库支持而被广泛应用于各种场景，包括机器学习和人工智能。OpenAI Gym是一个强大的开源平台，它为机器学习算法提供了多种环境，用于训练和测试智能体。"Python-OpenAIGym自驾小车模拟环境"就是这样一个环境，特别适合于研究自动驾驶技术。 OpenAI Gym的核心在于它提供了一个标准的接口，使得不同的环境（如游戏、机器人控制等）能够与各种强化学习算法无缝对接。在"Python-OpenAIGym自驾小车模拟环境"中，我们面对的是一个名为Duckietown的虚拟世界。Duckietown是由瑞士洛桑联邦理工学院开发的一个开源项目，旨在模拟一个充满鸭子形状机器人的小镇，用于自动驾驶技术的研究和教学。在这个环境中，你可以编写Python代码来控制一辆自动驾驶的小车。这辆小车需要学会识别道路标志、避免碰撞、遵循交通规则，并尽可能高效地行驶。通过OpenAI Gym的API，我们可以获取小车的状态信息，如位置、速度、传感器读数等，然后基于这些信息做出决策，调整小车的行动。强化学习是这个环境中主要的训练方法。智能体会根据其行为（驾驶策略）收到环境的反馈（奖励或惩罚），并通过不断试错来优化策略。例如，如果小车成功到达目的地，可能会获得正向奖励；如果撞到障碍物或违反交通规则，则会受到负向惩罚。随着时间的推移，智能体将逐渐学习到如何在Duckietown中安全、有效地驾驶。在解压后的文件"**gym-duckietown-master**"中，你应该能找到Duckietown环境的源代码和相关资源。其中可能包含以下关键部分： 1. **安装指南**：指导如何安装必要的依赖库，如`gym`, `duckietown-gym`, 和其他Python库。 2. **环境代码**：实现了Duckietown环境的Python模块，包括初始化环境、执行动作、获取观测值和奖励等功能。 3. **示例脚本**：展示了如何使用Python接口与环境进行交互，进行训练或演示。 4. **模型和数据**：可能包含了预训练的模型、路面图像、交通标志等资源。 5. **文档**：详细解释了环境的特性和使用方法。要深入学习这个环境，你需要了解强化学习的基本概念，如状态、动作、奖励函数和策略。此外，熟悉Python编程和OpenAI Gym的API也是必不可少的。通过在Duckietown环境中实践，你不仅可以提升对自动驾驶算法的理解，还能增强解决问题和调试代码的能力。这个模拟环境为研究者和开发者提供了一个安全、可重复且成本低廉的平台，用于探索和优化自动驾驶系统。

资源推荐

资源详情

资源评论

收起资源包目录

Python-OpenAIGym自驾小车模拟环境（180个子文件）

.bumpversion.cfg 102B

Dockerfile 1KB

Dockerfile 949B

Dockerfile 897B

Dockerfile 729B

Dockerfile 457B

.dockerignore 91B

finalmain.gif 4.42MB

needfordr.gif 1.3MB

domainrand-sim.gif 1.13MB

pedestrians.gif 384KB

.gitignore 65B

trafficlight_card0.jpg 199KB

trafficlight_card1.jpg 190KB

trafficlight_card.jpg 97KB

trafficlight_cover.jpg 64KB

wood_osb.jpg 11KB

default_dr.json 433B

default.json 220B

launch-xvfb 135B

Makefile 2KB

README.md 4KB

README.md 1KB

README.md 17KB

trafficlight.mtl 687B

sign_right_T_intersect.mtl 673B

sign_left_T_intersect.mtl 672B

sign_4_way_intersect.mtl 671B

sign_no_right_turn.mtl 669B

sign_duck_crossing.mtl 669B

sign_t_light_ahead.mtl 669B

sign_oneway_right.mtl 668B

sign_do_not_enter.mtl 668B

sign_no_left_turn.mtl 668B

duckiebot.mtl 668B

sign_T_intersect.mtl 667B

sign_oneway_left.mtl 667B

sign_pedestrian.mtl 666B

sign_yield.mtl 661B

sign_stop.mtl 660B

tree.mtl 372B

duckiebot.obj 680KB

building.obj 127KB

house.obj 107KB

trafficlight.obj 99KB

duckie.obj 90KB

barrier.obj 31KB

bus.obj 26KB

truck.obj 25KB

cone.obj 22KB

sign_left_T_intersect.obj 13KB

sign_oneway_left.obj 13KB

sign_no_right_turn.obj 13KB

sign_yield.obj 13KB

sign_oneway_right.obj 13KB

sign_t_light_ahead.obj 13KB

sign_duck_crossing.obj 13KB

sign_do_not_enter.obj 13KB

sign_T_intersect.obj 13KB

sign_4_way_intersect.obj 13KB

sign_no_left_turn.obj 13KB

sign_pedestrian.obj 13KB

sign_right_T_intersect.obj 13KB

sign_stop.obj 13KB

sign_blank.obj 5KB

tree.obj 3KB

LICENSE.pdf 74KB

building.png 1.12MB

house.png 891KB

stucco.png 804KB

grass_1.png 699KB

grass_2.png 689KB

simplesim_free.png 580KB

curve_left_2.png 479KB

curve_left_3.png 458KB

3way_left_2.png 403KB

floor_tiles_white.png 379KB

screencap1.png 349KB

screencap2.png 325KB

cone.png 314KB

4way_1.png 267KB

water_1.png 258KB

3way_right_1.png 248KB

3way_left_1.png 246KB

simplesim_1.png 234KB

black_tile.png 226KB

asphalt_1.png 226KB

curve_left_1.png 194KB

curve_right_1.png 193KB

straight_1.png 174KB

barrier.png 144KB

sign_t_light_ahead.png 80KB

floor_1.png 79KB

duckie.png 69KB

sign_duck_crossing.png 67KB

truck.png 48KB

truck_2.png 46KB

sign_pedestrian.png 43KB

bus.png 39KB

共 180 条

# Gym-Duckietown [![Build Status](https://circleci.com/gh/duckietown/gym-duckietown/tree/master.svg?style=shield)](https://circleci.com/gh/duckietown/gym-duckietown/tree/master) [![Docker Hub](https://img.shields.io/docker/pulls/duckietown/gym-duckietown.svg)](https://hub.docker.com/r/duckietown/gym-duckietown) [Duckietown](http://duckietown.org/) self-driving car simulator environments for OpenAI Gym. Please use this bibtex if you want to cite this repository in your publications: ``` @misc{gym_duckietown, author = {Chevalier-Boisvert, Maxime and Golemo, Florian and Cao, Yanjun and Mehta, Bhairav and Paull, Liam}, title = {Duckietown Environments for OpenAI Gym}, year = {2018}, publisher = {GitHub}, journal = {GitHub repository}, howpublished = {\url{https://github.com/duckietown/gym-duckietown}}, } ``` This simulator was created as part of work done at [Mila](https://mila.quebec/). <img src="media/simplesim_free.png" width="300px"> <h2 align="center"> Welcome to Duckietown! </h2> ## Table of Contents 1. [Gym-Duckietown](#Gym-Duckietown) 1. [Introduction](#Introduction) 2. [Installation](#Installation) 1. [Installation Using Conda (Alternative Method)](#Installation-Using-Conda-Alternative-Method) 3. [Usage](#Usage) 1. [Testing](#Testing) 2. [Learning](#Learning) 4. [Design](#Design) 1. [Map File Format](#Map-File-Format) 2. [Observations](#Observations) 3. [Actions](#Actions) 4. [Reward Function](#Reward-Function) 5. [Troubleshooting](#Troubleshooting) 1. [ImportError: Library "GLU" not found](#ImportError-Library-GLU-not-found) 2. [NoSuchDisplayException: Cannot connect to "None"](#NoSuchDisplayException-Cannot-connect-to-None) 3. [Running headless](#Running-headless) 4. [Running headless and training in a cloud based environment (AWS)](#Running-headless-and-training-in-a-cloud-based-environment-AWS) 5. [Poor performance, low frame rate](#Poor-performance-low-frame-rate) 6. [RL training doesn't converge](#RL-training-doesnt-converge) 7. [Unknown encoder 'libx264' when using gym.wrappers.Monitor](#Unknown-encoder-libx264-when-using-gymwrappersMonitor) Thanks @na018 for contributing this! ## Introduction Gym-Duckietown is a simulator for the [Duckietown](https://duckietown.org) Universe, written in pure Python/OpenGL (Pyglet). It places your agent, a Duckiebot, inside of an instance of a Duckietown: a loop of roads with turns, intersections, obstacles, Duckie pedestrians, and other Duckiebots. It can be a pretty hectic place! Gym-Duckietown is fast, open, and incredibly customizable. What started as a lane-following simulator has evolved into a fully-functioning autonomous driving simulator that you can use to train and test your Machine Learning, Reinforcement Learning, Imitation Learning, or even classical robotics algorithms. Gym-Duckietown offers a wide range of tasks, from simple lane-following to full city navigation with dynamic obstacles. Gym-Duckietown also ships with features, wrappers, and tools that can help you bring your algorithms to the real robot, including [domain-randomization](https://blog.openai.com/spam-detection-in-the-physical-world/), accurate camera distortion, and differential-drive physics (and most importantly, realistic waddling). <img src="media/finalmain.gif"> There are multiple registered gym environments, each corresponding to a different [map file](https://github.com/duckietown/gym-duckietown/tree/master/gym_duckietown/maps): - `Duckietown-straight_road-v0` - `Duckietown-4way-v0` - `Duckietown-udem1-v0` - `Duckietown-small_loop-v0` - `Duckietown-small_loop_cw-v0` - `Duckietown-zigzag_dists-v0` - `Duckietown-loop_obstacles-v0` (static obstacles in the road) - `Duckietown-loop_pedestrians-v0` (moving obstacles in the road) The `MultiMap-v0` environment is essentially a [wrapper](https://github.com/duckietown/gym-duckietown/blob/master/gym_duckietown/envs/multimap_env.py) for the simulator which will automatically cycle through all available [map files](https://github.com/duckietown/gym-duckietown/tree/master/gym_duckietown/maps). This makes it possible to train on a variety of different maps at the same time, with the idea that training on a variety of different scenarios will make for a more robust policy/model. `gym-duckietown` is an _accompanying_ simulator to real Duckiebots, which allow you to run your code on the real robot. We provide a domain randomization API, which can help you transfer your trained policies from simulation to real world. Without using a domain transfer method, your learned models will likely overfit to various aspects of the simulator, which won't transfer to the real world. When you deploy, you and your Duckiebot will be running around in circles trying to figure out what's going on. <img src="media/domainrand-sim.gif" width="300px" height="200px" ><img src="media/needfordr.gif" width="300px" height="200px" > The `Duckiebot-v0` environment is meant to connect to software running on a real Duckiebot and remotely control the robot. It is a tool to test that policies trained in simulation can transfer to the real robot. If you want to control your robot remotely with the `Duckiebot-v0` environment, you will need to install the software found in the [duck-remote-iface](https://github.com/maximecb/duck-remote-iface) repository on your Duckiebot. <img src="media/duckiebot_1.png" width="300px"> Duckiebot-v0 ## Installation Requirements: - Python 3.6+ - OpenAI gym - NumPy - Pyglet - PyYAML - cloudpickle - PyTorch or Tensorflow (to use the scripts in `learning/`) You can install all the dependencies except PyTorch with `pip3`: ``` git clone https://github.com/duckietown/gym-duckietown.git cd gym-duckietown pip3 install -e . ``` ### Installation Using Conda (Alternative Method) Alternatively, you can install all the dependencies, including PyTorch, using Conda as follows. For those trying to use this package on MILA machines, this is the way to go: ``` git clone https://github.com/duckietown/gym-duckietown.git cd gym-duckietown conda env create -f environment.yaml ``` Please note that if you use Conda to install this package instead of pip, you will need to activate your Conda environment and add the package to your Python path before you can use it: ``` source activate gym-duckietown export PYTHONPATH="${PYTHONPATH}:`pwd`" ``` ## Usage ### Testing There is a simple UI application which allows you to control the simulation or real robot manually. The `manual_control.py` application will launch the Gym environment, display camera images and send actions (keyboard commands) back to the simulator or robot. You can specify which map file to load with the `--map-name` argument: ``` ./manual_control.py --env-name Duckietown-udem1-v0 ``` There is also a script to run automated tests (`run_tests.py`) and a script to gather performance metrics (`benchmark.py`). ### Learning `gym-duckietown` provides starter code for both reinforcement learning (Pytorch only) and imitation learning (both Tensorflow and Pytorch). In the following section, we describe how to get started, as well as some tips on improving both agents. Within the `learning/` subdirectory, you will find `imitation/{tensorflow|pytorch}` and `reinforcement/pytorch`. To use either, you will want to change directories into the `learning/` directory, and call scripts from there (this allows us to import utility functions used in all three baselines while not forcing users to install both Tensorflow and Pytorch). **Pytorch Reinforcement Learning** can be run using: ``` python -m reinforcement.pytorch.train_reinforcement ``` whereas both **Tensorflow and Pytorch Imitation Learning** can be run using: ``` python -m imitation.{tensorflow|pytorch}.train_imitation ``` We use `argparse

评论收藏

内容反馈