基于IsaacGym环境的HighTorque腿式机器人的强化学习环境.zip资源-CSDN文库

共63个文件

py：26个

stl：13个

pyc：11个

版权申诉

34 浏览量 2024-04-29 17:21:25 上传评论收藏 14.24MB ZIP 举报

资源推荐

资源详情

资源评论

收起资源包目录

基于 Isaac Gym 环境的 HighTorque 腿式机器人的强化学习环境.zip （63个子文件）

livelybot_rl_control-main

setup.py 2KB

resources

robots

pai_12dof

CMakeLists.txt 281B

terrain

uneven.png 6KB

urdf

pai_12dof.urdf 15KB

package.xml 646B

export.log 220KB

meshes

l_hip_roll_link.STL 517KB

l_thigh_link.STL 421KB

base_link.STL 1.81MB

l_ankle_roll_link.STL 1.13MB

r_thigh_link.STL 421KB

r_hip_roll_link.STL 517KB

r_calf_link.STL 3.84MB

l_hip_yaw_link.STL 650KB

l_calf_link.STL 3.84MB

r_hip_yaw_link.STL 650KB

l_ankle_pitch_link.STL 209KB

r_ankle_pitch_link.STL 209KB

r_ankle_roll_link.STL 1.13MB

launch

gazebo.launch 560B

display.launch 493B

mjcf

pai_12dof_terrain.xml 16KB

pai_12dof.xml 16KB

config

joint_names_pai_12dof_v2_0312.yaml 265B

logs

Pai_ppo

model_1.pt 10.62MB

README.md 6KB

humanoid

__init__.py 2KB

algo

vec_env.py 3KB

__init__.py 2KB

ppo

__init__.py 2KB

rollout_storage.py 9KB

ppo.py 9KB

actor_critic.py 5KB

on_policy_runner.py 13KB

__pycache__

rollout_storage.cpython-38.pyc 5KB

actor_critic.cpython-38.pyc 4KB

ppo.cpython-38.pyc 5KB

on_policy_runner.cpython-38.pyc 7KB

__init__.cpython-38.pyc 343B

utils

__init__.py 2KB

math.py 2KB

helpers.py 8KB

terrain.py 11KB

task_registry.py 8KB

calculate_gait.py 5KB

logger.py 6KB

__pycache__

logger.cpython-38.pyc 4KB

task_registry.cpython-38.pyc 5KB

helpers.cpython-38.pyc 5KB

terrain.cpython-38.pyc 7KB

__init__.cpython-38.pyc 456B

math.cpython-38.pyc 1008B

envs

__init__.py 2KB

base

LICENSE 2KB

base_task.py 7KB

legged_robot.py 42KB

legged_robot_config.py 9KB

base_config.py 3KB

pai

pai_config.py 11KB

pai_env.py 24KB

scripts

play.py 7KB

train.py 2KB

sim2sim.py 8KB

# livelybot_rl_control This repository provides a reinforcement learning environment used to train HighTorque’s Legged Robot using NVIDIA’s Isaac Gym. Livelybot_rl_control also integrates a sim-to-sim framework from Isaac Gym to Mujoco that allows users to verify the trained policies in different physical simulations to ensure the robustness and generalization of the policies. ## Installation 1. Generate a new Python virtual environment with Python 3.8 using `conda create -n myenv python=3.8`. 2. For the best performance, we recommend using NVIDIA driver version 525 `sudo apt install nvidia-driver-525`. The minimal driver version supported is 515. If you're unable to install version 525, ensure that your system has at least version 515 to maintain basic functionality. 3. Install PyTorch 1.13 with Cuda-11.7: - `conda install pytorch==1.13.1 torchvision==0.14.1 torchaudio==0.13.1 pytorch-cuda=11.7 -c pytorch -c nvidia` 4. Install numpy-1.23 with `conda install numpy=1.23`. 5. Install Isaac Gym: - Download and install Isaac Gym Preview 4 from https://developer.nvidia.com/isaac-gym. - `cd isaacgym/python && pip install -e .` - Run an example with `cd examples && python 1080_balls_of_solitude.py`. - Consult `isaacgym/docs/index.html` for troubleshooting. 6. Install livelybot_rl_control: - Clone this repository. - `cd livelybot_rl_control && pip install -e .` ## Usage Guide #### Examples ```bash # Launching PPO Policy Training for 'v1' Across 4096 Environments # This command initiates the PPO algorithm-based training for the humanoid task. python scripts/train.py --task=pai_ppo --run_name v1 --headless --num_envs 4096 # Evaluating the Trained PPO Policy 'v1' # This command loads the 'v1' policy for performance assessment in its environment. # Additionally, it automatically exports a JIT model, suitable for deployment purposes. python scripts/play.py --task=pai_ppo --run_name v1 # Implementing Simulation-to-Simulation Model Transformation # This command facilitates a sim-to-sim transformation using exported 'v1' policy. python scripts/sim2sim.py --load_model /path/to/logs/Pai_ppo/exported/policies/policy_1.pt # Run our trained policy python scripts/sim2sim.py --load_model /path/to/logs/Pai_ppo/exported/policies/policy_example.pt ``` #### 1. Default Tasks - **pai_ppo** - Purpose: Baseline, PPO policy, Multi-frame low-level control - Observation Space: Variable $(47 \times H)$ dimensions, where $H$ is the number of frames - $[O_{t-H} ... O_t]$ - Privileged Information: $73$ dimensions #### 2. PPO Policy - **Training Command**: For training the PPO policy, execute: ``` python humanoid/scripts/train.py --task=humanoid_ppo --load_run log_file_path --name run_name ``` - **Running a Trained Policy**: To deploy a trained PPO policy, use: ``` python humanoid/scripts/play.py --task=humanoid_ppo --load_run log_file_path --name run_name ``` - By default, the latest model of the last run from the experiment folder is loaded. However, other run iterations/models can be selected by adjusting `load_run` and `checkpoint` in the training config. #### 3. Sim-to-sim - **Mujoco-based Sim2Sim Deployment**: Utilize Mujoco for executing simulation-to-simulation (sim2sim) deployments with the command below: ``` python scripts/sim2sim.py --load_model /path/to/export/model.pt ``` #### 4. Parameters - **CPU and GPU Usage**: To run simulations on the CPU, set both `--sim_device=cpu` and `--rl_device=cpu`. For GPU operations, specify `--sim_device=cuda:{0,1,2...}` and `--rl_device={0,1,2...}` accordingly. Please note that `CUDA_VISIBLE_DEVICES` is not applicable, and it's essential to match the `--sim_device` and `--rl_device` settings. - **Headless Operation**: Include `--headless` for operations without rendering. - **Rendering Control**: Press 'v' to toggle rendering during training. - **Policy Location**: Trained policies are saved in `humanoid/logs/<experiment_name>/<date_time>_<run_name>/model_<iteration>.pt`. #### 5. Command-Line Arguments For RL training, please refer to `humanoid/utils/helpers.py#L161`. For the sim-to-sim process, please refer to `humanoid/scripts/sim2sim.py#L169`. ## Code Structure 1. Every environment hinges on an `env` file (`legged_robot.py`) and a `configuration` file (`legged_robot_config.py`). The latter houses two classes: `LeggedRobotCfg` (encompassing all environmental parameters) and `LeggedRobotCfgPPO` (denoting all training parameters). 2. Both `env` and `config` classes use inheritance. 3. Non-zero reward scales specified in `cfg` contribute a function of the corresponding name to the sum-total reward. 4. Tasks must be registered with `task_registry.register(name, EnvClass, EnvConfig, TrainConfig)`. Registration may occur within `envs/__init__.py`, or outside of this repository. ## Add a new environment The base environment `legged_robot` constructs a rough terrain locomotion task. The corresponding configuration does not specify a robot asset (URDF/ MJCF) and no reward scales. 1. If you need to add a new environment, create a new folder in the `envs/` directory with a configuration file named `<your_env>_config.py`. The new configuration should inherit from existing environment configurations. 2. If proposing a new robot: - Insert the corresponding assets in the `resources/` folder. - In the `cfg` file, set the path to the asset, define body names, default_joint_positions, and PD gains. Specify the desired `train_cfg` and the environment's name (python class). - In the `train_cfg`, set the `experiment_name` and `run_name`. 3. If needed, create your environment in `<your_env>.py`. Inherit from existing environments, override desired functions and/or add your reward functions. 4. Register your environment in `humanoid/envs/__init__.py`. 5. Modify or tune other parameters in your `cfg` or `cfg_train` as per requirements. To remove the reward, set its scale to zero. Avoid modifying the parameters of other environments! 6. If you want a new robot/environment to perform sim2sim, you may need to modify `humanoid/scripts/sim2sim.py`: - Check the joint mapping of the robot between MJCF and URDF. - Change the initial joint position of the robot according to your trained policy. ## Acknowledgment The implementation of livelybot_rl_control relies on resources from [legged_gym](https://github.com/leggedrobotics/legged_gym) and [humanoid-gym](https://https://github.com/roboterax/humanoid-gym) projects.

评论收藏

内容反馈

版权申诉