【免费】HE-Drive-main.zip资源-CSDN文库

共164个文件

py：87个

pyc：57个

sh：7个

需积分: 0 40 浏览量更新于2024-12-25 收藏 3.96MB ZIP 举报

HE-Drive-main.zip

收起资源包目录

HE-Drive-main.zip （164个子文件）

deformable_aggregation.cpp 4KB

deformable_aggregation_cuda.cu 10KB

README.md 4KB

quick_start.md 2KB

scoring.png 518KB

overview.png 296KB

overview.png 220KB

sdc_car.png 208KB

motion_planner.png 191KB

sparse_perception.png 183KB

legend.png 45KB

nuscenes_3d_dataset.py 41KB

motion_utils.py 26KB

nuscenes_converter.py 24KB

sparsedrive_small_stage2.py 23KB

sparsedrive_small_stage1.py 22KB

detection3d_head.py 20KB

motion_planning_head.py 17KB

scoring.py 16KB

target.py 16KB

bev_render.py 14KB

blocks.py 13KB

attention.py 13KB

decoder.py 12KB

test.py 12KB

motion_eval_uniad.py 12KB

train.py 11KB

vector_eval.py 10KB

conditional_unet1d.py 10KB

cam_render.py 10KB

test_scorer.py 10KB

detection3d_blocks.py 10KB

instance_bank.py 9KB

transform.py 9KB

test.py 9KB

test_onemodal.py 9KB

test_mm.py 8KB

test_mean.py 8KB

augment.py 8KB

instance_queue.py 8KB

planning_eval.py 8KB

vectorize.py 7KB

utils.py 7KB

train.py 7KB

loading.py 7KB

mmdet_train.py 7KB

benchmark.py 7KB

target.py 7KB

map_blocks.py 6KB

group_in_batch_sampler.py 6KB

builder.py 6KB

nuscmap_extractor.py 6KB

test.py 6KB

model.py 4KB

AP.py 4KB

train_mg.py 4KB

sparsedrive.py 4KB

group_sampler.py 4KB

grid_mask.py 4KB

sparsedrive_head.py 4KB

metrics.py 4KB

utils.py 4KB

loss.py 4KB

eval_hooks.py 4KB

visualize.py 3KB

decoder.py 3KB

match_cost.py 3KB

target.py 3KB

__init__.py 3KB

losses.py 3KB

kmeans_motion.py 3KB

motion_blocks.py 3KB

distributed_sampler.py 3KB

deformable_aggregation.py 2KB

fuse_conv_bn.py 2KB

eval.py 2KB

distance.py 2KB

decoder.py 2KB

setup.py 2KB

base_target.py 2KB

train.py 1KB

kmeans_plan.py 1KB

kmeans_det.py 1KB

kmeans_map.py 1004B

__init__.py 733B

__init__.py 725B

__init__.py 329B

__init__.py 285B

__init__.py 277B

__init__.py 212B

__init__.py 191B

sampler.py 182B

box3d.py 158B

__init__.py 128B

__init__.py 97B

__init__.py 48B

__init__.py 42B

__init__.py 0B

nuscenes_3d_dataset.cpython-38.pyc 23KB

attention.cpython-38.pyc 11KB

共 164 条

身份认证购VIP最低享 7 折!

30元优惠券

资源推荐

资源预览

资源评论

<div align="center"> <h1>ð¤ HE-Drive</h1> <h2> Human-Like End-to-End Driving with Vision Language Models</h2> <br> <strong>We will open source the complete code after the paper is accepted ï¼</strong> <br><br> <a href='https://arxiv.org/abs/2410.05051'><img src='https://img.shields.io/badge/arXiv-HE_Drive-green' alt='arxiv'></a> <a href='https://jmwang0117.github.io/HE-Drive/'><img src='https://img.shields.io/badge/Project_Page-HE_Drive-green' alt='Project Page'></a> </div> ## ð¢ News - [2024/10.08]: ð¥ We release the HE-Drive paper on arXiv ! </br> ## ð Introduction **HE-Drive** is a groundbreaking end-to-end autonomous driving system that prioritizes human-like driving characteristics, ensuring both temporal consistency and comfort in generated trajectories. By leveraging sparse perception for key 3D spatial representations, a DDPM-based motion planner for generating multi-modal trajectories, and a VLM-guided trajectory scorer for selecting the most comfortable option, HE-Drive sets a new standard in autonomous driving performance and efficiency. This innovative approach not only significantly reduces collision rates and improves computational speed compared to existing solutions but also delivers the most comfortable driving experience based on real-world data. <p align="center"> <img src="misc/overview.png" width = 100% height = 100%/> </p> <br> <p align="center"> <img src="misc/scoring.png" width = 100% height = 100%/> </p> <br> ## ð Citing ``` @article{wang2024he, title={HE-Drive: Human-Like End-to-End Driving with Vision Language Models}, author={Wang, Junming and Zhang, Xingyu and Xing, Zebin and Gu, Songen and Guo, Xiaoyang and Hu, Yang and Song, Ziying and Zhang, Qian and Long, Xiaoxiao and Yin, Wei}, journal={arXiv preprint arXiv:2410.05051}, year={2024} } ``` Please kindly star âï¸ this project if it helps you. We take great efforts to develop and maintain it ð. ## ð ï¸ Installation > [!NOTE] > Installation steps follow [SparseDrive](https://github.com/swc-17/SparseDrive) ### Set up a new virtual environment ```bash conda create -n hedrive python=3.8 -y conda activate hedrive ``` ### Install dependency packpages ```bash hedrive_path="path/to/hedrive" cd ${hedrive_path} pip3 install --upgrade pip pip3 install torch==1.13.0+cu116 torchvision==0.14.0+cu116 torchaudio==0.13.0 --extra-index-url https://download.pytorch.org/whl/cu116 pip3 install -r requirement.txt ``` ### Compile the deformable_aggregation CUDA op ```bash cd projects/mmdet3d_plugin/ops python3 setup.py develop cd ../../../ ``` ### Prepare the data Download the [NuScenes dataset](https://www.nuscenes.org/nuscenes#download) and CAN bus expansion, put CAN bus expansion in /path/to/nuscenes, create symbolic links. ```bash cd ${hedrive_path} mkdir data ln -s path/to/nuscenes ./data/nuscenes ``` Pack the meta-information and labels of the dataset, and generate the required pkl files to data/infos. Note that we also generate map_annos in data_converter, with a roi_size of (30, 60) as default, if you want a different range, you can modify roi_size in tools/data_converter/nuscenes_converter.py. ```bash sh scripts/create_data.sh ``` ### Prepare the 3D representation > [!NOTE] > Generate 3D representation using SparseDrive second stage checkpoint! ### Commence training ```bash # train sh scripts/train.sh ``` ### Install Ollama and Llama 3.2-Vision 11B > [!NOTE] > Download Ollama 0.4, then run: ```bash ollama run llama3.2-vision-11b ``` > [!IMPORTANT] > Llama 3.2 Vision 11B requires least 8GB of VRAM. > > Please prepare at least 10 sets of VQA templates to complete the dialogue, focusing the llama knowledge domain on driving style assessment. ### Commence testing ```bash # test sh scripts/test.sh ``` ## ð½ Dataset - [x] nuScenes - [x] Real-World Data - [x] OpenScene/NAVSIM ## ð Acknowledgement Many thanks to these excellent open source projects: - [SparseDrive](https://github.com/swc-17/SparseDrive) - [DP](https://github.com/real-stanford/diffusion_policy) - [DP3](https://github.com/YanjieZe/3D-Diffusion-Policy) - [OpenScene](https://github.com/OpenDriveLab/OpenScene) - [NAVSIM](https://github.com/autonomousvision/navsim)