强化学习无人机对抗附python代码.zip_电子对抗强化学习资源-CSDN文库

共111个文件

py：54个

pyc：16个

txt：13个

版权申诉

python

163 浏览量 2024-05-21 23:30:33 上传评论收藏 5.31MB ZIP 举报

资源推荐

资源详情

资源评论

收起资源包目录

强化学习无人机对抗附python代码.zip （111个子文件）

checkpoint 167B

.DS_Store 6KB

Rule-coupled vs Random.gif 3.29MB

Rule-coupled vs Selfplay.gif 2.22MB

.gitignore 84B

tag-dqn_21500_1.h5 27KB

tag-dqn_21500_0.h5 27KB

tag-dqn_21500_2.h5 27KB

openai-maddpg.iml 398B

run_parameters.json 368B

settings.json 76B

README.md 1KB

README.md 745B

README.md 744B

README.md 2B

not-zip-safe 1B

PKG-INFO 280B

PKG-INFO 262B

environment.py 16KB

competition_3v3.py 16KB

environment-tmp.py 15KB

competition_3v3-tmp.py 13KB

environment.py 13KB

tf_util.py 12KB

distributions.py 12KB

simple_world_comm.py 11KB

rendering.py 11KB

angle_3v3.py 10KB

maddpg.py 9KB

dqn_tag.py 7KB

core.py 7KB

simple_crypto.py 6KB

simple_tag_v1.py 6KB

simple_adversary.py 6KB

simple_tag_v1.py 6KB

simple_tag.py 6KB

simple_push.py 4KB

simple_spread.py 4KB

simple_speaker_listener.py 3KB

simple_reference.py 3KB

replay_buffer.py 3KB

dqn.py 2KB

multi_discrete.py 2KB

simple.py 2KB

make_env.py 2KB

policy.py 2KB

interactive.py 1KB

__init__.py 467B

setup.py 426B

setup.py 408B

__init__.py 407B

scenario.py 309B

__init__.py 145B

__init__.py 0B

distributions.cpython-35.pyc 21KB

rendering.cpython-35.pyc 16KB

tf_util.cpython-35.pyc 13KB

environment.cpython-35.pyc 11KB

competition_3v3.cpython-35.pyc 10KB

maddpg.cpython-35.pyc 7KB

core.cpython-35.pyc 6KB

simple_tag_yuan_v2.cpython-35.pyc 6KB

simple_tag_v1.cpython-35.pyc 5KB

replay_buffer.cpython-35.pyc 4KB

multi_discrete.cpython-35.pyc 3KB

simple.cpython-35.pyc 2KB

__init__.cpython-35.pyc 1KB

scenario.cpython-35.pyc 668B

__init__.cpython-35.pyc 459B

__init__.cpython-35.pyc 434B

reward setting 219B

LICENSE.txt 1KB

SOURCES.txt 880B

SOURCES.txt 215B

新建.txt 99B

readme.txt 34B

共 111 条

# Multiagent reinforcement learning algorithms for multiple-UAV confrontation This is the source code of "Efficient training techniques for multi-agent reinforcement learning in combatant tasks", we construct a multi-agent confrontation environment originated from a combatant scenario of multiple unman aerial vehicles. To begin with, we consider to solve this confrontation problem with two types of MARL algorithms. One is extended from the classical deep Q-network for multi-agent settings (MADQN). The other one is extended from the state-of-art multi-agent reinforcement method, multi-agent deep deterministic policy gradient (MADDPG). We compare the two methods for the initial confrontation scenario and find that MADDPG outperforms MADQN. Then with MADDPG as the baseline, we propose three efficient training techniques, i.e., scenario-transfer training, self-play training and rule-coupled training. ![image](https://github.com/sanjinzhi/multiagent-confrontation/blob/master/Rule-coupled%20vs%20Random.gif) Rule-coupled red agents vs Random-move blue agents ![image](https://github.com/sanjinzhi/multiagent-confrontation/blob/master/Rule-coupled%20vs%20Selfplay.gif) Rule-coupled red agents vs Blue agents trained by self-play

评论收藏

内容反馈

版权申诉