# Dynamic Coverage Control of UAV Swarm under Connectivity Preservation
Based on MARL (Multi-Agent Reinforcement Learning),
this project provides a dynamic coverage control algorithm for UAV swarm.
Our task is to plan the flight route of the UAV swarm
so that all discrete PoIs (Points of Interest) can be covered for a certain period of time.
Considering the particularity of UAV swarm control,
this project focuses on analyzing how to maintain the communication connectivity of the swarm during execution.
## 1.Dynamic Point Coverage Environment
### Problem Formulation
There are $N$ UVAs, $x_i^t$ and $M$ PoIs, $p_j$.
Each PoI has certain detection requirements,
and each UAV can provide detection capabilities within a certain range of space.
We use the concept of energy and power to describe,
the UAV swarm can provide power for point $p_j$,
the power value changes with the movement of the UAV swarm.
Over time, the obtained by PoI $p_j$ will accumulate as energy.
When the energy obtained by $p_j$ is greater than required,
the task is considered to be completed.
Problem can be formulated as follow,
$$
P_{i,j}^t(x^t, p_j) = M_p \exp [-(x_i^t-p_j)^2 / r^2] , \|x_i^t-p_j\| \< r
$$
, where $M_p$ is denoted by the peak power of the sensor, $r$ is the detection radius of the sensor.
$$
E^t_{N,j} = \int_0^t \sum_{i=1}^N Pow^\tau_{i,j} d\tau
$$
,where $E^r_j$ is denoted by the required energy of $p_j$.
When $\forall j, [E_{N,j}^t\geq E^r_j]$,
the task is considered to be completed.
### Reinforcement Environment
We build a dynamic coverage environment based on [Multiagent-Particle-Envs](https://github.com/openai/multiagent-particle-envs).
Class `CoverageWorld` inherits from `multiagent.core.world`, in whose `step()`, the power and energy are calculated, and the PoI state is updated.
`multiagent/scenarios/coverage.py` describes the dynamic coverage scenario.
`multiagent/render.py` has been modified to display in real time the current power obtained by PoIs and the communication between the UAVs.
Some other changes, such as adding connectivity maintaining constraints, revising action according to constraints, will be mentioned later.
## 2. Dynamic Control based MADDPG
The agent's observations include its own position and velocity,
as well as the relative positions of other agents and PoIs.
The actions of the agent include forward, backward, left, and right, and keeping still.
As a purely cooperative scenario, the rewards of all agents are the same and are set as follows,
$$
R^t = R_{task}(\|M_d^t-M_d^{t-1}\|) + R_{done}^t + R_{dist}\sum_{j\in M_d^t} \min_i \\|x_i^t-p_j\\|_2
$$
,where $M_d^t$ is the set of done PoIs at time t, the first term represents a one-time reward for completing the coverage of a single poI;
$R_{done}^t$ is one-time reward for task completion, which is equal to 0 only when the task is completed;
The third item is the sum of distance of undone PoIs and its nearest agent.
The third term of the reward is crucial because it solves the problem of reward sparsity.
The trained resulted is displayed as follow, (2 and 3 is unde connectivity preservation)
<img src="https://github.com/zhaozijie2022/images/blob/master/dynamic-coverage-control/cov1.gif" width="250px"> <img src="https://github.com/zhaozijie2022/images/blob/master/dynamic-coverage-control/cov2.gif" width="250px"> <img src="https://github.com/zhaozijie2022/images/blob/master/dynamic-coverage-control/cov3.gif" width="250px">
## To be Continued
机智的程序员zero
- 粉丝: 2451
- 资源: 4700
最新资源
- Call The Roll.zip
- java课程设计-基于SSM框架的酒店客房管理系统源码+数据库脚本
- 9013,8550等SPICE模型
- 基于大数据、人工智能的招聘大数据分析展示系统-前端可视化.zip
- 具有执行器饱和及故障的航天器姿态主动容错控制;容错控制;航天器姿态;执行器故障及饱和
- (2025)Microsoft.WorldLockingTools插件Unity版本v2.7.3完整版(包括四个包)
- comsol复合材料各向异性相场模型,采用固体力学、亥姆霍兹pde接口,各向异性断裂能,提供mph文件
- matlab仿真,级联h桥储能变流器,下垂控制离网运行仿真,vsg控制,同步发电机控制,离网模式并网模式下均可以运行,可以实现预同步,实现平滑并网
- Flink流批一体数据处理快速集成开发框架 不仅能够快速构建基于Java的Flink流批一体应用程序,实现异构数据库实时同步和ETL,还可以让Flink SQL变得极其简单,玩转Flink
- 复旦大数据学院课程作业人工智能,分布式系统,自然语言处理,高级大数据解析,计算机网络,数据可视化.zip
- 单极倍频载波移相调制matlab仿真,9级联,19电平
- (2025)Unity图像识别包 OpenCV for Unity 2.6.0
- 大数据分析屏01234.zip
- Openmetadata之1.6.1最新版本安装-yellowcong
- 基于非奇异终端滑模控制的二自由度机械臂系统 1理论说明文档:进行机械臂轨迹跟踪控制器理论设计说明 2NTSMC控制:是通过引入非线性辅助项,将滑模控制器的终端项从奇异点(可能导致系统决策和切问
- 大数据存储与管理 (华中科技大学计算机科学与技术学院).zip
资源上传下载、课程学习等过程中有任何疑问或建议,欢迎提出宝贵意见哦~我们会及时处理!
点击此处反馈