DAT对抗优化训练大规模鲁棒深度神经网络python代码.zip

共10个文件

py：9个

pdf：1个

版权申诉

matlab

161 浏览量 2022-06-15 21:10:16 上传评论收藏 535KB ZIP 举报

资源推荐

资源详情

资源评论

收起资源包目录

Distributed Adversarial Training to Robustify Deep Neural Networks at Scale论文python代码.zip （10个子文件）

Distributed Adversarial Training to Robustify Deep Neural Networks at Scale.pdf 717KB

dat-master

models.py 8KB

main.py 11KB

utils.py 2KB

lamb.py 4KB

attack.py 5KB

dataset.py 5KB

wide_resnet.py 3KB

cifar_resnet18.py 4KB

quantization.py 2KB

Distributed Adversarial Training to Robustify Deep Neural Networks at Scale

Gaoyuan Zhang

1,*

Songtao Lu

1,*

Yihua Zhang

Xiangyi Chen

Pin-Yu Chen

Quanfu Fan

Lee Martie

Lior Horesh

Mingyi Hong

Sijia Liu

1,2

IBM Research, Yorktown Heights, NY 10598

Michigan State University, East Lansing, MI 48824

University of Minnesota, Minneapolis, MN 55455

Equal Contribution

Abstract

Current deep neural networks (DNNs) are vulner-

able to adversarial attacks, where adversarial per-

turbations to the inputs can change or manipulate

classiﬁcation. To defend against such attacks, an

effective and popular approach, known as adver-

sarial training (AT), has been shown to mitigate

the negative impact of adversarial attacks by virtue

of a min-max robust training method. While ef-

fective, it remains unclear whether it can success-

fully be adapted to the distributed learning con-

text. The power of distributed optimization over

multiple machines enables us to scale up robust

training over large models and datasets. Spurred by

that, we propose distributed adversarial training

(DAT), a large-batch adversarial training frame-

work implemented over multiple machines. We

show that DAT is general, which supports training

over labeled and unlabeled data, multiple types

of attack generation methods, and gradient com-

pression operations favored for distributed opti-

mization. Theoretically, we provide, under stan-

dard conditions in the optimization theory, the

convergence rate of DAT to the ﬁrst-order station-

ary points in general non-convex settings. Empir-

ically, we demonstrate that DAT either matches

or outperforms state-of-the-art robust accuracies

and achieves a graceful training speedup (e.g., on

ResNet–50 under ImageNet). Codes are available

at https://github.com/dat-2022/dat.

1 INTRODUCTION

The rapid increase of research in DNNs and their adoption

in practice is, in part, owed to the signiﬁcant breakthroughs

made with DNNs in computer vision [Alom et al., 2018].

Yet, with the apparent power of DNNs, there remains a se-

rious weakness of robustness. That is, DNNs can easily be

manipulated (by an adversary) to output drastically differ-

ent classiﬁcations and can be done so in a controlled and

directed way. This process is known as an adversarial attack

and considered as one of the major hurdles in using DNNs

in security critical and real-world applications [Goodfellow

et al., 2015, Szegedy et al., 2013, Carlini and Wagner, 2017,

Papernot et al., 2016, Kurakin et al., 2016, Eykholt et al.,

2018, Xu et al., 2019b].

Methods to train DNNs being robust against adversarial

attacks are now a major focus in research [Xu et al., 2019a].

But most of them are far from satisfactory [Athalye et al.,

2018] with the exception of the adversarial training (AT)

approach [Madry et al., 2017]. AT is a min-max robust

training method that minimizes the worst-case training loss

at adversarially perturbed examples. AT has inspired a wide

range of state-of-the-art defenses [Zhang et al., 2019b, Sinha

et al., 2018, Boopathy et al., 2020, Carmon et al., 2019,

Shafahi et al., 2019, Zhang et al., 2019a], which ultimately

resort to min-max optimization. However, different from

standard training, AT is more computationally intensive and

is difﬁcult to scale.

Motivation and challenges.

First, although a ‘fast’ ver-

sion of AT (we call Fast AT) was developed in [Wong et al.,

2020] where an iterative inner maximization solver is re-

placed by a simpliﬁed (single-step) solution, it may suffer

several problems compared to AT: unstable robust learn-

ing performance [Li et al., 2020], over-sensitive to learning

rate schedule [Rice et al., 2020], and catastrophic forgetting

of robustness against strong attacks [Andriushchenko and

Flammarion, 2020]. As a result, AT is still the dominant

robust training protocol across applications. Spurred by that,

we propose DAT, a new approach to speed up AT by allow-

ing for scaling batch size with distributed machines. Second,

existing AT-type methods are generally built on centralized

optimization. The need of AT in a distributed setting arises

when centralized robust training becomes infeasible or in-

effective. For example, training data are distributed as they

Accepted for the 38

Conference on Uncertainty in Artiﬁcial Intelligence (UAI 2022).

arXiv:2206.06257v1 [cs.LG] 13 Jun 2022

cannot centrally be stored at a single machine due to their

size or privacy. Or computing units are distributed as they

allow large-batch optimization to improve the scalability of

training.

1x128 1x512 3x512 6x512

Number of Nodes x Batch Size

Accuracy

Figure 1: Robust accuracy (RA) and standard test accuracy (TA)

of AT vs. scaled batch size under (ImageNet, ResNet-50) using

distributed machines.

While designing a distributed solution is important, doing

so effectively is non-trivial. Figure 1 demonstrates an exam-

ple: When scaling batch size with the number of computing

nodes, the conventional AT method yields a large perfor-

mance drop in both robust and standard accuracies. Thus,

the adaptation of AT to distributed learning leaves many

unanswered questions. In this work, we aim to design a

principled and theoretically-grounded (large-batch) DAT

framework by making full use of the computing capability

of multiple data-locality (distributed) machines, and show

that DAT expands the capacity of data storage and the com-

putational scalability. Furthermore, due to the existence of

many variants of AT, it requires a careful and systematic

study on distributed AT in its formulation, methodology,

theory and performance evaluation.

Contributions. We list our main contributions below.

(i)

We provide a general algorithmic framework for DAT,

which supports multiple (large-batch) distributed variants

of AT, e.g., supervised AT and semi-supervised AT.

(ii)

In theory, we quantify how descent errors from multiple

sources (gradient estimation, quantization, adaptive learning

rate, and inner maximization oracle) affect the convergence

of DAT. We prove that the convergence speed of DAT to the

ﬁrst-order stationary points in general non-convex settings at

a rate of

O(1/

√

T )

, where

is the total number of iterations.

This result matches the standard convergence rate of classic

training algorithms, e.g., stochastic gradient descent (SGD),

for only the minimization problems.

(iii)

In practice, we make a comprehensive empirical study

on DAT, showing its effectiveness to (1) robust training

over ImageNet, (2) provably robust training by randomized

smoothing, (3) robust training with unlabeled data, (4) ro-

bust pretraining + ﬁnetuning, and (5) robust training across

different computing and communication conﬁgurations.

2 RELATED WORK

Training robust classiﬁers.

AT [Madry et al., 2017], the

ﬁrst known min-max optimization-based defense, has in-

spired a wide range of other effective defenses. Examples

include adversarial logit pairing [Kannan et al., 2018], input

gradient or curvature regularization [Ross and Doshi-Velez,

2018, Moosavi-Dezfooli et al., 2019], trade-off between

robustness and accuracy (TRADES) [Zhang et al., 2019b],

distributionally robust training [Sinha et al., 2018], dynamic

adversarial training [Wang et al., 2019b], robust input attri-

bution regularization [Boopathy et al., 2020], certiﬁably ro-

bust training [Wong and Kolter, 2017], and semi-supervised

robust training [Stanforth et al., 2019, Carmon et al., 2019].

In particular, some recent works proposed fast but approxi-

mate AT algorithms, such as ‘free’ AT [Shafahi et al., 2019],

you only propagate once (YOPO) [Zhang et al., 2019a], and

fast gradient sign method (FGSM) based AT [Wong et al.,

2020]. These algorithms achieve speedup in training by sim-

plifying the inner maximization step of AT, but are designed

for centralized model training. A few works made empirical

efforts to scale AT up by using multiple computing nodes

[Xie et al., 2019, Kang et al., 2019, Qin et al., 2019], they

were limited to speciﬁc use cases and lacked a thorough

study on when and how distributed learning helps, either in

theory or in practice.

Distributed model training.

Distributed optimization

has been found to be effective for the standard training

of machine learning models [Dean et al., 2012, Goyal et al.,

2017, You et al., 2019, Chen et al., 2020]. In contrast to cen-

tralized optimization, distributed learning enables increas-

ing the batch size proportional to the number of computing

nodes/machines. However, it is challenging to train a model

via large-batch optimization without incurring accuracy loss

compared to the standard training with same number of

epochs [Krizhevsky, 2014, Keskar et al., 2016]. To tackle

this challenge, it was shown in [You et al., 2017b, 2018,

2019] that adaptation of learning rates to the increase of the

batch size is an essential mean to boost the performance

of large-batch optimization. A layer-wise adaptive learn-

ing rate strategy was then proposed to speed up the train-

ing as well as preserve the accuracy. Although these works

have witnessed several successful applications of distributed

learning in training standard image classiﬁers, they leave

the question of how to build robust DNNs with DAT open.

In this paper, we show that the power of layer-wise adaptive

learning rate also applies to DAT. Since distributed learn-

ing introduces machine-machine communication overhead,

another line of work [Alistarh et al., 2017, Yu et al., 2019,

Bernstein et al., 2018, Wangni et al., 2018, Stich et al., 2018,

Wang et al., 2019a] focused on the design of communication-

efﬁcient distributed optimization algorithms.

The study on distributed learning is extensive, but the prob-

lem of distributed min-max optimization is less explored,

with some exceptions [Srivastava et al., 2011, Notarnicola

et al., 2018, Hanada et al., 2017, Tsaknakis et al., 2020, Liu

et al., 2019a,b]. A key difference to our work is that none of

the aforementioned literature studied the large-batch min-

max optimization with its applications to training robust

DNNs, neither theoretically nor empirically. While there

are recent proposed algorithms for training Generative Ad-

versarial Nets (GANs) [Liu et al., 2019a,b], training robust

DNNs against adversarial examples is intrinsically different

from GAN training. In particular, training robust DNNs re-

quires inner maximization with respect to each training data

rather than empirical maximization with respect to model

parameters. Such an essential difference leads to different

optimization goals, algorithms, convergence analyses and

implementations.

3 PROBLEM FORMULATION

In this section, we ﬁrst review the standard setup of adver-

sarial training (AT) [Madry et al., 2017], and then propose a

general min-max setup for distributed AT (DAT).

Adversarial training.

AT [Madry et al., 2017] is a min-

max optimization method for training robust ML/DL models

against adversarial examples [Goodfellow et al., 2015]. For-

mally, AT solves the problem

minimize

(x,y)∈D



maximize

kδk

∞

≤

(θ, x + δ; y)



(1)

where

θ ∈ R

denotes the vector of model parame-

ters,

δ ∈ R

is the vector of input perturbations within



∞

ball of the given radius



, namely,

kδk

∞

≤ 

(x, y) ∈ D

corresponds to the training example

with

label

in the dataset

, and



represents a pre-deﬁned

training loss, e.g., the cross-entropy (CE) loss. The ratio-

nale behind problem (1) is that the model

is robustly

trained against the worst-case loss induced by the adversari-

ally perturbed samples. It is worth noting that the AT prob-

lem (1) is different from conventional stochastic min-max

optimization problems, e.g., GANs training [Goodfellow

et al., 2014]. Note that in (1), the stochastic sampling corre-

sponding to the expectation over

(x, y) ∈ D

is conducted

prior to the inner maximization operation. Such a differ-

ence leads to the sample-speciﬁc adversarial perturbation

δ(x)

= maximize

kδk

∞

≤

(θ, x + δ; y).

Distributed AT (DAT).

Let us consider a popular

parameter-server model of distributed learning [Dean et al.,

2012]. Formally, there exist

workers each of which has

access to a local dataset

(i)

, and thus

D = ∪

i=1

(i)

There also exists a server/master node (e.g., one of workers

could perform as server), which collects local information

(e.g., individual gradients) from the other workers to up-

date the model parameters

. Spurred by (1), DAT solves

problems of the following generic form,

minimize

i=1

(θ; D

(i)

(x,y)∈D

(i)



λ(θ; x, y) + max

kδk

∞

≤

φ(θ, δ; x, y)



(2)

where

denotes the local cost function at the

th worker,

is a robustness regularizer against the input perturbation

and

λ ≥ 0

is a regularization parameter that strikes a balance

between the training loss and the worst-case robustness

regularization. In (2), if

M = 1

(1)

= D

λ = 0

and

φ = 

, then the DAT problem reduces to the AT problem

(1). We cover two categories of (2).

DAT with labeled

data: In (2), we consider

φ(θ, δ; x, y) = (θ, x + δ; y)

with labeled training data

(x, y) ∈ D

(i)

for

i ∈ [M ]

. Here

[M]

denotes the integer set

{1, 2, . . . , M}



DAT with

unlabeled data: In (2), different from DAT with labeled

data, we augment

(i)

with an unlabeled dataset, and deﬁne

the robust regularizer

as the pseudo-labeled worst-case

CE loss [Carmon et al., 2019] or the TRADES regularizer

[Stanforth et al., 2019, Zhang et al., 2019b].

4 METHODOLOGIES

At the ﬁrst glance, distributed learning seems being natu-

rally applied since problem (2) is decomposable over mul-

tiple workers. Yet, the actual case is much more complex.

First

, in contrast to standard AT, DAT allows for using a

times larger batch size to update the model parameters

in (2). Thus, given the same number of epochs, DAT

takes

fewer gradient updates than AT. Although there

exist some large-batch model training techniques for solv-

ing min-only problems [You et al., 2017a,b, 2018, 2019,

Goyal et al., 2017, Keskar et al., 2016], it remains unclear

if they are effective to DAT due to its min-max optimiza-

tion nature.

Second

, either AT or distributed learning has

its own challenges. In AT, for ease of attack generation, i.e.,

conducting inner maximization of (2), fast gradient sign

method (FGSM) was leveraged to improve its computation

efﬁciency [Wong et al., 2020]. In distributed learning, gradi-

ent compression [Alistarh et al., 2017, Yu et al., 2019] was

used for reducing communication overhead. Thus, it also

remains unclear whether these customizations are adaptable

to DAT. In a nutshell, the distributed min-max optimization-

based robust training algorithm has not been well studied

previously, particularly in the use of different types of attack

generators (inner maximization oracles), gradient quantiza-

tion, large-batch size, and adaptive learning rate. Although

either of the standalone techniques was studied separately,

justifying their coherent integration ‘actually works’ (both

practically and theoretically) is quite demanding.

Algorithmic framework of DAT.

DAT follows the frame-

work of distributed learning with parameter server. In what

follows, we elaborate on its key components through its

meta-form shown by Algorithm 1 (see its detailed version

in Algorithm A1). DAT contains three algorithmic blocks.

In the ﬁrst block, every distributed worker calls for a max-

imization oracle to obtain the adversarial perturbation for

each sample within a data batch, then computes the gradient

of the local cost function

in (2) with respect to (w.r.t.)

model parameters

. And every worker is allowed to quan-

tize/compress the local gradient prior to transmission to

the server. In the second block, the server aggregates the

local gradients, and transmits the aggregated gradient (or

the quantized gradient) to the other workers. In the third

block, the model parameters are eventually updated by a

minimization oracle at each worker based on the received

gradient information from the server.

Algorithm 1

Meta-version of DAT (Alg. A1 in Supplement)

1: for Worker i = 1, 2, . . . , M do  Block 1

2: Sample-wise attack generation (A1)

3: Local gradient computation (A2)

4: Worker-server communication

5: end for

6: Gradient aggregation at server (A3)  Block 2

7: Server-worker communication

8: for Worker i = 1, 2, . . . , M do  Block 3

9: Model parameter update (A4)

10: end for

Large-batch challenge in DAT and a layerwise adaptive

learning rate (LALR) solution.

In DAT, the aggregated

gradient (Step 6 in Algorithm 1) is built on the data batch

that is

times larger than the standard AT. This leads

to a large-batch challenge in min-max optimization. This

challenge can also be veriﬁed from Fig. 1. To overcome the

large-batch challenge, we adopt the technique of layerwise

adaptive learning rate (LALR), backed up by the recent

successful applications to the standard training of large-

scale image classiﬁcation and language modeling networks

with large data batch [You et al., 2019, 2017b].

To be more speciﬁc, the model training recipe using LALR

becomes

t+1,i

= θ

t,i

−

τ(kθ

t,i

) · η

t,i

· u

t,i

, ∀i ∈ [h], (3)

where

t,i

denotes the

th-layer parameters at iteration

, with

= [θ

t,1

, . . . , θ

t,h

]

is the number of lay-

ers,

is a descent direction computed based on the ﬁrst-

order gradient w.r.t. model parameters

τ(kθ

t,i

) =

min{max{kθ

t,i

, c

}, c

}

is a layerwise scaling factor of

the adaptive learning rate

t,i

, and

= 0

and

= 10

are set in our experiments (see Appendix 4.1 for some abla-

tion studies on hyperparameter selection).

In (3), the speciﬁc form of the descent direction

is de-

termined by the optimizer employed. For example, if the

adaptive momentum (Adam) method is used, then

given by the exponential moving average of past gradients

scaled by square root of exponential moving averages of

squared past gradients [Reddi et al., 2018, Chen et al., 2018].

Such a variant of (3) that uses Adam as the base algorithm

is also known as LAMB [You et al., 2019] in standard train-

ing. However, it was elusive if the advantage of LALR is

preserved in large-batch min-max optimization. As will be

evident later, the effectiveness of LALR in DAT can be

justiﬁed from both theoretical and empirical perspectives.

The rationale is that the layer-wise adaptive learning rate

smooths the optimization trajectory so that a larger learn-

ing rate can be used without causing sharp optima even in

distributed min-max optimization.

Other add-ons for DAT.

In what follows, we illustrate

two add-ons to improve computation and communication

efﬁciency of DAT.

Inner maximization: Iterative vs. one-shot solution. In

DAT, each worker calls for an inner maximization oracle to

generate adversarial perturbations (Step 2 of Algorithm 1).

We specify two solvers of perturbation generation: iterative

projected gradient descent (PGD) and one-shot (projected)

FGSM [Goodfellow et al., 2015, Wong et al., 2020]. Our

experiments will show that FGSM together with LALR

works well in DAT. We also remark that other techniques

[Shafahi et al., 2019, Zhang et al., 2019a] can also be used to

simplify inner maximization, however, we focus on FGSM

since it is computationally lightest.

Gradient quantization. In contrast to standard AT, DAT

may call for worker-server communications (Steps 4 and

7 of Algorithm 1). That is, if a single-precision ﬂoating-

point data type is used, then DAT needs to transmit

32d

bits per worker-server communication at each iteration. Re-

call that

is the dimension of

. In order to reduce the

communication cost, DAT has the option to quantize the

transmitted gradients using a ﬁxed number of bits fewer than

. We specify the gradient quantization operation as the

randomized quantizer [Alistarh et al., 2017, Yu et al., 2019].

In Sec. 6 we will show that DAT, combined with gradient

quantization, still leads to a competitive performance. For

example, the robust accuracy of ResNet-50 trained by a

bit DAT (performing quantization at Step 4 of Algorithm 1)

for ImageNet is just

0.55%

lower than the robust accuracy

achieved by the

-bit DAT. It is also worth mentioning that

the All-reduce communication protocol can be regarded as

a special case of the parameter-server setting in Algorithm 1

when every worker performs as a server. In this case, the

communication network becomes fully connected and only

the worker-server communication (Step 4 of Algorithm 1)

is needed. Please refer to Appendix 1 for more details on

gradient quantization.

5 CONVERGENCE ANALYSIS OF DAT

Although standard AT has been proved with convergence

guarantees [Wang et al., 2019b, Gao et al., 2019], none

of existing work addressed the convergence of DAT and

took into account LALR and gradient quantization, even

in the standard AT setup. Different from AT, DAT needs to

quantify the descent errors from multiple sources (such as

gradient estimation, quantization, adaptive learning rate, and

inner maximization oracle). Before showing the challenges

of proving the convergence rate guarantees, we ﬁrst give the

following assumptions.

Assumptions.

Deﬁning

Ψ(θ)

i=1

(θ; D

(i)

)

(2), we measure the convergence of DAT by the ﬁrst-order

stationarity of

. Prior to convergence analysis, we impose

the following assumptions: (

)

Ψ(θ)

is with layer-wise

Lipschitz continuous gradients; (

)

φ)

in (2) is strongly

concave with respect to

and with Lipschitz continuous

gradients within the perturbation constraint; (

) Stochas-

tic gradient is unbiased and has bounded variance for each

worker denoted by

. Note that the validity of (

) could

be justiﬁed from [Sinha et al., 2018, Wang et al., 2019b] by

imposing a strongly convex regularization into the neigh-

borhood of

is needed for tractability of analysis. We

refer readers to Appendix 2.1 for more justiﬁcations on our

assumptions (A1)-(A3).

Technical challenges.

In theory, the incorporation of

LALR makes the analysis of min-max optimization highly

non-trivial. The fundamental challenge lies in the nonlinear

coupling between the biased adaptive gradient estimate re-

sulted from LALR and the additional error generated from

alternating update in DAT. From (3), we can see that the

updated

is based on the normalized gradient, while if we

perform convergence by applying the gradient Lipschitz con-

tinuity, the descent of the objective is measured by

∇Ψ(θ

)

This mismatch in the magnitude results in the bias term. The

situation here is even worse, since the maximization prob-

lem cannot be solved exactly, the size of the bias depends on

how close between the output of the oracle and the optimal

solution w.r.t. δ given θ.

We have proposed a new descent lemma (Lemma 2 in Ap-

pendix) to measure the decrease of the objective value in

the context of alternative optimization, and showed that the

bias error resulted from the layer-wise normalization can be

compensated by large-batch training (Theorem 1). Prior to

our work, we are not aware of any established convergence

analysis for large-batch min-max optimization.

Convergence rate.

In Theorem 1, we present the sub-

linear rate of DAT.

Theorem 1.

Suppose that assumptions

hold, the

inner maximizer of DAT provides a

-approximate solution

(i.e., the



-norm of inner gradient is upper bounded by

), and the learning rate is set by

∼ O(1/

√

T )

, then

{θ

}

t=1

generated by DAT yields the convergence rate

t=1

Ek∇

Ψ(θ

√

+ min

(

√

)

+ ε

, (4)

where

denotes the number of quantization bits, and

B = min{|B

(i)

|, ∀t, i}

stands for the smallest batch size

per worker.

Proof: Please see Appendix 3. 

The error rate given by (4) involves four terms. The term

O(1/

√

MB)

characterizes the beneﬁt of using the large

per-worker batch size

and

computing nodes in DAT.

It is introduced since the variance of adaptive gradients

(i.e.,

) is reduced by a factor

1/MB

, where

1/M

corre-

sponds to the linear speedup by

machines. In (4), the

term

min{

√

}

arises due to the variance of compressed

gradients, and the other two terms imply the dependence

on the number of iterations

as well as the

-accuracy of

the inner maximization oracle. We highlight that our conver-

gence analysis (Theorem 1) is not barely a combination of

LALR-enabled standard training analysis [You et al., 2019,

2017b] and adversarial training convergence analysis [Wang

et al., 2019b, Gao et al., 2019]. Different from the previous

work, we address the fundamental challenges in (a) quan-

tifying the descent property of the objective value at the

presence of multi-source errors during alternating min-max

optimization, and (b) deriving the theoretical relationship

between large data batch (across distributed machines) and

the eventual convergence error of DAT.

6 EXPERIMENTS

We empirically evaluate DAT and show its success in train-

ing robust DNNs across multiple applications, which in-

clude

adversarially robust ImageNet training,



prov-

ably robust training by randomized smoothing,

semi-

supervised robust training with unlabeled data,

robust

transfer learning,

DAT using different communication

protocols.

评论收藏

内容反馈

版权申诉

天天Matlab科研工作室

粉丝: 3w+
资源: 7259

DAT对抗优化训练大规模鲁棒深度神经网络python代码.zip

鲁棒自适应动态规划仿真代码.zip

基于自监督对比学习的深度神经网络对抗鲁棒性提升.pdf

网络游戏-一种工件台微动MIMO鲁棒模糊神经网络滑模控制方法.zip

无人机的鲁棒姿态控制器附matlab代码.zip

ruolubang_鲁棒约束_鲁棒_鲁棒优化_鲁棒优化模型_鲁棒优化cplex.zip

多旋翼无人机姿态控制系统的鲁棒设计附matlab代码.zip.zip

ruolubang_鲁棒约束_鲁棒_鲁棒优化_鲁棒优化模型_鲁棒优化cplex_源码.zip

CCGRO-toy-case-master_列与约束_鲁棒调度_鲁棒_鲁棒调度_列约束算法.zip

[ECCV2022]边界框不准确的鲁棒目标检测_Python_Cuda_.zip

电气代码：044微电网两阶段鲁棒优化经济调度方法.zip

网络游戏-基于滑模补偿的微陀螺仪鲁棒神经网络控制系统及方法.zip

matlab程序：微电网两阶段鲁棒优化经济调度方法.zip

基于Python完美复现微电网两阶段鲁棒优化经济调度方法源码+项目说明+超详细代码注释.zip

基于Pytorch实现RISTDnet红外小目标检测网络算法源码(强鲁棒性).zip

Python-特征去噪提高对抗鲁棒性

考虑风力发电不确定性的机组分布鲁棒优化matlab参考代码.zip

论文研究-不确定与损毁情景下可靠性设施选址鲁棒优化模型与算法研究.pdf

鲁棒深度学习.zip

Matlab 基于支持向量机(SVM)的数据回归预测 SVM回归

Matlab 基于BP神经网络的数据分类预测 BP分类

LSTM时间序列神经网络预测MATLAB代码

ADRC控制器仿真 simulink 2017a版本

2022建模国赛代码(三天坚持不易) 包括K-meas算法、bp预测、回归预测,(python和matlab做的).zip

matlab2020b ubuntu.txt

基于蚁群算法的三维路径规划(matlab实现)

基于智能优化算法的双层优化求解(matlab代码)

调频连续波（FMCW）雷达二维FFT代码matlab

基于蚁群算法的二维路径规划(matlab实现)

美赛各题常用算法程序与参考代码.rar

最新资源