自适应模型不可知元学习_maml缺点资源-CSDN文库

需积分: 24 15 浏览量 2020-11-29 21:54:21 上传评论 2 收藏 883KB PDF 举报

MAML算法在分类、回归和策略梯度微调的少次学习问题上表现良好，但需要代价高昂的超参数调整来保持训练的稳定性。在MAML引入一个名为阿尔法·MAML的扩展来解决这个缺点，该扩展结合了一个在线超参数自适应方案，消除了调整元学习和学习速率的需要。Alpha MAML: Adaptive Model-Agnostic Meta-Learning "自适应模型不可知元学习"，即Alpha MAML，是一种针对模型无关元学习（MAML）的扩展方法，旨在解决MAML在处理分类、回归和策略梯度微调等少量样本学习问题时存在的超参数调整难题。元学习，或称“学习如何学习”，是机器学习领域的一个分支，其目标是通过改变学习过程中的模型架构、优化规则、初始化条件或学习超参数，提升模型的学习效率。在元学习的应用场景中，特别是在少量样本学习任务中，研究者致力于开发可以从一个或极少数实例中学习新概念的方法。 MAML（Model-Agnostic Meta-Learning）是一种流行且通用的元学习算法，它通过对模型进行预训练，使其能够在少量样本上快速适应新的任务，从而在图像分类、强化学习等领域表现出色。然而，MAML的训练过程对超参数的选择非常敏感，需要耗时且复杂的超参数调整以保持训练的稳定性。 Alpha MAML引入了一种在线超参数自适应方案，它能够自动调整元学习和学习速率，消除了人工调参的需求。这一创新使得训练过程更加稳定，减少了对超参数选择的敏感性。在对Omniglot数据库的实验中，Alpha MAML显著降低了MAML训练超参数的调整需求，同时提高了训练的稳定性。 Alpha MAML的核心在于改进了MAML的更新规则，它不再需要手动设置特定的元学习率，而是动态地自我调整这些参数。这不仅简化了训练流程，也提高了元学习算法的泛化能力。通过这种方法，Alpha MAML能够在面对不同任务和数据集时更加灵活地适应，降低了应用元学习技术的门槛。 Alpha MAML是对元学习领域的重要贡献，它解决了MAML在实际应用中的关键问题，增强了元学习在少量样本学习任务中的性能。这一工作为未来的元学习研究提供了新的方向，可能推动更多高效、自适应的机器学习算法的发展。

资源详情

资源评论

资源推荐

6th ICML Workshop on Automated Machine Learning (2019)

Alpha MAML: Adaptive Model-Agnostic Meta-Learning

Harkirat Singh Behl harkirat@robots.ox.ac.uk

Atılım G¨une¸s Baydin gunes@robots.ox.ac.uk

Philip H.S. Torr phst@robots.ox.ac.uk

University of Oxford

Abstract

Model-agnostic meta-learning (MAML) is a meta-learning technique to train a model on a

multitude of learning tasks in a way that primes the model for few-shot learning of new

tasks. The MAML algorithm performs well on few-shot learning problems in classiﬁcation,

regression, and ﬁne-tuning of policy gradients in reinforcement learning, but comes with the

need for costly hyperparameter tuning for training stability. We address this shortcoming

by introducing an extension to MAML, called Alpha MAML, to incorporate an online

hyperparameter adaptation scheme that eliminates the need to tune meta-learning and

learning rates. Our results with the Omniglot database demonstrate a substantial reduction

in the need to tune MAML training hyperparameters and improvement to training stability

with less sensitivity to hyperparameter choice.

1. Introduction

Meta-learning—or “learning to learn”—concerns machine learning models that can improve

their learning quality by altering aspects of the learning process such as the model architecture,

optimization rules, initialization, or learning hyperparameters (Thrun and Pratt, 2012;

Schmidhuber, 1987; Hochreiter et al., 2001). An important application of meta-learning is in

few-shot learning problems (Vinyals et al., 2016; Behl et al., 2018), where one is concerned

with developing methods able to learn new concepts from one or only a few instances (Lake

et al., 2015). In this paper we focus on the state-of-the-art model-agnostic meta-learning

(MAML) (Finn et al., 2017) method, which is a conceptually simple and general algorithm

that has been shown to outperform existing approaches in tasks including few-shot image

classiﬁcation and few-shot adaptation in reinforcement learning (Antoniou et al., 2019).

MAML aims to solve the few-shot learning problem by being just few gradient descent

steps away from any new concepts, doing so by making the assumption that learning a new

concept will just involve few parameter updates (Algorithm 1). In other words, MAML is

based on learning an initial representation that can be eﬃciently ﬁne-tuned for new tasks in

a few steps.

The generality of MAML comes with the diﬃculty of choosing hyperparameters to

achieve stable training in practice (Antoniou et al., 2019). MAML has two important

hyper-parameters, namely the learning rate

and the meta-learning rate

, thus increasing

any hyperparameter grid search computation by an order, and making it signiﬁcantly more

time and resource consuming than comparable methods. Another complication to this

problem is the fact that it is currently not established whether the technique can beneﬁt

from a conventional decaying schedule for the inner learning rate

. Furthermore, a good

value of α in MAML is even more important than for any conventional stochastic gradient

descent (SGD) optimization, because only a handful of samples are available in the few-shot

learning case. This has signiﬁcant consequences, making it diﬃcult to scale this algorithm

2019 H.S. Behl, A.G. Baydin, and P.H.S. Torr.

arXiv:1905.07435v1 [cs.LG] 17 May 2019

Behl, Baydin and Torr

to problems bigger than toy scales, due to the diﬃculty in assessing whether MAML is not

suitable for a complex task or whether the hyperparameters are not suﬃciently tuned.

In this paper, we provide a conceptually simple solution to this problem, by introducing

an extension of the MAML algorithm to incorporate adaptive tuning of both the learning

rate

and the meta-learning rate

. Our aim is to make it possible to use MAML without or

with signiﬁcantly less parameter tuning, and thus to reduce the need for grid search. We also

aim to make the algorithm converge in fewer iterations. The solution we propose is based

on the hypergradient descent (HD) algorithm (Baydin et al., 2018), which automatically

updates a learning rate by performing gradient descent on the learning rate alongside original

optimization steps. The proposed algorithm does not need any extra gradient computations,

and just involves storing the gradients from the previous optimization step.

2. Related Work

Our work is primarily related with the subﬁelds of hyperparameter optimization and meta-

learning. In hyperparameter optimization one typically uses parallel runs to populate a

selected grid of hyperparameter values (e.g., a range of learning rates), or use more advanced

techniques such as Bayesian optimization (Snoek et al., 2012) and model-based approaches

(Bergstra et al., 2013; Hutter et al., 2013). An interesting line of research, which also

inspired our approach in this paper, is to use gradient-based optimization for the tuning

of hyperparameters (Bengio, 2000). Recent work in this area include reversible learning

(Maclaurin et al., 2015), which allows gradient-based optimization of hyperparameters

through a training run consisting of multiple iterations, and hypergradient descent (Baydin

et al., 2018), which achieves a similar optimization in an online, per-gradient-update, fashion.

Meta-learning is often referred to as “learning to learn” (Thrun and Pratt, 2012), meaning

a learning procedure (most of the time gradient-based) is able to improve aspects of the

learning process itself, such as the optimizer, hyperparameters like the learning rate, and

initializations. In this sense, the “meta” concept of meta-learning has aspects in common

with hyperparameter optimization. The MAML model (Finn et al., 2017) on which we

base our method, relies on meta-optimization through gradient descent in a model-agnostic

way. Another recent method, Meta-SGD (Li et al., 2017), performs online optimization

of a per-parameter learning rate vector

, to which the authors refer as learning both the

learning rate and update direction (because of the per-parameter nature being able to modify

direction), and model parameters

, using a single hyper-learning rate

. Our work diﬀers

from Meta-SGD as we perform simultaneous online optimization of both MAML learning

rates α and β, which are both scalars.

3. Model-Agnostic Meta-Learning (MAML)

The MAML algorithm, given model parameters

, aims to adapt to a new task

with SGD:

= θ − α∇

train(t)

) , (1)

where

is the task number and

is the learning rate.

train(t)

and

test(t)

denote the

training and test set within task

. The tasks are sampled from a deﬁned

(

). The

meta-objective is:

min

test(t)

) = L

test(t)

θ−α∇

train(t)

)

) (2)

剩余9页未读，继续阅读

评论收藏

内容反馈

zyp18810010105

粉丝: 0
资源: 6

自适应模型不可知元学习

评论0

最新资源

自适应模型不可知元学习

评论0

元学习-MAML-资源整合

pyinfer:Pyinfer是面向ML开发人员和研究人员的模型不可知工具，用于对机器学习模型或功能的推理统计进行基准测试

matlab基于改进模型不可知元学习算法的快速自适应主动噪声控制.zip

PMLSM伺服系统的特征模型与自适应迭代学习控制

请参阅用于无监督多目标域自适应的激光雷达不可知3D检测框架_See Eye to Eye A Lidar-Agnostic 3D

基于灰色马尔科夫模型的采煤机自适应截割策略研究

感应电机的神经网络鲁棒自适应动态面控制.pdf

Adaptive Control, Astrom.pdf

基于RBF神经网络自适应控制的下肢外骨骼步态跟踪.pdf

多智能体系统自适应跟踪控制.pdf

异构集群系统分布式自适应输出时变编队跟踪控制.docx

论文研究-基于自学习规则和改进贝叶斯结合的问题分类.pdf

基于机会权重的环境自适应动态车联网路由.pdf

改进的简化粒子群算法优化模糊神经网络建模

参数不确定时滞系统的自适应广义函数投影滞后同步

通过(SRA)、(ANFIS)、(CAPM)对金融时间序列进行预测的MATLAB仿真，源码+论文

ICLR2024录用论文

无需速度测量的网络机器人系统的自适应同步控制

基于自适应卡尔曼滤波原理的车距预测研究

量化非线性系统的自适应神经网络有限时间输出反馈控制

基于Petri网的自主网络服务自适应兼容性分析

使用非线性参数化模糊逼近器的一类未知非线性时滞系统的自适应跟踪控制

最新资源