ICML论文下载（从2017起至今）_ICML论文在哪里看资源-CSDN文库

共8个文件

pdf：7个

jpg：1个

ICML

国际会议

自然语言处理

需积分: 1 4 浏览量 2024-02-26 14:06:13 上传评论收藏 6.36MB ZIP 举报

资源推荐

资源详情

资源评论

收起资源包目录

CSDN-ICML论文分享.zip （8个子文件）

CSDN-ICML论文分享

关注AINLPer公众号免费获取更多NLP资料.jpg 8KB

ICML2021-accept paper.pdf 3.26MB

ICML2022-accept paper.pdf 4.07MB

ICML2017-accept paper.pdf 4.08MB

ICML2019-accept paper.pdf 1.85MB

ICML2020-accept paper.pdf 655KB

ICML-2018-accept-paper.pdf 5.93MB

ICML2023-accept paper.pdf 2.52MB

Reinforcement Learning 14

class_link: https://icml.cc/Conferences/2018/Schedule?showParentSession=3462

Title: RLlib: Abstractions for Distributed Reinforcement Learning

Author: Eric Liang · Richard Liaw · Robert Nishihara · Philipp Moritz · Roy Fox · Ken Goldberg ·

Joseph Gonzalez · Michael Jordan · Ion Stoica

Abstract: Reinforcement learning (RL) algorithms involve the deep nesting of highly irregular

computation patterns, each of which typically exhibits opportunities for distributed computation.

We argue for distributing RL components in a composable way by adapting algorithms for top-

down hierarchical control, thereby encapsulating parallelism and resource requirements within

short-running compute tasks. We demonstrate the benefits of this principle through RLlib: a

library that provides scalable software primitives for RL. These primitives enable a broad range of

algorithms to be implemented with high performance, scalability, and substantial code reuse.

RLlib is available as part of the open source Ray project at http://rllib.io/.

Link: http://proceedings.mlr.press/v80/liang18b/liang18b.pdf

Title: IMPALA: Scalable Distributed Deep-RL with Importance Weighted Actor-Learner

Architectures

Author: Lasse Espeholt · Hubert Soyer · Remi Munos · Karen Simonyan · Vlad Mnih · Tom Ward ·

Yotam Doron · Vlad Firoiu · Tim Harley · Iain Dunning · Shane Legg · koray kavukcuoglu

Abstract: In this work we aim to solve a large collection of tasks using a single reinforcement

learning agent with a single set of parameters. A key challenge is to handle the increased amount

of data and extended training time. We have developed a new distributed agent IMPALA

(Importance Weighted Actor-Learner Architecture) that not only uses resources more efficiently in

single-machine training but also scales to thousands of machines without sacrificing data

efficiency or resource utilisation. We achieve stable learning at high throughput by combining

decoupled acting and learning with a novel off-policy correction method called V-trace. We

demonstrate the effectiveness of IMPALA for multi-task reinforcement learning on DMLab-30 (a

set of 30 tasks from the DeepMind Lab environment (Beattie et al., 2016)) and Atari57 (all

available Atari games in Arcade Learning Environment (Bellemare et al., 2013a)). Our results show

that IMPALA is able to achieve better performance than previous agents with less data, and

crucially exhibits positive transfer between tasks as a result of its multi-task approach.

Link: http://proceedings.mlr.press/v80/espeholt18a/espeholt18a.pdf

Title: Mix & Match - Agent Curricula for Reinforcement Learning

Author: Wojciech Czarnecki · Siddhant Jayakumar · Max Jaderberg · Leonard Hasenclever · Yee

Teh · Nicolas Heess · Simon Osindero · Razvan Pascanu

Abstract: We introduce Mix and match (M&M) – a training framework designed to facilitate

rapid and effective learning in RL agents that would be too slow or too challenging to train

otherwise.The key innovation is a procedure that allows us to automatically form a curriculum

over agents. Through such a curriculum we can progressively train more complex agents by,

effectively, bootstrapping from solutions found by simpler agents.In contradistinction to typical

curriculum learning approaches, we do not gradually modify the tasks or environments

presented, but instead use a process to gradually alter how the policy is represented

internally.We show the broad applicability of our method by demonstrating significant

performance gains in three different experimental setups: (1) We train an agent able to control

more than 700 actions in a challenging 3D first-person task; using our method to progress

through an action-space curriculum we achieve both faster training and better final performance

than one obtains using traditional methods.(2) We further show that M&M can be used

successfully to progress through a curriculum of architectural variants defining an agents internal

state. (3) Finally, we illustrate how a variant of our method can be used to improve agent

performance in a multitask setting.

Link: http://proceedings.mlr.press/v80/czarnecki18a/czarnecki18a.pdf

Title: Learning to Explore via Meta-Policy Gradient

Author: Tianbing Xu · Qiang Liu · Liang Zhao · Jian Peng

Abstract: The performance of off-policy learning, including deep Q-learning and deep

deterministic policy gradient (DDPG), critically depends on the choice of the exploration policy.

Existing exploration methods are mostly based on adding noise to the on-going actor policy and

can only explore

Link: http://proceedings.mlr.press/v80/xu18d/xu18d.pdf

Deep Learning (Neural Network Architectures) 4

class_link: https://icml.cc/Conferences/2018/Schedule?showParentSession=3395

Title: Adafactor: Adaptive Learning Rates with Sublinear Memory Cost

Author: Noam Shazeer · Mitchell Stern

Abstract: In several recently proposed stochastic optimization methods (e.g. RMSProp, Adam,

Adadelta), parameter updates are scaled by the inverse square roots of exponential moving

averages of squared past gradients. Maintaining these per-parameter second-moment estimators

requires memory equal to the number of parameters. For the case of neural network weight

matrices, we propose maintaining only the per-row and per-column sums of these moving

averages, and estimating the per-parameter second moments based on these sums. We

demonstrate empirically that this method produces similar results to the baseline. Secondly, we

show that adaptive methods can produce larger-than-desired updates when the decay rate of the

second moment accumulator is too slow. We propose update clipping and a gradually increasing

decay rate scheme as remedies. Combining these methods and dropping momentum, we achieve

comparable results to the published Adam regime in training the Transformer model on the WMT

2014 English-German machine translation task, while using very little auxiliary storage in the

optimizer. Finally, we propose scaling the parameter updates based on the scale of the

parameters themselves.

Link: http://proceedings.mlr.press/v80/shazeer18a/shazeer18a.pdf

Title: Orthogonal Recurrent Neural Networks with Scaled Cayley Transform

Author: Kyle Helfrich · Devin Willmott · Qiang Ye

Abstract: Recurrent Neural Networks (RNNs) are designed to handle sequential data but suffer

from vanishing or exploding gradients. Recent work on Unitary Recurrent Neural Networks

(uRNNs) have been used to address this issue and in some cases, exceed the capabilities of Long

Short-Term Memory networks (LSTMs). We propose a simpler and novel update scheme to

maintain orthogonal recurrent weight matrices without using complex valued matrices. This is

done by parametrizing with a skew-symmetric matrix using the Cayley transform; such a

parametrization is unable to represent matrices with negative one eigenvalues, but this limitation

is overcome by scaling the recurrent weight matrix by a diagonal matrix consisting of ones and

negative ones. The proposed training scheme involves a straightforward gradient calculation and

update step. In several experiments, the proposed scaled Cayley orthogonal recurrent neural

network (scoRNN) achieves superior results with fewer trainable parameters than other unitary

RNNs.

Link: http://proceedings.mlr.press/v80/helfrich18a/helfrich18a.pdf

Title: Kronecker Recurrent Units

Author: Cijo Jose · Mouhamadou Moustapha Cisse · Francois Fleuret

Abstract: Our work addresses two important issues with recurrent neural networks: (1) they

are over-parametrized, and (2) the recurrent weight matrix is ill-conditioned. The former increases

the sample complexity of learning and the training time. The latter causes the vanishing and

exploding gradient problem. We present a flexible recurrent neural network model called

Kronecker Recurrent Units (KRU). KRU achieves parameter efficiency in RNNs through a Kronecker

factored recurrent matrix. It overcomes the ill-conditioning of the recurrent matrix by enforcing

soft unitary constraints on the factors. Thanks to the small dimensionality of the factors,

maintaining these constraints is computationally efficient. Our experimental results on seven

standard data-sets reveal that KRU can reduce the number of parameters by three orders of

magnitude in the recurrent weight matrix compared to the existing recurrent models, without

trading the statistical performance. These results in particular show that while there are

advantages in having a high dimensional recurrent space, the capacity of the recurrent part of the

model can be dramatically reduced.

Link: http://proceedings.mlr.press/v80/jose18a/jose18a.pdf

Title: Fast Parametric Learning with Activation Memorization

Author: Jack Rae · Chris Dyer · Peter Dayan · Timothy Lillicrap

Abstract: Neural networks trained with backpropagation often struggle to identify classes that

have been observed a small number of times. In applications where most class labels are rare,

such as language modelling, this can become a performance bottleneck. One potential remedy is

to augment the network with a fast-learning non-parametric model which stores recent

activations and class labels into an external memory. We explore a simplified architecture where

we treat a subset of the model parameters as fast memory stores. This can help retain

information over longer time intervals than a traditional memory, and does not require additional

space or compute. In the case of image classification, we display faster binding of novel classes on

an Omniglot image curriculum task. We also show improved performance for word-based

language models on news reports (GigaWord), books (Project Gutenberg) and Wikipedia articles

(WikiText-103) - the latter achieving a state-of-the-art perplexity of 29.2.

Link: http://proceedings.mlr.press/v80/rae18a/rae18a.pdf

Title: Dynamic Evaluation of Neural Sequence Models

Author: Ben Krause · Emmanuel Kahembwe · Iain Murray · Steve Renals

Abstract: We explore dynamic evaluation, where sequence models are adapted to the recent

sequence history using gradient descent, assigning higher probabilities to re-occurring sequential

patterns. We develop a dynamic evaluation approach that outperforms existing adaptation

approaches in our comparisons. We apply dynamic evaluation to outperform all previous word-

level perplexities on the Penn Treebank and WikiText-2 datasets (achieving 51.1 and 44.3

respectively) and all previous character-level cross-entropies on the text8 and Hutter Prize

datasets (achieving 1.19 bits/char and 1.08 bits/char respectively).

Link: http://proceedings.mlr.press/v80/krause18a/krause18a.pdf

Supervised Learning 2

class_link: https://icml.cc/Conferences/2018/Schedule?showParentSession=3429

Title: Dimensionality-Driven Learning with Noisy Labels

Author: Xingjun Ma · Yisen Wang · Michael E. Houle · Shuo Zhou · Sarah Erfani · Shutao Xia ·

Sudanthi Wijewickrema · James Bailey

Abstract: Datasets with significant proportions of noisy (incorrect) class labels present

challenges for training accurate Deep Neural Networks (DNNs). We propose a new perspective for

understanding DNN generalization for such datasets, by investigating the dimensionality of the

deep representation subspace of training samples. We show that from a dimensionality

perspective, DNNs exhibit quite distinctive learning styles when trained with clean labels versus

when trained with a proportion of noisy labels. Based on this finding, we develop a new

dimensionality-driven learning strategy, which monitors the dimensionality of subspaces during

training and adapts the loss function accordingly. We empirically demonstrate that our approach

is highly tolerant to significant proportions of noisy labels, and can effectively learn low-

dimensional local subspaces that capture the data distribution.

Link: http://proceedings.mlr.press/v80/ma18d/ma18d.pdf

Title: MentorNet: Learning Data-Driven Curriculum for Very Deep Neural Networks on

Corrupted Labels

Author: Lu Jiang · Zhengyuan Zhou · Thomas Leung · Li-Jia Li · Li Fei-Fei

Abstract: Recent deep networks are capable of memorizing the entire data even when the

labels are completely random. To overcome the overfitting on corrupted labels, we propose a

novel technique of learning another neural network, called MentorNet, to supervise the training

of the base deep networks, namely, StudentNet. During training, MentorNet provides a

curriculum (sample weighting scheme) for StudentNet to focus on the sample the label of which is

probably correct. Unlike the existing curriculum that is usually predefined by human experts,

MentorNet learns a data-driven curriculum dynamically with StudentNet. Experimental results

demonstrate that our approach can significantly improve the generalization performance of deep

networks trained on corrupted training data. Notably, to the best of our knowledge, we achieve

the best-published result on WebVision, a large benchmark containing 2.2 million images of real-

world noisy labels.

Link: http://proceedings.mlr.press/v80/jiang18c/jiang18c.pdf

Title: Learning to Reweight Examples for Robust Deep Learning

Author: Mengye Ren · Wenyuan Zeng · Bin Yang · Raquel Urtasun

Abstract: Deep neural networks have been shown to be very powerful modeling tools for

many supervised learning tasks involving complex input patterns. However, they can also easily

overfit to training set biases and label noises. In addition to various regularizers, example

reweighting algorithms are popular solutions to these problems, but they require careful tuning

of additional hyperparameters, such as example mining schedules and regularization

hyperparameters. In contrast to past reweighting methods, which typically consist of functions of

the cost value of each example, in this work we propose a novel meta-learning algorithm that

learns to assign weights to training examples based on their gradient directions. To determine the

example weights, our method performs a meta gradient descent step on the current mini-batch

example weights (which are initialized from zero) to minimize the loss on a clean unbiased

validation set. Our proposed method can be easily implemented on any type of deep network,

does not require any additional hyperparameter tuning, and achieves impressive performance on

class imbalance and corrupted label problems where only a small amount of clean validation data

is available.

Link: http://proceedings.mlr.press/v80/ren18a/ren18a.pdf

Title: Curriculum Learning by Transfer Learning: Theory and Experiments with Deep

Networks

Author: Daphna Weinshall · Gad A Cohen · Dan Amir

Abstract: We provide theoretical investigation of curriculum learning in the context of

stochastic gradient descent when optimizing the convex linear regression loss. We prove that the

rate of convergence of an ideal curriculum learning method is monotonically increasing with the

difficulty of the examples. Moreover, among all equally difficult points, convergence is faster when

using points which incur higher loss with respect to the current hypothesis. We then analyze

curriculum learning in the context of training a CNN. We describe a method which infers the

curriculum by way of transfer learning from another network, pre-trained on a different task.

While this approach can only approximate the ideal curriculum, we observe empirically similar

behavior to the one predicted by the theory, namely, a significant boost in convergence speed at

the beginning of training. When the task is made more difficult, improvement in generalization

performance is also observed. Finally, curriculum learning exhibits robustness against

unfavorable conditions such as excessive regularization.

Link: http://proceedings.mlr.press/v80/weinshall18a/weinshall18a.pdf

评论收藏

内容反馈

AINLPer

粉丝: 447
资源: 8

ICML论文下载（从2017起至今）

icml 2008 英文论文

2016年深度学习论文合集

ICML2020论文列表与下载链接爬虫

ICML 2019年 会议文章目录 （含论文下载链接）

ICML 2017 Deep RL Tutorial

cdfmatlab代码-icml2017hierchvid:ICML2017论文的Tensorflow实施：学习通过分层预测生成长期未来

ICML 2013国际会议论文集论文

ICML2020-2.zip

icml2020文章列表及下载链接.zip

ICML2023_Tutorial.pdf

icml 2017年 会议文章目录

蚂蚁金服人工智能部研究员ICML贡献论文01.pdf

蚂蚁金服人工智能部研究员ICML贡献论文07.pdf

icml 2018年 会议文章目录（含文章下载链接）

ICML2020-1.zip

ICML2019 (6).zip

ICML 2014 机器学习国际会议论文集

ICML2015 第三部分

stable-diffusion部署需要的包

大规模语言模型：从理论到实践

21个免费无限制免登录chatgpt资源， OpenAI GPT-4\3.5 模型的智能对话链接

人工智能大模型介绍.pptx

ChatGPT智能AI机器人微信小程序源码-带部署教程

llama3-中文微调训练集，让llama3更懂中文

diabetes糖尿病数据集

LM Studio windows版本安装

transformer代码

线性代数-同济大学第七版

最新资源

ICML 2019年会议文章目录（含论文下载链接）

icml 2017年会议文章目录

icml 2018年会议文章目录（含文章下载链接）