没有合适的资源?快使用搜索试试~ 我知道了~
资源推荐
资源详情
资源评论
![pdf](https://img-home.csdnimg.cn/images/20210720083512.png)
![pdf](https://img-home.csdnimg.cn/images/20210720083512.png)
![pdf](https://img-home.csdnimg.cn/images/20210720083512.png)
![rar](https://img-home.csdnimg.cn/images/20210720083606.png)
![pdf](https://img-home.csdnimg.cn/images/20210720083646.png)
![pdf](https://img-home.csdnimg.cn/images/20210720083512.png)
![pdf](https://img-home.csdnimg.cn/images/20210720083512.png)
![pdf](https://img-home.csdnimg.cn/images/20210720083512.png)
![pdf](https://img-home.csdnimg.cn/images/20210720083512.png)
![zip](https://img-home.csdnimg.cn/images/20210720083736.png)
![pdf](https://img-home.csdnimg.cn/images/20210720083512.png)
![rar](https://img-home.csdnimg.cn/images/20210720083606.png)
![zip](https://img-home.csdnimg.cn/images/20210720083736.png)
![pdf](https://img-home.csdnimg.cn/images/20210720083512.png)
![pdf](https://img-home.csdnimg.cn/images/20210720083512.png)
![pdf](https://img-home.csdnimg.cn/images/20210720083512.png)
![zip](https://img-home.csdnimg.cn/images/20210720083736.png)
![zip](https://img-home.csdnimg.cn/images/20210720083736.png)
![exe](https://img-home.csdnimg.cn/images/20210720083343.png)
![zip](https://img-home.csdnimg.cn/images/20210720083736.png)
![rar](https://img-home.csdnimg.cn/images/20210720083606.png)
![rar](https://img-home.csdnimg.cn/images/20210720083606.png)
![](https://csdnimg.cn/release/download_crawler_static/77485314/bg1.jpg)
Deep Learning for Two-Sided Matching
∗
Sai Srivatsa Ravindranath
a
, Zhe Feng
a
, Shira Li
b
, Jonathan Ma
b
, Scott D. Kominers
c
, and
David C. Parkes
a
a
John A. Paulson School of Engineering and Applied Sciences, Harvard University
saisr,zhe_feng,parkes@g.harvard.edu
b
Harvard College
shirali@alumni.harvard.edu, jonathan.q.ma@gmail.com
c
Harvard Business School
kominers@fas.harvard.edu
July 9, 2021
Abstract
We initiate the use of a multi-layer neural network to model two-sided matching and to
explore the design space between strategy-proofness and stability. It is well known that both
properties cannot be achieved simultaneously but the efficient frontier in this design space is
not understood. We show empirically that it is possible to achieve a good compromise between
stability and strategy-proofness—substantially better than that achievable through a convex
combination of deferred acceptance (stable and strategy-proof for only one side of the market)
and randomized serial dictatorship (strategy-proof but not stable).
1 Introduction
Two-sided matching markets, such as Uber, Airbnb, stock markets, and dating apps, play a significant
role in today’s world. As a result, there is a tremendous and rising interest to design better mechanisms
for two-sided matching. The seminal work of Gale and Shapley [
14
] introduced a simple mechanism
for stable matching in two-sided markets—Deferred-acceptance (DA)—which has since has been
applied in doctor-hospital matching [
25
], school choice [
3
,
22
,
2
], and the matching of cadets to
their branches of military service [
30
,
29
]. DA is stable, i.e., no pair of agents mutually prefer each
other to their DA partners. On the other hand, DA is not strategy-proof (SP); that is, under fully
general preferences, it is always possible that some agent can mis-report her preferences to obtain a
better matching than she would receive under the DA mechanism.
1
Another well-known mechanism,
random serial dictatorship (RSD), is SP but not stable.
2
More generally, it is well-known that it is
impossible to achieve both stability and strategy-proofness in two-sided matching [
9
,
24
]. At the
same time, there is little understanding of the nature of this tradeoff beyond point solutions such as
∗
This work is supported in part through an AWS Machine Learning Research Award.
1
As we discuss below, DA is strategy-proof for agents on one side of the market, but not for agents on both sides
of the market simultaneously.
2
Indeed, RSD is typically studied for one-sided assignment problems rather than for two-sided matching mechanisms.
For two-sided matching, not only is RSD unstable, but it can also fail to match participants in a way that is better
than receiving no match at all. We adopt RSD as a benchmark in this paper—despite its flaws—because we are not
aware of more suitable SP mechanisms for two-sided matching.
1
arXiv:2107.03427v1 [cs.GT] 7 Jul 2021
![](https://csdnimg.cn/release/download_crawler_static/77485314/bg2.jpg)
DA and RSD—and we are not aware of any work that has attempted to map out the tradeoff more
generally.
Inspired by the recent development of deep learning for optimal auction design [
10
,
8
,
28
,
23
],
we initiate the study of multi-player neural networks to model two-sided matching. We show how
to use machine learning to characterize the frontier curve of the tradeoff between stability and
strategy-proofness. The main challenges of applying neural networks to two-sided matching come
from handling the ordinal preference inputs, and in identifying suitable differentiable surrogates for
approximate strategy-proofness and stability.
For randomized matching mechanisms, the strongest SP concept is that of ordinal strategy-
proofness. This aligns incentives with truthful reporting whatever the utility function of an agent.
Ordinal SP is equivalent to a property of first-order stochastic dominance (FOSD) [
11
], which means
that agents have a weakly higher chance of getting their top-ranked choices when they report their
preferences truthfully. Our metric for SP quantifies the degree to which FOSD is violated. For this,
we adopt an adversarial approach, seeking to augment the training data with suitable defeating
mis-reports. We also provide a metric to quantify the degree to which stability is violated. We show
that a loss function built from these quantities can be trained through SGD, and illustrate the use of
the framework to identify the stability-strategyproofness frontier. We run simulations to validate the
efficiency of our approach, demonstrating for different preference distributions that our approach can
strike much better trade-offs between stability and strategyproofness than the convex combination
of DA and RSD.
Related work.
This work lies at the intersection of two-sided matching [
27
] and the role of
machine learning within economics [
6
]. The matching mechanisms learned by our neural networks are
randomized, approximately strategy-proof, and approximately stable. Budish et al. [
7
] and Mennle
and Seuken [
19
,
20
] discuss different notions of approximate strategy-proofness in the context of
matching and allocation. In this work, we focus on ordinal SP and its analog of FOSD [
11
]. This is
a strong and widely-used SP concept in the presence of ordinal preferences.
Classic results of Dubins and Freedman [
9
] and Roth [
24
] show that it is impossible to achieve both
stability and strategy-proofness in two-sided matching simultaneously—although strategy-proofness
for one side of the market is achieved by the Deferred Acceptance (DA) mechanism.
3
Thus market
designers have looked at mechanisms that relax one or both of these conditions. The Random serial
dictatorship (RSD) mechanism [
1
] is SP but typically fails to produce stable outcomes (and indeed, it
may even fail to be individually rational).
4
On the other hand, Roth et al. [
26
] study the polytope of
stable matchings and provide a wide class of stable matchings beyond the well-known DA outcome.
The stable improvement cycles mechanism of Erdil and Ergin [
12
], meanwhile, achieves as much
efficiency as possible on top of stability, but fails to be SP even for one side of the market. Finally, a
series of results have shown that DA becomes SP for both sides of the market in certain large-market
limit contexts; these results typically also require additional, structural assumptions on market
participants’ preferences (see, e.g., [16, 17, 18]).
This work belongs to the emerging literature on machine learning for economic design. Narasimhan
et al. [
21
] utilize support vector machines to search for good mechanisms among the weighted polytope
mechanisms. Recently, many papers [
10
,
13
,
15
,
8
,
28
,
23
] apply deep neural networks to optimal
auction design and facility location problems. In this work, and inspired by the neural network
3
Alcalde and Barberà [
4
] also showed the impossibility of individually rational, Pareto efficient, and SP allocation
rules. Alva and Manjunath [5] extended this result to randomized matching contexts.
4
The Top Trading Cycles (TTC) mechanism is likewise SP—but it effectively treats the market as one-sided,
producing outcomes that do not reflect the other side’s preferences.
2
![](https://csdnimg.cn/release/download_crawler_static/77485314/bg3.jpg)
architecture proposed by Dütting et al. [
10
], we use neural networks to model the design of two-sided
matching markets.
2 Preliminaries
Let
W
be a set of
n
workers and
F
a set of
m
firms, and suppose that each worker can be matched
to at most one firm and each firm to at most one worker. A matching
µ
is a set of (worker, firm)
pairs, with each worker and firm participating in at most one match. Let
B
denote the set of all
matchings. If a worker or firm remains unmatched, we say that it is matched to
⊥
. If (
w, f
)
∈ µ
, then
µ
matches
w
to
f
, and we write
µ
(
w
) =
f
and
µ
(
f
) =
w
. We write (
w, ⊥
)
∈ µ
(resp. (
⊥, f
)
∈ µ
) to
denote that w (resp. f ) is unmatched.
Each worker has a strict preference order
w
over the set
F
=
F ∪ {⊥}
. Each firm has a
strict preference order
f
over the set
W
=
W ∪ {⊥}
. Worker
w
(firm
f
) prefers remaining
unmatched to being matched with a firm (worker) that is ranked below
⊥
(the agents ranked
below
⊥
are unacceptable). If worker
w
prefers firm
f
to
f
0
then we represent this as
f
w
f
0
,
and similarly for the preferences of a firm. Let
P
denote the set of all preference profiles, with
= (
1
, . . . ,
n
,
n+1
,
n+m
)
∈ P
denoting the preference profile that comprises all workers and
firms.
A pair (
w, f
) forms a blocking pair for matching
µ
if
w
and
f
prefer each other to their partners
in
µ
(or
⊥
in the case that either or both are unmatched). A matching
µ
is stable if and only if
there are no blocking pairs. A matching
µ
is individually rational (IR) if it is not blocked by any
individual, i.e., there is no worker or firm that finds its partner unacceptable and prefers ⊥.
5
2.1 Randomized matchings
We work with randomized matching mechanisms
g
that map preference profiles
to distributions on
matchings, denoted
g
(
)
∈ 4
(
B
). This provides for differentiable mechanisms. Here,
4
(
B
) denotes
the probability simplex on the set of matchings.
We write
r ∈
[0
,
1]
(n+1)×(m+1)
to define the marginal probability
r
wf
≥
0 with which worker
w
is matched with firm
f
, for each
w ∈ W
and each firm
f ∈ F
. We require
P
f
0
∈F
r
wf
0
= 1 for all
w ∈ W
, and
P
w
0
∈W
r
w
0
f
= 1 for all
f ∈ F
. For notational simplicity, we also write
g
wf
(
) to denote
the marginal probability of matching worker w (or ⊥) and firm f (or ⊥).
Theorem 1
(Birkhoff von-Neumann)
.
Given any randomized matching
r
, there exists a distribution
on matchings, ∆(B), with marginal probabilities equal to r.
The following definition is standard [7] and generalizes stability to randomized matchings.
Definition 2 (Ex ante justified envy). A randomized matching r causes ex ante justified envy if
(1) some worker
w
prefers
f
over some (fractionally) matched firm
f
0
(including
f
0
=
⊥
) and firm
f
prefers
w
over some (fractionally) matched worker
w
0
(including
w
0
=
⊥
) (“
w
has envy towards
w
0
" and “f has envy towards f
0
"), or
(2) some worker
w
finds a (fractionally) matched
f
0
∈ F
unacceptable, i.e.
r
wf
0
>
0 and
⊥
w
f
0
,
or some firm f finds a (fractionally) matched w
0
∈ W unacceptable, i.e. r
w
0
f
> 0 and ⊥
f
w
0
.
A randomized matching
r
is ex ante stable if and only if it does not cause any ex ante justified
envy. Ex ante stability reduces to the standard concept of stability for a deterministic matching.
Part (2) of the definition of ex ante justified envy captures the idea that a randomized matching
5
Stability precludes empty matchings. For example, if a matching
µ
leaves a worker
w
and a firm
f
unmatched,
where w finds f acceptable and f finds w acceptable, then (w, f) is a blocking pair to µ.
3
剩余14页未读,继续阅读
资源评论
![avatar-default](https://csdnimg.cn/release/downloadcmsfe/public/img/lazyLogo2.1882d7f4.png)
![avatar](https://profile-avatar.csdnimg.cn/ace77722cc904668be9c7ee0feb247ba_dwf1354046363.jpg!1)
![avatar-vip](https://csdnimg.cn/release/downloadcmsfe/public/img/user-vip.1c89f3c5.png)
易小侠
- 粉丝: 6515
- 资源: 9万+
上传资源 快速赚钱
我的内容管理 展开
我的资源 快来上传第一个资源
我的收益
登录查看自己的收益我的积分 登录查看自己的积分
我的C币 登录后查看C币余额
我的收藏
我的下载
下载帮助
![voice](https://csdnimg.cn/release/downloadcmsfe/public/img/voice.245cc511.png)
![center-task](https://csdnimg.cn/release/downloadcmsfe/public/img/center-task.c2eda91a.png)
最新资源
- Hadoop 2.6 版本API压缩包
- "数据库管理系统实战与实现细节"
- Adobe Flash Player 29.0.0.140 for Internet Explorer
- "AndyTiming:轻便时序图绘制工具"
- 汇编语言编程入门与进阶指南
- 架构实施与实践交流大会纪要
- 本设计以STM32F1单片机为控制核心,设计并制作了简易多功能液体容器.设计主要包括单片机控制模块、电源模块、超声波测距模块、T
- HexView(Vector)版本的1.12.05
- aircrack-ng的1.0rc1版本,适用于Windows系统
- 为开发人员打造的低代码开发平台,将复杂的工作简单化、重复的工作自动化,提高质量、效率、可维护性
资源上传下载、课程学习等过程中有任何疑问或建议,欢迎提出宝贵意见哦~我们会及时处理!
点击此处反馈
![feedback](https://img-home.csdnimg.cn/images/20220527035711.png)
![feedback](https://img-home.csdnimg.cn/images/20220527035711.png)
![feedback-tip](https://img-home.csdnimg.cn/images/20220527035111.png)
安全验证
文档复制为VIP权益,开通VIP直接复制
![dialog-icon](https://csdnimg.cn/release/downloadcmsfe/public/img/green-success.6a4acb44.png)