nips2019合集_卷1.zip资源-CSDN文库

共95个文件

pdf：95个

nips2019

论文合集

NeurIPS

需积分: 48 17 浏览量 2019-12-17 14:41:21 上传评论 1 收藏 151.25MB ZIP 举报

资源推荐

资源详情

资源评论

收起资源包目录

nips2019合集_卷1.zip （95个子文件）

nips2019合集_卷1

A Flexible Generative Framework for Graph based Semi supervised Learning.pdf 161KB

Adaptively Aligned Image Captioning via Adaptive Attention Time.pdf 2.84MB

A General Family of Robust Stochastic Operators for Reinforcement Learning.pdf 653KB

A unified variance reduced accelerated gradient method for convex optimization.pdf 1.51MB

An Accelerated Decentralized Stochastic Proximal Algorithm for Finite Sums.pdf 1.13MB

A Solvable High Dimensional Model of GAN.pdf 801KB

A Mean Field Theory of Quantized Deep Networks The Quantization Depth Trade Off.pdf 1.15MB

Accurate reliable and fast robustness evaluation.pdf 1.28MB

A Unifying Framework for Spectrum Preserving Graph Sparsification and Coarsening.pdf 10.98MB

A Convex Relaxation Barrier to Tight Robustness Verification of Neural Networks.pdf 1.23MB

An Algorithmic Framework For Differentially Private Data Analysis on Trusted Processors.pdf 820KB

A Direct tilde O 1 蔚 Iteration Parallel Algorithm for Optimal Transport.pdf 598KB

Are deep ResNets provably better than linear predictors.pdf 208KB

Backprop with Approximate Activations for Memory efficient Network Training.pdf 656KB

A First Order Approach To Accelerated Value Iteration.pdf 628KB

Batched Multi armed Bandits Problem.pdf 420KB

An adaptive nearest neighbor rule for classification.pdf 9.35MB

Beyond Online Balanced Descent An Optimal Algorithm for Smoothed Online Optimization.pdf 533KB

A Stochastic Composite Gradient Method with Incremental Variance Reduction.pdf 1004KB

A General Theory of Equivariant CNNs on Homogeneous Spaces.pdf 776KB

Acceleration via Symplectic Discretization of High Resolution Differential Equations.pdf 306KB

A Linearly Convergent Proximal Gradient Algorithm for Decentralized Optimization.pdf 526KB

A Nonconvex Approach for Exact and Efficient Multichannel Sparse Blind Deconvolution.pdf 1.53MB

Asymmetric Valleys Beyond Sharp and Flat Local Minima.pdf 1.25MB

A Latent Variational Framework for Stochastic Optimization.pdf 265KB

Adaptive Gradient Based Meta Learning Methods.pdf 815KB

A Graph Theoretic Additive Approximation of Optimal Transport.pdf 467KB

Bayesian Batch Active Learning as Sparse Subset Approximation.pdf 4.33MB

Bandits with Feedback Graphs and Switching Costs.pdf 415KB

An Inexact Augmented Lagrangian Framework for Nonconvex Optimization with Nonlinear Constraints.pdf 1023KB

Adversarial Training for Free.pdf 2.01MB

Are Disentangled Representations Helpful for Abstract Visual Reasoning.pdf 1.98MB

An Embedding Framework for Consistent Polyhedral Surrogates.pdf 668KB

Anti efficient encoding in emergent communication.pdf 1.62MB

Adaptive Temporal Difference Learning for Policy Evaluation with Per State Uncertainty Estimates.pdf 3.74MB

A Zero Positive Learning Approach for Diagnosing Software Performance Regressions.pdf 1004KB

Asymptotic Guarantees for Learning Generative Models with the Sliced Wasserstein Distance.pdf 1.18MB

A Simple Baseline for Bayesian Uncertainty in Deep Learning.pdf 1.12MB

A Fourier Perspective on Model Robustness in Computer Vision.pdf 3.73MB

Backpropagation Friendly Eigendecomposition.pdf 937KB

Block Coordinate Regularization by Denoising.pdf 7.2MB

Bayesian Layers A Module for Neural Network Uncertainty.pdf 966KB

Are Anchor Points Really Indispensable in Label Noise Learning.pdf 880KB

Average Case Averages Private Algorithms for Smooth Sensitivity and Mean Estimation.pdf 893KB

BIVA A Very Deep Hierarchy of Latent Variables for Generative Modeling.pdf 7MB

Ask Not What AI Can Do But What AI Should Do Towards a Framework of Task Delegability.pdf 770KB

Blended Matching Pursuit.pdf 2.2MB

Algorithmic Guarantees for Inverse Imaging with Untrained Network Priors.pdf 1.42MB

A Geometric Perspective on Optimal Representations for Reinforcement Learning.pdf 7.69MB

A Debiased MDI Feature Importance Measure for Random Forests.pdf 404KB

Adversarial Self Defense for Cycle Consistent GANs.pdf 7.06MB

A Little Is Enough Circumventing Defenses For Distributed Learning.pdf 984KB

An Improved Analysis of Training Over parameterized Deep Neural Networks.pdf 707KB

A Regularized Approach to Sparse Optimal Policy in Reinforcement Learning.pdf 6.51MB

A necessary and sufficient stability notion for adaptive generalization.pdf 327KB

Band Limited Gaussian Processes The Sinc Kernel.pdf 1.13MB

Blocking Bandits.pdf 2.41MB

Beyond Confidence Regions Tight Bayesian Ambiguity Sets for Robust MDPs.pdf 3.54MB

Adapting Neural Networks for the Estimation of Treatment Effects.pdf 440KB

Adversarial Robustness through Local Linearization.pdf 1.55MB

Bayesian Learning of Sum Product Networks.pdf 474KB

Beyond Alternating Updates for Matrix Factorization with Inertial Bregman Proximal Gradient Algorithms.pdf 1.26MB

A Generic Acceleration Framework for Stochastic Composite Optimization.pdf 1.04MB

Assessing Disparate Impacts of Personalized Interventions Identifiability and Bounds.pdf 1.03MB

ADDIS an adaptive discarding algorithm for online FDR control with conservative nulls.pdf 1.86MB

A Meta MDP Approach to Exploration for Lifelong Reinforcement Learning.pdf 3.2MB

A Universally Optimal Multistage Accelerated Stochastic Gradient Method.pdf 1.62MB

A Step Toward Quantifying Independently Reproducible Machine Learning Research.pdf 200KB

Are Labels Required for Improving Adversarial Robustness.pdf 1.24MB

Blind Super Resolution Kernel Estimation using an Internal GAN.pdf 2.82MB

A Graph Theoretic Framework of Recomputation Algorithms for Memory Efficient Backpropagation.pdf 1.39MB

Approximation Ratios of Graph Neural Networks for Combinatorial Problems.pdf 224KB

Approximating Interactive Human Evaluation with Self Play for Open Domain Dialog Systems.pdf 2.15MB

Average Individual Fairness Algorithms Generalization and Experiments.pdf 2.25MB

A Model to Search for Synthesizable Molecules.pdf 2.53MB

A neurally plausible model learns successor representations in partially observable environments.pdf 768KB

A Kernel Loss for Solving the Bellman Equation.pdf 933KB

A Stratified Approach to Robustness for Randomly Smoothed Classifiers.pdf 995KB

Augmented Neural ODEs.pdf 8.42MB

A Polynomial Time Algorithm for Log Concave Maximum Likelihood via Locally Exponential Families.pdf 1.56MB

Bayesian Optimization under Heavy tailed Payoffs.pdf 633KB

A Tensorized Transformer for Language Modeling.pdf 1.04MB

A New Distribution on the Simplex with Auto Encoding Applications.pdf 1012KB

A Unified Bellman Optimality Principle Combining Reward Maximization and Empowerment.pdf 1.27MB

Accelerating Rescaled Gradient Descent Fast Optimization of Smooth Functions.pdf 827KB

Beating SGD Saturation with Tail Averaging and Minibatching.pdf 606KB

BatchBALD Efficient and Diverse Batch Acquisition for Deep Bayesian Active Learning.pdf 2.86MB

AutoAssist A Framework to Accelerate Training of Deep Neural Networks.pdf 907KB

Adaptive Cross Modal Few Shot Learning.pdf 6.2MB

Are Sixteen Heads Really Better than One.pdf 463KB

Adaptive Sequence Submodularity.pdf 1.28MB

Algorithmic Analysis and Statistical Estimation of SLOPE via Approximate Message Passing.pdf 1.07MB

Bias Correction of Learned Generative Models using Likelihood Free Importance Weighting.pdf 796KB

Adaptive Influence Maximization with Myopic Feedback.pdf 262KB

Approximate Inference Turns Deep Networks into Gaussian Processes.pdf 2.2MB

A Unifying Framework for Spectrum-Preserving

Graph Sparsiﬁcation and Coarsening

Gecia Bravo-Hermsdorff*

Princeton Neuroscience Institute

Princeton University

Princeton, NJ, 08544, USA

geciah@princeton.edu

Lee M. Gunderson*

Department of Astrophysical Sciences

Princeton University

Princeton, NJ, 08544, USA

leeg@princeton.edu

Abstract

How might one “reduce” a graph? That is, generate a smaller graph that preserves

the global structure at the expense of discarding local details? There has been

extensive work on both graph sparsiﬁcation (removing edges) and graph coarsening

(merging nodes, often by edge contraction); however, these operations are currently

treated separately. Interestingly, for a planar graph, edge deletion corresponds to

edge contraction in its planar dual (and more generally, for a graphical matroid

and its dual). Moreover, with respect to the dynamics induced by the graph

Laplacian (e.g., diffusion), deletion and contraction are physical manifestations

of two reciprocal limits: edge weights of

and

∞

, respectively. In this work, we

provide a unifying framework that captures both of these operations, allowing one

to simultaneously sparsify and coarsen a graph while preserving its large-scale

structure. The limit of inﬁnite edge weight is rarely considered, as many classical

notions of graph similarity diverge. However, its algebraic, geometric, and physical

interpretations are reﬂected in the Laplacian pseudoinverse

†

, which remains ﬁnite

in this limit. Motivated by this insight, we provide a probabilistic algorithm that

reduces graphs while preserving

†

, using an unbiased procedure that minimizes

its variance. We compare our algorithm with several existing sparsiﬁcation and

coarsening algorithms using real-world datasets, and demonstrate that it more

accurately preserves the large-scale structure.

1 Motivation

Many complex structures and phenomena are naturally described as graphs (eg,

brains, social

networks, the internet, etc). Indeed, graph-structured data are becoming increasingly relevant to

the ﬁeld of machine learning [

]. These graphs are frequently massive, easily surpassing our

working memory, and often the computer’s relevant cache [

]. It is therefore essential to obtain

smaller approximate graphs to allow for more efﬁcient computation.

Graphs are deﬁned by a set of nodes

and a set of edges

E ⊆ V × V

between them, and are often

represented as an adjacency matrix

with size

|V | × |V |

and density

∝ |E|

. Reducing either of

these quantities is advantageous: graph “coarsening” focuses on the former, aggregating nodes while

respecting the overall structure, and graph “sparsiﬁcation” on the latter, preferentially retaining the

important edges.

∗

Both authors contributed equally to this work.

The authors agree with the sentiment of the footnote on page

of [

], viz, omitting superﬂuous full stops

to obtain a more efﬁcient compression of, eg: videlicet, exempli gratia, etc.

Preprint. Under review.

arXiv:1902.09702v4 [cs.DM] 23 Aug 2019

Spectral graph sparsiﬁcation has revolutionized the ﬁeld of numerical linear algebra and is used, eg, in

algorithms for solving linear systems with symmetric diagonally dominant matrices in nearly-linear

time [

] (in contrast to the fastest known algorithm for solving general linear systems, taking

O(n

)-time, where ω ≈ 2.373 is the matrix multiplication exponent [8]).

Graph coarsening appears in many computer science and machine learning applications, eg: as

primitives for graph partitioning [

] and visualization algorithms

[

]; as layers in graph convolution

networks [

]; for dimensionality reduction and hierarchical representation of graph-structured

data [

]; and to speed up regularized least square problems on graphs [

], which arise in a

variety of problems such as ranking [15] and distributed synchronization of clocks [16].

A variety of algorithms, with different objectives, have been proposed for both sparsiﬁcation and

coarsening. However, a frequently recurring theme is to consider the graph Laplacian

L = D − A

where

is the diagonal matrix of node degrees. Indeed, it appears in a wide range of applications,

eg: its spectral properties can be leveraged for graph clustering [

]; it can be used to efﬁciently solve

min-cut/max-ﬂow problems [

]; and for undirected, positively weighted graphs (the focus of this

paper), it induces a natural quadratic form, which can be used, eg, to smoothly interpolate functions

over the nodes [19].

Work on spectral graph sparsiﬁcation focuses on preserving the Laplacian quadratic form

L~x

, a

popular measure of spectral similarity suggested by Spielman & Teng [

]. A key result in this ﬁeld is

that any dense graph can be sparsiﬁed to

O(|V |log |V |)

edges in nearly linear time using a simple

probabilistic algorithm [

]: start with an empty graph, include edges from the original graph with

probability proportional to their effective resistance, and appropriately reweight those edges so as to

preserve ~x

L~x within a reasonable factor.

In contrast to the ﬁrm theoretical footing of spectral sparsiﬁcation, work on graph coarsening

has not reached a similar maturity; while a variety of spectral coarsening schemes have been

recently proposed, algorithms frequently rely on heuristics, and there is arguably no consensus. Eg:

Jin & Jaja [

] use

eigenvectors of the Laplacian as feature vectors to perform

-means clustering

of the nodes; Purohit et al. [

] aim to minimize the change in the largest eigenvalue of the adjacency

matrix; and Loukas & Vandergheynst [23] focuses on a “restricted” Laplacian quadratic form.

Although recent work has combined sparsiﬁcation and coarsening [

], they used separate algorithmic

primitives, essentially analyzing the serial composition of the above algorithms. The primary contri-

bution of this work is to provide a unifying probabilistic framework that allows one to simultaneously

sparsify and coarsen a graph while preserving its global structure by using a single cost function that

preserves the Laplacian pseudoinverse L

†

Corollary contributions include:

Identifying the limit of inﬁnite edge weight with edge contraction,

highlighting how its algebraic, geometric, and physical interpretations are reﬂected in

†

, which

remains ﬁnite in this limit (Section 2);

Offering a way to quantitatively compare the effects

of edge deletion and edge contraction (Section 2 and 3);

Providing a probabilistic algorithm

that reduces graphs while preserving

†

, using an unbiased procedure that minimizes its variance

(Sections 3 and 4);

Proposing a more sensitive measure of spectral similarity of graphs, inspired

by the Poincaré half-plane model of hyperbolic space (Section 5.3); and

Comparing our algorithm

with several existing sparsiﬁcation and coarsening algorithms using synthetic and real-world datasets,

demonstrating that it more accurately preserves the large-scale structure (Section 5).

2 Why the Laplacian pseudoinverse

Many computations over graphs involve solving

L~x =

for

[

]. Thus, the algebraically relevant

operator is arguably the Laplacian pseudoinverse

†

. In fact, its connection with random walks

has been used to derive useful measures of distances on graphs, such as the well-known effective

resistance [

], and the recently proposed resistance perturbation distance [

]. Moreover, taking

the pseudoinverse of

leaves its eigenvectors unchanged, but inverts the nontrivial eigenvalues.

Thus, as the largest eigenpairs of

†

are associated with global structure, preserving its action will

preferentially maintain the overall “shape” of the graph (see Appendix Section G for details). For

instance, the Fielder vector [

] (associated with the “algebraic connectivity” of a graph) will be

For animated examples using our graph reduction algorithm, see the following link:

youtube.com/playlist?list=PLmﬁQcz2q6d3sZutLri4ZAIDLqM_4K1p-.

preferentially preserved. We now discuss in further detail why

†

is well-suited for both graph

sparsiﬁcation and coarsening.

Attention is often restricted to undirected, positively weighted graphs [

]. These graphs have

many convenient properties, eg, their Laplacians are positive semideﬁnite

(~x

L~x ≥ 0)

and have a

well-understood kernel and cokernel

1 =

L =

. The edge weights are deﬁned as a mapping

W: E → R

. When the weights represent connection strength, it is generally understood that

→ 0

is equivalent to removing edge

. However, the closure of the positive reals has a reciprocal

limit, namely w

→ +∞.

This limit is rarely considered, as many classical notions of graph similarity diverge. This includes

the standard notion of spectral similarity, where

is a

-spectral approximation of

if it preserves

the Laplacian quadratic form

to within a factor of

for all vectors

~x ∈ R

[

]. Clearly, this

limit yields a graph that does not approximate the original for any choice of

: any

with different

values for the two nodes joined by the edge with inﬁnite weight now yields an inﬁnite quadratic form.

This suggests considering only vectors that have the same value for these two nodes, essentially

contracting them into a single “supernode”. Algebraically, this interpretation is reﬂected in

†

, which

remains ﬁnite in this limit: the pair of rows (and columns) corresponding to the contracted nodes

become identical (see Appendix Section C).

Physically, consider the behavior of the heat equation

∂

~x + L~x =

: as

→ +∞

, the values on

the two nodes immediately equilibrate between themselves, and remain tethered for the rest of the

evolution.

Geometrically, the reciprocal limits of

→ 0

and

→ +∞

have dual interpretations:

consider a planar graph and its planar dual; edge deletion in one graph corresponds to contraction in

the other, and vice versa. This naturally extends to nonplanar graphs via their graphical matroids and

their duals [29].

Finally, while the Laplacian operator is frequently considered in the graph sparsiﬁcation and coarsen-

ing literature, its pseudoinverse also has many important applications in the ﬁeld of machine learning

[

], eg: online learning over graphs [

]; similarity prediction of network data [

]; determining

important nodes [

]; providing a measure of network robustness to multiple failures [

]; extending

principal component analysis to graphs [

]; and collaborative recommendation systems [

]. Hence,

graph reduction algorithms that preserve L

†

would be useful to the machine learning community.

3 Our graph reduction framework

We now describe our framework for constructing probabilistic algorithms that generate a reduced

graph

from an initial graph

, motivated by the following desiderata:

Reduce the number of

edges/nodes (Section 3.1);

Preserve

†

in expectation (Section 3.2); and

Minimize the change

in L

†

(Section 3.3).

We ﬁrst deﬁne these goals more formally. Then, in Section 3.4, we combine these requirements

to deﬁne our cost function and derive the optimal probabilistic action (ie, deletion, contraction, or

reweight) to perform to an edge.

3.1 Reducing edges and nodes

Depending on the application, it might be more important to reduce the number of nodes (eg,

coarsening a sparse network) or the number of edges (eg, sparsifying a dense network). Let

the number of prioritized items reduced during a particular iteration. When those items are nodes,

then

r = 0

for a deletion, and

r = 1

for a contraction. When those items are edges, then

r = 1

for a

deletion, however

r > 1

for a contraction is possible: if the contracted edge forms a triangle in the

original graph, then the other two edges will become parallel in the reduced graph (see Figure SI 3

in Appendix Section C). With respect to the Laplacian, this is equivalent to a single edge with

weight given by the sum of these now parallel edges. Thus, when edge reduction is prioritized, a

contraction will have

r = 1 + τ

, where

is the number of triangles in the original graph

in which

the contracted edge e participates.

In the spirit of another common analogy (edge weights as conductances of a network of resistors), breaking

a resistor is equivalent to deleting that edge, while contraction amounts to completely soldering over it.

Note that, even when node reduction is prioritized, the number of edges will also necessarily decrease.

Conversely, when edge reduction is prioritized, contraction of an edge is also possible, thereby

reducing the number of nodes as well. For the case of simultaneously sparsifying and coarsening a

graph, we choose to prioritize edge reduction, although nodes could also be a sensible choice.

3.2 Preserving the Laplacian pseudoinverse

Consider perturbing the weight of a single edge

e = (v

, v

)

∆w

. The change in the Laplacian is

− L

= ∆w

, (1)

where

and

are the perturbed and original Laplacians, respectively, and

is the (arbitrarily)

signed incidence (column) vector associated with edge e, with entries

)

(

+1 i = v

−1 i = v

0 otherwise.

(2)

The change in L

†

is given by the Woodbury matrix identity

[39]:

†

− L

†

= −

∆w

1 + ∆w

†

. (3)

Note that this change can be expressed as a matrix that depends only on the choice of edge

multiplied by a scalar term that depends (nonlinearly) on the change to its weight:

∆L

†

= f



∆w

, w

Ω



| {z }

nonlinear scalar

× M

|{z}

constant matrix

, (4)

where

f = −

∆w

1 +

∆w

Ω

, (5)

= w

†

, (6)

Ω

†

. (7)

Hence, if the probabilistic reweight of this edge is chosen such that

E[f] = 0

, then we have

E[L

†

] = L

†

, as desired. Importantly, f remains ﬁnite in the following relevant limits:

deletion:

∆w

→ −1, f → (1 − w

Ω

)

−1

contraction:

∆w

→ +∞, f → −(w

Ω

)

−1

(8)

Note that

diverges when considering deletion of an edge with

Ω

= 1

(ie, an edge cut). Indeed,

such an action would disconnect the graph and invalidate the use of equation

(3)

(see footnote 4).

However, this possibility is precluded by the requirement that E[f] = 0.

3.3 Minimizing the error

Minimizing the magnitude of

∆L

†

requires a choice of matrix norm, which we take to be the sum of

the squares of its entries (ie, the square of the Frobenius norm). Our motivation is twofold. First, the

algebraically convenient fact that the Frobenius norm of a rank one matrix has a simple form, viz,

≡ kM

= w

†

. (9)

Second, the square of this norm behaves as a variance; to the extent that the

associated to different

edges can be treated as (entrywise) uncorrelated one can decompose multiple perturbations as follows:





∆L

†





≈



∆L

†



, (10)

This expression is only ofﬁcially applicable when the initial and ﬁnal matrices are full-rank; additional care

must be taken when they are not. However, for the case of changing the edge weights of a graph Laplacian, the

original formula remains unchanged [

] (so long as the graph remains connected), provided one uses the

deﬁnitions in Section 3.5 (see also Appendix Sections C and F).

which allows the

single-edge

results from Section 3.4 to be iteratively applied to our reduction

algorithm, which has multiple reductions (Section 4). In Appendix Section A, we empirically validate

this approximation using synthetic and

real-world

networks, showing that this approximation is either

nearly exact or a conservative estimate.

For subtleties associated with edge contraction (see Appendix Section F, in particular equation

(39)

3.4 A cost function for spectral graph reduction

Combining the discussed desiderata, we choose to minimize the following cost function:

C = E



∆L

†



− β

E[r] , (11)

subject to



∆L

†



= 0 , (12)

where the parameter

controls the tradeoff between number of prioritized items reduced

and error

incurred in

†

. This cost function naturally arises when minimizing the expected squared error for a

given expected amount of reduction (or equivalently maximizing the expected number of reductions

for a given expected squared error).

We desire to minimize this cost function over all possible reduced graphs. As, when reducing

multiple edges,

E[r]

is additive and the expected squared error is empirically additive, we are able

to decompose this objective into a sequence of minimizations applied to individual edges. Thus,

minimization of this cost function for each edge acted upon can be seen as a probabilistic greedy

algorithm for minimizing the cost function for the ﬁnal reduced graph.

Here, we describe the analytic solution for the optimal action (ie, probabilistically choosing to delete,

contract, or reweight) to be applied to a single edge. We provide the solution in

Figure 1

, and a

detailed derivation in Appendix Section B.

For a given edge

, the values of

Ω

, and

are ﬁxed, and minimizing the cost function

(11)

(given

(12)

) results in a

piecewise

solution with three regimes, depending on the value of

When

β < β

, w

Ω

, τ

) = min(β

, β

)

is small compared with the error that would be incurred

by acting on this edge, thus it should not be changed;

When

β > β

, w

Ω

, τ

)

is large for

this edge, and the optimal solution is to probabilistically delete or contract this edge

+ p

= 1;

no reweight is required); and

In the intermediate case

(β

< β < β

)

, there are two possibilities,

depending on the edge and the choice of prioritized items: if

< β

, the edge is either deleted or

reweighted, and if β

< β

, the edge is either contracted or reweighted.

 < 

=0,p

=0,

w



<  < 



< 

=1

(1w

⌦

)

=0,

w

⇣

1 

1w

⌦

⌘

1

 1



< 

=0,p

=1

⌦



1+⌧

w

= 

⌦

 > 

=1 w

⌦

= w

⌦

prioritizing edges prioritizing nodes



1w

⌦



⌦

1+⌧

⌦



⌦

(1w

⌦

)

1+⌧

⌦

(1w

⌦

)

 < 

=0,p

=0,

w



<  < 



< 

=1

(1w

⌦

)

=0,

w

⇣

1 

1w

⌦

⌘

1

 1



< 

=0,p

=1

⌦



1+⌧

w

= 

⌦

 > 

=1 w

⌦

= w

⌦

prioritizing edges prioritizing nodes



1w

⌦



⌦

1+⌧

⌦



⌦

(1w

⌦

)

1+⌧

⌦

(1w

⌦

)

Figure 1: Left:

Minimizing C for a single edge e

. There are three regimes for the solution, depending on the

value of

. When node reduction is prioritized, set

= 0

. Right:

Values of β dividing the three regimes

Note that when edge reduction is prioritized, the number of triangles enters the expressions, and when node

reduction is prioritized, there is no deletion in the intermediate regime. However, for either choice, both deletion

and contraction can have ﬁnite probability, and the algorithm does not exclusively reduce one or the other. Thus,

when simultaneously sparsifying and coarsening a graph, the prioritized items may be chosen to be either edges

or nodes. We remark that the values of

, and

might be of independent interest as measures of edge

importance for analyzing connections in real-world networks.

评论收藏

内容反馈

wn87947

粉丝: 1
资源: 11

nips2019合集_卷1.zip

nips2019合集_卷5.zip

arxiv样式：用于纸预印本的Latex样式和模板（基于NIPS样式）

nips2019合集_卷8.zip

NIPS会议全文下载链接

ICML 2019年 会议文章目录 （含论文下载链接）

nips2019合集_卷3.zip

nips2019合集_卷7.zip

nips2019合集_卷4.zip

nips2019合集_卷6.zip

nips2019合集_卷9.zip

nips2019合集_卷2.zip

nips2020model-based rl.zip

ag_news_csv.tgz

nips_2018_papers_links

NIPS2016的论文

强化学习Nips2020论文下载

AAAI-2020 @ NIPS-2019 @ IJCAI-2019论文提要.zip

kdes_1.0.zip_kernel Descriptors

2016_NIPS_VI_tutorial

20090817_天清NIPS系列_快速安装指南_V3.1

pySPACE_nips.pdf

NIPS 语料库 12年数据

softmax_variants:适用于softmax变体的PyTorch代码

簇联邦学习改进python实现源码+项目说明+代码注释(提高精度+缓解用户孤立问题).zip

YOLOv8-deepsort 实现智能车辆目标检测+车辆跟踪+车辆计数

YOLOv8网络结构图，自制visio文件，yolov8.vsds，需要的自取，在原有的基础上直接改就行了

yolov8(2023年8月版本),已经下好yolov8s.pt和yolov8n.pt

Transformer模型实现长期预测并可视化结果（附代码+数据集+原理介绍）

最新资源

ICML 2019年会议文章目录（含论文下载链接）