isca2016.7z资源-CSDN文库

共54个文件

pdf：54个

isca

2016

需积分: 9 142 浏览量 2019-05-11 09:31:28 上传评论收藏 27.71MB 7Z 举报

资源推荐

资源详情

资源评论

收起资源包目录

isca2016.7z （54个子文件）

isca2016

07551422.pdf 380KB

07551409.pdf 836KB

07551414.pdf 564KB

07551431.pdf 605KB

07551434.pdf 542KB

07551432.pdf 1.17MB

07551378.pdf 896KB

07551417.pdf 754KB

07551416.pdf 1.16MB

07551428.pdf 1.47MB

07551399.pdf 1.11MB

07551390.pdf 801KB

07551385.pdf 385KB

07551392.pdf 827KB

07551405.pdf 439KB

07551424.pdf 967KB

07551407.pdf 1.74MB

07551435.pdf 918KB

07551404.pdf 790KB

07551426.pdf 540KB

07551391.pdf 320KB

07551430.pdf 724KB

07551412.pdf 409KB

07551408.pdf 629KB

07551418.pdf 447KB

07551386.pdf 751KB

07551403.pdf 556KB

07551400.pdf 850KB

07551413.pdf 991KB

07551384.pdf 2.17MB

07551379.pdf 1.01MB

07551395.pdf 694KB

07551415.pdf 3.81MB

07551429.pdf 1.03MB

07551410.pdf 275KB

07551423.pdf 1.44MB

07551397.pdf 1.09MB

07551380.pdf 479KB

07551398.pdf 842KB

07551402.pdf 730KB

07551433.pdf 363KB

07551396.pdf 2.4MB

07551381.pdf 422KB

07551425.pdf 662KB

07551421.pdf 657KB

07551388.pdf 2.22MB

07551373.pdf 133KB

07551332.pdf 113KB

07551411.pdf 321KB

07551387.pdf 1004KB

07551394.pdf 1.13MB

07551419.pdf 728KB

07551389.pdf 1.68MB

07551406.pdf 342KB

Dynamo: Facebook’s Data Center-Wide Power Management System

Qiang Wu, Qingyuan Deng, Lakshmi Ganesh, Chang-Hong Hsu

∗

Yun Jin, Sanjeev Kumar

†

, Bin Li, Justin Meza, and Yee Jiun Song

Facebook, Inc.

∗

University of Michigan

Abstract—Data center power is a scarce resource that

often goes underutilized due to conservative planning. This

is because the penalty for overloading the data center power

delivery hierarchy and tripping a circuit breaker is very high,

potentially causing long service outages. Recently, dynamic

server power capping, which limits the amount of power

consumed by a server, has been proposed and studied as a way

to reduce this penalty, enabling more aggressive utilization

of provisioned data center power. However, no real at-scale

solution for data center-wide power monitoring and control

has been presented in the literature.

In this paper, we describe Dynamo – a data center-wide

power management system that monitors the entire power

hierarchy and makes coordinated control decisions to safely

and efﬁciently use provisioned data center power. Dynamo

has been developed and deployed across all of Facebook’s

data centers for the past three years. Our key insight is that

in real-world data centers, different power and performance

constraints at different levels in the power hierarchy necessi-

tate coordinated data center-wide power management.

We make three main contributions. First, to understand

the design space of Dynamo, we provide a characterization

of power variation in data centers running a diverse set of

modern workloads. This characterization uses ﬁne-grained

power samples from tens of thousands of servers and spanning

a period of over six months. Second, we present the detailed

design of Dynamo. Our design addresses several key issues

not addressed by previous simulation-based studies. Third,

the proposed techniques and design have been deployed

and evaluated in large scale data centers serving billions of

users. We present production results showing that Dynamo

has prevented 18 potential power outages in the past 6

months due to unexpected power surges; that Dynamo enables

optimizations leading to a 13% performance boost for a

production Hadoop cluster and a nearly 40% performance

increase for a search cluster; and that Dynamo has already

enabled an 8% increase in the power capacity utilization

of one of our data centers with more aggressive power

subscription measures underway.

Keywords—data center; power; management.

I. INTRODUCTION

Warehouse-scale data centers consist of many thousands

of machines running a diverse set of workloads and comprise

the foundation of the modern web. The power delivery infras-

tructure supplying these data centers is equipped with power

breakers designed to protect the data center from damage

due to electrical surges. While tripping a power breaker

ultimately protects the physical infrastructure of a data center,

its application-level effects can be disastrous, leading to long

service outages at worst and degraded user experience at best.

Given how severe the outcomes of tripping a power breaker

are, data center operators have traditionally taken a con-

servative approach by over-provisioning data center power,

provisioning for worst-case power consumption, and further

adding large power buffers [1], [2]. While such an approach

ensures safety and reliability with high conﬁdence, it is

wasteful in terms of power infrastructure utilization – a scarce

data center resource. For example, it may take several years

∗

Work was performed while employed by Facebook, Inc.

†

Currently with Uber, Inc.

to construct a new power delivery infrastructure and every

megawatt of power capacity can cost around 10 to 20 million

USD [2], [3].

Under-utilizing data center power is especially inefﬁcient

because power is frequently the bottleneck resource limiting

the number of servers that a data center can house. It is

even more so with the recent trend of increasing server

power density [4], [5]. Figure 1 shows that server peak power

consumption nearly doubled going from the 2011 server (24-

core Westmere-based) to the 2015 server (48-core Haswell-

based) at Facebook. This trend has led to the proliferation of

ghost spaces in data centers: unused, and unusable, space [6].

To help improve data center efﬁciency, over-subscription

of data center power has been proposed in recent years [1],

[6], [7]. With over-subscription, the planned peak data center

power demand is intentionally allowed to surpass data center

power supply, under the assumption that correlated spikes

in server power consumption are infrequent. However, this

exposes data centers to the risk of tripping power breakers

due to highly unpredictable power spikes (e.g., a natural

disaster or a special event that causes a surge in user activity

for a service). To make matters worse, a power failure in

one data center could cause a redistribution of load to other

data centers, tripping their power breakers and leading to a

cascading power failure event.

Therefore, in order to achieve both power safety and

Figure 1. The measured power consumption (in watts) as a

function of server CPU utilization for two generations of web

servers used at Facebook. The  data points were measured from

a 24-core Westmere-based web server (24×L5639@2.13GHz,

12GB RAM, 2×1G NIC) from 2011, while the • data points were

measured from a 48-core Haswell-based web server (48×E5-

2678v3@2.50GHz, 32GB RAM, 1×10G NIC) from 2015. Both

servers were running a real web server workload. We varied the

server processor utilization by changing the rate of requests sent

to the server. Note that the 2015 server power was measured

using an on-board power sensor while the 2011 server power

was measured using a Yokogawa power meter.

2016 ACM/IEEE 43rd Annual International Symposium on Computer Architecture

DOI 10.1109/ISCA.2016.48

469

improved power utilization with over-subscription, power

capping, or peak power management techniques, have been

proposed [4], [8]. These techniques (1) continually monitor

server power consumption and (2) use processor and memory

dynamic voltage and frequency scaling (DVFS) to suppress

server power consumption if it approaches its power or ther-

mal limit.

Most prior work focused on server-level [8], [9], [10] or

ensemble-level [4] power management. They typically used

hardware P-states [11] or DVFS to control peak power

or thermal characteristics for individual or small groups of

servers, with control actions decided locally in isolation.

There have been fewer studies on data center-wide power

management to monitor all levels of the power hierarchy and

make coordinated control decisions. Some recent work [12],

[13] looked into the problem of data center-wide power man-

agement and the issues of coordination across different levels

of controllers. However, their proposed techniques and design

were limited because of the simpliﬁed system setup in their

simulation-based studies, which were based on either pure

simulation or a small test-bed of fewer than 10 servers. These

prior works did not address many key issues for data center-

wide power management in a real production environment

with tens or hundreds of thousands of servers.

In this paper, we describe Dynamo – a data center-wide

power management system that monitors the entire power

hierarchy and makes coordinated control decisions to safely

and efﬁciently use provisioned data center power. Our key

insight is that in real-world data centers, different power

and performance constraints at different levels in the power

hierarchy necessitate coordinated, data center-wide power

management. We make three main contributions:

1. To understand the design space of Dynamo, we provide

a characterization of power variation in data centers running

a diverse set of modern workloads. Using ﬁne-grained power

samples from tens of thousands servers for over six months,

we quantify the power variation patterns across different

levels of aggregation (from Rack to MSB) and across different

time scales (from a few seconds to tens of minutes). Based on

these results, as well as our study of power breaker charac-

teristics, we ﬁnd that to prevent real-world power failures, the

controller power reading cycle should be fast – on the order

of a few seconds – as opposed to minutes as suggested by

previous work.

2. We describe the design of a data center-wide power

management system in a real production environment. Our

design addresses several key issues not dealt with by previous

simulation-based studies, such as (1) scalable communica-

tion between controller and controllee, (2) application- and

service-aware capping actions, and (3) coordination of mul-

tiple controller instances with heterogeneous workload and

data dependence.

3. We deploy and evaluate Dynamo in large scale data

centers serving billions of users. We report a rich set of results

from real-life power capping events during critical power

limiting scenarios. We also describe real use cases where

Dynamo enables optimizations (such as Turbo Boost and ag-

gressive power over-subscription) leading to performance and

capacity improvements. For example, Dynamo can improve

the performance of a production Hadoop cluster by up to

13% and has enabled us to accommodate 8% more servers

under the same power constraints via more aggressive power

budgeting in an existing data center.

Dynamo has been deployed across all of Facebook’s data

centers for the past three years. In the rest of this paper, we

describe its design and share how it has enabled us to greatly

improve our data center power utilization.

II. BACKGROUND

In this section we describe what data center power delivery

infrastructure is, what it means to oversubscribe its power,

and under what conditions such oversubscription can cause

power outages. We also discuss the implications these factors

have on the design of a data center-wide power management

system.

A. Data Center Power Delivery

Figure 2 shows the power delivery hierarchy in a typical

Facebook data center, based on Open Compute Project (OCP)

speciﬁcations [14]. The local power utility supplies the data

center with 30 MW of power. An on-site power sub-station

feeds the utility power to the Main Switch Boards (MSBs) .

Each MSB is rated at 2.5 MW for IT equipment power and

has a standby generator that provides power in the event of a

utility outage.

A data center typically spans four rooms, called suites,

where racks of servers are arranged in rows. Up to four MSBs

provide power to each suite. In turn, each MSB supplies up to

four 1.25 MW Switch Boards (SBs). From each SB, power is

fed to the 190 KW Reactive Power Panels (RPPs) stationed at

the end of each row of racks.

Each RPP supplies power to (1) the racks in its row and

(2) a set of Direct Current Uninterruptible Power Supplies

(DCUPS). Each DCUPS provides 90 s of power backup to six

racks. The rack power shelf is rated at 12.6 KW. Depending

on the server speciﬁcations, there can be anywhere between 9

and 42 servers per rack

Here we describe the typical Facebook-owned data center based on

OCP speciﬁcations [14]; Facebook also leases data centers where the

power delivery hierarchy matches more traditional ones described in

literature [1]. The traditional model uses Power Distribution Units (PDUs)

and PDU Breakers in place of SBs and RPPs.

#

#

&&

"

),',-%/(

),',0+(

),',%-.(

'-%.(

)

)-

)-

&

)

"

!

"



&

)

&&



)-

&

!

&

Figure 2. Typical Facebook data center power delivery infras-

tructure [14].

470

Figure 3. Power breaker trip time as a function of power usage.

The dots represent raw data based on manufacturer testing for

different power breakers used at Facebook. The lines show the

lower-bound on trip time per device type.

Notice that power is oversubscribed at each level: a power

device supplies less power than its children draw at peak.

For example, an MSB supplies 2.5 MW to up to four SBs

that draw 4

× 1.25 = 5 MW at peak. Each power device

is equipped with a circuit breaker that trips if the device’s

power draw exceeds the breaker’s rated power. But breakers

do not trip instantly. We measured the amount of time it

took to trip the breakers at each level of our power delivery

hierarchy under different amounts of power overdraw, shown

in Figure 3 (note the ﬁgure’s logarithmic y-axis). Our results

corroborate measurements reported in prior literature [7].

We ﬁnd that a breaker trips when both (1) the current

through the breaker exceeds the breaker’s rated current and

(2) the breaker sustains the overdrawn power for a period

of time inversely proportional to the overdraw amount.In

other words, though circuit breakers trip quickly under large

power spikes, they sustain low amounts of overdrawn power

for long periods of time. For example, RPPs and Racks

sustain a 10% power overdraw for around 17 minutes. In

addition, as Figure 3 shows, lower-level devices in the power

delivery hierarchy sustain relatively more power overdraw

than higher-level devices. For example an RPP sustains a 40%

power overdraw for around 60 s while an MSB sustains only

a 15% power overdraw for the same period of time.

Because power oversubscription occurs at every level of the

power delivery hierarchy, it is insufﬁcient for power capping

techniques to monitor any single device or subset of devices

in a data center. Instead, techniques must take a holistic

approach, coordinating action across all devices in the power

delivery hierarchy. In addition, because some slack exists in

how long it takes circuit breakers to trip, opportunities exist

for minimizing the impact of server performance while still

ensuring power safety. These observations inform the design

of Dynamo that we discuss in Section III.

B. Power Variation Characterization

In Section II-A, we observed that power breaker trip time

varies as a function of power overdraw. A key design con-

sideration for power capping techniques, then, is how fast to

respond to power overdraw in order to guarantee protection

from tripping circuit breakers. To do this, we must answer













Figure 4. An illustration of calculated power variation for a time

window. The maximum power variation is the difference between

the maximum and minimum power values in the time window.

Here, v1 and v2 are the maximum power variations for time

windows 1 and 2, respectively.

the question, how quickly does power draw change in a real-

world production data center?

Since we can not safely overload the power delivery hier-

archy in our data centers, we instead measure and extrapolate

power variation to power-oversubscribed scenarios. We next

present a large-scale study characterizing the power variations

of servers at Facebook. We collected ﬁne-grained power

values (every 3 seconds) for every server in one data center

suite (roughly 30 K servers) for over six months. We also

examine coarser-grained power values (every 1 minute) for

all servers in all our data centers (on the order of hundreds of

thousands) for nearly 3 years.

To quantify power variation, we deﬁne the power slope,

which measures the rate at which power can increase in a

speciﬁc time window (from 3 seconds to 600 seconds) for

different levels of the power hierarchy (Figure 2). Figure 4

illustrates how the metrics are calculated. For each time

window, we calculate the worst-case power variation as the

difference of the maximum and minimum power values in

that time window.

We analyze the ﬁne-grained power data for each level

of the power hierarchy (rack, RPP, SB, and MSB) over six

months. To simplify the analysis, we partition the data into

pieces, study data from two-week time periods, and combine

results from multiple time periods. Figure 5 shows the

summarized results on power variations and allows us to make

the following two observations. First, larger time windows

have generally larger power variations. Second, the higher

the power hierarchy level, the smaller the relative power

variation, due to load multiplexing. For example, at the rack

level, the worst-case power variations range from 10% to

50% for different time windows, while at the MSB level, the

worst-case power variations range from 1% to 6% for the

same time window.

A third observation is that power variations also depend on

the application. We randomly selected a group of servers from

several services at Facebook (web server, cache, MySQL

database, news feed, hadoop, and f4/photo storage [15]; with

30 servers per service) and conducted a similar analysis (for

one speciﬁc time window of 60 s). Figure 6 summarizes

the results. We see that different applications have different

power variation characteristics. For example, servers running

f4/photo storage have the lowest median (50

percentile,

471

Figure 5. The measured power variations over different time windows for power devices at the racks, RPPs, SBs, and MSBs. The

x-axis is the power variation normalized to the average power during peak hours. The y-axis is the cumulative distribution function.

Lines of different colors represent different time windows (from 3 s to 600 s). For convenience, we also list the 99

percentile (p99)

power variation values for each case.

Figure 6. The measured power variations for several services in

Facebook at the server level. The default time window is 60 s.

For convenience, we also show the median (50

percentile, or

p50) and p99 power variation values for each case.

or p50) variations but the highest p99 variations among all

services we have studied.

C. Design Implications

Combining the observations from Sections II-A and II-B,

we derive the following design implications for a data center-

wide power management system.

Sub-minute power sampling. Such a system needs to

sample power at a sub-minute interval. Figure 5 shows that

3% (at MSB level) to 30% (at rack level) power demand

increases have been observed within a 60 s interval. Such

swings in power demand are large enough to trip a breaker

within a few minutes according to Figure 3.

2-minute power capping time. Such a system also needs

to react to power demand spikes in no more than 2 minutes

(and potentially much less to guarantee safety). Figure 3

shows that breakers trip for a

∼5% MSB power overdraw

in as little as 2-minutes. Within this 2-minute interval a

power management system must issue appropriate capping

commands and ensure that power settles.

We next discuss Dynamo, our data center-wide power

management system. We design Dynamo to sample data at

the granularity of a few seconds and conservatively target 10 s

of time for control actions and power settling time. This is an

important distinction from prior work that sampled power at

a coarser granularity of several minutes [1].

III. DYNAMO ARCHITECTURE

This section describes the design of Dynamo. We start with

an overview and then describe in detail the components that

Dynamo comprises: the agent, the leaf power controller, and

the higher-level power controllers.

A. Overview of Dynamo’s Design

As mentioned in Section I, our goal is to design a data

center-wide power management system that monitors the

entire power hierarchy. For a practical system to be deployed

to tens or hundreds of thousands of servers, it needs to be

(1) efﬁcient and reliable and (2) scalable to handle a power

hierarchy of extremely large size.

At a high level, Dynamo has two major components:

Dynamo agent. The agent is a light-weight program de-

ployed to every server in the data center. It reads power,

executes power capping/uncapping commands, and commu-

nicates with the controllers (discussed next). Since a Dynamo

agent needs to run on every server, it is designed to be as

simple as possible. We place most of the intelligence of the

system in the controller. As a result, Dynamo agents do not

communicate with one another and only communicate with

Dynamo controllers.

Dynamo controllers. The controllers run on a group of

dedicated servers, monitor data from the Dynamo agents un-

der their control, and are responsible for protecting data center

power devices. We take a distributed approach in the design

of the Dynamo controllers. For every physical power device

in the hierarchy that needs protection there is a matching

controller instance monitoring and controlling (directly or

indirectly) the set of downstream servers for that device. In

this way, there is a hierarchy of Dynamo controllers that

mirrors the topology of the data center’s power hierarchy.

Multiple levels of controllers coordinate to ensure the safety

of the entire power hierarchy.

Figure 7 illustrates how Dynamo’s major components in-

teract with each other. Each power device at the lowest level

is assigned a leaf power controller. The leaf power controller

communicates directly with Dynamo agents on all down-

stream servers of that power device. We use the Thrift [16]

remote procedure call (RPC) service for efﬁcient and reliable

472

评论收藏

内容反馈

夏天不热冬天不冷

粉丝: 29
资源: 33

isca2016.7z

HPCA 2019论文集

isca2018.7z

isca2013.7z

isca2017.7z

isca2015.7z

isca2014.7z

论会计管理信息化的ISCA模型.docx

ISCA2018.rar

ISCA 2018 论文.zip

ISCA伊斯卡鞋业分销系统解决方案.doc

ISCA2013所有论文

ISCA 计算机体系结构会议 2014所有论文

ISCA-2017-Hardware-Architectures-for-DNN-Tutorial

ISCA2018里面的Machine Learning论文

寒武纪 AI 指令集 论文

hpca13-isa-power-struggles.pdf

i-vector的工具箱

论文研究-求解高维优化问题的改进正弦余弦算法.pdf

串口助手工具合集.zip

OLED显示温度和时间-STM32F103C8T6（完整程序工程+原理图+相关资料）.zip

pn532模拟资料包.zip

Vivado license 永久

STM32全系列 Keil MDK pack包（当前最新离线包）

张飞硬件1~20部+笔记完整版百度网盘链接.txt

AD Type C 封装库 6Pin 24Pin分享（带3D视图）

STM32HAL库+RS485+串口+定时器+Modbus协议（主机+从机测试）

I2C主机及从机Verilog代码实现.zip

TDMS转EXCEL官方插件

最新资源

寒武纪 AI 指令集论文