Energy-EfficientResourceUtilizationforHeterogeneousEmbeddedComputingSystems资源-CSDN文库

113 浏览量 2021-02-22 09:35:08 上传评论收藏 1.06MB PDF 举报

嵌入式计算系统由于其在电子控制单元（ECUs）中的应用，通常会表现出异构分布式多核架构的特点，这些系统往往需要响应来自应用层面的多种复杂计算需求。例如，汽车电子和航空电子系统等复杂嵌入式系统往往拥有超过60个ECUs，每个ECU负责处理不同大小和紧急程度的多种任务。随着任务复杂度和功能需求的增加，如何在这些系统中实现能量效率和有效资源利用的优化成为了研究的重点。在这篇研究论文中，研究者们针对异构和分布式多核嵌入式系统中的能量效率与资源有效利用的联合优化问题展开了研究。该问题被认为是一个多约束和多变量的优化问题，由于缺乏封闭形式的解决方案，研究者提出了基于拉格朗日理论的功率分配和负载平衡策略。当拉格朗日方法无法完全解决问题时，采用数据拟合方法首先确定核心速度，然后通过拉格朗日方法解决负载平衡调度问题。研究论文给出了几个数值示例，用以展示所提方法的有效性，并演示了各个因素对优化系统的影响。最终，模拟和实际评估证明了理论结果与实际结果的一致性。据作者所知，这是首次将负载平衡、能量效率、硬件异构性和应用异构性相结合在异构和分布式嵌入式系统中的研究工作。研究的动机在于复杂嵌入式系统中异构分布式多核架构对各种应用层复杂计算请求的响应能力。系统模型被视为一个完全异构的模型，即从硬件角度看，所有节点具有不同的最大速度和功率消耗水平；而从应用角度看，它们可以采用不同的调度策略。研究者在论文中明确表示，目标是解决一个本质上具有多约束和多变量的优化问题，并为该问题提出基于拉格朗日理论的解决方案。在面临无法通过拉格朗日方法完全解决的问题时，研究者引入了数据拟合方法来确定核心速度，随后通过拉格朗日方法来解决负载平衡调度问题。此篇论文涉及到的关键词包括嵌入式和分布式系统、能量效率、有效资源利用、负载分配、功率分配和队列模型等。在引言部分，作者提出了研究的动机，解释了为什么需要在异构和分布式嵌入式系统中研究能量效率和资源有效利用的问题，这与当前复杂电子控制单元中面临的挑战密切相关。本文所提出的策略聚焦于如何通过系统性的方法，协调不同节点间的资源分配，以减少总体的能耗并提高资源利用效率。在考虑硬件异构性的前提下，不同节点具有不同的性能和功耗特性，因此需要根据实际应用场景，采用不同的调度策略来优化性能和能效。同时，研究者也强调了负载平衡的重要性，负载平衡是确保在多核系统中各个核心任务负载均匀分配的关键。良好的负载平衡策略能够避免一些核心过载而另一些空闲的情况发生，这不仅能够提高系统的响应速度，也能够减少因负载不均导致的能效损失。在论文中，作者使用了拉格朗日理论来构建数学模型，利用这种方法可以同时处理多个变量，并且在一定的约束条件下找到最优解。由于问题的复杂性，作者还引入了数据拟合方法，以便更精确地确定核心速度，进而应用拉格朗日方法解决负载平衡调度问题。作者通过数值示例和实际评估来验证理论分析的可行性，并且给出了多个实验结果，以展示不同因素对系统优化的影响，包括硬件异构性和应用异构性如何影响系统的整体性能。这项研究不仅具有理论价值，而且对于实际应用也具有重要的意义，为未来在嵌入式系统中的资源优化利用提供了新的思路和方法。

资源推荐

资源详情

资源评论

Energy-Efﬁcient Resource Utilization for

Heterogeneous Embedded Computing Systems

Jing Huang, Renfa Li, Senior Member, IEEE, Jiyao An, Member, IEEE,

Derrick Ntalasha, Fan Yang, and Keqin Li, Fellow, IEEE

Abstract—In this paper, the joint optimization problem with energy efﬁciency and effective resource utilization is investigated for

heterogeneous and distributed multi-core embedded systems. The system model is considered to be fully a heterogeneous model, that

is, all nodes have different maximum speeds and power consumption levels from the perspective of hardware while they can employ

different scheduling strategies from the perspective of applications. Since the concerned problem by nature is a multi-constrained and

multi-variable optimization problem in which a closed-form solution cannot be obtained, our aim is to propose a power allocation and

load balancing strategy based on Lagrange theory. Furthermore, when the problem cannot be fully solved by Lagrange approach, a

data ﬁtting method is employed to obtain core speed ﬁrst, and then load balancing schedule is solved by Lagrange method. Several

numerical examples are given to show the effectiveness of the proposed method and to demonstrate the impact of each factor to the

present optimization system. Finally, simulation and practical evaluations show that the theoretical results are consistent with the

practical results. To the best of our knowledge, this is the ﬁrst work that combines load balancing, energy efﬁciency, hardware

heterogeneity and application heterogeneity in heterogeneous and distributed embedded systems.

Index Terms—Embedded and distributed systems, energy efﬁciency, effective resource utilization, load distribution, power allocation,

queueing model

1INTRODUCTION

1.1 Motivation

typical complex embedded system will have a hetero-

geneous distributed multi-core architecture that can

respond to a variety of complicated computational requests

at the application level. It is common for complex embedded

systems, such as automotive electronics and avionics sys-

tems, to have over 60 Electronic Control Units (ECUs)[30],

with each ECU dedicated to handling numerous tasks of

different sizes and levels of urgency. As the complexity of

embedded systems continues to increase to meet the

demands of modern applications for increased computa-

tional power and performance, the need for energy efﬁciency

and effective resource utilization will become increasingly

signiﬁcant. Current and future embedded systems must

be able to assign general tasks to nodes in a manner that

improves resource utilization without affecting dedicated

tasks. Power must be allocated reasonably to each node in

order to achieve minimum power usage by the system.

Attaining optimal allocation of tasks and power in a distrib-

uted system is a well-known multi-variable optimization

problem. In light of these issues, the development of hetero-

geneous distributed embedded systems is challenging.

In heterogeneous systems, the architecture of each node

may differ, so the characteristics of nodes may vary. Each

node might have different maximum and minimum core

speed, or a different power consumption level [29]. The per-

formance of the overall system can be inﬂuenced by any

node. Therefore, to achieve energy efﬁciency in heteroge-

neous environments, the characteristics of each node must

be considered carefully. From the point of view of distrib-

uted systems, each node is assigned preloaded dedicated

tasks, and each task may have different task arrival rate and

task size. To achieve effective utilization of resources, a dis-

tributed system requires an efﬁcient load balancing algo-

rithm that can assign tasks appropriately to each node. From

the point of view of embedded systems, dedicated tasks exe-

cuted on speciﬁed nodes are more important or urgent than

general tasks. Moreover, each class of dedicated tasks has a

different degree of urgency. To utilize all the available

resources efﬁciently, each node should be set with an appro-

priate scheduling policy corresponding to the degree of

urgency of dedicated tasks assigned to it. From the point of

view of the overall system, computing performance is a vital

metric when a system’s Quality of Service (QoS) is being

evaluated. Thus, the QoS still needs to be guaranteed.

Balancing all of these factors is a challenge for the develop-

ment of heterogeneous distributed and embedded systems

 J. Huang, R. Li, J. An, D. Ntalasha, and F. Yang are with the College of

Computer Science and Electronic Engineering of Hunan University,

National Supercomputing Center in Changsha, Key Laboratory for Embed-

ded and Network Computing of Hunan Province, Changsha 410082,

China. E-mail: jingh@hnu.edu.cn, lirenfa@vip.sina.com, anbobcn@aliyun.

com, dbntalasha@gmail.com, yangfanf117@126.com.

 K. Li is with the College of Computer Science and Electronic Engineering

of Hunan University, National Supercomputing Center in Changsha, Key

Laboratory for Embedded and Network Computing of Hunan Province,

Changsha 410082, China Department of Computer Science, State Univer-

sity of New York, New Paltz, NY 12561. E-mail: lik@newpaltz.edu.

Manuscript received 6 Sept. 2016; revised 28 Mar. 2017; accepted 5 Apr. 2017.

Date of publication 11 Apr. 2017; date of current version 15 Aug. 2017.

Recommended for acceptance by A. Yakovlev.

For information on obtaining reprints of this article, please send e-mail to:

reprints@ieee.org, and reference the Digital Object Identiﬁer below.

Digital Object Identiﬁer no. 10.1109/TC.2017.2693186

1518 IEEE TRANSACTIONS ON COMPUTERS, VOL. 66, NO. 9, SEPTEMBER 2017

0018-9340 ß 2017 IEEE. Personal use is permitted, but republication/redistribution requires IEEE permission.

See http://www.ieee.org/publications_standards/publications/rights/index.html for more information.

that are both energy efﬁcient and making the best use of

resources. Although there are many studies of the diverse

aspects this problem, most of the existing research don’t con-

sider these factors jointly. Therefore, it is important to study

how energy efﬁciency and high resource utilization can be

achieved together on heterogeneous and distributed embed-

ded systems.

1.2 Our Contributions

In this paper, we study the problem of assigning a set of

general tasks to the computing nodes of a computational

heterogeneous distributed embedded system, wherein each

node is preloaded with a different number of dedicated

tasks, equipped with a DVFS feature. The structure of the

system is shown in Fig. 1. A node can be treated as a compu-

tational unit, which may include processor, memory etc.

Changing a node from its sleep state to a running state

takes a long time [1]. In embedded environments, a node

may be assigned important tasks that cannot be delayed.

Consequently, we don’t have the option to put an embed-

ded node to sleep, even if its core is not working. In our

investigations, to balance the power consumption and time

delay, we assume that a core continues to run at a low fre-

quency even when it is idle. Clearly, the power consump-

tion differs when the core is working and when it is not

working. Therefore, the cores can be considered to have two

distinct modes [25]:

 Core busy-power: The power consumption of a core

when there are tasks running on the core, is the

major power consumption of a core.

 Core idle-power: The power consumption of a core

when there is no task running.

We view each node as an M/M/1 queueing model with

inﬁnite waiting queue capacity [24], and deﬁne three queue-

ing disciplines-Discipline 1, Discipline 2, and Discipline 3-

each one of which could be employed by any node. The

details of the disciplines are as follows:

 Discipline 1: All general tasks and dedicated tasks on

this node are scheduled on a ﬁrst-come, ﬁrst-served

basis, without priority. We identify this discipline as,

“dedicated tasks without priority.”

 Discipline 2: On this node, the queueing principle is

that dedicated tasks are always scheduled before

general tasks. All tasks are executed without inter-

ruption. We identify this discipline as, “prioritized

dedicated tasks without preemption.”

 Discipline 3: Dedicated tasks are always scheduled

before general tasks on this node, with preemption.

We term this discipline as, “prioritized dedicated

tasks with preemption.”

Our aim is to ﬁnd the minimum overall power consump-

tion of the system, along with the response time of general

tasks, within an acceptable range. Our major contributions

are as follows:

 To the best of our knowledge, this work is the ﬁrst

study of the minimum power consumption problem

in heterogeneous distributed embedded systems

that considers the load distribution in combination

with the characteristics, queuing discipline, and idle

speed of each node.

 We propose an algorithm for ﬁnding the optimal

load distribution and power allocation scheme of the

system, such that the overall power consumption of

the system is minimized.

 We are the ﬁrst to take the optimal solutions as train-

ing data to ﬁt the relationship between the task size

and core speed, and then use optimal load balancing

to solve the problem when the problem cannot be

solved by a Lagrangian system. Experimental results

show this strategy to be efﬁcient.

 Based on our algorithm, we show the inﬂuence of dif-

ferent parameters on the optimal power allocation

and load distribution. These parameters include idle

speed of core, as well as power consumption expo-

nent a, preloaded tasks, queueing discipline, and

number of nodes in the system. We provide numeri-

cal examples to demonstrate the effectiveness of our

algorithm for each parameter. Furthermore, we give

an example where all parameters are different. Simu-

lation and practical evaluations show that the theoret-

ical results are consistent with the practical results.

Our study focuses on a well-deﬁned, multi-constrained,

and multi-variable optimization problem. The investigation

in this paper has made signiﬁcant contribution to high-

performance and energy-efﬁcient computing in modern het-

erogeneous and distributed embedded systems.

2RELATED WORK

Because energy efﬁciency is a primary concern for embed-

ded systems, especially for systems with limited power, this

topic has been studied extensively, and a large body of litera-

ture exists [2], [3], [4], [5], [6]. In recent years, supercomputer

operators also have paid considerable attention on energy

efﬁciency because supercomputers have very large power

requirements. While supercomputers are focused on perfor-

mance as their most signiﬁcant metric, the technique used by

embedded systems to achieve energy efﬁciency is similar to

that of supercomputers. Energy efﬁciency is about making

power consumption proportional to system utilization [20]

in a manner that decreases unnecessary energy loss. There

are many approaches to achieving power reduction. Most

Fig. 1. System structure.

HUANG ET AL.: ENERGY-EFFICIENT RESOURCE UTILIZATION FOR HETEROGENEOUS EMBEDDED COMPUTING SYSTEMS 1519

commonly, dynamic voltage and frequency scaling (DVFS)

[22], [23] is implemented at the operating system level to

manage power and to regulate the frequency and voltage of

CPUs. Generally speaking, two DVFS techniques exist for

multi-core systems: One is global DVFS, which scales the fre-

quency and voltage of all the cores simultaneously, and the

other is local DVFS, which regulates the frequency and volt-

age on a per core basis [7]. Experiments indicated that local

DVFS could achieve better performance than global DVFS

[8], [9], but it is more complicated.

The energy efﬁciency of embedded systems has been stud-

ied by a number of researchers. Because the architectures and

applications for embedded systems are quite diverse,

researchers have needed to establish various theories to study

the problem of energy efﬁciency in these different systems. In

[26], the authors investigated the tradeoff between inter-appli-

cation concurrency with performance and power consump-

tion under various system conﬁgurations. They proposed a

runtime optimization approach to achieve energy efﬁciency,

implemented on a real platform called Odroid XU- 3. In [27],

the minimum energy consumption was obtained based on a

running model generated through regression-based learning

of energy/performance trade-offs between different comput-

ing resources in the system. In [28], to support application

quality of service and to save energy, an energy-efﬁcient soft

real-time CPU scheduler for mobile devices was proposed

that primarily ran multimedia applications.

In addition to embedded computing, energy efﬁciency

also plays an important role in cloud computing, which is

marked by huge and increasing power consumption. The

techniques for achieving energy efﬁciency used in multi-

core embedded systems and cloud computing systems are

similar. Therefore, they could learn from each other. In [10],

the author used DVFS and workload dependent dynamic

power management to improve system performance and to

reduce energy consumption. In [11], based on a cooperative

game-theoretical approach and DVFS technology, the

authors investigated the problem of allocating tasks onto a

computational grid, with the aim of minimizing simulta-

neously the energy consumption and the makespan. In [12],

the authors also employed a game-theoretic approach to

study the problem of minimizing energy consumption in a

distributed system.

An efﬁcient load balancing strategy is a key component

to building out any distributed architecture. The complexi-

ties are reﬂected in the extensive body of literature on the

topic, as exempliﬁed by the excellent reference collection

given in [13]. The purpose of load balancing is to assign

tasks appropriately to nodes in terms of the workload and

computing power of each node. In [15], researchers pro-

posed a fault tolerant, hybrid load balancing strategy for a

heterogeneous grid computing environment. In [16], the

authors addressed the problem of optimal load balancing of

tasks when power is constrained.

The queueing discipline has also been studied widely. In

[14], two types of cases were considered, namely, systems

with and without special tasks. The authors addressed the

problem of minimizing the average response time of generic

tasks. Both [17] and [18] studied optimal load distribution in

heterogeneous distributed computer systems with both

generic and dedicated applications. In [17], each node was

modeled as an M/G/1 non-preemptive queuing system,

and was applied to several types of dedicated tasks, while

in [18], each node was treated as an M/M/1 non-

preemptive queuing system. The authors of [19] assumed

that each node was preloaded with dedicated tasks, and

three conditions were taken into account: Dedicated tasks

without priority, and prioritized dedicated tasks with and

without preemption. Each node was treated as an M/G/1

queueing system, and the authors focused on the problem

of optimal load balancing of general tasks.

In distributed heterogeneous embedded systems, in

order to achieve energy efﬁciency and effective utilization

of resources, it is necessary to consider the combination of

node heterogeneity, applications urgency (priority of tasks,

which might be different for each node), energy efﬁciency,

and the idle CPU state. To the best of our knowledge, pres-

ent studies on load balancing and energy efﬁciency have

not considered fully all of these factors together.

3SYSTEM MODEL AND PROBLEM FORMULATION

3.1 Power Model

The power dissipation of an embedded processor core

mainly consists of three parts, namely, dynamic, static, and

short-circuits consumption, among which dynamic power

consumption is the dominant component. The dynamic

power consumption can be expressed by P ¼ kCV

f where

k is an activity factor, C is the loading capacitance, V is the

supply voltage, and f is the clock frequency. Given that

s / f and f / V , then P

/ s

, where a

is around 3 [21].

For ease of discussion, we model the power allocated to pro-

cessor core with speed s

as s

The core busy-power is different from core idle-power. There

are implied energy-frequency and frequency-performance

relations. In this paper, the performance (speed) is deﬁned

as the number of instructions a core can perform per second

(IPS). Therefor, the dynamic power is s

when the core is

working at frequency f

and the corresponding speed is s

When a core is not working, because there are no instruc-

tions to perform, it is inappropriate to deﬁne the core speed

directly. In that case, our research focuses on the power con-

sumption rather than core speed. Therefore, when the core

is idle, we assume the speed to be s

, corresponding to a

frequency f

, such that s

equals the actual power of the

core, i.e., s

¼ CV

. A processor core still consumes

some amount of basic power P



that includes static power

dissipation, short circuit power dissipation, and other lea-

kages and wasted power. Therefore, the power model can

be formulated as

¼ðs

þ P



Þr

þðs

þ P



Þð1  r

¼ r

þ 1  r

ðÞs

þ P





r þ





1

þ 1 



r þ



þ P



(1)

3.2 Queueing Model

The queueing model is used to formulate and study the prob-

lem of power allocation and load balancing in a heteroge-

neous distributed embedded environments. Taking n as the

number of heterogeneous embedded computing nodes

1520 IEEE TRANSACTIONS ON COMPUTERS, VOL. 66, NO. 9, SEPTEMBER 2017

剩余13页未读，继续阅读

评论收藏

内容反馈

weixin_38670501

粉丝: 8
资源: 975

Energy-Efficient Resource Utilization for Heterogeneous Embedded...

最新资源

Energy-Efficient Resource Utilization for Heterogeneous Embedded...

Novel Resource Allocation Algorithm for Energy-Efficient Cloud Computing in Heterogeneous Environment

Energy Efficient Embedded Video Processing Systems 无水印pdf

Energy-Efficient Resource Allocation for Heterogeneous Cognitive Radio Networks

Energy Efficient Task Assignment with Guaranteed Probability Satisfying Timing Constraints for Embedded Systems

5GAA-WhitePaper-ITS-spectrum-utilization.pdf 英文

An Energy-Efficient Cooperative Spectrum Sensing Scheme based on D-S Theory in Cognitive Radio Sensor Networks

Grid Computing

IBM-Resource-Utilization-Analyser

藏经阁-Improving Resource Efficiency.pdf

C1-Enhance memory utilization with dmemfs.pdf

The efficient-parallel stripe noise removal algorithm with low resource

RUE - Resource Utilization Explorer-开源

3d-bin-space-utilization:用于3d装箱算法的平方空间利用核心

lookbusy 1.4 - a synthetic load generator for Linux systems

可穿戴无线传感器

Cloud and Fog Computing in 5G Mobile Networks-IET(2017).pdf

Modern Operating Systems 3rd

韩国崇实大学 Ad hoc and Sensor Networks 英文课件

system architecture

Protocols and Algorithms for Aperiodic Wireless Control Systems

ARINC 413A-1976-GUIDANCE FOR AIRCRAFT ELECTRICAL POWER UTILIZATION and .pdf

Personalization-Analytics-Busineess-Utilization-of-Net-Promoter-Score

Data-driven resource allocation with traffic load prediction

基于新能源汽车技术原理探讨其优缺点的分析.pdf

Modern Operating Systems,Tanenbaum,3rd

project-setup-team-solidaridad-utilisation:由GitHub Classroom创建的project-setup-team-solidaridad-utilization

最新资源