【免费】ParallelComputingEfficiencyClimbingtheLearningCurve资源-CSDN文库

Parallel

Computing

需积分: 0 72 浏览量 2009-12-15 20:22:00 上传评论收藏 473KB PDF 举报

资源详情

资源评论

资源推荐

Parallel Computing Efficiency: Climbing the Learning Curve

H.T.

Kumm and R.M. Lea

Aspex Microsystems Ltd.

Brunel University

Uxbridge Mddx UB8

3PH

United Kingdom

Abstract

Parallel computing shows considerable potential to

deliver cost-effective solutions to applications with

high requirements in performance. However, cur-

rent ex erience shows that performance often falls

well betw user expectations and below what would

be considered cost-effective. Therefore, MPC hard-

ware and software designers need to focus on the fac-

tors which affect parallel computing efficiency, if they

are to narrow the gap between potential and delivered

performance. Many analyses of arallel computing

efficiency which have been reporte fare mainly

too

re-

strictive to provide he1 ful results. Thus, a different

approach is required wfich would help parallel archi-

tecture and algorithm designers to climb the learning

curve associated with parallel computing efficiency.

In this paper, a performance analysis is intro-

duced which makes a distinction between natural par-

allelism (which is inherent to an application) and

applied Parallelism (which can be observed when an

ap lication is implemented on an MPC).

high-

ligiting parallel computing bottlenecks the analysis

supports MPC hardware and software designers to

identify causes for ineficiency. Furthermore helpful

efficiency measures can be defined.

The paper demonstrates the usefulness of the anal-

ysis and concludes that

could form the basis of

a methodology which formalises the improvement of

parallel computing eficiency.

Introduction

Many applications, for example scientific and en-

@neering simulation, on-line signal and data process-

ing, data visualisation, and context-sensitive data

and knowledge retrieval, require very high perfor-

mance (e.g up to teraflops oSP931 Since a sequen-

tial

teraflop computer would reqdre clock-cycles of

less than

ps it soon became clear that only par-

allel architectures have the potential to deliver such

performance. In fact, a rapid rowth in the market

for parallel computers can be o%served and it is pre-

dicted that this market will reach a value of

1000

M by 1996 Zor921.

puting, the observed performance for practical appli-

cations is often far below

user

expectations. In fact,

sustained performance often falls below 10% of peak

performance and values of less than

are not ex-

ceptional [CK92].

In order to provide users with some means to com-

are different MPCs, numerous parallel processing

genchmarks have been proposed. Although these are

useful measures for application performance, they do

not offer insight into whether performance could be

Despite

e considerable potential of parallel com-

improved, nor do they give any indication on how to

climb the learning curve in order to improve parallel

computing efficiency.

In contrast, a helpful performance evaluation

should ive the user a realistic expectation of the

gains w%ich could be achieved by implementing an

application on a given MPC. Furthermore, it should

provide guidance on how efficient a certain imple-

mentation is and where efficiency bottlenecks are.

Thus, it should provide answers to the following

questions.

What is the potential performance of the applica-

tion

2. What performance can be delivered when the ap-

plication

executed by a given MPC

3. How can the achieved performance be improved

One of the most widely used evaluation criteria

for parallel processing is the relative speed-u

S,,

of the execution of an application on a parallekom-

puter when compared to the execution-time required

on a sequential computer. It

normally defined

the fraction of sequential execution time,

T,,

over

parallel execution time,

Tp,

follows

(1)

The problem of the achievable speed-up

has

been widely discussed since the early days of parallel

computing. However, these analyses cover only spe-

cific groups of applications and provide an answer to

question

only (see section 2).

In order to gain further insight, it is helpful to

make a clear distinction between the following two

forms of parallelism:

Natural parallelism

defines the parallelism

which

inherent to an application and thus the max-

imal parallelism which could be achieved for that ap-

plication; it is the arallelism of an ideal program

executed on an ideaf MPC.

plied arallelism

defines the parallelism

whicg is actua& achieved when an application is ex+

cuted on a specific parallel computer; thus, it defines

the parallelism of a real program executed on a real

MPC.

Owing to the inevitable restrictions of a real MPC,

applied parallelism will normally fall below the nat-

ural arallelism of an application. Therefore, it is

usefuf to distinguish between a

natural speed-up,

S,,

the theoretical maximum speed-u for a iven ap-

plication, and the

applied speed-up,

whi& can be

achieved when the application is executed on a

cific MPC. On this basis, three measures of parJlel

computing efficiencies

can be defined.

728

Authorized licensed use limited to: CHONGQING UNIV OF POST AND TELECOM. Downloaded on March 27, 2009 at 02:00 from IEEE Xplore. Restrictions apply.

本内容试读结束，登录后可阅读更多

下载后可阅读完整内容，剩余4页未读，立即下载

评论收藏

内容反馈

calagiyou

粉丝: 0
资源: 1

Parallel Computing Efficiency Climbing the Learning Curve

评论0

最新资源

Parallel Computing Efficiency Climbing the Learning Curve

评论0

ComponentOne Studio for WinForms 2012 v3 1/5

MATLAB Parallel Computing Toolbox官方教程

The Sourcebook of Parallel Computing

CUDA for Engineers An Introduction to High Performance Parallel Computing

Parallel Computing on Heterogeneous Networks

Introduction to Parallel Computing 2nd PDF 英文版

Introduction to Parallel Computing

Algorithms and Parallel Computing

CUDA for Engineers An Introduction to High-Performance Parallel Computing azw3

Introduction to Parallel Computing 2nd

Introduction to Parallel Computing, Second Edition

The Sourcebook of Parallel Computing.pdf

Parallel Computing Toolbox User's Guide_R2020b.pdf

Addison Wesley - An Introduction to Parallel Computing 2nd Ed

Deep Learning and Parallel Computing Environment

CUDA for Engineers An Introduction to High-Performance Parallel Computing.pdf

python大作业 含爬虫、数据可视化、地图、报告、及源码（整和为一个文件）（2014-2020全国各地区原油加工量）.rar

仿真电路以及操作方法

【纯干货啊】华为IPD流程管理(完整版).pptx

可编程语言标准IEC61131-3中文版.pdf

OFDM完整仿真过程与教程.zip

信号与系统——保研复习资料.pdf

Landsat_WRS2.zip

最全的Visio形状/图形库

AxureRP9项目原型50套、案例20个、元件库1套.zip

北理工+成电+东南——通信/信号保研面试真题.pdf

数字信号处理——保研复习资料.pdf

风电和储能并网Simulink模型

使用STM32F103C8T6+L298N+MG513P30电机使用外部中断法和输入捕获法进行编码器测速

最新资源

python大作业含爬虫、数据可视化、地图、报告、及源码（整和为一个文件）（2014-2020全国各地区原油加工量）.rar