计算机体系结构-量化研究方法3rd资源-CSDN文库

共16个文件

pdf：16个

体系结构

4星 · 超过85%的资源需积分: 9 183 浏览量 2008-03-15 00:11:30 上传评论 1 收藏 7.96MB RAR 举报

资源推荐

资源详情

资源评论

收起资源包目录

计算机体系结构-量化研究方法3rd.rar （16个子文件）

计算机体系结构-量化研究方法3rd

1558605967-appendix-E.pdf 140KB

1558605967-appendix-C.pdf 234KB

chap03.2001.pdf 1.15MB

chap02.pdf 630KB

1558605967-appendix-H.pdf 1.23MB

chap04.2001.pdf 788KB

1558605967-appendix-G.pdf 271KB

1558605967-appendix-I.pdf 88KB

1558605967-appendix-F.pdf 72KB

chap05.pdf 1.04MB

chap08.pdf 879KB

1558605967-appendix-D.pdf 156KB

chap06.2001.pdf 1.11MB

AppendixA.pdf 1007KB

chap01.2001.pdf 857KB

chap07.pdf 836KB

H.1

Introduction H-2

H.2

Basic Techniques of Integer Arithmetic H-2

H.3

Floating Point H-13

H.4

Floating-Point Multiplication H-17

H.5

Floating-Point Addition H-21

H.6

Division and Remainder H-27

H.7

More on Floating-Point Arithmetic H-33

H.8

Speeding Up Integer Addition H-37

H.9

Speeding Up Integer Multiplication and Division H-45

H.10

Putting It All Together H-58

H.11

Fallacies and Pitfalls H-62

H.12

Historical Perspective and References H-63

Exercises H-69

Computer Arithmetic

by David Goldberg

Xerox Palo Alto Research Center

The Fast drives out the Slow even if the Fast is wrong.

W. Kahan

H-2

■

Appendix H

Computer Arithmetic

Although computer arithmetic is sometimes viewed as a specialized part of CPU

design, it is a very important part. This was brought home for Intel in 1994 when

their Pentium chip was discovered to have a bug in the divide algorithm. This

ﬂoating-point ﬂaw resulted in a ﬂurry of bad publicity for Intel and also cost them

a lot of money. Intel took a $300 million write-off to cover the cost of replacing

the buggy chips.

In this appendix we will study some basic ﬂoating-point algorithms, includ-

ing the division algorithm used on the Pentium. Although a tremendous variety of

algorithms have been proposed for use in ﬂoating-point accelerators, actual

implementations are usually based on reﬁnements and variations of the few basic

algorithms presented here. In addition to choosing algorithms for addition, sub-

traction, multiplication, and division, the computer architect must make other

choices. What precisions should be implemented? How should exceptions be

handled? This appendix will give you the background for making these and other

decisions.

Our discussion of ﬂoating point will focus almost exclusively on the IEEE

ﬂoating-point standard (IEEE 754) because of its rapidly increasing acceptance.

Although ﬂoating-point arithmetic involves manipulating exponents and shifting

fractions, the bulk of the time in ﬂoating-point operations is spent operating on

fractions using integer algorithms (but not necessarily sharing the hardware that

implements integer instructions). Thus, after our discussion of ﬂoating point, we

will take a more detailed look at integer algorithms.

Some good references on computer arithmetic, in order from least to most

detailed, are Chapter 4 of Patterson and Hennessy [1994]; Chapter 7 of Hama-

cher, Vranesic, and Zaky [1984]; Gosling [1980]; and Scott [1985].

Readers who have studied computer arithmetic before will ﬁnd most of this sec-

tion to be review.

Ripple-Carry Addition

Adders are usually implemented by combining multiple copies of simple com-

ponents. The natural components for addition are

half adders

and

full adders

The half adder takes two bits

and

as input and produces a sum bit

and a

carry bit

out

as output. Mathematically,

= (

) mod 2

and

out



(

)/2



where





is the ﬂoor function. As logic equations,

and

out

where

means

∧

and

means

∨

. The half adder is also called a (2,2)

adder, since it takes two inputs and produces two outputs. The full adder is a

(3,2) adder and is deﬁned by

= (

) mod 2,

out



(

)/2



, or the

logic equations

H.1 Introduction

H.2 Basic Techniques of Integer Arithmetic

■

H.2.1

abc

H.2.2

out

The principal problem in constructing an adder for

-bit numbers out of

smaller pieces is propagating the carries from one piece to the next. The most

obvious way to solve this is with a

ripple-carry adder,

consisting of

full

adders, as illustrated in Figure H.1. (In the ﬁgures in this appendix, the least-sig-

niﬁcant bit is always on the right.) The inputs to the adder are

–1

–2

⋅ ⋅ ⋅

and

–1

–2

⋅ ⋅ ⋅

, where

–1

–2

⋅ ⋅ ⋅

represents the number

–1

–2

⋅ ⋅ ⋅

. The

output of the

th adder is fed into the

input of the

next adder (the (

+ 1)-th adder) with the lower-order carry-in

set to 0. Since

the low-order carry-in is wired to 0, the low-order adder could be a half adder.

Later, however, we will see that setting the low-order carry-in bit to 1 is useful

for performing subtraction.

In general, the time a circuit takes to produce an output is proportional to the

maximum number of logic levels through which a signal travels. However, deter-

mining the exact relationship between logic levels and timings is highly technol-

ogy dependent. Therefore, when comparing adders we will simply compare the

number of logic levels in each one. How many levels are there for a ripple-carry

adder? It takes two levels to compute

from

and

. Then it takes two more

levels to compute

from

, and so on, up to

. So there are a total of 2

levels. Typical values of

are 32 for integer arithmetic and 53 for double-

precision ﬂoating point. The ripple-carry adder is the slowest adder, but also the

cheapest. It can be built with only

simple cells, connected in a simple, regular

way.

Because the ripple-carry adder is relatively slow compared with the designs

discussed in Section H.8, you might wonder why it is used at all. In technologies

like CMOS, even though ripple adders take time O(

), the constant factor is very

small. In such cases short ripple adders are often used as building blocks in larger

adders.

Figure H.1 Ripple-carry adder, consisting of n full adders.The carry-out of one full

adder is connected to the carry-in of the adder for the next most-signiﬁcant bit. The car-

ries ripple from the least-signiﬁcant bit (on the right) to the most-signiﬁcant bit (on the

left).

n–1

Full

adder

n–1

n–2

Full

adder

Full

adder

Full

adder

H-4 ■ Appendix H Computer Arithmetic

Radix-2 Multiplication and Division

The simplest multiplier computes the product of two unsigned numbers, one bit at

a time, as illustrated in Figure H.2(a). The numbers to be multiplied are a

n–1

n–2

⋅ ⋅ ⋅ a

and b

n–1

n–2

⋅ ⋅ ⋅ b

, and they are placed in registers A and B, respectively.

Multiply Step (i) If the least-significant bit of A is 1, then register B, containing b

n–1

n–2

⋅ ⋅ ⋅

, is added to P; otherwise 00 ⋅ ⋅ ⋅ 00 is added to P. The sum is placed back

into P.

Figure H.2 Block diagram of (a) multiplier and (b) divider for n-bit unsigned inte-

gers. Each multiplication step consists of adding the contents of P to either B or 0

(depending on the low-order bit of A), replacing P with the sum, and then shifting both

P and A one bit right. Each division step involves ﬁrst shifting P and A one bit left, sub-

tracting B from P, and, if the difference is nonnegative, putting it into P. If the difference

is nonnegative, the low-order bit of A is set to 1.

Carry-out

Shift

n + 1

Shift

(a)

(b)

评论收藏

内容反馈

worldseeker

2012-04-14

东西还不错，但是每个章节是分开的，这样不是很方便，要是合起来就好了
天蓝控

2011-09-20

很全，虽然是扫描版的
minilin2

2012-05-08

当时的考试用书比较全面也比较抽象

BigBangBug

粉丝: 6
资源: 22

计算机体系结构-量化研究方法3rd

计算机体系结构 量化研究方法

计算机体系结构—量化研究方法

计算机体系结构量化研究方法

计算机体系结构-量化研究方法第三版 中文版

计算机体系结构 - 量化研究方法 第三版（中文版和英文版）

《计算机体系结构-量化研究方法》全书课后习题答案

[计算机体系结构-量化研究方法].5th.[John L. Hennessy&David A. Patterson]1

计算机体系结构量化分析3rd

计算机体系结构的量化研究

计算机体系结构量化研究方法(第四版)

计算机体系结构量化研究方法第四版

《计算机体系结构-量化研究方法》-第五版 以及课后习题答案

计算机体系结构-量化研究方法 中文第四版 英文第四版 英文第五版

计算机体系结构—量化研究方法（第5版）_体系结构_

计算机体系结构－量化研究方法_第6版（英文版）.pdf

计算机体系结构-量化研究方法_计算机体系结构量化研究方法pdf_

计算机体系结构：量化研究方法 第4版

体系结构\计算机体系结构：量化研究方法(原版)

计算机体系结构 量化研究方法 (第5版)

计算机体系结构：量化研究方法（第5版）

计算机体系结构-量化研究方法(二)_

计算机体系结构-量化研究方法(第四版)

计算机系统结构-量化研究方法

计算机体系结构-量化研究方法3th

计算机体系结构-量化研究方法第六版

计算机体系结构-量化研究(第5版)-答案与附录

最新资源

计算机体系结构量化研究方法

计算机体系结构-量化研究方法第三版中文版

计算机体系结构 - 量化研究方法第三版（中文版和英文版）

《计算机体系结构-量化研究方法》-第五版以及课后习题答案

计算机体系结构-量化研究方法中文第四版英文第四版英文第五版

计算机体系结构：量化研究方法第4版

计算机体系结构量化研究方法 (第5版)