【免费】代码阅读方法与时间资源-CSDN文库

共17个文件

pdf：14个

txt：2个

chm：1个

代码阅读

需积分: 0 84 浏览量 2008-07-13 11:11:12 上传评论收藏 2.49MB RAR 举报

资源推荐

资源详情

资源评论

收起资源包目录

代码阅读方法与实践.rar （17个子文件）

代码阅读方法与实践

《代码阅读方法与实践》部分完整源代码

rfc2068.txt 378KB

strftime.pdf 19KB

at.pdf 21KB

cat.pdf 17KB

last.pdf 12KB

apropos.pdf 12KB

ffs.pdf 242KB

ctags.pdf 16KB

mbuf.pdf 30KB

perlguts.pdf 135KB

sendmail.pdf 37KB

tcpdump.pdf 46KB

execve.pdf 20KB

gprof.pdf 35KB

vixie-security.pdf 51KB

rfc793.txt 174KB

代码阅读方法与实践（英文版，Diomidis.Spinellis.著）.chm 1.96MB

AFast File System for UNIX*

Marshall Kirk McKusick, William N. Joy†,

Samuel J.Lefﬂer‡, Robert S. Fabry

Computer Systems Research Group

Computer Science Division

Department of Electrical Engineering and Computer Science

University of California, Berkeley

Berkeley, CA94720

ABSTRACT

Areimplementation of the UNIX ﬁle system is described. The reimplementation

provides substantially higher throughput rates by using more ﬂexible allocation policies

that allowbetter locality of reference and can be adapted to a wide range of peripheral

and processor characteristics. The newﬁle system clusters data that is sequentially

accessed and provides twoblock sizes to allowfast access to large ﬁles while not wasting

large amounts of space for small ﬁles. File access rates of up to ten times faster than the

traditional UNIX ﬁle system are experienced. Long needed enhancements to the pro-

grammers’ interface are discussed. These include a mechanism to place advisory locks

on ﬁles, extensions of the name space across ﬁle systems, the ability to use long ﬁle

names, and provisions for administrative control of resource usage.

Revised February 18, 1984

CR Categories and Subject Descriptors: D.4.3 [Operating Systems]:F ﬁle

organization, directory structures, access methods;D.4.2 [Operating Systems]:S

allocation/deallocation strategies, secondary storage devices;D.4.8 [Operating Systems]:P

measurements, operational analysis;H.3.2 [Information Systems]:I ﬁle organization

Additional Keywords and Phrases: UNIX, ﬁle system organization, ﬁle system performance, ﬁle system

design, application program interface.

General Terms: ﬁle system, measurement, performance.

*UNIX is a trademark of Bell Laboratories.

†William N. Joyiscurrently employed by: Sun Microsystems, Inc, 2550 Garcia Avenue, Mountain View, CA

94043

‡Samuel J. Lefﬂer is currently employed by: Lucasﬁlm Ltd., PO Box 2009, San Rafael, CA 94912

This work was done under grants from the National Science Foundation under grant MCS80-05144, and the

Defense Advance Research Projects Agency(DoD) under ARPAOrder No. 4031 monitored by NavalElec-

tronic System Command under Contract No. N00039-82-C-0235.

SMM:05-2 A Fast File System for UNIX

TABLE OF CONTENTS

1. Introduction

2. Old ﬁle system

3. New ﬁle system organization

3.1. Optimizing storage utilization

3.2. File system parameterization

3.3. Layout policies

4. Performance

5. File system functional enhancements

5.1. Long ﬁle names

5.2. File locking

5.3. Symbolic links

5.4. Rename

5.5. Quotas

Acknowledgements

References

1. Introduction

This paper describes the changes from the original 512 byte UNIX ﬁle system to the newone

released with the 4.2 BerkeleySoftware Distribution. It presents the motivations for the changes, the meth-

ods used to effect these changes, the rationale behind the design decisions, and a description of the new

implementation. This discussion is followed by a summary of the results that have been obtained, direc-

tions for future work, and the additions and changes that have been made to the facilities that are available

to programmers.

The original UNIX system that runs on the PDP-11† has simple and elegant ﬁle system facilities.

File system input/output is buffered by the kernel; there are no alignment constraints on data transfers and

all operations are made to appear synchronous. All transfers to the disk are in 512 byte blocks, which can

be placed arbitrarily within the data area of the ﬁle system. Virtually no constraints other than available

disk space are placed on ﬁle growth [Ritchie74], [Thompson78].*

When used on the VAX-11 together with other UNIX enhancements, the original 512 byte UNIX ﬁle

system is incapable of providing the data throughput rates that manyapplications require. Forexample,

applications such as VLSI design and image processing do a small amount of processing on a large quanti-

ties of data and need to have a high throughput from the ﬁle system. High throughput rates are also needed

by programs that map ﬁles from the ﬁle system into large virtual address spaces. Paging data in and out of

the ﬁle system is likely to occur frequently [Ferrin82b]. This requires a ﬁle system providing higher band-

width than the original 512 byte UNIX one that provides only about twopercent of the maximum disk

bandwidth or about 20 kilobytes per second per arm [White80], [Smith81b].

Modiﬁcations have been made to the UNIX ﬁle system to improve its performance. Since the UNIX

ﬁle system interface is well understood and not inherently slow, this development retained the abstraction

and simply changed the underlying implementation to increase its throughput. Consequently,users of the

system have not been faced with massive software conversion.

Problems with ﬁle system performance have been dealt with extensively in the literature; see

[Smith81a] for a survey.Previous work to improve the UNIX ﬁle system performance has been done by

[Ferrin82a]. The UNIX operating system drewmanyofits ideas from Multics, a large, high performance

†DEC, PDP,VAX, MASSBUS, and UNIBUS are trademarks of Digital Equipment Corporation.

*Inpractice, a ﬁle’ssize is constrained to be less than about one gigabyte.

AFast File System for UNIX SMM:05-3

operating system [Feiertag71]. Other work includes Hydra [Almes78], Spice [Thompson80], and a ﬁle sys-

tem for a LISP environment [Symbolics81]. Agood introduction to the physical latencies of disks is

described in [Pechura83].

2. Old File System

In the ﬁle system developed at Bell Laboratories (the ‘‘traditional’’ﬁle system), each disk drive is

divided into one or more partitions. Each of these disk partitions may contain one ﬁle system. Aﬁle sys-

tem neverspans multiple partitions.† Aﬁle system is described by its super-block, which contains the

basic parameters of the ﬁle system. These include the number of data blocks in the ﬁle system, a count of

the maximum number of ﬁles, and a pointer to the free list,alinked list of all the free blocks in the ﬁle sys-

tem.

Within the ﬁle system are ﬁles. Certain ﬁles are distinguished as directories and contain pointers to

ﬁles that may themselves be directories. Every ﬁle has a descriptor associated with it called an inode.An

inode contains information describing ownership of the ﬁle, time stamps marking last modiﬁcation and

access times for the ﬁle, and an array of indices that point to the data blocks for the ﬁle. Forthe purposes

of this section, we assume that the ﬁrst 8 blocks of the ﬁle are directly referenced by values stored in an

inode itself*. An inode may also contain references to indirect blocks containing further data block indices.

In a ﬁle system with a 512 byte block size, a singly indirect block contains 128 further block addresses, a

doubly indirect block contains 128 addresses of further singly indirect blocks, and a triply indirect block

contains 128 addresses of further doubly indirect blocks.

A150 megabyte traditional UNIX ﬁle system consists of 4 megabytes of inodes followed by 146

megabytes of data. This organization segregates the inode information from the data; thus accessing a ﬁle

normally incurs a long seek from the ﬁle’sinode to its data. Files in a single directory are not typically

allocated consecutive slots in the 4 megabytes of inodes, causing manynon-consecutive blocks of inodes to

be accessed when executing operations on the inodes of several ﬁles in a directory.

The allocation of data blocks to ﬁles is also suboptimum. The traditional ﬁle system nevertransfers

more than 512 bytes per disk transaction and often ﬁnds that the next sequential data block is not on the

same cylinder,forcing seeks between 512 byte transfers. The combination of the small block size, limited

read-ahead in the system, and manyseeks severely limits ﬁle system throughput.

The ﬁrst work at Berkeleyonthe UNIX ﬁle system attempted to improve both reliability and

throughput. The reliability was improvedbystaging modiﬁcations to critical ﬁle system information so

that theycould either be completed or repaired cleanly by a program after a crash [Kow alski78]. The ﬁle

system performance was improvedbyafactor of more than twobychanging the basic block size from 512

to 1024 bytes. The increase was because of twofactors: each disk transfer accessed twice as much data,

and most ﬁles could be described without need to access indirect blocks since the direct blocks contained

twice as much data. The ﬁle system with these changes will henceforth be referred to as the old ﬁle system.

This performance improvement gav e astrong indication that increasing the block size was a good

method for improving throughput. Although the throughput had doubled, the old ﬁle system was still using

only about four percent of the disk bandwidth. The main problem was that although the free list was ini-

tially ordered for optimal access, it quickly became scrambled as ﬁles were created and removed. Eventu-

ally the free list became entirely random, causing ﬁles to have their blocks allocated randomly overthe

disk. This forced a seek before every block access. Although old ﬁle systems provided transfer rates of up

to 175 kilobytes per second when theywere ﬁrst created, this rate deteriorated to 30 kilobytes per second

after a fewweeks of moderate use because of this randomization of data block placement. There was no

wayofrestoring the performance of an old ﬁle system except to dump, rebuild, and restore the ﬁle system.

Another possibility,assuggested by [Maruyama76], would be to have a process that periodically

†By‘‘partition’’here we refer to the subdivision of physical space on a disk drive.Inthe traditional ﬁle sys-

tem, as in the newﬁle system, ﬁle systems are really located in logical disk partitions that may overlap. This

overlapping is made available, for example, to allowprograms to copyentire disk drivescontaining multiple ﬁle

systems.

*The actual number may vary from system to system, but is usually in the range 5-13.

SMM:05-4 A Fast File System for UNIX

reorganized the data on the disk to restore locality.

3. New ﬁle system organization

In the newﬁle system organization (as in the old ﬁle system organization), each disk drive contains

one or more ﬁle systems. Aﬁle system is described by its super-block, located at the beginning of the ﬁle

system’sdisk partition. Because the super-block contains critical data, it is replicated to protect against

catastrophic loss. This is done when the ﬁle system is created; since the super-block data does not change,

the copies need not be referenced unless a head crash or other hard disk error causes the default super-block

to be unusable.

To insure that it is possible to create ﬁles as large as 2

bytes with only twolev els of indirection, the

minimum size of a ﬁle system block is 4096 bytes. The size of ﬁle system blocks can be anypower of two

greater than or equal to 4096. The block size of a ﬁle system is recorded in the ﬁle system’ssuper-block so

it is possible for ﬁle systems with different block sizes to be simultaneously accessible on the same system.

The block size must be decided at the time that the ﬁle system is created; it cannot be subsequently changed

without rebuilding the ﬁle system.

The newﬁle system organization divides a disk partition into one or more areas called cylinder

groups.Acylinder group is comprised of one or more consecutive cylinders on a disk. Associated with

each cylinder group is some bookkeeping information that includes a redundant copyofthe super-block,

space for inodes, a bit map describing available blocks in the cylinder group, and summary information

describing the usage of data blocks within the cylinder group. The bit map of available blocks in the cylin-

der group replaces the traditional ﬁle system’sfree list. Foreach cylinder group a static number of inodes

is allocated at ﬁle system creation time. The default policyistoallocate one inode for each 2048 bytes of

space in the cylinder group, expecting this to be far more than will everbeneeded.

All the cylinder group bookkeeping information could be placed at the beginning of each cylinder

group. Howeverifthis approach were used, all the redundant information would be on the top platter.A

single hardware failure that destroyed the top platter could cause the loss of all redundant copies of the

super-block. Thus the cylinder group bookkeeping information begins at a varying offset from the begin-

ning of the cylinder group. The offset for each successive cylinder group is calculated to be about one track

further from the beginning of the cylinder group than the preceding cylinder group. In this way the redun-

dant information spirals down into the pack so that anysingle track, cylinder,orplatter can be lost without

losing all copies of the super-block. Except for the ﬁrst cylinder group, the space between the beginning of

the cylinder group and the beginning of the cylinder group information is used for data blocks.†

3.1. Optimizing storage utilization

Data is laid out so that larger blocks can be transferred in a single disk transaction, greatly increasing

ﬁle system throughput. As an example, consider a ﬁle in the newﬁle system composed of 4096 byte data

blocks. In the old ﬁle system this ﬁle would be composed of 1024 byte blocks. By increasing the block

size, disk accesses in the newﬁle system may transfer up to four times as much information per disk trans-

action. In large ﬁles, several 4096 byte blocks may be allocated from the same cylinder so that evenlarger

data transfers are possible before requiring a seek.

The main problem with larger blocks is that most UNIX ﬁle systems are composed of manysmall

ﬁles. A uniformly large block size wastes space. Table 1 shows the effect of ﬁle system block size on the

amount of wasted space in the ﬁle system. The ﬁles measured to obtain these ﬁgures reside on one of our

†While it appears that the ﬁrst cylinder group could be laid out with its super-block at the ‘‘known’’location,

this would not work for ﬁle systems with blocks sizes of 16 kilobytes or greater.This is because of a require-

ment that the ﬁrst 8 kilobytes of the disk be reserved for a bootstrap program and a separate requirement that the

cylinder group information begin on a ﬁle system block boundary.Tostart the cylinder group on a ﬁle system

block boundary,ﬁle systems with block sizes larger than 8 kilobytes would have toleave anempty space

between the end of the boot block and the beginning of the cylinder group. Without knowing the size of the ﬁle

system blocks, the system would not knowwhat roundup function to use to ﬁnd the beginning of the ﬁrst cylin-

der group.

AFast File System for UNIX SMM:05-5

time sharing systems that has roughly 1.2 gigabytes of on-line storage. The measurements are based on the

active user ﬁle systems containing about 920 megabytes of formatted space.

Space used %waste Organization

775.2 Mb 0.0 Data only,noseparation between ﬁles

807.8 Mb 4.2 Data only,each ﬁle starts on 512 byte boundary

828.7 Mb 6.9 Data +inodes, 512 byte block UNIX ﬁle system

866.5 Mb 11.8 Data +inodes, 1024 byte block UNIX ﬁle system

948.5 Mb 22.4 Data +inodes, 2048 byte block UNIX ﬁle system

1128.3 Mb 45.6 Data +inodes, 4096 byte block UNIX ﬁle system

Ta asted space as a function of block size.

The space wasted is calculated to be the percentage of space on the disk not containing user data. As the

block size on the disk increases, the waste rises quickly,toanintolerable 45.6% waste with 4096 byte ﬁle

system blocks.

To beable to use large blocks without undue waste, small ﬁles must be stored in a more efﬁcient way.

The newﬁle system accomplishes this goal by allowing the division of a single ﬁle system block into one

or more fragments.The ﬁle system fragment size is speciﬁed at the time that the ﬁle system is created;

each ﬁle system block can optionally be broken into 2, 4, or 8 fragments, each of which is addressable. The

lower bound on the size of these fragments is constrained by the disk sector size, typically 512 bytes. The

block map associated with each cylinder group records the space available in a cylinder group at the frag-

ment level; to determine if a block is available, aligned fragments are examined. Figure 1shows a piece of

amap from a 4096/1024 ﬁle system.

Bits in map XXXX XXOO OOXX OOOO

Fragment numbers 0-3 4-7 8-11 12-15

Block numbers 0123

Each bit in the map records the status of a fragment; an ‘‘X’’shows that the fragment is in use, while a ‘‘O’’

shows that the fragment is available for allocation. In this e

Fragments of adjoining blocks cannot be used as a full block,

ev e niftheyare large enough. In this e

On a ﬁle system with a block size of 4096 bytes and a fragment size of 1024 bytes, a ﬁle is repre-

sented by zero or more 4096 byte blocks of data, and possibly a single fragmented block. If a ﬁle system

block must be fragmented to obtain space for a small amount of data, the remaining fragments of the block

are made available for allocation to other ﬁles. As an example consider an 11000 byte ﬁle stored on a

4096/1024 byte ﬁle system. This ﬁle would uses twofull size blocks and one three fragment portion of

another block. If no block with three aligned fragments is available at the time the ﬁle is created, a full size

block is split yielding the necessary fragments and a single unused fragment. This remaining fragment can

be allocated to another ﬁle as needed.

Space is allocated to a ﬁle when a program does a write system call. Each time data is written to a

ﬁle, the system checks to see if the size of the ﬁle has increased*. If the ﬁle needs to be expanded to hold

the newdata, one of three conditions exists:

1) There is enough space left in an already allocated block or fragment to hold the newdata. The new

data is written into the available space.

2) The ﬁle contains no fragmented blocks (and the last block in the ﬁle contains insufﬁcient space to

hold the newdata). If space exists in a block already allocated, the space is ﬁlled with newdata. If

the remainder of the newdata contains more than a full block of data, a full block is allocated and the

*Aprogram may be overwriting data in the middle of an existing ﬁle in which case space would already have

been allocated.

评论收藏

内容反馈

toyosaizeald

粉丝: 0
资源: 4

代码阅读方法与时间

代码阅读方法与实践.pdf

代码阅读方法与实践（中文版）

OpenFrameworks 插件可轻松测量代码不同部分的执行时间_C++_代码_相关文件_下载

如何阅读开源代码

如何阅读源代码

有限差分时域法的时间反演算法_MATLAB_代码_下载

一种沿完全指定路径的时间最优轨迹规划 的二分算法_C++_代码_下载

如何阅读别人的代码（共分十一章）

代码之美(中文完整版).pdf

每周课程时间表问题的遗传算法解决方案_MATLAB_代码_下载

引力波电磁优化_python_代码_下载

代码大全中文版

与 PEST 模型优化软件一起使用的时间序列处理器_Fortran_代码_下载

jQuery权威指南-源代码

Android高级编程--源代码

JAVA上百实例源码以及开源项目源代码

Matlab 脚本中 的二维声学FDTD仿真演示_MATLAB_代码_下载

使用改进的 PSO（粒子 群优化）算法解决 VRPTW 问题_python_代码_下载

Qt 5实现串口调试助手 （源工程文件、0积分下载）

【SystemVerilog】路科验证V2学习笔记（全600页）.pdf

AutoSAR标准协议4.2.2

光伏-储能并网系统仿真.rar

GD32替换STM32注意事项.pdf

XCP协议的规范文档

NPPJSONViewer.zip

CANoe通过CAPL脚本实现自动测试

VS2015安装证书，JavaScript_ProjectSystem.msi，JavaScript_LanguageService.msi

蓝牙BLE协议中文版.pdf

BaiduOCR.zip

最新资源

一种沿完全指定路径的时间最优轨迹规划的二分算法_C++_代码_下载

Matlab 脚本中的二维声学FDTD仿真演示_MATLAB_代码_下载

使用改进的 PSO（粒子群优化）算法解决 VRPTW 问题_python_代码_下载

Qt 5实现串口调试助手（源工程文件、0积分下载）