RobustChip-LevelClockTreeSynthesisforSOCDesigns

需积分: 21 170 浏览量 2008-09-29 10:24:48 上传评论收藏 320KB PDF 举报

### 基于SOC设计的稳健芯片级时钟树综合技术 #### 摘要与背景在现代的系统级芯片(System-on-a-Chip, SOC)设计中，一个关键问题是如何进行有效的芯片级时钟树综合(Chip-Level Clock Tree Synthesis, CCTS)。随着集成度不断提高，SOC设计已成为65nm/45nm VLSI技术中的主流趋势，并预计未来将持续发展。为了确保整个SOC的性能、稳定性和可靠性，CCTS是物理设计过程中不可或缺的一环。 #### 芯片级时钟树综合的重要性 CCTS的主要目标之一是合并芯片上不同知识产权(Intellectual Property, IP)模块的时钟树，使得整个时钟树在所有工艺角落(process corners)下具有较小的时钟偏移(skew)。这种平衡有助于在整个设计空间内实现良好的时序收敛，从而保证了芯片在各种条件下的正常运行。另一个重要的方面是减少不同IP之间存在关键路径的关键时钟树之间的时钟分歧(clock divergence)，以降低这些路径上的最大可能时钟偏移，进而提高芯片的整体产量。 #### 本文提出的方法及贡献本文提出了一种有效的CCTS算法，该算法能够同时减少多角落时钟偏移以及关键路径之间的时钟分歧。据我们所知，这是首次尝试解决这一实际中非常重要的问题。实验结果表明，我们的方法能够在不显著增加缓冲区面积或线路长度的情况下，实现10%-31%(平均20%)的时钟分歧减少，并且时钟偏移降低了16-64皮秒(相当于1GHz时钟周期时间的1.6%-6.4%)。 #### 算法细节 1. **多角落时钟偏移减少**：为了减少时钟偏移，该算法考虑了所有可能的工艺角落，包括但不限于最佳情况(best-case)、最坏情况(worst-case)以及中间的各种情况。通过动态调整时钟树的结构和参数，可以在不同条件下保持稳定的时钟信号。 2. **时钟分歧最小化**：对于存在关键路径的IP模块之间的时钟分歧问题，算法采用了一种创新的平衡策略。这包括但不限于时钟树的重新路由、缓冲区的优化配置以及频率调节等措施，以确保时钟信号在这些关键路径上的一致性。 3. **资源优化**：为了解决资源占用问题，本文提出的算法通过智能选择和放置缓冲区来最小化额外的缓冲器面积和线路长度的增加。这种优化不仅减少了硬件资源的需求，而且有助于提高整体的电路性能。 #### 实验验证为了验证所提方法的有效性，作者进行了广泛的实验测试。通过对多个案例的研究，结果显示该方法能够显著改善SOC设计中的时钟性能，同时保持较低的资源消耗。 #### 结论本文介绍了一种针对SOC设计中的芯片级时钟树综合问题的有效解决方案。通过同时减少多角落时钟偏移和关键路径间的时钟分歧，所提出的算法不仅提高了时钟信号的稳定性，还增强了整个SOC的设计质量和生产效率。这一成果对于当前和未来的SOC设计领域具有重要意义，有望推动更复杂、更高性能的集成电路的发展。

资源推荐

资源详情

资源评论

Robust Chip-Level Clock Tree Synthesis for SOC Designs

Anand Rajaram

Department of ECE

University of Texas at Austin, Texas

anandr@mail.utexas.edu

David Z. Pan

Department of ECE

University of Texas at Austin, Texas

dpan@ece.utexas.edu

ABSTRACT

A key problem that arises in System-on-a-Chip (SOC) designs

of today is the Chip-level Clock Tree Synthesis (CCTS). CCTS

is done by merging all the clock trees belonging to diﬀerent

IPs per chip speciﬁcations. A primary requirement of CCTS

is to balance the sub-clock-trees belonging to diﬀerent IPs such

that the entire tree has a small skew across all process cor-

ners. This helps in timing closure across all the design cor-

ners. Another important requirement of CCTS is to reduce

clock divergence between IPs that have critical timing paths

between them, thereby reducing maximum possible clock skew

in the critical paths and thus improves yield. In this work,

we propose eﬀective CCTS algorithms to simultaneously re-

duce multi-corner skew and clock divergence. To the best of

our knowledge, this is the ﬁrst work that attempts to solve this

practically important problem. Experimental results on several

testcases indicate that our methods achieve 10%-31%(20% on

average) clock divergence reduction and between 16-64ps skew

reduction (1.6%-6.4% of cycle time for a 1GHz clock) with less

than 0.5% increase in buﬀer area/wirelength compared to ex-

isting CTS algorithms.

Categories and Subject Descriptors

B.7.2 [Hardware]: Integrated Circuits

General Terms

Algorithms

Keywords

Clock Network, Chip-level CTS, Physical Design

1. INTRODUCTION

A System-on-a-Chip (SOC) design can be deﬁned as “an IC,

signs to provide full functionality for an application” [1]. In

today’s 65nm/45nm VLSI technologies, SOC designs have be-

come increasingly common and the trend is expected to con-

tinue in the future [2]. Most SOC physical design closure is

done in a hierarchical fashion [1]. In such a methodology, diﬀer-

ent logical and physical partitions of the chip are timing closed

independently [1–4] followed by a chip-level timing closure step.

This chip-level timing closure includes CCTS in which a chip-

level clock tree is synthesized to drive all the block-level clock

trees. The primary objective of CCTS is that the full clock tree,

which includes the chip-level and all the block-level clock trees,

Permission to make digital or hard copies of all or part of this work for

personal or classroom use is granted without fee provided that copies are

not made or distributed for proﬁt or commercial advantage and that copies

bear this notice and the full citation on the ﬁrst page. To copy otherwise, to

republish, to post on servers or to redistribute to lists, requires prior speciﬁc

permission and/or a fee.

should be balanced and have less skew across all design corners.

Satisfying this requirement is relatively easy when considering

only the nominal delay corner. However, timing closure in most

practical chips involve verifying timing across several corners

that represent several global variation eﬀects. This implies that

the clock trees should have small skews across all the design

corners. This is a very challenging task primarily because of

the possible diﬀerence in the way the delays of the diﬀerent

sub-clock-trees scale, either because of diﬀerence in the clock

structures or the relative signiﬁcance of cell and interconnect

delays. Another objective of CCTS is to minimize the clock

divergence for the IPs with critical path between them. This

helps to minimize skew variation between the critical timing

paths between the IPs and thus improves the overall yield. In

this work, we propose eﬀective algorithms with the objective

of addressing the above two objectives.

2. MOTIVATION

2.1 Signiﬁcance of Clock Divergence Reduction

The signiﬁcance of reducing clock divergence between registers

in timing-critical paths is well known. For a given overall delay,

the lesser the divergent delay between the such register-pairs,

the lesser is the value of maximum skew (and skew variation)

that can be seen between them. The same principle is also

applicable at the chip-level where diﬀerent sub-blocks interact

with each other instead of register pairs.

2.2 Impact of Sub-block Clock Pin Location

Unlike hard IPs, the clock pins of the soft-IPs can be changed

speciﬁc to a given chip and ﬂoorplan. This ﬂexibility can be

used towards clock divergence reduction between critical IPs.

Figure 1 shows a simple example where the clock pin assign-

ment might make a diﬀerence in clock divergence reducing.

Case

Critical Paths

Case

Critical Paths

Divergence point

between A,B

Figure 1: Pin location in Case B will result in reduced

clock divergence between A and B.

2.3 Multi-corner skew reduction problem

Consider Figure 2 where only two sub-blocks are present. The

squares in the sub-blocks represent clock sinks. The left-side

block has bigger buﬀers with longer interconnects and the right-

side block has smaller buﬀers with shorter interconnect. Let us

assume that both sub-clock-trees have identical delays in the

nominal corner. However, their delays across other corners will

be diﬀerent, mainly because of the diﬀerence in the intercon-

nect lengths and buﬀer sizes. To balance these two sub-clock-

720

40.4

本内容试读结束，登录后可阅读更多

下载后可阅读完整内容，剩余3页未读，立即下载

评论收藏

内容反馈

unit2

粉丝: 1
资源: 2

Robust Chip-Level Clock Tree Synthesis for SOC Designs

CTS(Clock Tree Synthesis)重要参考

A Practical and Robust Bump-mapping Technique for Today's GPUs

Feature Tracking for Robust Structure-from-Motion

讲稿_Robust Multi-Modality Multi-Object Tracking.docx

演示-Robust Multi-Modality Multi-Object Tracking.pptx

Complete and robust no-fit polygon generation for the irregular stock cutting problem

Robust Real-time Object Detection 论文 整理ppt 及一篇相关中文论文

Robust scale-adaptive meanshift for tracking

Robust Optimization-Directed Design

Robust Controller Design for Parallel Multi-Inverter Systems Using Synthesis

Robust Collaborative Learning of Patch-level and Image-level

Simple, Accurate, and Robust Projector-Camera Calibration.pdf

robust multi-period portfolio selection .pdf

Robust Observer-Based Fault Diagnosis for Nonlinear System Using MATLAB

2012-Robust Low-Cost Control Scheme of Direct-Drive Gearless Tra

Synthesis and Scripting Techniques for Designing Multi-Async Clock

JEP162A-01：2021 System Level ESD Part II：Implementation of Effective ESD Robust Designs - 完整英文电子版（137页）.pdf

Qt 5实现串口调试助手 （源工程文件、0积分下载）

【SystemVerilog】路科验证V2学习笔记（全600页）.pdf

AutoSAR标准协议4.2.2

光伏-储能并网系统仿真.rar

XCP协议的规范文档

GD32替换STM32注意事项.pdf

NPPJSONViewer.zip

蓝牙BLE协议中文版.pdf

CANoe通过CAPL脚本实现自动测试

最新资源

Robust Real-time Object Detection 论文整理ppt 及一篇相关中文论文

Qt 5实现串口调试助手（源工程文件、0积分下载）