0% found this document useful (0 votes)

10 views

Week 6 Lecture Material_watermark

The document discusses timing closure in chip design, focusing on the integration of placement and routing solutions to meet geometric and timing constraints. It covers components such as timing-driven placement, routing, and physical synthesis, and emphasizes the importance of static timing analysis (STA) for ensuring that setup and hold time constraints are met. Additionally, it introduces the Zero-Slack Algorithm for optimizing gate and wire delays while establishing timing budgets during the physical design process.

Uploaded by

R INI BHANDARI

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

10 views

Week 6 Lecture Material_watermark

Uploaded by

R INI BHANDARI

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 100

Lecture 32: TIMING CLOSURE (PART 1)

PROF. INDRANIL SENGUPTA

DEPARTMENT OF COMPUTER SCIENCE AND ENGINEERING
Introduction
• The layout of a chip must satisfy:
– Geometric constraints (e.g. non-overlapping cells and routability)
– Timing constraints of the design (e.g. setup and hold constraints)
• The optimization process that meets the above requirements
and constraints is often called timing closure.
• Integrates placement and routing solutions with specialized
methods to improve circuit performance.

2
Components of Timing Closure
1. Timing-driven placement
 Minimizes signal delays when assigning locations to circuit elements.
2. Timing-driven routing
 Minimizes signal delays when selecting routing topologies and specific routes.
3. Physical synthesis
 Sizing transistors or gates to decrease the delay or increase the drive strength of a
gate.
 Inserting buffers into nets to decrease propagation delays.
 Restructuring the circuit along its critical paths.

3
Background
• For many years, signal propagation delay in logic gates was
the main contributor to circuit delay, while wire delay was
negligible.
– Cell placement and wire routing did not affect circuit performance.
• Technology scaling post-1990 significantly increased the
relative impact of wire-induced delays.
– High-quality placement and routing have become critical for timing
closure.

4
Background
15% delay  Mid 80 Scenario
 Most of the input to output delay
85% delay
of the logic is due to gate delay.
50% delay
 Mid 90 Scenario
50% delay  Half of input to output delay of the
logic is due to wire delay.

80% delay  Today’s Scenario

 Most of input to output delay of
20% delay the logic is due to wire delay.

5
Quick Recap of Setup and Hold Times
• Timing optimization tools adjust propagation delays through
circuit components, with the primary goal of satisfying timing
constraints. Two ways:
– Setup (long-path) constraints: Amount of time a data input signal
should be stable before the clock edge for each storage element.
– Hold (short-path) constraints: Amount of time a data input signal
should be stable after the clock edge at each storage element.

6
(a) Setup Constraints
• Ensure that no signal transition occurs too late.
• Initial phases of timing closure focus on these types of
constraints:
tcycle ≥ tcombDelay + tsetup + tskew
• Checking whether a circuit meets setup constraints requires
estimating how long signal transitions will take to propagate
from one storage element to the next.
– Typically uses Static Timing Analysis.

7
• What is Static Timing Analysis?
– Propagates actual arrival times (AAT) and required arrival times (RAT)
to the terminals of every gate or cell.
– Can quickly identify timing violations, and diagnose them by tracing
out critical paths in the circuit that are responsible for these timing
failures.
– Models propagation of signal transitions with the worst possible delay.
– Typically excludes false paths from the analysis.

8
– For every timing point x in the circuit netlist, the timing slack is
computed as:
SLACK(x) = RAT(x) – AAT(x)
– Positive slack means timing has been met; negative means violation.
– Guided by slack values, physical synthesis restructures the netlist to
make it more suitable for high-performance layout implementation.
• Gates lying on critical paths can be upsized to propagate signals faster.
• Buffers may be inserted into long critical wires.
• The netlist tree can be restructured to decrease the overall depth.

9
Hold-time Constraints
• Ensure that signal transitions do not occur too early.
– Hold violations can occur when a signal path is too short, allowing a
receiving flip-flop to capture the signal at the same cycle instead of
the next cycle.
• Hold-time constraint is given by:
tcombDelay ≥ thold + tskew
– Clock skew affects hold-time constraints significantly more than setup
constraints. So, hold-time constraints are typically enforced after
synthesizing the clock network.

10
END OF LECTURE 32

11
Lecture 33: TIMING CLOSURE (PART 2)

PROF. INDRANIL SENGUPTA

DEPARTMENT OF COMPUTER SCIENCE AND ENGINEERING
Timing Analysis and Performance Constraints
• Almost all digital ICs are synchronous Finite State Machines (FSM).
– Transitions occur at a set clock frequency.
– A sequential circuit, unrolled in time:

Combinational Combinational Combinational

Logic FF Logic FF Logic FF
Copy 1 Copy 2 Copy 3

Clock

2
• The maximum clock frequency for a given design depends upon:
– Gate delays, which are the signal delays due to gate transitions.
– Wire delays, which are the delays associated with signal propagation
along wires.
– Clock skew.
• Need to quickly estimate sequential circuit timing:
– Perform static timing analysis (STA).
– Assume clock skew is negligible, postpone until after clock network
synthesis.

3
Static Timing Analysis
• We represent a combinational logic netlist as a directed acyclic graph (DAG).
• The inputs are annotated with times 0, 0 and 0.6 time units respectively, at
which signal transitions occur relative to the start of the clock cycle.
• The gate and wire delays are also shown.

a <0> (0.15) (0.2)

y (2) w (2) (0.2) f
(0.1)
b <0> (0.1) x (1) (0.3) (0.25)
z (2)
c <0.6> (0.1)

4
DAG Representation
• The graph has one vertex for each input and output, as well as one vertex
for each logic gate.
• A source node s is introduced with a directed edge to each input.
• Vertices corresponding to logic gates are labeled with the respective gate
delays.
• Directed edges from the source to the inputs are labeled with transition
times, and directed edges between gate vertices are labeled with wire
delays.

5
a <0> (0.15) (0.2)
y (2) w (2) (0.2) f
(0.1)
b <0> (0.1) x (1) (0.3) (0.25)
z (2)
c <0.6> (0.1)

a (0) (0.15) y (2)

(0) (0.1) (0.2)

s (0) b (0) (0.1) x (1) w (2) (0.2) f (0)

(0.6) (0.3) (0.25)

DAG
c (0) (0.1) z (2)

6
Actual Arrival Time (AAT)
• The AAT of a given node v V, denoted as AAT(v), is defined as the latest
transition time at v measured from the beginning of the clock cycle.
– By convention, AAT(v) records the arrival time at the output side of node v.
– In the previous example, AAT(x) = 0.1 + 1 = 1.1, AAT(y) = 1.1 + 0.1 + 2 = 3.2
• Formal definition:
AAT (v ) = max ( AAT (u ) + t (u , v ) )
u∈FI ( v )

where FI(v) is the set of all nodes from which there exists a directed edge
to v, and t(u,v) is the delay corresponding to the (u,v) edge.

7
• All AAT values in the DAG can be computed in O(|V| + |E|) time.
– Linear in number of gates are edges.
• This linear scaling of runtime makes STA applicable to modern designs
with hundreds of millions of gates.
a (0) (0.15) y (2)
A0 A 3.2
(0) (0.1) (0.2)

s (0) b (0) (0.1) x (1) w (2) (0.2) f (0)

A0
A0 A 1.1 (0.3) (0.25) A 5.65 A 5.85
(0.6)

c (0) (0.1) z (2)

A 0.6 A 3.4

8
Required Arrival Time (RAT)
• The RAT of a given node v V, denoted as RAT(v), is defined as the time by
which the latest transition at a given node v must occur in order for the
circuit to operate correctly within a given clock cycle.
– Unlike AATs, which are determined from multiple paths from upstream inputs
and flip-flop outputs, RATs are determined from multiple paths to downstream
outputs and flip-flop inputs.
• Formal definition:
RAT (v ) = min (RAT (u ) − t (u, v) )
u∈FO ( v )
where FO(v) is the set of all vertices with a directed edge from v.

9
• It is assumed that the RAT values for the outputs are given.
• For the example, suppose that RAT(f) = 5.5 .

a (0) (0.15) y (2)

R 0.95 R 3.1
(0) (0.1) (0.2)

s (0) b (0) (0.1) x (1) w (2) (0.2) f (0)

R -0.35 R -0.35 R 0.75 (0.3) (0.25) R 5.3 R 5.5
(0.6)

c (0) (0.1) z (2)

R 0.95 R 3.05

10 10
Slack Computation
• The correct operation of the chip with respect to setup constraints (e.g.
maximum path delay), requires that AAT at each node does not exceed RAT.
– That is, for all vertices v V, we must have AAT(v) ≤ RAT(v).
• The slack of a node v is computed as:
slack (v ) = RAT (v ) − AAT (v )
– Critical paths or critical nets are signals that have negative slack.
– Non-critical paths or non-critical nets have positive slack.

11
Final Result with Slacks A: AAT
Computed R: RAT
S: Slack
a (0) (0.15) y (2)
A0 A 3.2
(0) R 0.95 (0.1) R 3.1 (0.2)
S 0.95 S -0.1
s (0) b (0) (0.1) x (1) w (2) (0.2) f (0)
A0 A0
R -0.35 A 1.1 A 5.65 A 5.85
(0.6) R -0.35 R 0.75 (0.3) (0.25) R 5.3 R 5.5
S -0.35 S -0.35 S -0.35 S -0.35 S -0.35
c (0) (0.1) z (2)
A 0.6 A 3.4
R 0.95 R 3.05
S 0.35 S -0.35

12 12
Current Practice
• In modern designs, separate timing analyses are performed for the cases of
rise delay (rising transitions) and fall delay (falling transitions).
• Signal integrity extensions to STA consider changes in delay due to switching
activity on neighboring wires of the path under analysis.
– For signal integrity analysis, the STA tool keeps track of windows (intervals) of
AATs and RATs.
– Typically executes multiple timing analysis iterations before these timing
windows stabilize.
• Statistical STA is a generalization of STA where gate and wire delays are
modeled by random variables and represented by probability distributions.

13
Drawbacks of STA
1. Assumption of a clock.
Not applicable to asynchronous subsystems.

2. Assumption that all paths are sensitizable.

Optimization tools waste considerable runtime and chip resources (e.g.
power, area, speed) satisfying phantom constraints.
– False paths, which are never activated.
– Multi-cycle paths, where signal transitions do not need to finish within one
clock cycle.

14
END OF LECTURE 33

15
Lecture 34: TIMING CLOSURE (PART 3)

PROF. INDRANIL SENGUPTA

DEPARTMENT OF COMPUTER SCIENCE AND ENGINEERING
Delay Budgeting with the Zero-Slack Algorithm
• In timing-driven physical design, both gate and wire delays must be
optimized to obtain a timing-correct layout.
• There exists a dilemma:
– Timing optimization requires knowledge of capacitive loads, and hence the
actual wire length.
– Wire lengths are unknown until placement and routing are completed.
• Timing budgets are used to establish delay and wire length constraints for
each net, for guiding placement and routing to a timing-correct result.
– Best-known approach to timing budgeting is the Zero Slack Algorithm.

2
Basic Idea
• Some notations:
– Consider a netlist consisting of logic gates v1, v2, …, vn
– Consider a set of nets e1, e2, …, em, where ei is the output net of gate vi.
– Let t(v) and t(e) denote gate delay and wire delay, respectively.

3
• The ZSA takes the netlist as input, and tries to decrease positive slacks of all
nodes to zero by increasing t(v) and t(e) values.
• These increased delay values together constitutes the Timing Budget TB(v) of
node v, which should not be exceeded during placement and routing.
TB(v) = t(v) + t(e)
• If TB(v) is exceeded, then the place-and-route tool typically:
(i) decrease the wirelength of e, or (ii) changes the size of gate v.
– The delay impact of a wire or gate size change can be estimated using the Elmore
delay model.

4
• If most arcs (branches) of a timing path are within budget, then the path
may meet its timing constraints even if some arcs exceed their budgets.
– Thus, another approach to satisfying the timing budget is rebudgeting.

• The zero slack algorithm shall be explained with the help of an illustrative
example.

5
Basic Steps in ZSA
1. Determine the initial slacks of all the nodes, and select a node vmin with
minimum positive slack slackmin.
2. Find a path of vertices that dominates slackmin, i.e. any change in the
delays in vertices along the path will cause slackmin to change.
3. Evenly distribute the slack by increasing TB(v) for each vertex v in the
path. Each budget increment will decrement the slack value of a vertex.
By repeating the process, the slack of each node in V will end up at zero.
The resulting timing budgets at all nodes are the final output of ZSA.

6
Example
• Use the zero-slack algorithm to distribute slack
• Format: <AAT, Slack, RAT>, [timing budget]
O1: <13,4,17>
I1 <1,4,5> [0] <3,4,7> [0] O2: <6,8,14>
2
I2
<0,5,5> [0]
<7,4,11> [0]
4 <13,4,17> [0]
6 O1
I3
<1,6,7> [0]

<6,8,14> [0]
3 0 O2
I4 <3,5,8> [0] <6,5,11> [0]

7 7
Example
• Find the path with the minimum non-zero slack (MARKED IN RED).

O1: <13,4,17>
I1 <1,4,5> [0] <3,4,7> [0] O2: <6,8,14>
2
I2
<0,5,5> [0]
<7,4,11> [0]
4 <13,4,17> [0]
6 O1
I3
<1,6,7> [0]

<6,8,14> [0]
3 0 O2
I4 <3,5,8> [0] <6,5,11> [0]

8 8
Example
• Find the path with the minimum non-zero slack.
• Distribute the slacks and update the timing budgets.
O1: <17,0,17>
I1 <1,0,1> [1] <4,0,4> [1] O2: <6,8,14>
2
I2
<0,2,2> [0]
<9,0,9> [1]
4 <16,0,16> [1]
6 O1
I3
<1,4,5> [0]

<6,8,14> [0]
3 0 O2
I4 <3,4,7> [0] <6,4,10> [0]

9 9
Example
• Find the path with the minimum non-zero slack.
• Distribute the slacks and update the timing budgets.
O1: <17,0,17>
I1 <1,0,1> [1] <4,0,4> [1] O2: <6,8,14>
2
I2
<0,2,2> [0]
<9,0,9> [1]
4 <16,0,16> [1]
6 O1
I3
<1,4,5> [0]

<6,8,14> [0]
3 0 O2
I4 <3,4,7> [0] <6,4,10> [0]

10 10
Example
• Find the path with the minimum non-zero slack.
• Distribute the slacks and update the timing budgets.
O1: <17,0,17>
I1 <1,0,1> [1] <4,0,4> [1] O2: <6,8,14>
2
I2
<0,0,0> [2]
<9,0,9> [1]
4 <16,0,16> [1]
6 O1
I3
<1,4,5> [0]

<6,8,14> [0]
3 0 O2
I4 <3,4,7> [0] <6,4,10> [0]

11 11
Example
• Find the path with the minimum non-zero slack.
• Distribute the slacks and update the timing budgets.
O1: <17,0,17>
I1 <1,0,1> [1] <4,0,4> [1] O2: <6,8,14>
2
I2
<0,0,0> [2]
<9,0,9> [1]
4 <16,0,16> [1]
6 O1
I3
<1,4,5> [0]

<6,8,14> [0]
3 0 O2
I4 <3,4,7> [0] <6,4,10> [0]

12 12
Example
• Find the path with the minimum non-zero slack.
• Distribute the slacks and update the timing budgets.
O1: <17,0,17>
I1 <1,0,1> [1] <4,0,4> [1] O2: <6,8,14>
2
I2
<0,0,0> [2]
<9,0,9> [1]
4 <16,0,16> [1]
6 O1
I3
<1,2,3> [2]

<6,8,14> [0]
3 0 O2
I4 <3,2,5> [0] <6,2,8> [2]

13 13
Example
• Find the path with the minimum non-zero slack.
• Distribute the slacks and update the timing budgets.
O1: <17,0,17>
I1 <1,0,1> [1] <4,0,4> [1] O2: <6,8,14>
2
I2
<0,0,0> [2]
<9,0,9> [1]
4 <16,0,16> [1]
6 O1
I3
<1,2,3> [2]

<6,8,14> [0]
3 0 O2
I4 <3,2,5> [0] <6,2,8> [2]

14 14
Example
• Find the path with the minimum non-zero slack.
• Distribute the slacks and update the timing budgets.
O1: <17,0,17>
I1 <1,0,1> [1] <4,0,4> [1] O2: <10,4,14>
2
I2
<0,0,0> [2]
<9,0,9> [1]
4 <16,0,16> [1]
6 O1
I3
<1,0,1> [3]

<10,4,14> [0]
3 0 O2
I4 <3,1,4> [0] <7,0,7> [3]

15 15
Example
• Find the path with the minimum non-zero slack.
• Distribute the slacks and update the timing budgets.
O1: <17,0,17>
I1 <1,0,1> [1] <4,0,4> [1] O2: <14,0,14>
2
I2
<0,0,0> [2]
<9,0,9> [1]
4 <16,0,16> [1]
6 O1
I3
<1,0,1> [3]

<10,0,10> [4]
3 0 O2
I4 <3,0,3> [1] <7,0,7> [3]

16 16
A Modification: Early Mode Analysis
• ZSA uses late-mode analysis with respect to setup constraints, i.e. the
latest times by which signal transitions can occur for the circuit to operate
correctly.
• Correct operation also depends on satisfying hold-time constraints on the
earliest signal transition times.
• Early-mode analysis considers these constraints.

17
How it Works?
• To correctly analyze this timing constraint, the earliest actual arrival time
of signal transitions at each node must be determined.
• The required arrival time of a sequential element in early mode is the time
at which the earliest signal can arrive and still satisfy the library-cell hold-
time requirement.
• For each gate v, AATEM(v) ≥ RATEM(v) must be satisfied.
– AATEM(v) is the earliest actual arrival time of a signal transition at gate v
– RATEM(v) is the required arrival time in early mode at gate v

18
• The early-mode slack can be defined as:
slackEM(v) = AATEM(v) – RATEM(v)

• When adapted to early-mode analysis, ZSA is also called the near zero-
slack algorithm.
– The modified algorithm seeks to decrease TB(v) by decreasing t(v) or t(e), so
that all nodes have minimum early-mode timing slacks.
– Since t(v) and t(e) cannot be negative, node slacks may not necessarily all
become zero.

19
To Summarize
• In practice, if the delay of a node does not satisfy its early-mode timing
budget, the delay constraint can be satisfied by adding additional delay
(padding) to appropriate components.
– The additional delay can violate late-mode timing constraints.
• Thus, a circuit should be first designed with ZSA and late-mode analysis.
Early-mode analysis may then be used to confirm that early-mode
constraints are satisfied, or to guide circuit modifications to satisfy such
constraints.

20
END OF LECTURE 34

21
Lecture 35: TIMING CLOSURE (PART 4)

PROF. INDRANIL SENGUPTA

DEPARTMENT OF COMPUTER SCIENCE AND ENGINEERING
False Paths
 Paths that physically exist in a design but are not logic
/functional paths.
 These paths never get sensitized under any input conditions.
 An example is shown on the next slide.

2
An example:
The path of length 400 is never exercised.
u x
200 1 200 1
MUX fi MUX
0 0
v y
100 100

3
Multi-cycle Paths
• Data paths that require more than one clock period for
execution.

2 clock period delay

4
Timing Analysis Problems
• We want to determine the true critical paths of a circuit in
order to:
– Determine the minimum cycle time for which the circuit will function.
– Identify critical paths from performance optimization – do not try to
optimize the wrong (non-critical) paths
• Implications:
– Do not want false paths (produced by static delay analysis).
– Delay model is worst case model.

5
Functional Timing Analysis
• Estimate when the output of a given circuit gets stable.

0
Combinational
block
0
clock 0 T

6
Why Timing Analysis?
• Timing verification
– Verifies whether a design meets a given timing constraint.
• Example: cycle-time constraint
• Timing optimization
– Needs to identify critical portion of a design for further optimization.
• Critical path identification
• In both cases, higher the accuracy, the better.

7
Timing Analysis - Basics
• Naïve approach - Simulate all input vectors with SPICE
– Accurate, but too expensive.
• Gate-level timing analysis
– Less accurate than SPICE due to the level of abstraction, but much more
efficient.
– Scenario:
• Gate/wire delays are pre-characterized (accuracy loss).
• Perform timing analysis of a gate-level circuit assuming the gate/wire
delays.

8
Gate-level Timing Analysis
False path z • A naive approach is topological analysis.
aware – Easy longest-path problem
arr(z)? 1 – Linear in the size of a network
• Not all paths can propagate signal events.
– False paths
1 – If all longest paths are false, topological
analysis gives delay overestimate.
Functional timing analysis = false-path-
x1 x2 aware timing analysis
– Compute false-path-aware arrival time
arr(x1)=0 arr(x2)=0

9
Example: 2-bit Carry-skip Adder
c_in s0

Length 5 Length 1
a0
b0 s1
1
0
a1 c_out
b1

10
False Path Analysis - Basics
• Is a path responsible for delay?
– If the answer is no, can ignore the path for delay computation.
• Check the falsity of long paths until we find the longest true path.
– How can we determine whether a path is false?

• Delay underestimation is unacceptable.

– Can lead to overlooking a timing violation.
• Delay overestimation is not desirable, but acceptable.
– Topological analysis can give overestimate, but never give underestimate.

11
Possible Approach :: Boolean Difference
fi-1 fi Fi+1

• Path P = {f0, f1, f2, … , fn}

∂f i
gives conditions under which node fi is “sensitive” to node fi-1
∂f i −1

• So output P is sensitive to f0 if

12
Example :: Static False Path
u x fj
200 1 200 1
MUX fi MUX
0 0
v y
100 100

∂f i ∂f j
The path is not sensitizable and hence is false. Hence, ⋅ =0
∂u ∂x

13
Definitions
• Given a simple gate (i.e. AND, OR, NAND, NOR), a controlling value on an
input determines the output of the gate independent of the other inputs.
• Given a simple gate (i.e. AND, OR, NAND, NOR), a non-controlling value on
an input cannot determine the output of the gate independent of the
other inputs.
– 0 is a controlling value for AND gate; 1 is non-controlling value for AND gate.
• Controlling / non-controlling value is merely a specialization of the
Boolean difference to simple gates.

14
a
f
b

a
g
b

15
Controlling/Non-Controlling Values
Controlled value of AND
0 0 1

Controlling value of AND Non-Controlling value of AND

Controlled value of OR
1 1 0

Controlling value of OR Non-Controlling value of OR

16
END OF LECTURE 35

17
Lecture 36: TIMING CLOSURE (PART 5)

PROF. INDRANIL SENGUPTA

DEPARTMENT OF COMPUTER SCIENCE AND ENGINEERING
Static Sensitization
• A path is statically-sensitizable if there exists an input vector such that
all the side inputs to the path are set to non-controlling values.
– This is independent of gate delays. The longest true path
is of length 2?
1 Controlling value!
0
t=0
t=0
These paths are not
1 statically-sensitizable
t=0 0

2
Static Sensitization (contd.)
• The (dashed) path is responsible for delay!
• Delay underestimation by static sensitization (delay = 2 when true
delay = 3)
– incorrect condition

1
0
1 2 3
0 2
0

3
What is Wrong with Static Sensitization?
• The idea of forcing non-controlling values to side inputs is
okay, but timing was ignored.
– The same signal can have a controlling value at one time and a non-
controlling value at another time.

• How about timing simulation as a correct method?

4
Timing Simulation
0
2 2
2 3
1
1 1
1
4
0 4
Implies that delay = 0 for these inputs
BUT!

5
0
2 2
2 3
1
1 3 4
1
1
4->2
0 2
Implies that delay = 4 with the same set of inputs.

6
What is Wrong with Timing Simulation?
• If gate delays are reduced, delay estimates can increase.
• Not acceptable since
– Gate delays are just upper-bounds, actual delay is in [0,d].
• Delay uncertainty due to manufacturing.
– We are implicitly analyzing a family of circuits where gate delays are
within the upper-bounds.

7
Monotone Speedup Property
• Definition: For any circuit C, if
a) C’ is obtained from C by reducing some gate delays, and
b) delay_estimate(C’) ≤ delay_estimate(C),
then delay_estimate has Monotone Speedup property.

Timing simulation does not have this property.

8
Timing Simulation Revisited
0 2
2
3
1
1 4
1
1
4
0 4
means that the rising signal occurs anywhere
4 between t = -∞ and t = 4.

9
What we just saw …
• Timed 3-valued (0,1,X) simulation
– called X-valued simulation.
• Monotone speedup property is satisfied.

10
SAT Based False Path Analysis
• Satisfiability (SAT) solvers are used for solving a wide range of
problems.
• Modern SAT solvers run very fast and can handle a large
number of variables.
• Basically, given a Boolean function F in product-of-sum form, a
SAT solver tries to find some assignment of the variables for
which F = 1.

11
The SAT Formulation
Decision problem:
Is there an input vector under which the output gets stable only after t = T ?
Idea:
1. Characterize the set of all input vectors S(T) that make the output stable
no later than t = T.
2. Check if S(T) contains S = all possible input vectors.
This check is solved as a SAT problem:
Is S \ S(T) empty?  set difference + emptiness check
• Let F and F(T) be the characteristic functions of S and S(T)
• Is F !F(T) satisfiable?

12
Example
d
g
a
b e f
c

Assume all the PIs arrive at t = 0, all gate delays = 1.

Is the output stable at time t > 2?

13
g(1,t=2) : the set of input vectors under which
g gets stable to value = 1 no later than t =2
d
g
a
b e f Onset:
stabilized by t=2?
c
g(1,t=2) = d(1,t=1) ∩ f(1,t=1)
= (a(0,t=0) ∩ b(0,t=0)) ∩ (c(1,t=0) ∪ e(1,t=0))
= !a!b(c ∪ ∅) = !a!bc = S1(t=2)
g(1,t=∞) = on-set = !a!bc = g(1,t=2) = S1

14
g(0,t=2) : the set of input vectors under which
g gets stable to value = 0 no later than t=2
d
g
a
b e f

c
g(0,t=2) = d(0,t=1) ∪ f(0,t=1)
= (a(1,t=0) ∪ b(1,t=0)) ∪ (c(0,t=0) ∩ e(0,t=0))
= (a+b) + (!c ∩ ∅) = a+b = S0(t=2)
g(0,t=∞) = off-set = a+b+!c = S0

15
g(0,t=2) : the set of input vectors under which
g gets stable to 0 no later than t=2
d
g
a
b e Offset:
f NOT stabilized by t=2
under abc = 000
c
g(0,t=2) = a+b
g(0,t=∞) = offset = a+b+!c
g(0,t=∞) \ g(0,t=2) = (a+b+!c) !(a+b) = !a !b !c = satisfiable

16
Summary
• False-path-aware arrival time analysis is well-understood.
– Practical algorithms exist.
• Can handle industrial circuits easily.
• Remaining problems:
– Incremental analysis (make it so that a small change in the circuit
does not make the analysis start all over).
– Integration with logic optimization.
– DSM issues such as cross-talk-aware false path analysis.

17
END OF LECTURE 36

18
Lecture 37: TIMING DRIVEN PLACEMENT

PROF. INDRANIL SENGUPTA

DEPARTMENT OF COMPUTER SCIENCE AND ENGINEERING
Timing Driven Placement (TDP)
• TDP optimizes circuit delay, either to satisfy all timing constraints, or to
achieve the greatest possible clock frequency.
• It uses the results of STA to identify critical nets and attempts to improve
signal propagation delay through those nets.
• TDP minimizes one or both of the following: WNS = min (slack ( τ) )
τ∈Τ
a) Worst negative-slack (WNS)
b) Total negative slack (TNS)
TNS = ∑ slack ( τ)
τ∈Τ, slack ( τ ) < 0

where T is the set of timing endpoints (i.e. primary outputs, or inputs to

flip-flops).

2
Techniques for Timing-Driven Placement
• Algorithmic techniques for TDP can be categorized as net-based,
path-based, or integrated.
• Two types of net-based techniques:
1. Delay budgeting, which assigns upper bounds to the timing or length of
individual nets.
2. Net weighting, which assign higher priorities to critical nets during placement.
• Path-based techniques seek to shorten or speedup all timing-critical paths
rather than individual nets.
– More accurate but does not scale to large designs because number of paths
can grow exponentially with number of gates (e.g. multiplier).

3
• Both path-based and net-based approaches rely on support within the
placement algorithm, and require a dedicated infrastructure for
incremental calculation of timing statistics and parameters.
• Integrated techniques typically use constraint-driven mathematical
formulation in which STA results are incorporated as constraints and
possibly in the objective function.
• In practice, some industrial flows do not incorporate timing-driven
methods during initial placement because timing information can be quite
inaccurate until locations are available.
– Instead, subsequent placement iterations, especially during detailed
placement, perform timing optimizations.

4
Net Based Techniques
• These approaches impose either quantitative priorities that
reflect timing criticality (net weights), or upper bounds on the
timing of nets in the form of net constraints (delay budgets).
• Net weights are more effective at the early design stages,
while delay budgets are more meaningful if timing analysis is
more accurate.

5
(a) Net Weighting
• A traditional placer optimizes total wirelength and routability.
• To account for timing, a placer can minimize the total weighted wirelength,
where each net is assigned a net weight.
– The higher the net weight is, the more timing-critical the net is.
• Net weights can be assigned either statically or dynamically to improve the
timing.

6
Static Net Weights
• They are computed before placement and do not change.
• They are usually based on slack: the more critical the net (i.e. smaller
slack), greater is the weight.
• Static net weights can be either discrete:
 ω if slack > 0
w= 1 where ω1 > 0, ω2 > 0, and ω2 > ω1
ω 2 if slack ≤ 0
• Or they can be continuous:
α
 slack 
w = 1 − 
 t 
where t is the longest path delay and α is a criticality exponent.

7
• Alternatively, net weights can be assigned based on sensitivity, as:
w = wo + α( slack target − slack ) ⋅ s wSLACK + β ⋅ s w
TNS

where w0 is the original net weight TNS: Total Negative Slack

slack is the computed slack value of the net WNS: Worst Negative Slack
slacktarget is the target slack of the design
swSLACK is the slack sensitivity to the weight of the net
swTNS is the TNS sensitivity to the net weight
α and β are constants bounds on the net weight change that control the
tradeoff between WNS and TNS.

8
Dynamic Net Weights
• They are computed during placement iterations and keep an updated timing
profile.
• This can be more effective than static net weights, since they are computed
before placement, and can become outdated when net lengths change.
• Estimated slack of a net at iteration k can be computed as:
slack k = slack k −1 − s LDELAY ⋅ ∆L
where ΔL is the change in wirelength between iterations (k-1) and k
slackk is the slack at iteration k
sLDELAY is the delay sensitivity to the wirelength

9
• After the timing information has been updated, the net weights should be
adjusted accordingly.
– This incremental method of weight modification is based on previous iterations.
• The net criticality at iteration k is computed as:
1
 2 (υ k −1 + 1) if among the top 3% of critical nets
υk =
1
 υ k −1
2 otherwise
• And then the net weight is updated as:
wk = wk −1 ⋅ (1 + υ k )

10
Integrated Technique using Linear Programs
• Unlike net-based methods, where the timing requirements are mapped to
net weights or net constraints, path-based methods directly optimize the
design’s timing.
– As the number of paths can grow quickly, this method is much slower than
net-based approaches.
• To improve scalability, timing analysis may be captured by a set of
constraints and an optimization objective.
– For example, in a linear programming framework.

11
• In the context of timing-driven placement, a linear program (LP) minimizes
a function of slack (e.g. TNS), subject to two main types of constraints:
1. Physical constraints, which define the locations of the cells.
2. Timing constraints, which define the slack requirements.

• In addition, some electrical constraints may also be incorporated.

12
Physical Constraints:
• Given a set of cells V and the set of nets E, we define the notations:
– xv and yv denote the center of cell v V
– Ve denotes the set of cells connected to net e E

– left(e), right(e), bottom(e), and top(e) respectively denote the coordinates

of the left, right, bottom, and top boundaries of e’s bounding box
– δx(v,e) and δy(v,e) denote pin offsets from xv and yv for v’s pin connected to e

13
• Then, for all v ∈ Ve: left (e) ≤ xv + δ x (v, e)
Every pin of a given
right (e) ≥ xv + δ x (v, e)
net e must be
bottom (e) ≤ yv + δ y (v, e) contained within e’s
top (e) ≥ yv + δ y (v, e) bounding box.

• Then, e’s half-parameter wire-length (HPWL) is defined as

L(e) = right (e) − left (e) + top (e) − bottom(e)

14
Timing Constraints:
• For timing constraints, let
– tGATE(vi,vo) be the gate delay from an input pin vi to the output pin vo for cell v
– tNET(e,uo,vi) be net e’s delay from cell u’s output pin uo to cell v’s input pin vi
– AAT(vj) be the arrival time on pin j of cell v

• For every input pin vi of cell v, the arrival time at vi is the arrival time at
the previous output pin u0 of cell u plus the net delay:
AAT (vi ) = AAT (u o ) + t NET (u o , vi )

15
• For every output pin v0 of cell v, the arrival time at v0 should be greater
than or equal to the arrival time plus gate delay of each input vi. That is,
for each input vi of cell v,
AAT (vo ) ≥ AAT (vi ) + t GATE (vi , vo )

• For every pin τp in a sequential cell τ, the slack is computed as the

difference between the required arrival time RAT(τp ) and actual arrival
time AAT(τp ),
slack ( τ p ) ≤ RAT ( τ p ) − AAT ( τ p )
• Upper bound all pin slacks by zero (or a small positive value),
slack(τp) ≤ 0

16
Objective Functions:
a) Optimize the total negative slack (TNS) max : ∑ slack (τ
τ p ∈Pins ( τ ), τ∈Τ
p)

where Pins(τ) is the set of pins of cell τ, and

T is the set of all sequential elements or endpoints.
b) Optimize the worst negative slack (WNS)
max : WNS
where WNS ≤ slack(τp) for all pins.
c) Optimize some combination of wirelength and slack
where E is the set of all nets, α is a constant
min : ∑ L(e) − α ⋅ WNS
between 0 and 1 that trades off WNS and e∈E
wirelength, and L(e) is the HPWL of net e.

17
END OF LECTURE 37

STA Prime Time
No ratings yet
STA Prime Time
125 pages
Static Timing Analysis
100% (2)
Static Timing Analysis
100 pages
Sta Vlsi
100% (2)
Sta Vlsi
40 pages
Determination of Dissolved Oxygen by Winkler Titrattion
50% (2)
Determination of Dissolved Oxygen by Winkler Titrattion
10 pages
Booklet Gems and Gemstones PDF
100% (4)
Booklet Gems and Gemstones PDF
12 pages
11 Timing Analysis Logic
No ratings yet
11 Timing Analysis Logic
55 pages
Lecture 5 Update
No ratings yet
Lecture 5 Update
27 pages
Lecture-5 - update
No ratings yet
Lecture-5 - update
51 pages
Lecture 3 STA
No ratings yet
Lecture 3 STA
55 pages
2020 12 Concept of Timing Analysis
No ratings yet
2020 12 Concept of Timing Analysis
28 pages
Digital VLSI Design Timing Analysis: Semester B, 2021-22 Lecturer: Zvika Webb 21 March 2022
100% (1)
Digital VLSI Design Timing Analysis: Semester B, 2021-22 Lecturer: Zvika Webb 21 March 2022
86 pages
file-3
No ratings yet
file-3
71 pages
Digital VLSI Design Timing Analysis: Semester A, 2018-19 Lecturer: Dr. Adam Teman
No ratings yet
Digital VLSI Design Timing Analysis: Semester A, 2018-19 Lecturer: Dr. Adam Teman
72 pages
Lecture 5-6 - Static Timing Analysis
No ratings yet
Lecture 5-6 - Static Timing Analysis
71 pages
PPT2-UNIT 4
No ratings yet
PPT2-UNIT 4
43 pages
Static Timimg Analysis Mat
No ratings yet
Static Timimg Analysis Mat
45 pages
Static Timing Analysis - Suresh
No ratings yet
Static Timing Analysis - Suresh
33 pages
12-Set Up and Hold Violations-13!11!2024
No ratings yet
12-Set Up and Hold Violations-13!11!2024
40 pages
Timing Analysis in Physical Design
100% (2)
Timing Analysis in Physical Design
32 pages
Static Timing Analysis (STA) Basics
No ratings yet
Static Timing Analysis (STA) Basics
12 pages
STA Basics
100% (2)
STA Basics
13 pages
Setup Hold Time
100% (1)
Setup Hold Time
28 pages
_STA__(1)[1]
No ratings yet
_STA__(1)[1]
70 pages
Timing Issues in Digital ASIC Design
No ratings yet
Timing Issues in Digital ASIC Design
101 pages
Static Timing Analysis
No ratings yet
Static Timing Analysis
4 pages
EDA unit 4
No ratings yet
EDA unit 4
8 pages
016 Timing Analysis
No ratings yet
016 Timing Analysis
33 pages
Lec 8
No ratings yet
Lec 8
4 pages
Sta Vlsi PDF
No ratings yet
Sta Vlsi PDF
40 pages
Lec45 Full
No ratings yet
Lec45 Full
18 pages
STA
No ratings yet
STA
123 pages
Static Timing Analysis Updated
No ratings yet
Static Timing Analysis Updated
15 pages
Sta
100% (1)
Sta
91 pages
1 RTL2GDS Sta
No ratings yet
1 RTL2GDS Sta
75 pages
VDF Project Part2 (2021)
No ratings yet
VDF Project Part2 (2021)
62 pages
Fundamental STA
No ratings yet
Fundamental STA
47 pages
Assignment 2 Grandhi Vyasa Maharshi
No ratings yet
Assignment 2 Grandhi Vyasa Maharshi
7 pages
STA - Part 1
No ratings yet
STA - Part 1
24 pages
STA Intel
No ratings yet
STA Intel
81 pages
STA - Booklet
No ratings yet
STA - Booklet
34 pages
Static Timing Analysis and Timing Violations of Sequential Circuits
No ratings yet
Static Timing Analysis and Timing Violations of Sequential Circuits
7 pages
Chapter 5 Static Timing Analysis
100% (4)
Chapter 5 Static Timing Analysis
126 pages
TimingAnalysis Presentation v1 1
No ratings yet
TimingAnalysis Presentation v1 1
70 pages
Static Timing Analysis
No ratings yet
Static Timing Analysis
71 pages
All Ac379 An
No ratings yet
All Ac379 An
43 pages
Timing
100% (1)
Timing
30 pages
Logic Synthesis: Timing Analysis
No ratings yet
Logic Synthesis: Timing Analysis
33 pages
Static Timing Analysis Basics by Selva Kumar
67% (3)
Static Timing Analysis Basics by Selva Kumar
59 pages
Sta-types of Paths
No ratings yet
Sta-types of Paths
12 pages
Synthesis - 07 - 23
No ratings yet
Synthesis - 07 - 23
102 pages
Timing Simulation
100% (3)
Timing Simulation
3 pages
Sta Notes
No ratings yet
Sta Notes
33 pages
Technology in Telecommunications Networks
From Everand
Technology in Telecommunications Networks
Tanushri Kaniyar
No ratings yet
Circuit bench - 100 shields for arduino
From Everand
Circuit bench - 100 shields for arduino
Newton C. Braga
No ratings yet
Reference Guide To Useful Electronic Circuits And Circuit Design Techniques - Part 2
From Everand
Reference Guide To Useful Electronic Circuits And Circuit Design Techniques - Part 2
Kerwin Mathew
No ratings yet
An Introduction To Digital Design
From Everand
An Introduction To Digital Design
Jason King
2/5 (1)
Analog Dialogue, Volume 48, Number 1: Analog Dialogue, #13
From Everand
Analog Dialogue, Volume 48, Number 1: Analog Dialogue, #13
Analog Dialogue
4/5 (1)
Stair Lighting Timer
From Everand
Stair Lighting Timer
Dr. Hidaia Mahmood Alassouli
No ratings yet
BICSI RCDD Registered Communications Distribution Designer Exam Prep And Dumps RCDD-001 Exam Guidebook Updated Questions
From Everand
BICSI RCDD Registered Communications Distribution Designer Exam Prep And Dumps RCDD-001 Exam Guidebook Updated Questions
Byte Books
No ratings yet
WAN TECHNOLOGY FRAME-RELAY: An Expert's Handbook of Navigating Frame Relay Networks
From Everand
WAN TECHNOLOGY FRAME-RELAY: An Expert's Handbook of Navigating Frame Relay Networks
Mamta Devi
No ratings yet
Analog Dialogue, Volume 47, Number 2
From Everand
Analog Dialogue, Volume 47, Number 2
Analog Dialogue
No ratings yet
Analog Dialogue, Volume 45, Number 4: Analog Dialogue, #4
From Everand
Analog Dialogue, Volume 45, Number 4: Analog Dialogue, #4
Analog Dialogue
No ratings yet
Week 01 Assignment Solution
No ratings yet
Week 01 Assignment Solution
5 pages
Week 10 pmrf
No ratings yet
Week 10 pmrf
11 pages
Week 9 doubt session
No ratings yet
Week 9 doubt session
8 pages
NPTEL - Week - 8 - v1 reevalution assignment solution
No ratings yet
NPTEL - Week - 8 - v1 reevalution assignment solution
7 pages
noc20_cs18_assigment_3
No ratings yet
noc20_cs18_assigment_3
1 page
VPY_assigment_2
No ratings yet
VPY_assigment_2
1 page
NPTEL_Assign_week6_V
No ratings yet
NPTEL_Assign_week6_V
10 pages
noc20_cs18_assigment_5
No ratings yet
noc20_cs18_assigment_5
1 page
Week9 pbs
No ratings yet
Week9 pbs
5 pages
Week10 assignment solution
No ratings yet
Week10 assignment solution
5 pages
BEC501 - Management notes
No ratings yet
BEC501 - Management notes
33 pages
Week7 pbs
No ratings yet
Week7 pbs
4 pages
DC_week_6_assignment
No ratings yet
DC_week_6_assignment
6 pages
Week5 pbs
No ratings yet
Week5 pbs
5 pages
Counter Design Using Different Flipflops
No ratings yet
Counter Design Using Different Flipflops
20 pages
electronic part1
No ratings yet
electronic part1
25 pages
electronic part2
No ratings yet
electronic part2
20 pages
Introduction TO MEMORY SYSYTEM
No ratings yet
Introduction TO MEMORY SYSYTEM
24 pages
Ade Assignment Second
No ratings yet
Ade Assignment Second
3 pages
booth encodng
No ratings yet
booth encodng
2 pages
Module1 pcs notes
No ratings yet
Module1 pcs notes
39 pages
Module 1 Notes Upto Marginal Density Function
No ratings yet
Module 1 Notes Upto Marginal Density Function
11 pages
Module 5 Pcs Notes
No ratings yet
Module 5 Pcs Notes
31 pages
Zener voltage regulator values
No ratings yet
Zener voltage regulator values
1 page
Module-5 EPC
No ratings yet
Module-5 EPC
27 pages
Module 4 CT Question Bank
No ratings yet
Module 4 CT Question Bank
1 page
Tutorial 3
No ratings yet
Tutorial 3
1 page
Ade Values To Be Written in Lab Manu
No ratings yet
Ade Values To Be Written in Lab Manu
3 pages
JK Flipflop Using VHDL
No ratings yet
JK Flipflop Using VHDL
2 pages
Verilog Parameters and Operators
No ratings yet
Verilog Parameters and Operators
25 pages
Vocabulary - Revisión Del Intento - Tema 9
No ratings yet
Vocabulary - Revisión Del Intento - Tema 9
2 pages
3RD QUARTER SCIENCE 6 FRICTION Lesson Exemplar
No ratings yet
3RD QUARTER SCIENCE 6 FRICTION Lesson Exemplar
5 pages
The Effects and Solutions of Drugs and Substance Abuse
No ratings yet
The Effects and Solutions of Drugs and Substance Abuse
9 pages
Especificacion de Materiales y Tabla de Compatibilidad
No ratings yet
Especificacion de Materiales y Tabla de Compatibilidad
2 pages
Experiment-1: Term - I Physics Practicals
No ratings yet
Experiment-1: Term - I Physics Practicals
15 pages
Journal Homepage: - : Manuscript History
No ratings yet
Journal Homepage: - : Manuscript History
17 pages
Consumers Perception Towards Organic Products in Dehradun
No ratings yet
Consumers Perception Towards Organic Products in Dehradun
63 pages
Science Presentation
No ratings yet
Science Presentation
12 pages
OP1 Field Notebook 1v5a
No ratings yet
OP1 Field Notebook 1v5a
254 pages
Kari Naal
0% (1)
Kari Naal
7 pages
Solar Car Aerodynamic Design For Optimal Cooling and High Efficiency
No ratings yet
Solar Car Aerodynamic Design For Optimal Cooling and High Efficiency
8 pages
53 531 - Submersible - Pumps - Submittals
No ratings yet
53 531 - Submersible - Pumps - Submittals
2 pages
Download full Microbial Spoilage of Foods H. A. Modi ebook all chapters
100% (9)
Download full Microbial Spoilage of Foods H. A. Modi ebook all chapters
50 pages
Recovered PDF 312 PDF
No ratings yet
Recovered PDF 312 PDF
1 page
RV Drop Route's Only on 24th Feb & 27th Feb 2025
No ratings yet
RV Drop Route's Only on 24th Feb & 27th Feb 2025
50 pages
Answers Heat Mass Transfer I
No ratings yet
Answers Heat Mass Transfer I
8 pages
Led Cube
No ratings yet
Led Cube
9 pages
Logic Activutues
No ratings yet
Logic Activutues
6 pages
AMC Agreement 2021-22
No ratings yet
AMC Agreement 2021-22
7 pages
Family Medicine - Seminar - Consultation
No ratings yet
Family Medicine - Seminar - Consultation
4 pages
Department of EXTC Engineering Problem Based Learning Experiment No 10
No ratings yet
Department of EXTC Engineering Problem Based Learning Experiment No 10
6 pages
A Technique To Guide Replacement of Multiunit Abutments Supporting An Existing Implant-Supported Fixed Complete Denture
No ratings yet
A Technique To Guide Replacement of Multiunit Abutments Supporting An Existing Implant-Supported Fixed Complete Denture
4 pages
Fan H 3 Aluminium 288
No ratings yet
Fan H 3 Aluminium 288
6 pages
TD 3 Pert
No ratings yet
TD 3 Pert
1 page
Eco-Friendly Paint - tcm18-175384 PDF
No ratings yet
Eco-Friendly Paint - tcm18-175384 PDF
4 pages
Spell - Bee Words
No ratings yet
Spell - Bee Words
1 page
p-7770-sv Svendborg Brake
No ratings yet
p-7770-sv Svendborg Brake
1 page
03 Study Focus 2
No ratings yet
03 Study Focus 2
10 pages

Week 6 Lecture Material_watermark

Uploaded by

Week 6 Lecture Material_watermark

Uploaded by

Lecture 32: TIMING CLOSURE (PART 1)

PROF. INDRANIL SENGUPTA

80% delay  Today’s Scenario

PROF. INDRANIL SENGUPTA

Combinational Combinational Combinational

a <0> (0.15) (0.2)

a (0) (0.15) y (2)

(0) (0.1) (0.2)

s (0) b (0) (0.1) x (1) w (2) (0.2) f (0)

(0.6) (0.3) (0.25)

s (0) b (0) (0.1) x (1) w (2) (0.2) f (0)

c (0) (0.1) z (2)

a (0) (0.15) y (2)

s (0) b (0) (0.1) x (1) w (2) (0.2) f (0)

c (0) (0.1) z (2)

2. Assumption that all paths are sensitizable.

PROF. INDRANIL SENGUPTA

PROF. INDRANIL SENGUPTA

2 clock period delay

• Delay underestimation is unacceptable.

• Path P = {f0, f1, f2, … , fn}

Controlling value of AND Non-Controlling value of AND

Controlling value of OR Non-Controlling value of OR

PROF. INDRANIL SENGUPTA

• How about timing simulation as a correct method?

Timing simulation does not have this property.

Assume all the PIs arrive at t = 0, all gate delays = 1.

PROF. INDRANIL SENGUPTA

where T is the set of timing endpoints (i.e. primary outputs, or inputs to

where w0 is the original net weight TNS: Total Negative Slack

• In addition, some electrical constraints may also be incorporated.

– left(e), right(e), bottom(e), and top(e) respectively denote the coordinates

• Then, e’s half-parameter wire-length (HPWL) is defined as

• For every pin τp in a sequential cell τ, the slack is computed as the

where Pins(τ) is the set of pins of cell τ, and

You might also like