# Probabilistic Simulation for Reliability Analysis of CMOS VLSI Circuits

FARID N. NAJM, MEMBER, IEEE, RICHARD BURCH, PING YANG, SENIOR MEMBER, IEEE, AND IBRAHIM N. HAJJ, SENIOR MEMBER, IEEE

Abstract—A novel current-estimation approach is developed to support the analysis of electromigration (EM) failures in power supply and ground buses of CMOS VLSI circuits. It uses the original concept of *probabilistic simulation* to efficiently generate accurate estimates of the *expected current waveform* required for electromigration analysis. As such, the approach is *pattern-independent* and relieves the designer of the tedious task of specifying logical input waveforms. This approach has been implemented in the program CREST (CuRrent ESTimator) which has shown excellent accuracy and dramatic speedups compared to traditional approaches. We describe the approach and its implementation, and present the results of numerous CREST runs on real circuits.

## I. INTRODUCTION

THE RELIABILITY of integrated circuits is a major concern for the electronics industry. As higher levels of integration are used, the minimum line width and line separation will decrease thereby increasing the chip failure rate. This indicates that the importance of reliability can only increase in the future. It is, therefore, imperative that circuits are designed with reliability in mind.

This work addresses *electromigration* (EM), [1], [2] which is a major reliability problem caused by the transport of atoms in a metal line due to the electron flow. Under persistent current stress, this can cause deformations of the metal leading to either short or open circuits. The failure rate due to EM depends on the current density in the metal lines and is usually expressed as a median time-to-failure (MTF). There is a definite need for CAD tools that predict the susceptibility of a given design to EM failures.

A simulation tool, SPIDER [3], has been developed to estimate the MTF for each section of a metal bus corresponding to any user-selected interconnect signal. It requires the user to specify current sources to load the metal bus at specified contact points. Using these current sources, SPIDER extracts an equivalent resistance net-

F. N. Najm was with the Coordinated Science Laboratory, University of Illinois at Urbana-Champaign, Urbana, IL. He is now with the VLSI Design Laboratory, Texas Instruments, Inc., Dallas, TX 75265.

R. Burch and P. Yang are with the VLSI Design Laboratory, Texas Instruments, Inc., Dallas, TX 75265.

I. N. Hajj is with the Coordinated Science Laboratory, University of Illinois at Urbana-Champaign, Urbana, IL 61801.

IEEE Log Number 8932823.

work to represent the bus, simulates the network using SPICE [4] to determine the current density in each section, and then estimates the MTF of each section using models developed in [5]. The user, however, is left with the problem of specifying current sources. This can be very hard to do for a big chip, especially for CMOS circuits, because they draw current only during switching transients. Hence, there is a need for a CAD tool that derives these currents.

We present a new technique for solving this *current estimation* problem for CMOS circuits. This has been implemented in the program CREST (CuRrent ESTimator) and has proven to be very effective both in terms of accuracy and speed. We focus our attention on the power and ground buses, and derive loading currents for them to be used for MTF estimation. These busses are the usual, although not only, locations of severe EM failures. Preliminary results of this works have been presented in [6], [7].

It is important to understand exactly what information about the current is needed for EM analysis. CMOS circuits, as pointed out above, draw current only during switching, and, therefore, produce a non-dc current waveform. It is well known [5] that, in the presence of such waveforms, the MTF due to EM is dependent on the shape of the waveform and not simply on its time-average (i.e., area). So a simple averaging approach is unacceptable. On the other hand, the MTF is the combined effect of a large number of current waveforms, corresponding to the variety of logical waveforms that can be applied at the circuit inputs during typical operation. It would be insufficient, therefore, to use a standard timing simulator to derive the current corresponding to a single set of logical input transitions. It is also obvious that redoing a such a simulation for every possible input transition is impractical, since for a circuit with n inputs, the number of possible transitions at the inputs is  $2^{2n}$ .

CREST overcomes this problem by deriving an *expected current waveform*; this is a waveform whose value at a given time is the weighted average of all possible current values at that time, as shown in Fig. 1. Such a waveform is a good compromise between an *unacceptable* time-average and an *insufficient* single-transition estimate of the current, and provides an appropriate current estimate for EM analysis.

Manuscript received January 24, 1989. This work was supported by Texas Instruments, Inc., and the U.S. Air Force Rome Air Development Center. This paper was recommended by Editor M. R. Lightner.

IEEE TRANSACTIONS ON COMPUTER-AIDED DESIGN, VOL. 9, NO. 4, APRIL 1990



Fig. 1. Four actual current waveforms (dashed) and the corresponding expected current waveform (solid).

To derive this waveform, CREST considers a userspecified range (or set) of possible input values and/or transitions and calculates the expected current waveform *over* this range. Rather than enumerating the set of inputs and averaging the corresponding current results (which would be impractical), CREST uses *statistical* information about the inputs to directly derive the required expected current waveform. The resulting methodology is what we call a *probabilistic simulation* of the circuit. In general, it can be slightly more time consuming than standard timing simulation, but it needs to be applied only once, resulting in significant speedup.

Several simplifying assumptions and/or approximations will be made in the following sections to make the problem computationally tractable. Whenever possible, we will attempt to justify these assumptions. However, for lack of space, this will not always be possible, and the reader will be referred to appropriate references. Nevertheless, we will offer a verification of the overall approach on a *global* scale, by comparing the end result of the simulation (expected waveform) from CREST with that derived using SPICE.

The rest of this paper is organized as follows. The next section gives a system overview to introduce some basic concepts and the overall simulation strategy. Section III explains the basic current estimation algorithm as part of the simulation of standard CMOS gates. Pass-transistor circuits are handled in Section IV. Section V discusses supergates, required to handle signal dependence. Section VI presents implementation issues and results, and the last section draws some conclusions and indicates the direction of future research. The appendix discusses a basic graph reduction operation that will be frequently used.

## **II. SYSTEM OVERVIEW**

We have developed a new current estimation approach that derives the expected current waveform directly from probabilistic information about the inputs. A single eventdriven simulation, similar in some aspects to timing simulation, is the basis of the technique. Before going into the details, we introduce some basic concepts and provide the theoretical groundwork.

Consider a set  $\Omega$ , each element of which represents a combination of *logical waveforms* to be applied at the cir-

cuit inputs;<sup>1</sup> this is the range of inputs over which the expected current waveform is to be derived. If certain probabilities are assigned to the elements of  $\Omega$ , then we can think of it as a *probability space* [8]. Associated with each element of  $\Omega$  is an actual current waveform that the circuit would draw if subjected to that combination of inputs. This association (or mapping) defines a *stochastic process* i(t) whose *mean* E[i(t)] is the expected current waveform to be derived. Likewise, every input node  $N_i$  has associated with it a stochastic process  $x_{N_i}(t)$  that embodies the different possible logic waveforms allowed at  $N_i$ . These in turn define other processes at the internal nodes of the circuit.

The technique to be presented takes the user's input specification (defining the processes  $x_{N_i}(t)$ ) and uses the circuit topology (which defines the mapping from logical inputs to current waveforms) to derive the corresponding processes at internal nodes, decipher the statistics of i(t), and derive its mean. This, somewhat abstract, description will be made more concrete in Section II-2.1.

## 2.1. Probability Waveforms

We build on the concept of *signal probabilities* [9], which has recently become popular in the testing field [10]. This can be summarized as follows: a probability value is assigned to each input node to indicate the probability that it is high. The circuit topology is then used to propagate these values so that the probability at any internal node is derived. The probability space in this case is a subset of the set  $\Omega$  defined above, because it contains steady-state values only, and not waveforms.

We extend the signal probabilities concept by defining *transition probabilities*. The transition probability of a signal N at time t is the probability of a low-to-high transition from  $t^-$  (just before t) to  $t^+$  (just after t), denoted  $P_{N,lh}(t)$ . Given these probabilities at the inputs, then internal node transition probabilities can be derived from them.

Using transition probabilities we can compactly describe a large set of logic waveforms. For example: input N is high with probability 0.76 at time 0 ( $P_{N,h}(0) = 0.76$ ), switches low-to-high with probability 0.5 at time 2 ns ( $P_{N,h}(2 \text{ ns}) = 0.5$ ), is then high with probability 0.35 at time 3 ns ( $P_{N,h}(3 \text{ ns}) = 0.35$ ), etc. Such an alternating sequence of signal probabilities and transition probabilities will be referred to as a *probability waveform*. An example of a probability waveform and the few logical waveforms it represents is shown in Fig. 2. CREST uses such a waveform as a representation<sup>2</sup> of a stochastic process at a node. The site of a transition prob-

Strictly speaking, if the circuit contains memory elements, then its initial state also affects the structure of  $\Omega$ .

<sup>&</sup>lt;sup>2</sup>This, in fact, is an *incomplete* representation, but is sufficiently accurate for our purposes. It would be prohibitively expensive to maintain a complete representation.



Fig. 2. A probability waveform (bottom) represents four logical waveforms (top).

ability in this waveform will also be referred to as a *transition edge*, shown as an arrow in Fig. 2. A transition edge is an important part of a probability waveform because it may cause current to be drawn. Transition edges will also be called *probabilistic events*, or simply *events*.

An event at a node N at time t is described by the three probabilities:  $P_{N,h}(t^-)$ ,  $P_{N,lh}(t)$ , and  $P_{N,h}(t^+)$ . It is easy to see that these are enough to describe the statistics of the event since other probabilities can be derived from them. For instance, the probability of a high-to-low transition can be easily derived using the identity  $P_h(t^+) - P_h(t^-) = P_{lh}(t) - P_{hl}(t)$ , which is obtained from simple probability theory.

# 2.2. Simulation Algorithm

CREST is a *probabilistic* simulator since it operates on probabilistic, rather than logical, signals. The simulation algorithm itself, however, is deterministic. The user specifies probability waveforms at the primary inputs of the circuit, which are propagated into the circuit to derive the expected current waveform.

The waveforms specified at the circuit inputs constitute the primary means for the user to influence the simulation. They allow one to study the reliability of the circuit under "typical" operating conditions. As described above, every probability waveform consists of a sequence of transition edges, or events, tagged with signal and transition probabilities. If these probabilities are set to 0's and 1's, then the waveform becomes a simple logic waveform. Thus if enough is known about the activity at the circuit inputs, the user may fine tune the simulation to the point where logical waveforms are specified at some inputs (say, clock inputs). Otherwise, if little is known about the inputs, then one can allow a large variety of possible signals by specifying appropriate probability waveforms.

To prepare a set of probability waveform inputs, there are at least two options. If one is comfortable with the probability waveform concept, then these inputs may be simply entered as sequences of real numbers between 0 and 1. Otherwise, we assume that the user has available a representative set of logical input waveforms from previous logic, timing, or fault simulation runs on his design. The corresponding set of probability waveforms can then be obtained as follows. For a given input node, consider all the logical waveforms that it may experience. Then, for every time point, take the average of the values in all these waveforms, considering high to be 1 and low to be 0. When the signal is not changing, the value thus obtained is the required probability waveform value at that time. When the signal *is* changing, from low to high, then the fraction of the logic waveforms in which it makes that transition at that time gives the required transition probability in its probability waveform.

Given the input probability waveforms, an event-driven simulation approach, similar to what is commonly employed by logic or timing simulators, is used to propagate them throughout the circuit. The circuit is divided into gates. When a probabilistic event occurs on the input to a gate, the expected current pulse caused by the event is estimated, and the appropriate probabilistic event is created on the output of the gate. The expected current pulses from individual gates are summed to create the expected current waveform drawn by the circuit. The program operates on a transistor description of the circuit, which it partitions into primitive gates of two major types: standard CMOS gates and pass transistor gates. Current pulse estimation and probabilistic event propagation for each of these gate types will be further discussed in Sections III and IV, respectively.

## 2.3. Signal Dependence

A major problem with any tool dealing with a number of statistical quantities is keeping track of, and modeling, the *correlation* or *dependence* among them, CREST is no exception. Dependence between signals in CREST is of two types. The first is a dependence existing between the two values at the same node at two distinct time points, this will be referred to as *temporal dependence*. The second is the dependence existing between two signals if their nodes depend on the same fan-out stem in the circuit, this will be referred to as *spatial dependence*. A feedback loop involves both spatial and temporal dependencies; in general two signals may be dependent due to either or both of these types.

CREST accounts for temporal dependence in a limited sense. The dependence between two signal values separated by a single transition edge *is* accounted for—this, in fact, is the reason transition probabilities are introduced. The program, however, does not keep track of the dependence between signal values separated by more than one transition edge. It turns out that this approach is sufficiently accurate for our purposes because the probabilities at (and on both sides of) a transition edge are enough to derive the current pulse corresponding to it. To accurately keep track of temporal dependence between the signals at any two time points in a waveform would be equivalent to a complete representation of a stochastic process<sup>2</sup> and would be prohibitively expensive.

Spatial dependence is handled using the concept of a *supergate* [10], as explained in Section V. It is important to make the point now, however, that this reduces to simulating gates whose inputs are *independent*. The descrip-

tions of the simulation of primitive gates in the following sections will, therefore, assume that a gate's inputs are independent.

# 2.4. Graph Reduction

This section briefly introduces a *graph reduction* procedure that is central to the simulation algorithms to be presented, and which will be used frequently below. The Appendix is devoted to a precise formulation and description of this approach.

Simply stated, the need will frequently arise to derive the probability that a mesh of transistors, joining two specified points, provides a conducting path between them. An example is given in Fig. 3, where the p part of a CMOS complex gate is shown. Typically, the probabilities at the gate nodes of the transistors are known, and the probability at the gate output is required. The general methodology will be to consider a graph that is identical to the transistor mesh (Fig. 3) in which the edges are labeled with the probabilities of the gate nodes. This graph is then *reduced* to a single edge whose label(s) provide the required probabilities. In Fig. 3, the probability that the output is high is simply the probability that the edge  $\xi$  is conducting.

Actually, the edges are labeled by both signal *and* transition probabilities, as well as by the expected conductance of the transistors. After the graph is reduced, the resultant labels on  $\xi$  determine the event at the output, and its expected conductance is used to derive the expected current. The reader is referred to the Appendix for details of this reduction.

### **III. STANDARD GATE SIMULATION**

The term *standard gate* will be used to refer to a CMOS fully complementary gate, as shown in Fig. 4. The p-block or p-part (n-block or n-part) of a gate will be used to refer to the p- (n-) channel transistor mesh between its output node and the power supply (ground). A gate will be assumed to have spatially independent inputs. The general case is properly handled using the concept of a supergate, as described in Section II, with the independent-inputs-gate-solver used as a subroutine. Simulating a gate is the procedure of analyzing a gate that has certain events at its inputs to derive the corresponding output event and expected current pulse.

Given the events at the inputs of a gate at a certain time t, its output event can be easily derived as follows. Consider the p part of the gate and build a graph that represents it using the transistor gate probabilities to label the graph edges with the probabilities that each edge "is on," "was on," and "transitions from off to on." The graph reduction procedure described in the Appendix is then used to reduce the graph to a single edge between  $V_{dd}$  and the output node. It is obvious that the probabilities of the single edge  $\xi$  remaining at the end of the reduction give the required output event. The time of occurrence of this event will be derived at the end of this section.



Fig. 3. A typical graph reduction.



Fig. 4. A generic CMOS gate.

The current estimation procedure at each gate will now be discussed. The expected gate current pulse will be modeled by a *triangular pulse* that starts with a peak of  $E[I] \stackrel{\triangle}{=} E[i(t^+)]$  at time t and decays linearly to zero at time  $t + \tau$ . The rest of this section describes the derivation of E[I] and  $\tau$ .

We will focus on the *charging* current component and leave out the *direct* component which may be drawn through a path of p- and n-channel transistors during the transition. This policy has been adopted based on Veendrick's [11] work which suggests that if the gate is *well designed* then the direct current component may be neglected.

Consider the generic CMOS gate structure shown in Fig. 4. The figure shows the p-transistor block, the ntransistor block, and the output node capacitance split into two lumped capacitors  $C_p$  to  $V_{dd}$  and  $C_n$  to  $V_{ss}$ . Similarly, each internal node  $n_i$  has two capacitances  $C_{in}$  and  $C_{ip}$ . The values of these capacitances are derived from the circuit description and the transistor model parameters. On a low-to-high transition, the currents flowing through  $C_n$ and  $C_p$  at the output node are  $i_{p1}$  and  $i_{p2}$ , respectively, as shown in the figure. The corresponding  $i_{n1}$  and  $i_{n2}$  for a high-to-low transition are also shown. The currents  $i_{p2}$  and  $i_{n2}$  are discharging currents that redistribute locally, and we are interested in  $i = i_{p1} + i_{n1}$ . Of course, these currents are associated with the output node only, and the total gate current  $i_{tot}$  will be larger than *i*. However, the output current will play a central role in the derivation.

Let  $i_p = i_{p1} + i_{p2}$  and  $i_n = i_{n1} + i_{n2}$ . It is easy to verify that  $i_{p1} = i_p \times C_n/(C_p + C_n)$ , and  $i_{n1} = i_n \times C_p/(C_p + C_n)$ . Therefore

$$E[i(t)] = E[i_p(t)] \times \frac{C_n}{C_p + C_n} + E[i_n(t)] \times \frac{C_p}{C_p + C_n}.$$
(3.1)

Also, in particular, the value at the peak is

$$E[I] = E[I_p] \times \frac{C_n}{C_p + C_n} + E[I_n] \times \frac{C_p}{C_p + C_n}.$$
(3.2)

The values of  $E[I_p]$  and  $E[I_n]$  are derived as follows. For  $i_p$ , consider the p part of the gate, and let every transistor  $T_k$  be represented by a switch of on-conductance  $g_{\text{on},k}$ , where  $g_{\text{on},k}$  is (the maximum conductance) given by

$$g_{\text{on},k} = \frac{\beta_k}{2V_{\text{dd}}} \left( V_{\text{dd}} - V_T \right)^2 (1 + \lambda V_{\text{dd}}) \qquad (3.3)$$

where  $V_T$  is the magnitude of the transistor threshold voltage,  $\beta_k$  is its transconductance, and  $\lambda$  is the channel length modulation factor. This definition of  $g_{on,k}$  is used because we are interested in the peak current drawn, which occurs during saturation. The value of  $g_{on,k}$  is derived from the transistor model parameters given by the user.

Now let  $G_p(t)$  be the random conductance between the output node and  $V_{dd}$ .  $G_p$  is a function of the individual transistors' random conductances  $g_k$ , where  $g_k$  is 0 if the transistor is off and  $g_{on,k}$  if it is on. If an event occurs at the gate at time t, then the value of  $E[G_p(t^+)]$  and the previous state of the output node,  $V_o(t-)$ , will determine  $E[I_p]$ . Formally, we have  $E[I_p] = E[(V_{dd} - V_o(t^-)) \times G_p(t^+)]$ , which becomes

$$E[I_p] = V_{dd} \times E[G_p(t^+)]G_p(t^-) = 0] \times P(G_p(t^-) = 0)$$
(3.4)

where P(A) is the probability of the event A, and E[A|B]denotes the *conditional expected value* of A given B. The formula is correct because if  $G_p(t^-) = 0$  (1) then  $V_o(t^-)$ = 0 ( $V_{dd}$ ). Similarly for the n part of the gate, we get

$$E[I_n] = V_{dd} \times E[G_n(t^+) | G_n(t^-) = 0] \\ \times P(G_n(t^-) = 0).$$
(3.5)

If the gate inputs at  $t^+$  are independent of their values at  $t^-$  then  $E[G_p(t^+)|G_p(t^-) = 0] = E[G_p(t^+)]$  and the problem would be simplified. In this case the value of  $E[G_p]$  (or  $E[G_n]$ ) may be derived from the graph by considering a graph representation of the p-(n-) block of the gate using the conductances  $E[g_k]$  of the transistors and their gate node probabilities and performing a graph reduction. However, this is not true in general and the dependence between  $G(t^+)$  and  $G(t^-)$  should be taken into account.

To find the *conditional* expected value of  $G_p$ ,  $E[G_p(t^+)|G_p(t^-) = 0]$ , we perform the graph reduction using  $E[g_k(t^+)|G_p(t^-) = 0]$ , instead of  $E[g_k(t^+)]$ , for every transistor; likewise for  $G_n$ . If  $x_k$  is the gate node of transistor  $T_k$  in the p part, then it can be shown [12] that:

 $E[q_{1}(t^{+})]G(t^{-}) = 0]$ 

and, if  $T_k$  is in the *n* part, then

$$E[g_{k}(t^{+})|G_{n}(t^{-}) = 0]$$

$$= g_{on,k} \times \left\{ \frac{P_{x_{k},hh}(t)}{P_{x_{k},h}(t^{-})} + [P_{x_{k},lh}(t) - P_{x_{k},l}(t^{-})P_{x_{k},h}(t^{+})] \right\}$$

$$\times \frac{P(G_{n}(t^{-}) = 0|x_{k}(t^{-}) = 0)}{P(G_{n}(t^{-}) = 0)P(x_{k}(t^{-}) \neq 0)} \right\}. \quad (3.7)$$

Therefore, it takes an additional graph reduction for every gate input to compute the values  $P(G_p(t^-) = 0 | x_k(t^-) \neq 0)$  and  $P(G_n(t^-) = 0 | x_k(t^-) = 0)$ .

The derivation of  $E[G(t^+)|G(t^-) = 0]$  outlined above makes the implicit assumption that when the probability space is restricted by the condition  $G(t^-) = 0$ , the independence of the gate intputs is preserved. This may not always be true, and the implementation in CREST has a protection measure to safeguard against this consisting of a simple upper bound [12] on  $E[G(t^+)|G(t^-) = 0]$ .

Having found E[I] for the output node, the expected value of charge delivered to (or from) the output node capacitors is easily found as follows:

$$E[q] = V_{dd} \times C_n \times P_{o,lh}(t) + V_{dd} \times C_p \times P_{o,hl}(t)$$
(3.8)

where o is the output node. We now make the approximation that the time constant for charging or discharging the output node is the largest of the internal gate nodes. Consequently, the time span of the output node current represents the time span  $\tau$  of the total gate current. By the triangular pulse approximation:

$$\tau = 2 \times \frac{E[q]}{E[I]}.$$
 (3.9)

Next, the expected value of the charge delivered by the total gate charging current,  $E[q_{tot}]$  is derived using the capacitances at each internal node *j* as follows:

$$E[q_{\text{tot}}] \approx \sum_{j \in p \text{ block}} V_{\text{dd}} C_{jn} P_{j,lh} + \sum_{j \in n \text{ block}} V_{\text{dd}} C_{jp} P_{j,hl}.$$
(3.10)

Strictly speaking the probabilities  $P_{j, lh}$  and  $P_{j, hl}$  are hard to find; they are in fact *NP-hard* to find in general, based on [13]. We have, therefore, opted to use an upper bound of these probabilities to replace them in the equation. An upper bound of  $P_{j, lh}(P_{j, hl})$ , for a node in the p- (n-) block, is the probability that the conduction state between j and  $V_{dd}(V_{ss})$  goes from off-to-on. This is found by a graph reduction that repeats the work done to find  $P_{o, lh}$  for the output o for every internal node j. Finally, the peak total

$$= g_{\text{on},k} \times \left\{ \frac{P_{x_{k},l}(t)}{P_{x_{k},l}(t^{-})} + \left[ P_{x_{k},lh}(t) - P_{x_{k},l}(t^{-}) P_{x_{k},h}(t^{+}) \right] \times \frac{P(G_{p}(t^{-}) = 0 | x_{k}(t^{-}) \neq 0)}{P(G_{p}(t^{-}) = 0) P(x_{k}(t^{-}) = 0)} \right\}$$
(3.6)

current is found as

$$E[I_{\text{tot}}] = \frac{2E[q_{\text{tot}}]}{\tau} = \frac{E[q_{\text{tot}}]}{E[q]} \times E[I]. \quad (3.11)$$

The relationship between these quantities is depicted in Fig. 5.

Having derived the expected gate current pulse, the time of the new event at the output of this gate needs to be derived, as follows. If one considers a resistor R charging a capacitor C from a power supply V, then the current and voltage at the capacitor both reach their half-point at time  $0.693 \times RC$ . If we assume that the individual current pulses i(t) are exponentially decaying, rather than linear, then the switching time at the output,  $t_s$ , is  $0.693 \times (C_p + C_n)/G_p$  for a low-to-high transition, and  $0.693 \times (C_p + C_n)/G_n$  for a high-to-low transition. This  $t_s$  is, again, a random variable, and one is interested in  $E[t_s|V_o$  transitions]. Knowing that the duration of  $i_p$  (or  $i_n$ ) determines the gate delay, and that these currents deliver charge to both  $C_p$  and  $C_n$ , then if  $q_p$  and  $q_n$  are the charges delivered, we have

$$E[q_p] = V_{\rm dd} \times (C_p + C_n) \times P_{o,lh} \qquad (3.12)$$

and

$$E[q_n] = V_{dd} \times (C_p + C_n) \times P_{o,hl}. \quad (3.13)$$

The duration of the two pulses is derived as before as

$$\tau_p = 2 \times \frac{E[q_p]}{E[I_p]}$$
 and  $\tau_n = 2 \times \frac{E[q_n]}{E[I_n]}$ . (3.14)

Having found these values, the time delay can be shown [12] to be

$$E[t_s | V_o \text{ transitions}] = 0.35 \times \frac{\tau_p \times P_{o,lh} + \tau_n \times P_{o,hl}}{P_{o,lh} + P_{o,hl}}.$$
(3.15)

It is important to note that  $\tau_p$  and  $\tau_n$  are independent of the particular partitioning of  $(C_p + C_n)$ , which makes the timing estimate reliable.

# **IV. PASS-TRANSISTOR CIRCUITS**

Pass-transistors present a problem because they act as memory elements. The output of a pass-transistor network depends on more than just its inputs, the previous values at its output and internal nodes are also important. It is also no longer true that the probabilities of the path(s) from the output to  $V_{dd}$  or  $V_{ss}$  completely define the output event because the output may be disconnected from both. Current drawn through pass-transistors is a relatively small fraction of the overall current dissipated in a CMOS circuit and will be ignored. However, events must be propagated through them so that the gates downstream are correctly simulated.

To reduce their complexity, we break up pass-transistor networks into two types of primitive gates: *stages* and *wires*, as shown in Fig. 6. A concrete example of this



Fig. 5. Relationship between gate output current and gate total current pulses.



Fig. 7. The decomposition of a typical pass-transistor circuit into stages and wires.

decomposition is given in Fig. 7. Stages represent conducting paths that connect two nodes; while wires are used to tie together the outputs of two or more stages. These gates can be used to build complicated pass-transistor networks. This decomposition involves two simplifying assumptions: transistors are unidirectional, and memory states are significant only at the boundaries of stages and wires. The direction of each gate must be carefully assigned, and we use several levels of algorithms, including some rules from [14].

As in traditional logic simulation, the two logic values 0 and 1 are not enough to model pass-transistor circuits, and ways of representing *weak* 0 and 1 signals must be introduced. Nodes in a pass-transistor section of a circuit can have four valid states: conducting path to  $V_{dd}$  (high tied or *ht*), conducting path to  $V_{ss}$  (low tied or *lt*), charged with no conducting paths (high floating or *hf*), and discharged with no conducting paths (low floating or *lf*). An additional state is introduced: no path to either  $V_{dd}$  or  $V_{ss}$  (floating or *f*). Although there is redundant information, five probabilities are used to fully define the states of all nodes in a pass transistor structure:  $P_{ht}$ ,  $P_{lt}$ ,  $P_{hf}$ ,  $P_{lf}$ , and  $P_{f}$ .

Correspondingly, the set of probabilities needed to describe an event is also enlarged, these events will be referred to as *expanded events*. Expanded events must contain the probability that the node was high tied  $(P_{hr})$ , low tied  $(P_{h})$ , and high  $(P_{h})$  before and after the transition. Additionally, transition probabilities from low to high  $(P_{h})$ , low tied to high tied  $(P_{h \rightarrow h})$ , low tied to floating  $(P_{h \rightarrow f})$ , floating to high tied  $(P_{f \rightarrow h})$ , and floating to floating  $(P_{f \rightarrow f})$  are needed to propagate probability waveforms through the pass-transistors.

# 4.1. Stage Simulation

The simulation of a stage involves a graph reduction by which the stage is reduced to a single edge  $\xi$  between its input and output. This procedure (see the Appendix) provides the probabilities that this edge is on,  $P_{\xi,1}(t^+)$ , was on,  $P_{\xi,1}(t^-)$ , and transitions from off to on,  $P_{\xi,01}(t)$ . Other probabilities, such as on to off,  $P_{\xi,10}(t)$ , on to on,  $P_{\xi,11}(t)$ , and off to off,  $P_{\xi,00}(t)$ , can be derived from simple probability theory. If the stage has input x and output y then the following formulas can be derived [12]:

$$P_{y,ht}(t^{\pm}) = P_{\xi,1}(t^{\pm}) \times P_{x,ht}(t^{\pm})$$
(4.1)

$$P_{y,h}(t^{\pm}) = P_{\xi,1}(t^{\pm}) \times P_{x,h}(t^{\pm})$$
(4.2)

$$P_{y,h\to ht}(t) = P_{\xi,11}(t) \times P_{x,h\to ht}(t)$$
(4.3)

$$P_{y,f \to ht}(t) = P_{\xi,01}(t) \times P_{x,ht}(t^{+}) + P_{\xi,11}(t) \\ \times P_{x,f \to ht}(t)$$
(4.4)

$$P_{y,lt \to f}(t) = P_{\xi,10}(t) \times P_{x,lt}(t-) + P_{\xi,11}(t)$$

$$\times P_{-t_{1} \to s}(t)$$
(4.5)

$$P_{y,f \to f}(t) = P_{\xi,00}(t) + P_{\xi,01}(t) \times P_{x,f}(t^{+}) + P_{\xi,10}(t)$$

$$\times P_{x,f}(t^{-}) + P_{\xi,11}(t) \times P_{x,f \to f}(t).$$
 (4.6)

Notice that these formulas, while determining most of the stage output probabilities, do not finalize its simulation because the probabilities of the high floating or low floating states are not yet available. This is to be expected because the state of the floating output of a stage can be affected by other stages tied to the same wire gate. The simulation of a wire, to be discussed next, uses the quantities derived above for the stage(s) outputs to finalize their simulation and obtain the wire output probabilities.

# 4.2. Wire Simulation

If a wire has more than two inputs, one can think of it as a cascade of several wires with two inputs each. Consequently, it is sufficient to study the simulation of a wire with only two inputs x and y, and an output z. Since a wire is basically a wired-OR configuration, voltage division may arise when its inputs are x = lt and y = ht (or vice versa). In this case we make the assumption that the output is z = lt; this is based on the "weak pull-up to  $V_{dd}$ " configuration which is commonly used in CMOS and nMOS circuits. This assumption is implicit in the derivation of the following wire simulation formulas whose (tedious) derivation [12] will not be included here:

$$P_{z,ht} = P_{x,ht} \times P_{y,f} + P_{y,ht} \times P_{x,f} + P_{x,ht} \times P_{y,ht}$$

$$(4.7)$$

$$P_{z,h} = P_{x,h} + P_{y,h} - P_{x,h} \times P_{y,h}$$

$$P_{z,h \rightarrow ht}(t) = P_{x,ht}(t^{+}) \times [P_{y,h \rightarrow ht}(t) + P_{y,h \rightarrow f}(t)]$$

$$+ P_{y,ht}(t^{+}) \times [P_{x,h \rightarrow ht}(t) + P_{x,h \rightarrow f}(t)]$$

$$+ P_{x,f}(t^{+}) \times P_{y,t \rightarrow ht}(t) + P_{y,f}(t^{+})$$

$$\times P_{x,h \rightarrow ht}(t)$$

$$- P_{x,h \rightarrow ht}(t) \times P_{y,h \rightarrow f}(t) - P_{y,t \rightarrow ht}(t)$$

$$\times P_{x,h \rightarrow ht}(t)$$

$$- P_{x,h \rightarrow ht}(t) \times P_{y,h \rightarrow ht}(t)$$

$$(4.9)$$

$$P_{z,f \to ht} = P_{x,f \to ht} \times P_{y,f \to f} + P_{y,f \to ht}$$
$$\times P_{x,f \to f} + P_{x,f \to ht} \times P_{y,f \to ht}$$
(4.10)

$$P_{z,lt \to f}(t) = P_{x,f}(t^{+}) \times P_{y,lt \to f}(t) + P_{y,f}(t^{+})$$
$$\times P_{x,lt \to f}(t)$$
$$- P_{x,lt \to f}(t) \times P_{y,lt \to f}(t)$$
(4.11)

$$P_{z,f \to f} = P_{x,f \to f} \times P_{y,f \to f}.$$
(4.12)

When all the inputs of a wire have been considered, its simulation is finalized using the following two formulas [12]:

$$P_{z,lf}(t^{+}) \approx \begin{cases} P_{z,lt \to f}(t), \\ \text{if } P_{z,f}(t^{-}) = 0; \\ \frac{P_{z,lf}(t^{-})}{P_{z,f}(t^{-})} \times P_{z,f \to f}(t) + P_{z,lt \to f}(t), \\ \text{otherwise.} \end{cases}$$

$$(A.12)$$

$$P_{z,lf \rightarrow ht}(t^{+}) \approx \begin{cases} 0, & \text{if } P_{z,f}(t^{-}) = 0; \\ \frac{P_{z,lf}(t^{-})}{P_{z,f}(t^{-})} \times P_{z,f \rightarrow ht}(t) \\ & \text{otherwise.} \end{cases}$$
(4.14)

A minor independence assumption must be made in the derivation of these formulas; this is necessary because of the limited sense in which temporal dependence is accounted for (as mentioned in Section II-2.3).

Having derived the probabilities of the wire's output event, the only remaining unknown is the *time* of that event. The approach used by Horowitz [15] is suited to our probabilistic models. By using the expected conductance of a stage derived in the graph reduction step, and the internal capacitances of stages, a time constant can be derived for the pass-transistor network. This is used to derive the time of the output event.

# V. SUPERGATES

CREST assumes that the primary circuit inputs are *in-dependent*. The single-gate current estimation algorithms described above require that the probability waveforms,

at the inputs of each gate, be independent. This condition is violated if the circuit contains reconvergent fan out or feedback. To overcome this problem, we borrow the concept of *supergates* from [10]; a supergate is simply a subset of the circuit with independent inputs, an example is shown in Fig. 8. Supergate input nodes that are reconvergent fan-out stems, or that affect internal reconvergent fan-out stems of the supergate, are called *reconvergent fan-out input nodes*, (RFI) nodes. Nodes A and B in Fig. 8 are RFI nodes of that supergate.

If logic (not probability) waveforms are assigned to RFI nodes, then the signals at all internal supergate nodes become independent and the supergate can be easily simulated. Given the probabilistic events at the RFI inputs of a supergate, suppose the set of all possible logical transitions embodied by these events is generated (Fig. 9), and the supergate simulated for each of them. If the current results are summed up, weighted by the probability of each case, the required supergate currents would result.

Based on these observations, the simulation of a supergate in CREST is carried out by maintaining a set of different *simulation subprocesses*, each representing the result of a particular sequence of *logical* events at the supergate's RFI nodes. Every subprocess has a certain probability, namely the probability that the supergate has that state at that time. When a new event arrives at an RFI node it is applied to all existing subprocesses, some of which will cause the creation of new subprocesses. When two subprocesses are performing identical simulations, they are *merged* to produce a single new subprocess whose probability is the sum of their probabilities.

This approach, while acceptable for small supergates, is too expensive for larger ones. In these cases, CREST uses two heuristic parameters to reduce the performance penalties while maintaining acceptable accuracy.

1)  $\alpha_s$ : Limit supergate size to a user-specified *size* threshold,  $\alpha_s$ . By limiting supergate size, this indirectly limits the number of RFI nodes and, therefore, the number of simulation subprocesses.

2)  $\alpha_p$ : Terminate a simulation subprocess if its probability becomes less than a user-specified *probability threshold*,  $\alpha_p$ . This eliminates subprocesses that are likely to contribute very little to the current waveform.

The effectiveness of these heuristics will be discussed in the next section.

The formation of supergates is done as a preprocessing step, during the initial partitioning phase of the program. The details of that process are tedious and not interesting enough to warrant inclusion in this paper. Very briefly, if the circuit is combinational, then the process involves a forward and a backward sweep to discover the node dependencies and build the supergates. In a circuit with feedback, it is easy to see that every feedback loop is, strictly speaking, a single supergate. Therefore, a standard strongly-connected-components algorithm is first used to group feedback loops into temporary supergates, after which the circuit appears combinational, and supergates can be easily built. Since large feedback loops can



Fig. 8. A supergate schematic showing two RFI nodes A and B.



Fig. 9. The decomposition of a single event into logical transitions.

lead to huge supergates, we have used heuristics to break up large feedback blocks into smaller ones to maintain a reasonable execution time.

# VI. IMPLEMENTATION AND RESULTS

As mentioned above, the probabilistic simulation approach has been implemented in a program called CREST. The program is about 15 000 lines, written in C, and has been run on a variety of circuits and computer systems. It accepts a SPICE circuit description file, and requires another file that specifies the probability waveforms at the primary circuit inputs. Excellent accuracy and speed have been achieved on real circuits.

To assess the accuracy of the results, it is important to make a fair comparison with an *expected current waveform* derived using a valid simulation tool. To do so, we have generated the expected waveform for a variety of examples by running SPICE on every set of input voltage signals allowed by the probability vectors, weighting each resulting current waveform by the probability that the inputs producing it would occur, and summing the weighted waveforms to produce the required result. Since the number of required SPICE simulation runs grows exponentially with the number of circuit inputs, the comparisons to be presented below will necessarily be limited to medium sized circuits. There is no reason to suspect, however, that the accuracy observed on these circuits will deteriorate on larger ones.

Figs. 10 and 11 show the results for a single complex gate and an inverter chain, respectively. Important features of these two figures are the accuracy of the current waveform in Fig. 10 and the good timing performance in Fig. 11. Fig. 12 shows the current pulse for a typical passtransistor circuit. In Fig. 13 we show the result for a larger circuit—a 4-bit ALU with 154 MOSFET's involving a large number of pass-transistor gates and weak pull-up transistors. The ALU was run for logical (rather than probabilistic) inputs because it would take too long to run SPICE for all its possible inputs to allow a comparison with a probabilistic CREST run.

CREST simulation times for these and other circuits are compared with SPICE in Table I. SPICE was chosen for



Fig. 10. Complex gate current estimate.



Fig. 11. A 40-stage inverter chain current estimate.



Fig. 12. Current results for a typical pass-transistor circuit.



Fig. 13. Current results for a 4-bit ALU circuit with 154 MOSFET's.

these comparisons because it is a generally available tool, and as such provides a common frame of reference. Since SPICE is known to be slow on large circuits, we also offer absolute measures of timing in the form of CPU seconds. The table illustrates the dramatic gains available from CREST's probabilistic analysis when an expected waveform is needed. In fact, as the number of circuit inputs increases, the speedup of CREST compared to an approach based on logical inputs increases exponentially since the size of the inputs space increases as  $2^{2n}$ . For example, a (heuristic) CREST run on the 1839-transistor 34-bit ALU, which takes 13 s on a CONVEX 220, is

TABLE I EXECTION TIME COMPARISON BETWEEN CREST AND SPICE. TIME IS IN CPU SECONDS ON A SEQUENT. SIZE REFERS TO THE NUMBER OF TRANSISTORS

| Circuit      | Size | 0.8      | $\alpha_p$ | Time         |
|--------------|------|----------|------------|--------------|
| 2-bit ripple | 54   | $\infty$ | 0          | 11.4         |
| adder        |      | ł        | 0          | 2.6 (4.4X)   |
| 4-bit ripple | 200  | - 300    | 0          | 332.2        |
| adder        |      | x        | .01        | 29.0 (11.4X) |
|              |      | 3        | .1         | 10.6 (31.3X) |
|              |      | 1        | 0          | 5.6 (59.3X)  |
| 4-bit        | 648  | 9        | .0005      | 2403.1       |
| multiplier   |      | 1        | 0          | 297.5 (8.1X) |

equivalent to  $2^{144}$  SPICE runs. The gains possible for logical inputs are illustrated in the last two examples, where logical waveforms were used since the circuits were too large to examine for all inputs in SPICE.

We finally present the results to demonstrate the effectiveness of the supergate simulation heuristics. Three circuits were used for these simulations: a 2-bit adder (Fig. 14), a 4-bit adder (Fig. 15), and a 4-bit multiplier (Fig. 16). The waveforms are compared in the figures indicated and the timing and speedup results of the different runs are shown in Table II. The exact CREST simulation shown in Fig. 14(a) was 3000 times faster than SPICE. Fig. 14(b) shows a heuristic CREST run that is over four times faster with comparable current results. Fig. 15 shows the results of three different heuristic runs in CREST compared to a full-accuracy run. The first run merely eliminated extremely improbable supergate processes and obtained 11X speedup with virtually no accuracy loss. The second run combined both heuristics and achieved excellent accuracy with 31X speedup. The final run completely eliminated supergates and shows acceptable accuracy with a 59X speedup. Fig. 16 compares a fairly tight heuristic run and a relaxed heuristic run for a 4-bit parallel multiplier. In the tight run, supergates are allowed to grow up to nine gates and supergate simulation processes are ignored only if their probability is less than 0.0005. In the relaxed run, the supergate size limit is set to one gate, i.e., "no supergates." The relaxed heuristic ran eight times faster with comparable results. In general, the heuristic yielded comparable results with excellent improvements in speed. This was particularly true for the most complex circuit, the 4-bit multiplier.

In all examples tested, the results were excellent. Peak currents were within 20 percent, average currents were within 10 percent, and, as clearly shown in Fig. 11, timing estimates were within 10 percent of SPICE.

# VII. SUMMARY AND CONCLUSIONS

We have discussed the EM problem and stressed the need for an *expected current waveform* for MTF estimation. We have presented such a technique, based on a new so-called *probabilistic simulation* approach which has been implemented in the program CREST. The combined effects of a variety of input patterns, each of which would require separate SPICE simulations, can be derived with



Fig. 14. Current waveforms for the 2-bit ripple adder in Table II.



Fig. 15. Current waveforms for different heuristic CREST runs compared to the full-accuracy CREST run on the 4-bit ripple adder in Table II.

## IEEE TRANSACTIONS ON COMPUTER-AIDED DESIGN, VOL. 9, NO. 4, APRIL 1990



Fig. 16. Current waveforms for a fairly tight heuristic run and a relaxed heuristic run for the 4-bit multiplier in Table II.

 
 TABLE II

 Performance of the Supergate Heuristics. Time is in CPU Seconds on a VAX-11/780. Size Refers to the Number of Transistors

| Circuit    | Size | CREST | SPICE | Speedup |
|------------|------|-------|-------|---------|
| Decode1    | 20   | 1.1   | 642   | 584X    |
| Decode2    | 28   | 1.4   | 989   | 706X    |
| Decode3    | 36   | 1.9   | 1588  | 836X    |
| Invt10     | 20   | 1.0   | 221   | 221X    |
| Invt40     | 80   | 3.3   | 2094  | 635X    |
| Bridge     | 10   | 0.8   | 23237 | 29000X  |
| Pass1      | 8    | 0.8   | 282   | 352X    |
| Pass2      | 9    | 1.1   | 1505  | 1370X   |
| 4-bit ALU  | 154  | 10.9  | 3535  | 324X    |
| 34-bit ALU | 1839 | 145.0 | 86152 | 594X    |
|            |      |       |       |         |

a single simulation at no more expense than the simulation for a single vector. As such, our approach is pattern-independent and provides a dramatic speedup (ranging from  $200 \times to 29\ 000 \times$ ) over the conventional approach using SPICE. Furthermore, the speedup increases exponentially with the number of circuit inputs because a single probabilistic run covers an exponentially increasing number of logical runs. The waveforms produced agree well with results obtained from SPICE: peak currents are within 20 percent, average currents are within 10 percent, and timing estimates are within 10 percent.

CREST can handle general CMOS circuits, allowing pass transistors, reconvergent fan out, and feedback. Heuristics have been presented that help maintain speed in the presence of reconvergent fan out, without significantly affecting accuracy. The results of several CREST runs on real circuits have been presented.

The success of the heuristics leads to the conclusion that reconvergent fan-out paths and the corresponding signal dependence become less important for larger circuits (e.g., Fig. 16). This is supported by the intuitive observation that the effect of a reconvergent fan-out node becomes negligible if the reconvergent paths contain a large number of gates.

The expected current waveform derived by CREST has an obvious immediate use in an unrelated application: the derivation of the expected instantaneous power dissipation, as well as the overall power dissipation of the circuit. Apart from this obvious application, future work on CREST will address the simulation of circuits with truly bidirectional pass-transistors. Other extensions of the approach will also be aimed at estimating the voltage drop on the power and ground busses. This may require deriving more statistics of the current waveform, such as the waveform variance.

# APPENDIX GRAPH REDUCTION

This appendix describes a graph reduction procedure that is central to the simulation algorithms to be presented. Let G = (V, E) be an undirected connected graph with no self loops, possibly with parallel edges, in which two arbitrary vertices  $u, v \in V$  are specified. At time t, each edge  $e \in E$  is labeled with three probability values  $P_{e,1}(t^+)$ , which is the probability that the edge is on,  $P_{e,1}(t^{-})$ , the probability that the edge was on, and  $P_{e,01}(t)$ , the probability that it transitions from off to on. Furthermore, each edge has associated with it a random conductance value g(e) which is either 0, if the edge is off, or  $g_{on}(e)$ , if it is on. The equivalent conductance of a network of edges is defined to be that of an identical resistive network with the same conductances. The events "is on," "was on," and "transitions from off to on," as well as the random conductances, of any two distinct edges  $e_1$  and  $e_2$  are assumed to be independent.

Let  $\psi$  be a random variable that is 1 if there exists at least one path of on-edges in G between u and v, and 0 otherwise. The probabilities  $P_{\psi,1}(t^+)$ ,  $P_{\psi,1}(t^-)$ , and  $P_{\psi,01}(t)$  are defined as above. Finally, let  $G_G$  be the random conductance between u and v at  $t^+$ . We are interested in finding the values of  $P_{\psi,1}(t^+)$ ,  $P_{\psi,1}(t^-)$ ,  $P_{\psi,01}(t)$ , and  $E[G_G]$ .

Finding  $P_{\psi,1}(t^+)$ , which is perhaps the simplest to derive, is known to be *NP-hard*, see [16. p. 211]. The algorithms to be given will, therefore, include some approximations to maintain a reasonable execution time. The general methodology will be to reduce the size of the graph by removing edges and/or vertices, so that the probabilities of  $\psi$  and  $E[G_G]$  remain invariant, until the graph reduces to a single edge  $\xi$  between u and v, as shown in Fig. 3. The labels on this edge,  $P_{\xi,1}(t^+)$ ,  $P_{\xi,0}(t)$ , and  $E[g(\xi)]$  are the required answers. Hence, the term "graph reduction."

We will first describe solutions for a restricted class of graphs, namely for series-parallel graphs (see [pp. 197-199, 17]). For such a graph, one can accurately derive the probabilities of  $\psi$  by making parallel and series combinations of edges until a single edge remains. If  $e_p(e_s)$  is the parallel (series) combination of edges  $e_1$  and  $e_2$ , then the following formulas apply:

$$P_{e_{p,1}}(t^{\pm}) = P_{e_{1,1}}(t^{\pm}) + P_{e_{2,1}}(t^{\pm}) - P_{e_{2,1}}(t^{\pm})$$
(A.1)

$$P_{e_{s,1}}(t^{\pm}) = P_{e_{1,1}}(t^{\pm})P_{e_{2,1}}(t^{\pm})$$
(A.2)

$$P_{e_{p},01}(t) = P_{c_{1},01}(t)P_{e_{2},0}(t^{-}) + P_{e_{2},01}(t)P_{e_{1},0}(t^{-}) - P_{e_{1},01}(t)P_{e_{2},01}(t)$$
(A.3)

$$P_{e_{1},01}(t) = P_{e_{1},01}(t)P_{e_{2},1}(t^{+}) + P_{e_{2},01}(t)P_{e_{1},1}(t^{+}) - P_{e_{1},01}(t)P_{e_{2},01}(t).$$
(A.4)



Fig. 17. A generic node elimination step.

The derivation of  $E[G_G]$  is not as simple. To begin with, E[g(e)] is easily derived for every edge e as  $g_{on}(e) \times P_{e,1}$ . We then perform the series-parallel combinations using these values. In case of two edges in parallel, it is easy to see that

$$E[g(e_p)] = E[g(e_1)] + E[g(e_2)].$$
 (A.5)

In the series case, however, the problem is not as simple and we resort to the approximation:

$$E\left\lfloor\frac{1}{g(e)} \mid g(e) \neq 0\right\rfloor \approx \frac{1}{E[g(e) \mid g(e) \neq 0]} \quad (A.6)$$

where E[A|B] denotes the *conditional expected value* of A given B, to derive [12]

$$\frac{1}{E[g(e_s)]} \approx \frac{1}{E[g(e_1)]P_{e_{2,1}}} + \frac{1}{E[g(e_2)]P_{e_{1,1}}}.$$
 (A.7)

In case of a general, nonseries-parallel graph, and since the problem is *NP-hard* as pointed out above, we resort to another technique which is in line with the graph reduction methodology. This involves a *node elimination* technique which is the graph-domain operation corresponding to Gaussian elimination on the graph's adjacency matrix [18]. It works by removing a vertex from the graph and adding new edges between each of its neighbors, as shown in Fig. 17. Suppose a vertex v has neighbors  $v_1, \dots, v_n$  along edges  $e_1, \dots, e_n$ . Upon removing v, every two neighbors  $v_i$  and  $v_j$  share a new edge  $e_{ij}$  (Fig. 17) whose probabilities are derived by applying the series formulas (A.2) and (A.4) above to  $e_i$  and  $e_j$ . Using the approximation (A.6), the value of  $E[g(e_{ij})]$ is derived [12] as

$$\frac{1}{E[g(e_{ij})]} \approx \frac{1}{E[g(e_i)]P_{e_{j,1}}} + \frac{1}{E[g(e_j)]P_{e_{i,1}}} + \frac{1}{E[g(e_j)]P_{e_{i,1}}} + \frac{\sum_{k=1}^{n} E[g(e_k)]}{\frac{k \neq i,j}{E[g(e_i)]E[g(e_j)]}}.$$
 (A.8)

As a result, certain edges are *split* into two or more new edges which may not, therefore, be independent. We, however, ignore this fact and proceed with the graph reduction as before. Knowing that the problem is *NP-hard*, the resulting loss of accuracy is inevitable, but excellent results have been obtained in practice.

#### REFERENCES

- [1] F. M. d'Heurle, "Electromigration and failure in electronics: An introduction," *Proc. of IEEE*, vol. 59, Oct. 1971.
- [2] J. R. Black, "Electromigration failure modes in aluminum metallization for semiconductor devices," Proc. IEEE, vol. 57, Sept. 1969.
- [3] J. E. Hall, D. E. Hocevar, P. Yang, and M. J. McGraw, "SPIDER— A CAD system for modeling VLSI metallization patterns," *IEEE Trans. Computer-Aided Design*, vol. CAD-6, pp. 1023-1031, Nov. 1987
- [4] L. W. Nagel, "SPICE2: A computer program to simulate semiconductor circuits," Ph.D. dissertation, Dep. Elect. Eng., Univ. of Calif., Berkeley, 1975.
- [5] J. W. McPherson and P. B. Ghate, "A methodology for the calculation of continuous dc electromigration equivalents from transient current waveforms," in *Proc. Symp. on Electromigration of Metals*, New Orleans, LA, pp. 64-74, Oct. 7-12, 1984.
  [6] R. Burch, F. Najm, P. Yang, and D. Hocevar, "Pattern-independent of the second secon
- [6] R. Burch, F. Najm, P. Yang, and D. Hocevar, "Pattern-independent current estimation for reliability analysis of CMOS circuits," in *Proc.* ACM/IEEE 25th Design Automation Conf., pp. 294–299, June 12-15, 1988.
- [7] F. Najm, R. Burch, P. Yang, and I. Hajj, "CREST-A current estimator for CMOS circuits," in *Proc. IEEE Int. Conf. on Computer-Aided Design*, Santa Clara, CA, pp. 204-207, Nov. 7-10, 1988.
- [8] A. Papoulis, Probability, Random Variables, and Stochastic Processes. New York: McGraw-Hill, 1984.
- [9] K. P. Parker and E. J. McCluskey, "Probabilistic treatment of general combinational networks," *IEEE Trans. Comput.*, vol. C-24, pp. 668-670, June 1975.
- [10] S. C. Seth, L. Pan, and V. D. Agrawal, "PREDICT-Probabilistic estimation of digital circuit testability," in *Proc. IEEE 15th Ann. Int. Symp. on Fault-Tolerant Computing*, Ann Arbor, MI, pp. 220-225, June 1985.
- [11] H. J. M. Veendrick, "Short-circuit dissipation of static CMOS circuitry and its impact on the design of buffer circuits," *IEEE J. Solid-State Circuits*, vol. SC-19, Aug. 1984.
- [12] F. Najm, "Probabilistic simulation for reliability analysis of VLSI circuits," Ph.D. dissertation, Dep. Elect. and Computer Eng., Univ. of Illinois at Urbana-Champaign, June 1989.
- [13] F. Najm and I. Hajj, "The complexity of test generation at the transistor level," Rep. UILU-ENG-87-2280, Coordinated Science Lab., Univ. of Illinois at Urbana-Champaign, Dec. 1987.
  [14] N. P. Jouppi, "Derivation of signal flow direction in MOS VLSI,"
- [14] N. P. Jouppi, "Derivation of signal flow direction in MOS VLSI," IEEE Trans. Computer-Aided Design, vol. CAD-6, pp. 480-490, May 1987.
- [15] M. Horowitz, "Timing models for MOS pass networks," in Proc. IEEE Int. Symp. on Circuits and Systems, 1983.
- [16] M. R. Garey and D. S. Johnson, Computers and Intractability, A Guide to the Theory of NP-Completeness. New York: Freeman, 1979.
- [17] F. Harary, "Combinatorial problems in graphical enumeration," in Applied Combinatorial Mathematics, E. F. Beckenbach, Ed. New York; Wiley, 1984.
- York: Wiley, 1984.
  [18] S. Parter, "The use of linear graphs in gauss elimination," SIAM Rev., vol. 3, no. 2, pp. 119–130, Apr. 1961.



Farid N. Najm (M<sup>\*</sup>) received the B.E. degree (with distinction) in electrical engineering from the American University of Beirut in 1983, and the M.S. and Ph.D. degrees in electrical and computer engineering from the University of Illinois at Urbana-Champaign in 1986 and 1989, respectively.

He held temporary positions with ASEA Ltd. in Sweden in 1982 and The University of Warwick in England in 1984. He also worked as an electronics engineer with the American University

of Beirut from 1983 to 1984. While at the University of Illinois, he was a research assistant with the Digital and Analog Circuits Group at the Coordinated Science Laboratory. He held temporary positions with Texas Instruments Inc. in Dallas, TX, in 1987 and 1988. In 1989, he joined the VLSI Design Laboratory at Texas Instruments Inc. as a member of the technical staff. His research interests include design for reliability, probabilistic circuit analysis, timing analysis, test generation, design for testability, and circuit and timing simulation.

Dr. Najm is a member of ACM SIGDA.



**Richard Burch** received the B.S. degree in physics and the M.S.E.E. degree from the University of Oklahoma in 1984 and 1986, respectively. He is currently working towards the Ph.D. degree in electrical engineering at the University of Illinois at Urbana-Champaign, Urbana, IL.

In 1984, he joined Texas Instruments' Semiconductor Process and Design Center as a member of the technical staff. His current interests include research into circuit simulation, power and current estimation, and electromigration failure pre-

diction.



**Ping Yang** (M'81–SM'86) received the B.S. degree in physics in 1974 from the National Taiwan University, Taipei, Taiwan, and the M.S. and Ph.D. degrees in electrical engineering from the University of Illinois, Urbana, in 1978 and 1980, respectively.

From 1974 to 1976 he was an ensign in the Chinese Navy. He joined the Central Research Laboratory at Texas Instruments Incorporated, Dallas, TX. in 1980. His research interests are in the area of computer-aided design, circuits and

systems, large-scale integrated circuits and solid-state electronics. He is currently a Director of Design for Manufacturing Project in the Semiconductor Process and Design Center. He has coauthored two books and presented or published more than 80 articles in international scientific and technical journals and conferences.

Dr. Yang is a member of the IEEE Circuits and Systems Society Adcom (1988–1990), a Program Evaluator of IEEE-ABET/EAC (1988–1989), and a member of Phi Kappa Phi. He has served on the Program Committee of ICCAD, IEDM, DAC, and IEEE Workshop on Numerical Modeling of Processes and Devices for Integrated Circuits. He presently serves as special Section Editor of the *IEEE Circuits and Devices Magazine* devoted to Circuit Simulation and Modeling. In 1984, he served as section chairman of the IEEE Electron Devices Society in Dallas. He has participated as a lecturer in several short courses for continuing education.



**Ibrahim N. Hajj** (S'68-M'70-SM'82) received the B.E. degree from the American University of Beirut in 1964, the M.S. degree from the University of New Mexico, Albuquerque, in 1966, and the Ph.D. degree from the University of California, Berkeley, in 1970, all in electrical engineering.

From 1970 to 1978 he was with the Department of Electrical Engineering at the University of Waterloo, Waterloo, Ont., Canada. In 1978 he joined the University of Illinois at Urbana-Champaign,

where he is now a Professor of Electrical and Computer Engineering and a Research Professor at the Coordinated Science Laboratory. In 1987, he was a Visiting Professor at the Institute of Circuit Theory and Telecommunications of the Technical University of Denmark. His current research interests include computer-aided design of VLSI circuits, network synthesis, simulation, testing, statistical analysis, design for reliability, and parallel processing. He has authored or coauthored over one hundred publications on these subjects. He is a coauthor of a book. *Switch-Level Timing Simulation of MOS VLSI Circuits*, (Kluwer Academic Publishers, 1989).

Dr. Hajj served as an Associate Editor of the IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS. He is a member of the Computer-Aided Network Design (CANDE) Committee, and is a member of Sigma Xi.