Basic VLSI Design Notes(1)
Basic VLSI Design Notes(1)
Moore’s Law:
The invention of the transistor by William B. Shockley, Walter H. Brattain
and John Bardeen of Bell Telephone Laboratories was followed by the
development of the Integrated Circuit (IC).
The very first IC emerged at the beginning of 1960 and since that time there
have already been four generations of ICs:
1) SSI (small scale integration)
2) MSI (medium scale integration)
3) LSI (large scale integration)
4) VLSI (very large scale integration).
5) ULSI (ultra large scale integration) which is characterized by complexities
in excess of 3 million devices on a single IC chip.
Over the past several years, Silicon CMOS technology has become the
dominant fabrication process for relatively high performance and cost
effective VLSI circuits.
The revolutionary nature of this development is indicated by the way in
which the number of transistors integrated in circuits on a single chip has
grown as indicated in Figure 1
STEP2:
A layer of silicon dioxide (Si02), typically 1µm thick, is grown all over the
surface of the wafer to protect the surface.
STEP3:
The surface is now covered with a photoresist which is deposited onto the
wafer and spun to achieve an even distribution of the required thickness.
STEP4:
The photoresist layer is then exposed to ultraviolet light through a mask which
defines those regions into which diffusion is to take place together with transistor
channels. Assume, for example, that those areas exposed to ultraviolet radiation are
polymerized (hardened), but that the areas required for diffusion are shielded by
the mask and remain unaffected.
STEP6:
The remaining photoresist is removed and a thin layer of Si02 is grown over the
entire chip surface and then polysilicon is deposited on top of this to form the gate
structure.
STEP7:
Further photoresist coating and masking allows the polysilicon to be patterned and
then the thin oxide is removed to expose areas into which n-type impurities are to
be diffused to form the source and drain as shown.
STEP9:
The whole chip then has metal (aluminum) deposited over its surface to a thickness
typically of 1 micro meter. This metal layer is then masked and etched to form the
required interconnection pattern.
CMOS FABRICATION
1. The p-well Process
Mask 1 - defines the areas in which the deep p-well diffusions are to take
place.
Mask 2 - defines the thinox regions, namely those areas where the thick
oxide is to be stripped and thin oxide grown to accommodate p- and n-
transistors and wires.
Mask 4 - A p-plus mask is now used to define all areas where p-diffusion is
to take place.
Mask 5 - This is usually performed using the negative form of the p-plus
mask and defines those areas where n-type diffusion is to take place.
N-well CMOS circuits are also superior to p-well because of the lower substrate
bias effects on transistor threshold voltage and inherently lower parasitic
capacitances associated with source and drain regions.
For the depletion mode transistor, the gate is connected to the source so it is
always on.
In this configuration the depletion mode device is called the pull-up (p.u.)
and the enhancement mode device the pull-down (p.d.) transistor.
When Vin is logic 0, enhancement nmos is OFF. Hence current flows
through depletion nmos to the output voltage i.e.Vout=Vdd or Vout=logic 1.
When Vin is logic 1, enhancement nmos is ON. Hence current flows from
Vdd to ground via enhancement and depletion nmos. Vout = 0 or logic 0.
Vin Vout
Logic 0 Logic 1
We must now ensure that for this input voltage we get out the same voltage
as would be the case for inverter 1 driven with input = Vdd·
i.e. Vout1=Vout2
Similarly,
In saturation region
------- (1)
In both cases the factor K is a technology-dependent parameter such that
The factor W/L is, of course, contributed by the geometry and it is common
practice to write
1. Region 1:
Vin =logic 0, we have the p-transistor fully turned on while the n-transistor is
fully turned off.
BICMOS Inverters
It consists of two bipolar transistors T1 and T2 with one nMOS transistor
T3, and one pMOS transistor T4, both being enhancement mode devices.
With Vin= 0 volts (GND) T3 is off so that T1 will be non-conducting. But T4
is on and supplies current to the base of T2 which will conduct and act as a
current source to charge the load CL toward +5 volts(Vdd). The output of the
inverter will rise to +5 volts less the base to emitter voltage VBE of T2.
With Vin = +5 volts (Vdd) T4 is off so that T2 will be non-conducting. But
T3 will now be on and will supply current to the base of T1 which will
Now, consider the case in which L = W, that is, a square of resistive material, then
Thus the actual values associated with the layers in a MOS circuit depend on the
thickness of the layer and the resistivity of the material forming the layer.
The unit is denoted □Cg and is defined the gate-to-channel capacitance of a MOS
transistor having W = L = feature size, that is, a 'standard' or 'feature size' square.
Consider the area defined in Figure. First, we must calculate the area relative to
that of a standard gate.
If we consider the case of one standard (feature size square) gate area capacitance
being charged through one feature size square of n channel resistance (that is,
through Rs for an nMOS pass transistor channel).
However, in practice, circuit wiring and parasitic capacitances must be allowed for
so that the figure taken for t is often increased by a factor of two or three so that for
5 µm circuit
𝜏 = 0.2 to 0.3 nsec is a typical design figure used in assessing likely worst case
delays.
INVERTER DELAYS:
If we consider a pair of cascaded inverters, then the delay over the pair will be
constant irrespective of the sense of the logic level transition of the input to the
first.
Assuming t = 0.3 nsec and making no extra allowances for wiring capacitance, we
have an overall delay of 𝜏 + 4𝜏 = 5𝜏. In general terms, the delay through a pair of
similar nMOS inverters is
When considering CMOS inverters, the nMOS ratio rule no longer applies, but we
must allow for the natural asymmetry of the usually equal size pull-up p-transistors
and the n-type pull-down transistors.
overall delay=2𝜏 + 5𝜏 = 7𝜏
In this analysis we assume that the p-device stays in saturation for the entire
charging period of the load capacitor CL.
The saturation current for the p-transistor is given by
𝟐𝑽𝑫𝑫 𝑪𝑳 𝟐𝑽𝑫𝑫 𝑪𝑳
𝝉𝒓 = =
𝜷𝒑 (𝑽𝑫𝑫 𝟎.𝟐𝑽𝑫𝑫 )𝟐 𝜷𝒑 (𝟎.𝟖𝑽𝑫𝑫 )𝟐
Super Buffers:
A positive going logic transition Vin at the input, it will be seen that the inverter
formed by T1 and T2 is turned on and, thus, the gate of T3 is pulled down toward 0
volt with a small delay. Thus, T3 is cut off while T4 (the gate of which is also
connected to Vin) is turned on and the output is pulled down quickly.
A negative going logic transition Vin at the input: when Vin drops to 0 volt, then
the gate of T3 is allowed to rise quickly to VDD· Thus, as T4 is also turned off by
Vin, T3 is made to conduct with Vin on its gate, that is, with twice the average
voltage that would apply if the gate was tied to the source as in the conventional
nMOS inverter.
Stick diagrams may be used to convey layer information through the use of a color
code for example, in the case of nMOS design, green for n-diffusion, red for
polysilicon, blue for metal, yellow for implant, and black for contact areas.
Inverter:
NAND Gate
Inverter:
NOR GATE:
When using nMOS switch logic, there is one restriction which must always
be observed no pass transistor gate input may be driven through one or more
pass transistors.
Logic levels propagated through pass transistors are degraded by threshold
voltage effects.
Since the signal out of pass transistor T1 does not reach a full logic 1, but
rather a voltage one transistor threshold below a true logic 1, this degraded
voltage would not permit the output of T2 to reach an acceptable logic 1
level.
And Operation:
OR operation:
a) CMOS
b) nMOS
Some of the most commonly used inverter circuit diagrams, the inverter
symbol, and the corresponding stick and symbolic diagrams is shown in the
figure below.
In this configuration the depletion mode device is called the pull-up (p.u.)
and the enhancement mode device the pull-down (p.d.) transistor.
𝑍 = = =8
𝑅 =𝑍 × 𝑅 = 8 × 10𝑘Ω = 80𝑘Ω
Similarly
𝑍 = = =1
𝑅 =𝑍 × 𝑅 = 1 × 10𝑘Ω = 10𝑘Ω
For the given dimension of the inverter 4: or 8:1 find the power dissipation
𝑍 = = =4
𝑅 =𝑍 × 𝑅 = 4 × 10𝑘Ω = 40𝑘Ω
Similarly
𝑍 = = =
𝑅 =𝑍 × 𝑅 = × 10𝑘Ω = 5𝑘Ω
The critical factor here is that the output voltage Vout must be near enough to
ground to turn off any following inverter-like stages, that is
𝑛𝑍𝑝.𝑑 𝑛𝑍𝑝.𝑑 1
= = = 0.2
𝑛𝑍𝑝.𝑑 + 4𝑛𝑍𝑝.𝑑 5𝑛𝑍𝑝.𝑑 5
.
nMOS Nand ratio= =
.
The ratio between Zp.u. and the sum of all the pull-down Zp.d must be 4:1.
This ratio must be adjusted appropriately if input signals are derived through
pass transistors.
nMOS Nand gate area requirements are considerably greater than those of a
corresponding nMOS inverter, since not only must pull-down transistors be
added in series to provide the desired number of inputs, but, as inputs are
added, so must there be a corresponding adjustment of the length of the pull-
up transistor channel to maintain the required overall ratio .
nMOS Nand gate delays are also increased in direct proportion to the
number of inputs added. If each pull-down transistor is kept to minimum
size , then each will present 1□Cg at its input, but if there are n such inputs,
then the length and resistance of the pull-up transistor must be increased by a
factor of n to keep the correct ratio.
Thus, delays associated with the nMOS Nand are
NOR gates:
A Parity Generator:
This implementation seems the obvious one, but it does suffer from the fact
that as the input line under consideration moves down in significance so the
complexity of the logic grows.
For example, we have shown only the top three lines in Figure, but it will be
seen that:
An requires one diffusion path and no switches.
An-1 requires two diffusion paths and two switches.
An-2 requires three diffusion paths and four switches.
By inspecting (or mapping from) Table , it will be seen that the following
expressions relate the two codes:
A suitable arrangement is set out in Figure below, and the only detailed
design required is that of a two input Exclusive-Or gate. Many arrangements
are possible to implement this operation
(Vin)t is clocked in by ɸ1 (or ɸ2) of the clock and charges the gate
capacitance Cg of the inverter to Vin.
If subscript t is taken to represent the time during which ɸ1 (say) is at logic 1
and subscript t + 1 is taken to indicate the period during which ɸ1 is at logic
0, then the available output will be ( 𝑉𝚤𝑛 )t+1 which will be maintained by the
stored charge on the gate until Cg discharges or until the next ɸ1 signal
occurs.
If uncomplemented storage is essential, the basic element is modified as
indicated in Figure below and will be seen to consist of six transistors for
nMOS and eight for CMOS.
Data clocked in on ɸ1 is stored on Cg1 and the corresponding output appears
at the output of inverter 1.
On ɸ2 this value is clocked into and stored by Cg2 and the output of inverter 2
then presents the 'true' form of the stored bit.
A passive bus rail is a floating rail to which signals may be connected from
drivers through series switches, for example, pass transistors, to propagate
along the bus and from which signals may be taken, also through pass
transistors.
General considerations
1. Lower unit cost compared with other approaches to the same requirement.
2. Higher reliability High levels of system integration usually greatly reduce
interconnections-a weak spot in any system.
3. Lower power dissipation, lower weight, and lower volume compared with
most other approaches to a given system.
4. Better performance-particularly in terms of speed power product.
5. Enhanced repeatability. There are fewer processes to control if the whole
system or a very large part of it is realized on a single chip.
6. The possibility of reduced design/development periods (particularly for
more complex systems) if suitable design procedures and design aids are
available.
Figure sets out the basic architecture of most, if not all, microprocessors
Now we will decompose the data path into a block diagram showing the
main subunits.
In doing this it is useful to anticipate a possible floor plan to show the
planned relative disposition of the subunits on the chip and thus on the mask
layouts.
A block diagram is presented in Figure below
4-bit shifter
Any general purpose n-bit shifter should be able to shift incoming data by up
to n - 1 place in a right-shift or left-shift direction.
If we now further specify that all shifts should be on an 'end-around' basis,
so that any bit shifted out at one end of a data word will be shifted in at the
other end of the word, then the problem of right shift or left shift is greatly
eased.
In fact, a moment's consideration will reveal, for a 4-bit word, that a 1-bit
shift right is equivalent to a 3-bit shift left and a 2-bit shift right is equivalent
to a 2-bit shift left, etc.
Thus we can achieve a capability to shift left or right by zero, one, two, or
three places by designing a circuit which will shift right only (say) by zero,
one, two, or three places.
1. Crossbar switch:
The arrangement is quite general and may be readily expanded to
accommodate n-bit inputs/outputs.
In fact, this arrangement is overkill in that any input line can be connected to
any or all output lines-if all switches are closed, then all inputs are connected
to all outputs in one glorious short circuit.
Furthermore, 16 control signals (sw00-sw15), one for each transistor switch,
must be provided to drive the crossbar switch, and such complexity is highly
undesirable.
1. Power dissipation:
The simple case of an nMOS 8:1 inverter which may be set out with a
minimum feature size pull-down transistor giving a total resistance from
Vdd to GND of 90 kΩ.
The maximum power dissipation for this particular design will thus be
2. Speed:
Take the simple case of one 8: 1 inverter driving another similar inverter.
The longest delays will occur when the output of the first stage is changing
from logic 0 (Lo) to logic 1 (Hi), that is, the Δ transition of the output.
The capacitances associated with the output and the input of the next stage
must charge through the pull-up resistance of the first stage as in Figure.
It is also obvious that during the complementary ▼ transition the same
capacitances must be discharged through the pull-down transistor of the first
stage.
Summery Table.
Power Area Speed
2λ*2λ pull down Δ=40𝜏
0.278mW 36λ2
16λ*2λ pull up ▼=5𝜏
2λ*6λ pull down Δ=21𝜏
0.744mW 24λ2
6λ*2λ pull up ▼=0.66𝜏
1. Area:
The area of a basic CMOS inverter is proportional to the total area occupied
by the p- and n-devices.
For fixed VDD and f, minimizing Psd requires minimizing CL which can be
achieved by minimizing the area A since CL is proportional to the gate areas
comprising A.
𝑊 𝜇 𝑊
=
𝐿 𝜇 𝐿
Assume Lp=Ln=2λ
𝜇
𝑊 = 𝑊
𝜇
Generally 𝜇 = 2 ∗ 𝜇
Hence
2𝜇
𝑊 = 𝑊
𝜇
𝑊
=2
𝑊
To compensate for the lower hole mobility µp, compared to electron mobility
µn
𝑊 = 2𝑊
Simulators are available for logic (switch level) simulation and timing
simulation.
Circuit simulation via such programs as SPICE is also possible, but may be
expensive in terms of computing time and therefore impractical for other
than small subsystems.
Recent advances in simulators have made it possible to use the software as 'a
probe' to examine the simulated responses on various parts of the circuit to
input stimuli also provided via the simulator.