Extended Abstracts of the 22nd (1990 International) Conference on Solid State Devices and Materials, Sendai, 1990, pp. 653-656

# A Versatile 50ps BICMOS Technology for Mixed CMOS/ECL ICs

H.Klose, B.Hoffmann, M.Kerber, K.Ziemann, I.Kerner, R. Schreiter, R.Koeppl

Siemens AG, Corporate Research and Development, Munich, FRG

H.Kaiser, K.Mueller

Siemens AG, Components Group, Munich, FRG

A high performance BICMOS process denoted B5C for the realization of high speed/high density integrated circuits is presented. The main features of this process are 1.0 micron minimal feature size and a selfaligned bipolar transistor of 10 GHz cut-off frequeny and 50 ps minimal CML stage-delay time. Moreover bipolar transistors with 70V Early-voltage are available using one additional masking step. As a more complex performance-demonstrator a high speed 16k SRAM with 3.5ns adress access time is presented. Both delay times show, that B5C is one of the fastest BICMOS processes worldwide.

### 1. Introduction

In the past the basic differences between MOS- and BIPOLAR devices lead to two different processing worlds with separate equipment and own factories. By the implementation of CMOS-specific process modules like selfalignment into BIPOLAR processes or by using epitaxy within CMOS flowcharts, an increased synergy arose between both process-areas. This was the precondition for merging these technologies to form both CMOS as well as BIPOLAR devices on the same chip. Thus the circuit-designer is served on the same chip with devices to realize high speed signal processing paths by using BIPOLAR and high density/low power circuit parts like SRAM-cores by using CMOS (e.g. [1]).

This paper presents a 1.0 micron BICMOS process with CMOS devices of 0.8 micron drawn gate length and a high performance selfaligned BIPOLAR transistor. The key electrical data is given and the performance features are demonstrated by respective circuits and demonstrators. Focus is put onto the trade-off between analog and digital performance of the bipolar devices.

#### 2. Technology

In Fig.1 the schematic flowchart of B5C is sketched. The additional process-modules which had to be added to the underlying CMOS process are the buried layer formation, the epi-deposition, the collector plug drive in and the realization of the emitter/base structure. The main technological features of the isolation process block are selfaligned  $n^+/p$  buried layers and wells for a low resistive collector wiring and a tight buried layer pitch at simultaneously low mask-counts and thus low costs. The epi is of a lightly doped n-type material.



Fig.1 Schematic B5C flowchart

For digital applications this epi-layer is swapped by either the n- or the p-well [2]. For the use of the BIPOLAR transistor in analog circuits the n-well implant is blocked by one additional but noncritical masking steps. Thus the sub-collector region is formed by the lightly doped n-epi.



Fig.2 Early voltage vs.  $V_{\mbox{CE}}$  for the digital and analog BIPOLAR transistor

In Fig.2 a comparison is given between the Early-voltages of the "digital" and "analog" BIPOLAR transistors. Despite common models, as they are implemented for instance in network-analysis programs like SPICE, the Early-voltage vs. VCE dependencies are not constant but convex with a maximal Early-voltage at around 3V VCE. The highest Earlyvoltages achieved are 32V and 85V respectively for the two well types. As the Early-voltage VA reads as

 $VA = QB^*(dQB/dVCB)^{-1} = QB/CBC$ 

with QB being the neutral base charge, VCB the base to collector voltage and CBC the base-collector junction capacitance, with increasing VCB the Early-voltage rolls off beyond 3V VCE as QB decreases due to pinching of the neutral base. CBC on the other hand remains constant within this voltage regime due to the buried layer punch-on of the base-collector depletion layer. In most analog circuits the VCE value of the bipolar transistor is larger than or equal to 1V. Thus the respective worst case data for the Early-voltage are 20V for the "digital" and 75V for the "analog" BJT (Fig.2).



Fig.3 Tradeoff between Early-voltage, drive capability  $j_k$  and gate-delay time  $t_{del}$ .

Fig.3 sketches the trade-off between Early-voltage, drive capability and the speed of an unloaded CML stage. There the drive capability per unit area is described by the Kirk-current density  $j_K$  which marks the 20% roll-off point of the cut-off frequency. The intermediate data-points are experimental values obtained by a selfaligned pedestal implant of double ionized phosphorous through the emitter window. It is shown that a high Early-voltage has to be paid for both, decreased speed and decreased driving capability. This degradation is more severe for the drive capability than for the delay time of the unloaded gate, as the first one is directly related to the Early-voltage via the sub-collector doping, whereas the minimal gate delay time shows a weaker dependence due to the influence of other quantities like base resistance and junction capacitances.

| device | quantity                   | unit  | value |  |
|--------|----------------------------|-------|-------|--|
| TLB    | Aeff                       | µm²   | 2.1   |  |
|        | 8                          | 1     | 100   |  |
|        | RS                         | Ω     | 280   |  |
|        | RC                         | Ω     | 66    |  |
|        | fT                         | GHz   | 10    |  |
|        | CIE                        | fF    | 17.5  |  |
|        | CIC                        | fF    | 10.3  |  |
|        | CIS                        | fF    | 60    |  |
|        | VEBO                       | v     | 3.5   |  |
|        | VBCO                       | v     | 12    |  |
|        | VCEO                       | v     | 6.5   |  |
| NMOS   | L                          | μm    | 0.8   |  |
|        | $ID(V_{GS} = V_{DS} = 5V)$ | μA/μm | 380   |  |
|        | VT                         | v     | 0.74  |  |
| PMOS   | L                          | μm    | 0.8   |  |
|        | VT                         | v     | 0.9   |  |
|        | ID(VGS = VDS = 5V)         | mA/μm | 190   |  |

Fig.4 Summary of the NMOS, PMOS and BJT device parameters

Thus rising the Early-voltage to a value of 75V means a speed degradation by a factor of 1.7 and a reduction in drive-capability by a factor of 2.5 (see Fig.3).

Next to the well formation the MOS devices are formed with a drawn gate length of 0.8 microns. For improved hot-electron hardness an LDD implantation scheme is applied for the NMOS transistor. Then the base-emitter structure is realized using a double-polysilicon selfalignment-scheme [3,4,5]. Finally a two layers of metal process module is applied. A third metal interconnection scheme is optional available particularly for complex CML and ECL functions. The respective global device data is summarized in Fig.4 for the BJT, the NMOS and the PMOS transistor. The cross section of the active B5C devices obtained by SEM analysis is given in Fig.5.



Fig.6 CML gate delay time vs power consumption per stage



## PMOST

NMOST

BJT

Fig.5 SEM analysis of the B5C devices

## 3. Basic Circuit-Evaluation

data were determined via The basic performance ringoscillators with CMOS, BICMOS (totempole) and pure BIPOLAR stages. The gate delay times of the latter stages are given in Fig.6. For a 1.0\*3.0 um<sup>2</sup> drawn emitter size the minimal gate delay time is 65ps at 600uA/gate. The low power performance for the same transistor layout is 200ps at 50uA/gate. For higher speeds larger transistors have to be used e.g. with 0.8\*20 um<sup>2</sup> emitter-size. Due to the higher drive-capability and the reduction of base-resistance due to the narrower emitter stripe of 0.8 micron - resulting in an effective emitter width of 0.4 micron - a further improvement of the gate delay time as low as 50ps is achieved. The delaytime of mixed 3-input NAND/NOR gates realized in pure CMOS is 360ps.

In Fig.7 the load lines of equally sized CMOS, BICMOS totempoles and pure BIPOLAR CML driving stages are shown. The break-even point for the use of BICMOS totempoles instead of single stage CMOS drivers is approximately 0.5pF. With respect to the load-dependence the technologies CMOS:BICMOS:BIPOLAR relates as 6:3:1. This means that for high capacitive loading :

the BICMOS totempole is a factor of 2 faster than the pure CMOS stage

and that

the BIPOLAR stage is a factor of 6 faster than CMOS and a factor of 3 faster than a BICMOS totempole. In the case of embedded CML within a CMOS environment of course the delay-times to convert CML levels to CMOS voltage swings has to be taken into account. This approximately takes 1ns.



Fig.7 Loadlines of CMOS, BICMOS and BIPOLAR driving stages

### 4. Demonstrators

To demonstrate the performance of B5C, in addition to ringoscillator data - which gives only a first impression of the performance of a technology - larger demonstrators have been designed, realized and evaluated. One example is a 16k SRAM [6] with CMOS memory-core and ECL peripherals. The respective chip-microfotograph of the SRAM is given in Fig.8.



Fig.8 Chip - micrograph of the 16k SRAM

| 2 | ionv | 100 | ł.  |           | - | - |     |
|---|------|-----|-----|-----------|---|---|-----|
|   | ر ۵۵ |     | 1   | Ē         |   |   |     |
|   | /    |     |     | Ē         |   |   | 000 |
|   |      |     |     | E         |   |   |     |
|   |      |     |     | -         |   |   |     |
|   | -    |     | 3.8 | F<br>ns - |   |   |     |
| • |      |     |     |           |   |   |     |
|   |      |     |     | Ē         |   |   |     |

Fig.9 Input and output wave-forms of the 16k SRAM

The measured input/output waveform for a read operation is depicted in Fig.9, where a 3.8ns readout could be obtained. Thus, due to the combination of the fast B5C devices together with a well-balanced CMOS/ECL circuit-mix, by far the fastest address acces time for a 1.0 micron technology was obtained.

## 5. Conclusions

The presentation described a versatile BICMOS process for high speed/high density applications. Gate delay times of 50ps for CML stages and address access times of 3.8ns for a 16k SRAM show that by using B5C the whole performance regime both of high speed BIPOLAR and high density CMOS can be covered on one single chip. It was demonstrated experimentally that by using one extra masking step good analog performance is achievable even in a high performance BICMOS environment.

#### References:

- H.Klose,B.Zehner,A.Wieder,"BICMOS, a Technology for High Speed/High Density ICs", Proc. 1989 IEEE Int. Conf. on Comp. Design, ICCD'89, pp.304-309.
- [2] H. Klose, T. Meister, B. Hoffmann, H. Kabza, J. Weng, "Well-optimization for high speed BICMOS Technologies", ESSDERC, 1988.
- [3] H.Kabza et al, "a 1um-Polysilicon Selfaligning Bipolar Process for Low-Power/High Speed Circuits", IEEE Electr. Devices Lett., Vol. 10, pp.344-346.
- [4] Y. Kobayashi, C. Yamaguchi, Y. Amenuya and T. Sakai, "High performance LSI process technology: SST CBI-CMOS p. 760-763, Proceedings IEDM, San Francisco 1988.
- [5] H.Klose, T.Meister, B.Hoffmann, "Low Cost and High Performance BICMOS Processes: A Comparison", 1989 IEEE Bipolar Circuits and Technology Meeting, pp.178-181.
- [6] W.Heimsch,R.Krebs,B.Pfaeffel,K.Ziemann, A 3.8ns 16k BICMOS SRAM", IEEE-Solic State Circuits, Vol.25,No.1,pp.48-54.