# A 500ps/8.5ns Array Read/Write Latency 1Mb Twin 1T1MTJ STT-MRAM designed in 90nm CMOS/40nm MTJ Process with Novel Positive Feedback S/A Circuit

T. Ohsawa<sup>1, 2</sup>, S. Miura<sup>3</sup>, H. Honjo<sup>3</sup>, S. Ikeda<sup>1, 4</sup>, T. Hanyu<sup>1, 4</sup>, H. Ohno<sup>1, 4</sup> and T. Endoh<sup>1, 2, 5</sup>

<sup>1</sup> Center for Innovative Integrated Electronic Systems, Tohoku University, <sup>2</sup> JST, ACCEL

Aramaki Aza-Aoba 468-1, Aoba-ku, Sendai, 980-0845 Japan; Phone: +81-22-796-3424 E-mail: t-ohsawa@cies.tohoku.ac.jp

<sup>3</sup> Green Platform Research Laboratories, NEC Corporation; Miyukigaoka 34, Tsukuba, Ibaraki, 305-8501 Japan

<sup>4</sup> Research Institute of Electrical Communication, Tohoku University; Katahira 2-1-1, Aoba-ku, Sendai, 980-8577 Japan

<sup>5</sup> Graduate School of Engineering, Tohoku University; Aramaki Aza-Aoba 468-1, Aoba-ku, Sendai, 980-0845 Japan

## Abstract

A positive feedback S/A is proposed that can achieve 500ps array read latency for 1Mb twin 1T1MTJ STT-MRAM designed in 90nm CMOS/40nm MTJ under 8.5ns write latency. This S/A also relaxes the requirement on MTJ's thermal stability factor  $\Delta = E/k_bT$  to ' $\Delta > 70$ ' from ' $\Delta > 90$ ' in the conventional S/A estimated with the condition of 1FIT error rate in 10 years for 256Mb array with the same design rule.

# 1. Introduction

Static power increase in cache memories is one of the bottlenecks in improving computer performance. It is expected that high-performance nonvolatile (NV) memories using spin-transfer-torque magnetic tunnel junctions (STT-MTJs) will be applied to the cache to reduce the static power drastically. There have been proposed several differential pair type STT-MRAM cells for performance-oriented applications [1]-[3]. However, their sizes are large for the high-density applications. A smaller twin 1T1MTJ cell was proposed [4]. However, the access time (4ns) is not fast enough. In this paper, we propose a S/A with positive feedback loop to latch the sensing nodes very fast. This scheme also provides substantially read disturb-free data sensing.

# 2. Cell and S/A Design

Fig.1 shows the proposed S/A scheme compared with the conventional one [5] for twin 1T1MTJ cell. The conventional sensing scheme relies on a differential amplifier such as a current mirror type, the sensing of which is rather slow. On the other hand, the proposed S/A can latch a pair of sensing nodes very fast, because the sensing is performed in a positive feedback loop that is formed in the current path.

Fig. 2 is the actual implementation of the cell and the S/A into the 2kb unit (256 rows by 8 columns). A S/A and a DQ buffer (DQB) are shared among eight columns. The unit is repeated 16 times in the row direction to make a 64kb cell array. Fig. 3 shows a 1Mb twin 1T1MTJ STT-MRAM macro consisting of sixteen 64kb cell arrays.

Fig. 4 shows the array read latency simulated for the 1Mb twin 1T1MTJ STT-MRAM with the proposed positive feedback S/A that is compared with the conventional current mirror S/A. The signal development times in which the voltage difference between a pair of bit lines (BL and BBL) is amplified to 80mV are compared. The proposed S/A can develop the signal in 500ps, while the conventional S/A takes 2.1ns for the signal amplification. Thus the positive feedback sensing scheme is shown to be effective in

high-speed signal sensing for twin 1T1MTJ STT-MRAM.

The scaling trend of MTJ switching time in the twin 1T1MTJ cell is shown in Fig. 5. In the simulations, the channel width and the gate length of the PFET selective device is kept constant at  $0.64\mu$ m and  $0.1\mu$ m, respectively. The cell can be switched in 8.5ns for 40nm size MTJ.

## 3. Read Disturb Failure Rate

Fig. 6 shows the simulated currents flowing through a pair of MTJs in a cell for the both sensing schemes. It is shown that the disturb current is suddenly attenuated due to the positive feedback loop in the proposed S/A, while the read disturb current continues to flow in the conventional sensing scheme. The read disturb-failure rate of 256Mb twin 1T1MTJ STT-MRAM with 512b parallel read using the proposed sensing scheme is shown in Fig. 7 as a function of the MTJ's energy barrier normalized by thermal energy  $E/k_bT$  compared with the retention-failure rate. The read disturb-failure rate is shown smaller than the retention-failure rate. On the other hand, the failure of the STT-MRAM with the conventional S/A is limited by read disturb and ' $E/k_bT > 90$ ' is required for realizing the macro whose failure rate is less than a few FIT. By adopting the proposed S/A, the required energy barrier is limited by retention and relaxed to ' $E/k_bT > 70$ ', which falls within a realizable range [6].

The 512b parallel write current of 256Mb twin 1T1MTJ STT-MRAM using the architecture was simulated to compare with 6TSRAMs as shown in Table I. It is much smaller than low operating power (LOP) SRAM and even smaller than low standby power (LSTP) one, due to the elimination of static current in cells.

### 4. Chip Features

Fig.8 shows the microphotograph of the 1Mb twin 1T1MTJ STT-MRAM measuring 1.4mm<sup>2</sup>, being 59% reduction from the 4T2MTJ counterpart [3] due to the cell size reduction and the elimination of PL drivers used for the power gating in the 4T2MTJ STT-MRAM. Fig. 9 is the operational waveforms of the chip. Table II summarizes the features of the 1Mb twin 1T1MTJ STT-MRAM.

### Acknowledgments

This work was supported in part by JSPS through FIRST program and by JST, ACCEL.

### References

- [1] S. Yamamoto and S. Sugahara, JJAP, 48 (2009) 043001.
- [2] K. Abe et al., SSDM, pp. 1144-1145, 2010.
- [3] T. Ohsawa et al., IEEE J. Solid-State Circuits, 48 (2013) 1511.
- [4] H. Noguchi et al., Symp. VLSI Circuits, pp. C108-C109, 2013.
- [5] J. -H. Song et al., Int. J. Circ. Theor. Appl. 39 (2011) 313.
- [6] K. Miura et al., Symp. VLSI Technology, pp. 214-215, 2011.





(b) Proposed S/A (positive feedback type)







Fig.3 1Mb twin 1T1MTJ STT-MRAM macro using the 2kb unit in Fig. 2.







Fig.9 Measured operational waveform of 1Mb twin 1T1MTJ STT-MRAM.









Fig.5 Scaling trend of MTJ switching time for the 1T1MTJ cell (PFET selective

respectively.

device size is unchanged at W/Lg= 0.64µm/0.1µm). P and AP stand for

parallel state and anti-parallel state,

0.98mm

1.44mm

Fig.4 Array read latency of the proposed S/A under the same condition as the conventional one.



Fig.7 256Mb memory failure rates in 10 years use. The parameters shown at the bottom of the graph are used to calculate the failure

rates by the equation at the top of the graph.

1.40mm<sup>2</sup> Fig.8 Photograph of fabricated 1Mb twin 1T1MTJ STT-MRAM macro.

Table II Features of 1Mb twin 1T1MT STT-MRAM.

| *                    |                                              |                    |                   |       |  |
|----------------------|----------------------------------------------|--------------------|-------------------|-------|--|
|                      |                                              | Twin               | 6TSRAM            |       |  |
|                      |                                              | 1T1MTJ<br>STT-MRAM | LOP               | LSTP  |  |
| Power supply voltage |                                              | 0.9 V              | 0.9V              |       |  |
|                      | Total Current/256Mbit<br>(Write cycle @15ns) | 77mA               | 1.46A             | 138mA |  |
| Components           | Write Current/512bit<br>Write Current/cell   | 70mA<br>136µA      | 0.18mA<br>0.36 μA |       |  |
|                      | Array Control Current/512bit                 | 7mA                | 4mA               |       |  |
|                      | Static Current/256Mbit                       | 0                  | 1.46A             | 134mA |  |

Table I 512b write current of 256Mb twin 1T1MTJ

STT-MRAM compared with 6T SRAMs.

|                 | 1Mb twin 1T1MTJ STT-MRAM                |  |  |
|-----------------|-----------------------------------------|--|--|
| Process         | 90nm CMOS (Lg=100nm) + MTJ              |  |  |
| Cell size       | 0.88µm <sup>2</sup>                     |  |  |
| Macro size      | 1.40mm <sup>2</sup> /1Mb                |  |  |
| Cell efficiency | 66%                                     |  |  |
| Supply voltage  | 1.0V                                    |  |  |
| Organization    | 64k word x 16bit                        |  |  |
| Access time     | 400ps (V <sub>dd</sub> =1.0V,RT)        |  |  |
| Read current    | 52mA (V <sub>dd</sub> =1.0V, 2ns cycle) |  |  |
| Write current   | 23mA (V <sub>dd</sub> =1.0V, 5ns cycle) |  |  |
|                 |                                         |  |  |