Chikayoshi Kamata
Toshiba
Network
Latest external collaboration on country level. Dive into details by clicking on the dots.
Publication
Featured researches published by Chikayoshi Kamata.
international electron devices meeting | 2012
Eiji Kitagawa; Shinobu Fujita; Kumiko Nomura; Hiroki Noguchi; Keiko Abe; Kazutaka Ikegami; Tadaomi Daibou; Y. Kato; Chikayoshi Kamata; Saori Kashiwada; Naoharu Shimomura; Junichi Ito; H. Yoda
We demonstrated lower power consumption of mobile CPU by replacing high-performance (HP)-SRAMs with spin transfer torque (STT)-MRAMs using perpendicular (p)-MTJ. The key points that enable the low power consumption are adapting run time power gating architecture (shown in Fig. 1), and satisfying both fast and low-power writing, namely, 3 nsec and 0.09 pJ, of p-MTJ cell (shown in Fig. 3). As shown in Table 1, only our developed p-MTJ has achieved 3 nsec, 0.09 pJ. Thanks to the fast and low-power p-MTJ, the power consumption of cache memory could be reduced by over 80% without degradation of performance.
international electron devices meeting | 2014
Kazutaka Ikegami; Hiroki Noguchi; Chikayoshi Kamata; Minoru Amano; Keiko Abe; Keiichi Kushida; Eiji Kitagawa; T. Ochiai; Naoharu Shimomura; Shogo Itai; Daisuke Saida; Chika Tanaka; Atsushi Kawasumi; Hiroyuki Hara; Junichi Ito; Shinobu Fujita
Since it has been difficult to increase clock frequency of processors due to power budget, there is a trend toward increase in number of processor cores and cache capacities (Fig. 1) to improve the processor performance. According to this trend, there have been two serious issues on the cache memories. One issue is large leakage power of SRAM-based cache (Ex. About 80% of average processor power in a mobile usage case [1]). Another one is large memory area of SRAM especially for last level cache (LLC) like L4 cache. Recently, eDRAM is used to reduce memory area for LLC (Fig. 1). However, gate length of eDRAM is difficult to be reduced less than 40-50 nm, and its power is not small due to frequent refresh (retention time ~ 100μs.). To reduce the cache power and decrease memory area further at the same time, advanced STT-MRAM based cache has been considered promising from theoretical analysis [2]. However, both low power and high density LLC have not been ever clarified based on a realistic MTJ (magnetic tunneling junction) integration and circuit design. This paper presents solutions for the power and memory density with more advanced STT-MRAM cell technologies by low-temperature process development and novel cache memory architecture based circuit design.
symposium on vlsi technology | 2014
Kazutaka Ikegami; Hiroki Noguchi; Chikayoshi Kamata; Minoru Amano; Keiko Abe; Keiichi Kushida; Eiji Kitagawa; T. Ochiai; Naoharu Shimomura; Atsushi Kawasumi; Hiroyuki Hara; Junichi Ito; Shinobu Fujita
We evaluated embedded perpendicular spin transfer torque magnetic random access memory (STT-MRAM) performance fabricated by magnetic tunnel junction (MTJ) -“Last process”, which is able to expand material and structural design space of CMOS and MTJ, by SPICE simulation and test chip measurement. By the post-layout simulation, we show that the delay increase by parasitics, which originates from fabricating MTJ on the upper metal layer is below 50ps and negligible for most applications. And from the test chip measurement, we demonstrated switching operation as fast as 4ns, below 1V for STT-MRAM.
international memory workshop | 2013
Atsushi Kawasumi; Keiichi Kushida; Hiroyuki Hara; Y. Unekawa; Keiko Abe; Kazutaka Ikegami; Hiroki Noguchi; Eiji Kitagawa; Chikayoshi Kamata; Saori Kashiwada; Y. Kato; Daisuke Saida; Naoharu Shimomura; Junichi Ito; Shinobu Fujita
Circuit techniques for energy-efficient STT MRAM, which is suitable for replacing SRAM L2 cache memories, are proposed. The waking-up from the power-down mode without any cycle penalties becomes possible by eliminating the voltage generator even at higher frequency than 100MHz. The read current variation caused by the generator elimination is mitigated by 50% using the adaptive pulse-driven read current control. The cross-coupled hierarchical switch reduces the unneeded read current by 66% and enhances the read margin.
IEEE Transactions on Magnetics | 2014
Daisuke Saida; Naoharu Shimomura; Eiji Kitagawa; Chikayoshi Kamata; Megumi Yakabe; Yuichi Osawa; Shinobu Fujita; Junichi Ito
The efficiency of the power reduction in a normally off mobile processor employing spin-transfer torque magnetoresistive random access memory-based cache memory strongly depends on the write charge, which is the value of the write pulsewidth multiplied by the write current. We studied the spin-transfer switching probability (Psw) in a perpendicular magnetic tunnel junction (p-MTJ) with the aim of achieving high-speed switching of less than a few nanoseconds at a low current below 100 μA. High-speed switching faster than 1 ns was observed at a write current below 100 μA. To understand the observed pulsewidth dependence of Psw, analysis using a macrospin model based on a dynamic switching was carried out. The analysis reproduced the nature of the Psw distributions quantitatively. In addition, the developed analysis suggested the effectiveness of miniaturizing the storage layer to reduce the switching current. The p-MTJ operated continuously without any error at a write pulsewidth of 2 ns and current of 49 μA.
international electron devices meeting | 2015
Kazutaka Ikegami; Hiroki Noguchi; Satoshi Takaya; Chikayoshi Kamata; Minoru Amano; Keiko Abe; Keiichi Kushida; Eiji Kitagawa; T. Ochiai; Naoharu Shimomura; Daisuke Saida; Atsushi Kawasumi; Hiroyuki Hara; Junichi Ito; Shinobu Fujita
MTJ-based cache memory is expected to reduce processor power significantly. However, write energy increases rapidly for high speed operation and considered not suited for lower level cache memory. In this work, we have developed L2 and L3 cache memory based on thermal stability factor engineered pMTJ with 2T-2MTJ and 1T-1MTJ memory cell and novel error handling scheme. These techniques reduce 75% energy with 2% performance overhead compared to SRAM-based l2 and L3 cache memory.
Archive | 2009
Takayuki Tsukamoto; Kohichi Kubo; Chikayoshi Kamata; Takahiro Hirai; Shinya Aoki; Toshiro Hiraoka
Archive | 2014
Shigeki Takahashi; Yuichi Ohsawa; Junichi Ito; Chikayoshi Kamata; Saori Kashiwada; Minoru Amano; Hiroaki Yoda
symposium on vlsi circuits | 2013
Hiroki Noguchi; Keiichi Kushida; Kazutaka Ikegami; Keiko Abe; Eiji Kitagawa; Saori Kashiwada; Chikayoshi Kamata; Atsushi Kawasumi; Hiroyuki Hara; Shinobu Fujita
Archive | 2013
Mitsuru Sato; Kohichi Kubo; Chikayoshi Kamata; Noriko Bota