Journal: IEEE Micro

Volume 45, Issue 6

4 -- 5Hsien-Hsin S. Lee. A 33-GW Pilgrimage to the Promised Land of Agent Workforce
6 -- 7Wonil Choi, Jie Zhang 0048. Special Issue on Cache Coherent Interconnects and Resource Disaggregation Techniques
8 -- 15Jinyeong Lim, Juncheol Ye, Jaehong Kim 0002, Hwijoon Lim, Hyunho Yeo, Junhyeok Jang, Myoungsoo Jung, Dongsu Han. Efficient Disaggregated Cloud Storage for Cold Videos With Neural Enhancement
16 -- 23Daegyu Han, Sungho Moon, Kyeungpyo Kim, Sung-Soon Park 0001, Beomseok Nam. Improving Remote File Access in Distributed Object Stores by Decoupling Metadata and Data Paths Using NVMe-oF
24 -- 35Miryeong Kwon, Donghyun Gouk, Eunjee Na, Jiseon Kim, Junhee Kim, Hyein Woo, Eojin Ryu, Hyunkyu Choi, Jinwoo Baek, Hanyeoreum Bae, Mahmut T. Kandemir, Myoungsoo Jung. Containerized In-Storage Processing and Computing-Enabled Solid-State Drive Disaggregation
36 -- 45Dongsuk Oh, Miryeong Kwon, Jiseon Kim, Eunjee Na, Junseok Moon, Hyunkyu Choi, Seonghyeon Jang, Hanjin Choi, Hongjoo Jung, Sangwon Lee 0014, Myoungsoo Jung. Compute Express Link Topology-Aware and Expander-Driven Prefetching: Unlocking Solid-State Drive Performance
46 -- 55Miryeong Kwon, Donghyun Gouk, Junhyeok Jang, Jinwoo Baek, Hyunwoo You, Sangyoon Ji, Hongjoo Jung, Junseok Moon, Seungkwan Kang, Seungjun Lee, Myoungsoo Jung. From Block to Byte: Transforming PCIe Solid-State Devices With Compute Express Link Memory Protocol and Instruction Annotation
56 -- 64Xiran Yang, Yifei Yu, Chuandong Li 0004, Jianqiang Zeng, Ke Zhou 0001, Diyu Zhou, Xiaolin Wang 0001, Zhenlin Wang 0003, Yingwei Luo. Ginkgo: A Learned-Index Enhanced Tiered Memory System
65 -- 72Zhihao Zhang, Weinan Liu, Zhenlong Song, Xinbiao Gan, Yue Yu 0001, Yiming Zhang 0003. Tiered Cache-Sharing Service for Virtual Machine Images Based on Memory Pool
73 -- 81Jaeyung Jun, HyunWoong Ahn, Joohee Lee, Jungmin Choi, Byungil Koh, Donguk Moon, Hoshik Kim. Improving SQL Join Algorithms for Distributed Systems: A Case Study of Compute Express Link-Based Multihost Shared Memory
82 -- 90Junhyeok Park 0002, Chang Gyu Lee, Soon Hwang, Seung-Jun Cha, Woosuk Chung, Youngjae Kim 0001. Maximizing Interconnect Bandwidth and Efficiency in Nonvolatile Memory, Express-Based Key-Value Solid-State Devices With Fine-Grained Value Transfer
91 -- 99Heetaek Jeong, Kanghyun Choi, Hamin Jang, Dongup Kwon, Eunjin Baek, Pyeongsu Park, Jangwoo Kim. 2: A Fast, Scalable, and Flexible Switching System for Emerging Interconnects
100 -- 107Derrick Quinn, Neel Patel, Mohammad Alian. Compute-Enabled CXL Memory Expansion for Efficient Retrieval Augmented Generation
108 -- 117Donghyun Gouk, Seungkwan Kang, Seungjun Lee, Jiseon Kim, Kyungkuk Nam, Eojin Ryu, Sangwon Lee 0014, Dongpyung Kim, Junhyeok Jang, Hanyeoreum Bae, Myoungsoo Jung. CXL-GPU: Pushing GPU Memory Boundaries with the Integration of CXL Technologies
119 -- 123Joshua J. Yi. A Review of Wisconsin Alumni Research Foundation v. Apple - Part VII
124 -- 126Shane Greenstein. Private Returns on Technology Adoption

Volume 45, Issue 5

4 -- 5Hsien-Hsin S. Lee. United Semiconductors of America
6 -- 8Chris Wilkerson. Special Issue on Contemporary Industry Products 2025
9 -- 19Ian Schneider, Hui Xu, Stephan Benecke, David A. Patterson 0001, Keguo Huang, Parthasarathy Ranganathan, Cooper Elsworth. An Introduction to Life-Cycle Emissions of Artificial Intelligence Hardware
20 -- 29Heng Liao, Bingyang Liu, Xianping Chen, Zhigang Guo, Chuanning Cheng, Jianbing Wang, Xiangyu Chen 0004, Peng Dong, Rui Meng, Wenjie Liu, Zhe Zhou 0002, Ziyang Zhang, Yuhang Gai, Cunle Qian, Yi Xiong, Zhongwu Cheng, Jing Xia, Yuli Ma, Xi Chen, Wenhua Du, Shizhong Xiao, Chungang Li, Yong Qin, Liudong Xiong, Zhou Yu, Lv Chen, Lei Chen, Buyun Wang, Pei Wu, Junen Gao, Xiaochu Li, Jian He, Shizhuan Yan, Bill McColl. UB-Mesh: A Hierarchically Localized nD-FullMesh Data Center Network Architecture
30 -- 42Satyam Srivastava, Akhil Arunkumar, Nithesh kurella, Amrit Panda, Gaurav Jain, Purushotham Kamath, Mark Wutzke, Arun Tiruvur, Mayank Mike Gupta, Ilya Soloveychik, Vamsi Darsi, Malav Dalal, Vinayak Patankar, Sasidhar Dudyala, Senthil Duraisamy, Santhosh Ramchandran, Raghav Venkatasubramanian, Yuwei Qin, Xin Wang, Jayaprakash Balachandran, Ali Murat Gok, Piotr Wojciechowski, Saliya Ekanayake, Chris Ng, Ranju Sarma, Shubhankit Rathore, Tristan Trouwen, Siwei Zhuang, Chris J. Nicol, Sudeep Bhoja. Corsair: An In-Memory Computing Chiplet Architecture for Inference-Time Compute Acceleration
43 -- 55Heetaek Jeong, Wonsik Lee, Eunjin Baek, Changsu Kim 0004, Changyeon Jo, Dongju Chae, Kanghyun Choi, Hamin Jang, Mohamed A. Elgammal, Sungmin Hong, Eriko Nurvitadhi, Dongup Kwon, Jangwoo Kim. MangoBoost Alice: Extremely Fast, Seamless, and Versatile FPGA-Accelerated DPU Solutions
56 -- 66Darshan Gandhi, Pushkar Nandkar, Nasim Farahini, Håkan Zeffer, John Long, Samuel Rydh, Matheen Musaddiq, Tuowen Zhao, Joshua Brot, Reid Goodbar, Yun Du, Mingran Wang, Raghu Prabhakar. Speculative Decoding on the SN40L Reconfigurable Dataflow Unit
67 -- 78Gurpreet Singh Kalsi, Hong Wang, Jason Howard, Joshua B. Fryman, Fabrizio Petrini, Daniel S. Klowden, Sanjaya Tayal, Anil Rao, Steve Pawlowski. BiFrost: A Composable, Resilient Interconnect Network Architecture for Scalable Artificial Intelligence Systems
79 -- 93Amin Firoozshahian, Joel Coburn, Ajit Punj, Aravind Sukumaran-Rajam, Colby Boyer, Rakesh Nattoji, Mahima Bathla, Bob Dreyer, Sujith Srinivasan, Harshitha Pilla, Michael Rotzin, Surendra Rajupalem, K. Rajesh Jagannath, Krishna Noru, Harikrishna Reddy, Chris Yang, Charlie Hong-Men Su, Charlie Cheng. A Comparative Analysis of Loosely and Tightly Coupled Accelerator Architectures for Machine Learning
94 -- 102Jianping Zeng 0001, Shuyi Pei, Da Zhang, Yuchen Zhou 0005, Amir Beygi, Xuebin Yao, Ramdas Kachare, Tong Zhang, Zongwang Li, Marie Nguyen, Rekha Pitchumani, Yang Soek Ki, Changhee Jung. Performance Characterizations and Usage Guidelines of Samsung CMM-H
103 -- 115Yejin Lee 0001, Alicia Golden, Anna Y. Sun, Basil Hosmer, Bilge Acun, Can Balioglu, Changhan Wang, Charles David Hernandez, Christian Puhrsch, Daniel Haziza, Driss Guessous, Francisco Massa, Jacob Kahn, Jeffrey Wan, Jeremy Reizenstein, Jiaqi Zhai, Joe Isaacson, Joel Schlosser, Juan Pino 0001, Kaushik Ram Sadagopan, Leonid Shamis, Linjian Ma, Min-Jae Hwang, Mingda Chen, Mostafa Elhoushi, Pedro Rodríguez 0001, Ram Pasunuru, Samuel Hsia, Scott Yih, Sravya Popuri, Xing Liu, Carole-Jean Wu. Characterizing and Efficiently Accelerating Multimodal Generation Model Inference
116 -- 125Manu Shamsa, Damion Searls, Samik Biswas, Steve Wong, Mariusz Oriol, Mateusz Duchalski, Rajesh Poornachandran, Jason Dutra, Felix Leung, Zoey Sun, Sam Whitlock, Pulkit A. Misra. Unlocking Hardware Potential With Usage Meters
127 -- 132Gadi Singer. Exploring Trends to Ride Artificial Intelligence's Explosive Growth Trajectory
134 -- 137Joshua J. Yi. A Review of Wisconsin Alumni Research Foundation v. Apple - Part VI
138 -- 140Shane Greenstein. Wireless Commercial Breakthrough

Volume 45, Issue 4

4 -- 5Hsien-Hsin S. Lee. Intelligence for Sale
6 -- 10Jun Yang 0002, Xulong Tang. Special Issue on Top Picks From the 2024 Computer Architecture Conferences
11 -- 18Lieven Eeckhout. Assessing Processor Sustainability Using the First-Order FOCAL Carbon Model
19 -- 28Jaylen Wang, Daniel S. Berger, Fiodar Kazhamiaka, Celine Irvene, Chaojie Zhang, Esha Choukse, Kali Frost, Rodrigo Fonseca, Brijesh Warrier, Chetan Bansal, Jonathan Stern, Ricardo Bianchini, Akshitha Sriraman. Enabling Sustainable Cloud Computing With Low-Carbon Server Design
29 -- 36Yuqi Xue, Yiqi Liu, Lifeng Nai, Jian Huang 0006. Hardware-Assisted Virtualization of Neural Processing Units for Cloud Platforms
37 -- 43Shuangliang David Chen, Saptadeep Pal, Rakesh Kumar 0002. Waferscale Network Switches
44 -- 53Nandeeka Nayak, Toluwanimi O. Odemuyiwa, Xinrui Wu, Michael Pellauer, Joel S. Emer, Christopher W. Fletcher. From TeAAL to FuseMax: Separation of Concerns for Attention Accelerator Design
54 -- 59Esha Choukse, Pratyush Patel, Chaojie Zhang, Aashaka Shah, Íñigo Goiri, Saeed Maleki, Rodrigo Fonseca, Ricardo Bianchini. Splitwise: Efficient Generative LLM Inference Using Phase Splitting
60 -- 71Alexander Shypula, Aman Madaan, Yimeng Zeng, Uri Alon 0002, Jacob R. Gardner, Milad Hashemi, Graham Neubig, Parthasarathy Ranganathan, Osbert Bastani, Amir Yazdanbakhsh. Automated High-Level Code Optimization for Warehouse Performance
72 -- 78Jaime Roelandts, Ajeya Naithani, Sam Ainsworth 0001, Timothy M. Jones 0001, Lieven Eeckhout. Scalar Vector Runahead: Removing the Shackles of Indirect Memory Chains on In-Order Cores
79 -- 86Martin Cochet, Karthik Swaminathan, Erik Loscalzo, Joseph Zuckerman, Maico Cassel dos Santos, Davide Giri, Alper Buyuktosunoglu, Tianyu Jia, David Brooks 0001, Gu-Yeon Wei, Kenneth L. Shepard, Luca P. Carloni, Pradip Bose. BlitzCoin: A Decentralized Hardware Solution for Power Management of Highly Heterogeneous Systems on Chip
87 -- 94Rhys Gretsch, Peiyang Song 0002, Advait Madhavan, Jeremy Lau, Timothy Sherwood. Delay-Space Arithmetic and Architecture
95 -- 102Zirui Neil Zhao, Adam Morrison 0001, Christopher W. Fletcher, Josep Torrellas. From Colocation to Exfiltration: Practical Cache Side-Channel Attacks in the Modern Public Cloud
104 -- 109Joshua J. Yi. A Review of Wisconsin Alumni Research Foundation v. Apple - Part V
110 -- 112Shane Greenstein. Prototype Competition and Breakthroughs

Volume 45, Issue 3

4 -- 5Hsien-Hsin S. Lee. Toward Disaggregated and Heterogenous AI Systems
6 -- 7Rob Aitken, Larry Yang. Special Issue on Hot Chips 2024
8 -- 14Gerard Williams, Pradeep Kanapathipillai. Qualcomm Oryon CPU in Snapdragon X Elite: Micro-Architecture and Design
15 -- 21Nadav Bonen, Arik Gihon, Leon Polishuk, Yoni Aizik, Yulia Okunev, Tsvika Kurts, Nithiyanandan Bashyam. Lunar Lake an Intel Mobile Processor: SoC Architecture Overview (2024)
22 -- 30Tomai Knopp, Jeffrey Chu, Sagheer Ahmad. AMD Versal AI Edge Series Gen 2
31 -- 40Michael D. Powell, Patrick Fleming, Venkidesh Iyer Krishna, Naveen Lakkakula, Subhiksha Ravisundar, Praveen Mosur, Arijit Biswas, Pradeep Dubey, Kapil Sood, Andrew Cunningham, Smita Kumar. Intel Xeon 6 Product Family
41 -- 48Alan Smith 0003, Vamsi Krishna Alla. AMD Instinct MI300X: A Generative AI Accelerator and Platform Architecture
49 -- 57Kaifan Wang, Jian Chen, Yinan Xu 0001, Zihao Yu, Wei He, Dan Tang, Ninghui Sun, Yungang Bao. XiangShan: An Open Source Project for High-Performance RISC-V Processors Meeting Industrial-Grade Standards
58 -- 65Kalhan Koul, Zhouhua Xie, Maxwell Strange, Sai Gautham Ravipati, Bo-Wun Cheng, Olivia Hsu, Po-Han Chen, Mark Horowitz, Fredrik Kjolstad, Priyanka Raina. Designing Programmable Accelerators for Sparse Tensor Algebra
66 -- 75Christopher J. Berry, Michael J. Becht, Tim E. Bubb, Howard Haynie, Robert J. Sonnelitter, Katie Seggerman, Jonathan Hsieh, Edward Malley, Mike Cadigan, Susan M. Eickhoff, Matthias Klein, Craig R. Walters, Christian G. Zoellin, Cédric Lichtenau. The IBM Telum II Processor
76 -- 85Younggeun Choi, Junyoung Park, Sang Min Lee 0014, Jeseung Yeon, Minho Kim, Changjae Park, Byeongwook Bae, Hyunmin Jeong, Hanjoon Kim, June Paik, Nuno P. Lopes, Sungjoo Yoo. FuriosaAI RNGD: A Tensor Contraction Processor for Sustainable AI Computing
86 -- 94Antti Rautakoura, Timo Hämäläinen 0001, Ari Kulmala. Three SoCs in Three Years: How to Get Agile
97 -- 102Joshua J. Yi. A Review of Wisconsin Alumni Research Foundation v. Apple - Part IV
103 -- 107Jianming Tong, Zishen Wan. Sipping Matcha of Security: A Fireside Chat With Mengjia Yan
108 -- 110Shane Greenstein. The Scramble After Breakthrough
112 -- 0Gary S. Tyson. Sally A. McKee

Volume 45, Issue 2

4 -- 5Hsien-Hsin S. Lee. Taiwan Semiconductor Manufacturing Company's $165 Billion Bet
6 -- 7Whit Schonbein, Joseph Schuchart. Special Issue on Hot Interconnects 31
8 -- 17Quentin Anthony, Benjamin Michalowicz, Jacob Hatef, Lang Xu, Mustafa Abduljabbar, Aamir Shafi, Hari Subramoni, Dhabaleswar K. Panda 0001. Understanding and Characterizing Communication Characteristics for Distributed Transformer Models
18 -- 25Weiyang Wang, Manya Ghobadi. Spine-Free Networks for Large Language Model Training
26 -- 35Manjunath Gorentla Venkata, Valentine Petrov, Sergey Lebedev, Devendar Bureddy, Ferrol Aderholdt, Joshua Ladd, Gil Bloch, Mike Dubman, Gilad Shainer. Unified Collective Communication: A Unified Library for CPU, GPU, and DPU Collectives
36 -- 45Tu Tran, Goutham Kalikrishna Reddy Kuncham, Bharath Ramesh 0005, Shulei Xu, Hari Subramoni, Dhabaleswar K. Panda 0001. OHIO: Enhancing RDMA Scalability in Alltoall With Optimized Communication Overlap
46 -- 55Jinsun Yoo, William Won, Meghan Cowan, Nan Jiang, Benjamin Klenk, Srinivas Sridharan 0002, Tushar Krishna. Toward a Standardized Representation for Deep Learning Collective Algorithms
56 -- 64Cristina Olmedilla, Jesús Escudero-Sahuquillo, Pedro Javier García, Francisco J. Quiles 0001, Wenhao Sun, Long Yan, Yunping Lyu, José Duato. ECP: Improving the Accuracy of Congesting-Packets Identification in High-Performance Interconnection Networks
65 -- 66Ryusuke Egawa, Yasutaka Wada. Special Issue on COOL Chips
67 -- 77Jueun Jung, Seungbin Kim, Bokyoung Seo, Wuyoung Jang, Sangho Lee, Jeongmin Shin, Donghyeon Han, Kyuho Jason Lee. A Mobile Semantic Lidar SLAM Processor With Artificial-Intelligence-Based 3-D Perception and Spatiotemporal-Aware Computing
78 -- 89Hoai Luan Pham, Vu Trung Duong Le, Tuan Hai Vu, Van Duy Tran, Van Tinh Nguyen, Thi Diem Tran, Yasuhiko Nakashima. MRCA 2.0: An Area-Optimized Multigrained Reconfigurable Cryptographic Accelerator for Securing Blockchain-Based Internet of Things Systems
90 -- 100Reoma Matsuo, Yuya Degawa, Hidetsugu Irie, Shuichi Sakai, Ryota Shioya. Flexible Approximate Computing for Mitigating Branch Divergence in GPUs
102 -- 113Kyungsoo Lee, Sohyun Kim, Joohee Lee, Donguk Moon, Rakie Kim, Honggyu Kim, Hyeongtak Ji, Yunjeong Mun, Youngpyo Joo. Improving Key-Value Cache Performance With Heterogeneous Memory Tiering: A Case Study of Compute-Express-Link-Based Memory Expansion
114 -- 117Joshua J. Yi. A Review of Wisconsin Alumni Research Foundation v. Apple - Part III
118 -- 120Shane Greenstein. Artificial Intelligence and the Jevons Paradox
122 -- 124Doug Burger, Paul Chow, Joel S. Emer, Mark D. Hill, James C. Hoe, Masato Motomura. Derek Chiou

Volume 45, Issue 1

4 -- 5Hsien-Hsin S. Lee. Rise of the Agentic AI Workforce
6 -- 8Debendra Das Sharma, Nam Sung Kim. Special Issue on Interconnects for Chiplet Integration Technologies
9 -- 15Boyd Phelps, Arif Khan. Disaggregated Designs: Technology Challenges and Enablers
16 -- 25Peter Z. Onufryk, Swadesh Choudhary. UCIe: Standard for an Open Chiplet Ecosystem
26 -- 34Au Huynh, Kent Stahn, Manuel Mota, Christian de Verteuil, Jennifer Pyon, Reza Movahedinia. UCIe Standard: Enhancing Die-to-Die Connectivity in Modern Packaging
35 -- 40Tony Chan Carusone, Dustin Dunwell, Sundeep Gupta, Letizia Giuliano, Adrien Auge, Michael Klempa, Sue Hung Fung. Co-Design of Interchiplet, Package, and System Interconnect Protocols
41 -- 47Wei Tang 0010, Chester Liu, Zhengya Zhang. Energy-Efficient Parallel Interconnects for Chiplet Integration
48 -- 56Durand Jarrett-Amor, Tony Chan Carusone. A Comparison of Single-Ended, NRZ Unidirectional Signaling and Single-Ended, NRZ Simultaneous-Bidirectional Signaling for Die-to-Die Links
57 -- 66Alan Smith 0003, Gabriel H. Loh, Samuel Naffziger, John J. Wuu, Nathan Kalyanasundharam, Eric Chapman, Raja Swaminathan, Tyrone Huang, Wonjun Jung, Alexander Kaganov, Hugh McIntyre, Ramon Mangaser. Interconnect Design for Heterogeneous Integration of Chiplets in the AMD Instinct MI300X Accelerator
67 -- 74Sridhar Muthrasanallur, Yervant Zorian. A Test, Debug, and Silicon Lifecycle Management Architecture for a UCIe-Based Open Chiplet Ecosystem
76 -- 86Thommas K. S. Flores, Ivanovitch Silva, Mariana Azevedo, Thaís Medeiros, Morsinaldo Medeiros, Daniel G. Costa, Paolo Ferrari, Emiliano Sisinni. Advancing Tiny Machine Learning Operations: Robust Model Updates in the Internet of Intelligent Vehicles
87 -- 94Tomasz Szydlo, Marcin Nagy. Management of TinyML-Enabled Internet of Things Devices
95 -- 100Joshua J. Yi. A Review of Wisconsin Alumni Research Foundation v. Apple - Part II
101 -- 103Shane Greenstein. Spillovers, Bottlenecks, and More Invention After Invention
104 -- 112Mariam Elgamal, Yueying Lisa Li. Measuring What Matters: A Fireside Chat With Joel Emer