


default search action
26th HPCA 2020: San Diego, CA, USA
- IEEE International Symposium on High Performance Computer Architecture, HPCA 2020, San Diego, CA, USA, February 22-26, 2020. IEEE 2020, ISBN 978-1-7281-6149-5
- Mohsen Imani, Mohammad Samragh Razlighi, Yeseong Kim, Saransh Gupta, Farinaz Koushanfar
, Tajana Rosing:
Deep Learning Acceleration with Neuron-to-Memory Transformation. 1-14 - Mingyu Yan, Lei Deng
, Xing Hu, Ling Liang, Yujing Feng, Xiaochun Ye, Zhimin Zhang, Dongrui Fan
, Yuan Xie:
HyGCN: A GCN Accelerator with Hybrid Architecture. 15-29 - Ismail Akturk, Ulya R. Karpuzcu:
ACR: Amnesic Checkpointing and Recovery. 30-43 - Jingwen Leng, Alper Buyuktosunoglu, Ramon Bertran
, Pradip Bose, Quan Chen, Minyi Guo, Vijay Janapa Reddi:
Asymmetric Resilience: Exploiting Task-Level Idempotency for Transient Error Recovery in Accelerator-Based Systems. 44-57 - Eric Qin, Ananda Samajdar, Hyoukjun Kwon
, Vineet Nadella, Sudarshan Srinivasan, Dipankar Das, Bharat Kaul, Tushar Krishna:
SIGMA: A Sparse and Irregular GEMM Accelerator with Flexible Interconnects for DNN Training. 58-70 - Nader Sehatbakhsh
, Baki Berkay Yilmaz, Alenka G. Zajic
, Milos Prvulovic:
EMSim: A Microarchitecture-Level Simulation Tool for Modeling Electromagnetic Side-Channel Signals. 71-85 - Elaheh Sadredini
, Reza Rahimi, Marzieh Lenjani, Mircea Stan
, Kevin Skadron
:
Impala: Algorithm/Architecture Co-Design for In-Memory Multi-Stride Pattern Matching. 86-98 - Ting-Ru Lin, Drew Penney, Massoud Pedram, Lizhong Chen:
A Deep Reinforcement Learning Framework for Architectural Exploration: A Routerless NoC Case Study. 99-110 - Hamza Omar, Omer Khan:
IRONHIDE: A Secure Multicore that Efficiently Mitigates Microarchitecture State Attacks for Interactive Applications. 111-122 - Nader Sehatbakhsh
, Baki Berkay Yilmaz, Alenka G. Zajic, Milos Prvulovic:
A New Side-Channel Vulnerability on Modern Computers by Exploiting Electromagnetic Emanations from the Power Management Unit. 123-138 - Wenjie Xiong
, Jakub Szefer:
Leaking Information Through Cache LRU States. 139-152 - Mohammad Reza Jokar, Junyi Qiu, Frederic T. Chong
, Lynford L. Goddard, John M. Dallesasse
, Milton Feng, Yanjing Li:
Baldur: A Power-Efficient and Scalable Network Using All-Optical Switches. 153-166 - Rajiv Nishtala, Vinicius Petrucci, Paul M. Carpenter, Magnus Själander:
Twig: Multi-Agent Task Management for Colocated Latency-Critical Cloud Services. 167-179 - Reid Pinkham, Shuqing Zeng, Zhengya Zhang
:
QuickNN: Memory and Performance Optimization of k-d Tree Based Nearest Neighbor Search for 3D Point Clouds. 180-192 - Tirthak Patel
, Devesh Tiwari:
CLITE: Efficient and QoS-Aware Co-Location of Multiple Latency-Critical Jobs for Warehouse Scale Computers. 193-206 - Amirhossein Mirhosseini, Brendan L. West, Geoffrey W. Blake, Thomas F. Wenisch:
Q-Zilla: A Scheduling Framework and Core Microarchitecture for Tail-Tolerant Microservices. 207-219 - Yujeong Choi, Minsoo Rhu:
PREMA: A Predictive Multi-Task Scheduling Algorithm For Preemptible Neural Processing Units. 220-233 - Priyank Faldu
, Jeff Diamond, Boris Grot
:
Domain-Specialized Cache Management for Graph Analytics. 234-248 - Bahar Asgari
, Ramyad Hadidi
, Tushar Krishna, Hyesoon Kim, Sudhakar Yalamanchili:
ALRESCHA: A Lightweight Reconfigurable Sparse-Computation Accelerator. 249-260 - Zhekai Zhang, Hanrui Wang, Song Han, William J. Dally:
SpArch: Efficient Architecture for Sparse Matrix Multiplication. 261-274 - Farzaneh Zokaee, Lei Jiang:
Mitigating Voltage Drop in Resistive Memories by Dynamic RESET Voltage Regulation and Partition RESET. 275-286 - Jie Zhang, Gyuyoung Park, David Donofrio, John Shalf
, Myoungsoo Jung:
DRAM-Less: Hardware Acceleration of Data Processing with New Memory. 287-302 - Xin Xin, Youtao Zhang, Jun Yang:
ELP2IM: Efficient and Low Power Bitwise Operation Processing in DRAM. 303-314 - Keni Qiu, Nicholas Jao, Mengying Zhao, Cyan Subhra Mishra
, Gulsum Gudukbay
, Sethu Jose, Jack Sampson, Mahmut Taylan Kandemir, Vijaykrishnan Narayanan
:
ResiRCA: A Resilient Energy Harvesting ReRAM Crossbar-Based Accelerator for Intelligent Embedded Processors. 315-327 - Tae Jun Ham, Sungjun Jung, Seonghak Kim, Young H. Oh, Yeonhong Park, Yoonho Song
, Jung-Hun Park, Sanghee Lee, Kyoung Park, Jae W. Lee, Deog-Kyoon Jeong:
A3: Accelerating Attention Mechanisms in Neural Networks with Approximation. 328-341 - Linghao Song
, Fan Chen, Youwei Zhuo, Xuehai Qian, Hai Li, Yiran Chen:
AccPar: Tensor Partitioning for Heterogeneous Deep Learning Accelerators. 342-355 - Donald Kline Jr., Jiangwei Zhang, Rami G. Melhem, Alex K. Jones
:
FLOWER and FaME: A Low Overhead Bit-Level Fault-map and Fault-Tolerance Approach for Deeply Scaled Memories. 356-368 - Yuezhi Che, Rujia Wang:
Multi-Range Supported Oblivious RAM for Efficient Block Data Retrieval. 369-382 - Ipoom Jeong
, Seihoon Park, Changmin Lee, Won Woo Ro:
CASINO Core Microarchitecture: Generating Out-of-Order Schedules Using Cascaded In-Order Scheduling Windows. 383-396 - Ajeya Naithani
, Josué Feliu
, Almutaz Adileh, Lieven Eeckhout:
Precise Runahead Execution. 397-410 - Liang Zhu, Chao Chen, Zihao Su, Weiguang Chen, Tao Li, Zhibin Yu:
BBS: Micro-Architecture Benchmarking Blockchain Systems through Machine Learning and Fuzzy Set. 411-423 - Mehdi Alipour, Stefanos Kaxiras, David Black-Schaffer, Rakesh Kumar:
Delay and Bypass: Ready and Criticality Aware Instruction Scheduling in Out-of-Order Processors. 424-434 - Yunfan Li, Lizhong Chen:
EquiNox: Equivalent NoC Injection Routers for Silicon Interposer-Based Throughput Processors. 435-446 - Mayank Parasar, Hossein Farrokhbakht, Natalie D. Enright Jerger
, Paul V. Gratz
, Tushar Krishna, Joshua San Miguel
:
DRAIN: Deadlock Removal for Arbitrary Irregular Networks. 447-460 - Karthik Sangaiah, Michael Lui, Ragh Kuttappa, Baris Taskin, Mark Hempstead:
SnackNoC: Processing in the Communication Layer. 461-473 - Kyle Shiflett, Dylan Wright, Avinash Karanth
, Ahmed Louri:
PIXEL: Photonic Neural Network Accelerator. 474-487 - Udit Gupta
, Carole-Jean Wu, Xiaodong Wang, Maxim Naumov, Brandon Reagen, David Brooks, Bradford Cottel, Kim M. Hazelwood, Mark Hempstead, Bill Jia, Hsien-Hsin S. Lee, Andrey Malevich, Dheevatsa Mudigere, Mikhail Smelyanskiy, Liang Xiong, Xuan Zhang
:
The Architectural Implications of Facebook's DNN-Based Personalized Recommendation. 488-501 - Changmin Lee, Wonjae Shin, Dae Jeong Kim, Yongjun Yu, Sung-Joon Kim, Taekyeong Ko, Deokho Seo, Jongmin Park, Kwanghee Lee, Seongho Choi, Namhyung Kim, Vishak G, Arun George, Vishwas V, Donghun Lee, Kang-Woo Choi, Changbin Song, Dohan Kim, Insu Choi, Ilgyu Jung, Yong Ho Song, Jinman Han:
NVDIMM-C: A Byte-Addressable Non-Volatile Memory Module for Compatibility with Standard DDR Memory Interfaces. 502-514 - Daniel Richins, Dharmisha Doshi, Matthew Blackmore, Aswathy Thulaseedharan Nair, Neha Pathapati, Ankit Patel, Brainard Daguman, Daniel Dobrijalowski, Ramesh Illikkal, Kevin Long, David Zimmerman, Vijay Janapa Reddi:
Missing the Forest for the Trees: End-to-End AI Application Performance in Edge Data Centers. 515-528 - Xiaoming Chen, Yinhe Han, Yu Wang:
Communication Lower Bound in Convolution Accelerators. 529-541 - Xingyao Zhang
, Shuaiwen Leon Song, Chenhao Xie, Jing Wang, Weigong Zhang, Xin Fu
:
Enabling Highly Efficient Capsule Networks Processing Through A PIM-Based Architecture Design. 542-555 - Marzieh Lenjani, Patricia Gonzalez-Guerrero, Elaheh Sadredini
, Shuangchen Li, Yuan Xie, Ameen Akel, Sean Eilert, Mircea R. Stan
, Kevin Skadron
:
Fulcrum: A Simplified Control and Access Mechanism Toward Flexible and Practical In-Situ Accelerators. 556-569 - Gurunath Kadam
, Danfeng Zhang
, Adwait Jog:
BCoal: Bucketing-Based Memory Coalescing for Efficient and Secure GPUs. 570-581 - Xiaowei Ren, Daniel Lustig, Evgeny Bolotin, Aamer Jaleel, Oreste Villa, David W. Nellans:
HMG: Extending Cache Coherence Protocols Across Modern Hierarchical Multi-GPU Systems. 582-595 - Trinayan Baruah, Yifan Sun
, Ali Tolga Dinçer, Saiful A. Mojumder, José L. Abellán, Yash Ukidave, Ajay Joshi, Norman Rubin, John Kim
, David R. Kaeli:
Griffin: Hardware-Software Support for Efficient Page Migration in Multi-GPU Systems. 596-609 - Jianbo Dong, Zheng Cao
, Tao Zhang, Jianxi Ye, Shaochuang Wang, Fei Feng, Li Zhao, Xiaoyong Liu, Liuyihan Song, Liwei Peng, Yiqun Guo, Xiaowei Jiang, Lingbo Tang, Yin Du, Yingya Zhang, Pan Pan, Yuan Xie:
EFLOPS: Algorithm and System Co-Design for a High Performance Distributed Training Platform. 610-622 - Jawad Haj-Yahya, Yanos Sazeides, Mohammed Alser, Efraim Rotem, Onur Mutlu:
Techniques for Reducing the Connected-Standby Energy Consumption of Mobile Devices. 623-636 - Jieming Yin, Subhash Sethumurugan, Yasuko Eckert, Chintan Patel, Alan Smith, Eric Morton, Mark Oskin, Natalie D. Enright Jerger
, Gabriel H. Loh:
Experiences with ML-Driven Design: A NoC Case Study. 637-648 - Evangelos Vasilakis, Vassilis Papaefstathiou, Pedro Trancoso
, Ioannis Sourdis:
Hybrid2: Combining Caching and Migration in Hybrid Memory Systems. 649-662 - Seikwon Kim, Wonsang Kwak, Changdae Kim, Daehyeon Baek
, Jaehyuk Huh:
Charge-Aware DRAM Refresh Reduction with Value Transformation. 663-676 - Jian Chen, Ying Zhang, Xiaowei Jiang, Li Zhao, Zheng Cao
, Qiang Liu:
DWT: Decoupled Workload Tracing for Data Centers. 677-688 - Nitish Kumar Srivastava, Hanchen Jin, Shaden Smith, Hongbo Rong, David H. Albonesi, Zhiru Zhang
:
Tensaurus: A Versatile Accelerator for Mixed Sparse-Dense Tensor Computations. 689-702 - Jian Weng, Sihao Liu
, Zhengrong Wang, Vidushi Dadu, Tony Nowatzki:
A Hybrid Systolic-Dataflow Architecture for Inductive Matrix Algorithms. 703-716 - Adrián Barredo
, Juan M. Cebrian
, Miquel Moretó
, Marc Casas
, Mateo Valero:
Improving Predication Efficiency through Compaction/Restoration of SIMD Instructions. 717-728

manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.