


default search action
51st MICRO 2018: Fukuoka, Japan
- 51st Annual IEEE/ACM International Symposium on Microarchitecture, MICRO 2018, Fukuoka, Japan, October 20-24, 2018. IEEE Computer Society 2018, ISBN 978-1-5386-6240-3

1A: Accelerators
- Anurag Mukkara, Nathan Beckmann, Maleen Abeydeera, Xiaosong Ma, Daniel Sánchez:

Exploiting Locality in Graph Analytics through Hardware-Accelerated Traversal Scheduling. 1-14 - Xuda Zhou, Zidong Du, Qi Guo, Shaoli Liu, Chengsi Liu, Chao Wang, Xuehai Zhou, Ling Li, Tianshi Chen, Yunji Chen

:
Cambricon-S: Addressing Irregularity in Sparse Neural Networks through A Cooperative Software/Hardware Approach. 15-28 - Youwei Zhuo, Jinglei Cheng

, Qinyi Luo, Jidong Zhai, Yanzhi Wang, Zhongzhi Luan, Xuehai Qian:
CSE: Parallel Finite State Machines with Convergence Set Enumeration. 29-41 - Dani Voitsechov, Oron Port, Yoav Etsion

:
Inter-Thread Communication in Multithreaded, Reconfigurable Coarse-Grain Arrays. 42-54 - Tao Chen, Shreesha Srinath, Christopher Batten, G. Edward Suh

:
An Architectural Framework for Accelerating Dynamic Parallel Algorithms on Reconfigurable Hardware. 55-67
1B: Microarchitecture
- Sizhuo Zhang, Andrew Wright, Thomas Bourgeat

, Arvind:
Composable Building Blocks to Open up Processor Design. 68-81 - Hideki Ando:

Performance Improvement by Prioritizing the Issue of the Instructions in Unconfident Branch Slices. 82-94 - Alberto Ros

, Stefanos Kaxiras:
The Superfluous Load Queue. 95-107 - Almutaz Adileh, David J. Lilja, Lieven Eeckhout:

Architectural Support for Probabilistic Branches. 108-120 - Hidetsugu Irie, Toru Koizumi, Akifumi Fukuda, Seiya Akaki, Satoshi Nakae, Yutaro Bessho, Ryota Shioya, Takahiro Notsu, Katsuhiro Yoda, Teruo Ishihara, Shuichi Sakai

:
STRAIGHT: Hazardless Processor Architecture Without Register Renaming. 121-133
2A: ML Accelerators
- Mostafa Mahmoud, Kevin Siu, Andreas Moshovos:

Diffy: a Déjà vu-Free Differential Deep Neural Network Accelerator. 134-147 - Youngeun Kwon

, Minsoo Rhu:
Beyond the Memory Wall: A Case for Memory-Centric HPC System for Deep Learning. 148-161 - Xingyao Zhang

, Chenhao Xie
, Jing Wang, Weidong Zhang, Xin Fu
:
Towards Memory Friendly Long-Short Term Memory Networks (LSTMs) on Mobile GPUs. 162-174 - Youjie Li, Jongse Park, Mohammad Alian

, Yifan Yuan, Zheng Qu, Peitian Pan, Ren Wang, Alexander G. Schwing, Hadi Esmaeilzadeh, Nam Sung Kim:
A Network-Centric Hardware/Algorithm Co-Design to Accelerate Distributed Training of Deep Neural Networks. 175-188 - Chunhua Deng, Siyu Liao, Yi Xie, Keshab K. Parhi

, Xuehai Qian, Bo Yuan:
PermDNN: Efficient Compressed DNN Architecture with Permuted Diagonal Matrices. 189-202
2B: Compilers and Programming Languages
- Po-An Tsai, Yee Ling Gan, Daniel Sánchez:

Rethinking the Memory Hierarchy for Modern Languages. 203-216 - Mark C. Jeffrey

, Victor A. Ying
, Suvinay Subramanian
, Hyun Ryong Lee, Joel S. Emer, Daniel Sánchez:
Harmonizing Speculative and Non-Speculative Execution in Architectures for Ordered Parallelism. 217-230 - Sam Silvestro, Hongyu Liu, Tong Zhang, Changhee Jung, Dongyoon Lee

, Tongping Liu:
Sampler: PMU-Based Sampling to Detect Memory Errors Latent in Production Software. 231-244 - Steve Margerm, Amirali Sharifian, Apala Guha, Arrvindh Shriraman, Gilles Pokam:

TAPAS: Generating Parallel Accelerators from Parallel Programs. 245-257 - Qingrui Liu, Joseph Izraelevitz, Se Kwon Lee

, Michael L. Scott
, Sam H. Noh, Changhee Jung:
iDO: Compiler-Directed Failure Atomicity for Nonvolatile Memory. 258-270
3A: Memory Systems - I
- Srikant Bharadwaj, Guilherme Cox

, Tushar Krishna, Abhishek Bhattacharjee
:
Scalable Distributed Last-Level TLBs Using Low-Latency Interconnects. 271-284 - Ben Lin, Michael B. Healy, Rustam Miftakhutdinov, Philip G. Emma, Yale N. Patt:

Duplicon Cache: Mitigating Off-Chip Memory Bank and Bank Group Conflicts Via Data Duplication. 285-297 - Yaohua Wang, Arash Tavakkol, Lois Orosa

, Saugata Ghose, Nika Mansouri-Ghiasi, Minesh Patel, Jeremie S. Kim, Hasan Hassan, Mohammad Sadrosadati, Onur Mutlu
:
Reducing DRAM Latency via Charge-Level-Aware Look-Ahead Partial Restoration. 298-311 - Tri Minh Nguyen, Adi Fuchs, David Wentzlaff:

CABLE: A CAche-Based Link Encoder for Bandwidth-Starved Manycores. 312-325 - Seokin Hong, Prashant Jayaprakash Nair, Bülent Abali, Alper Buyuktosunoglu, Kyu-Hyoun Kim

, Michael B. Healy:
Attaché: Towards Ideal Memory Compression by Mitigating Metadata Bandwidth Overheads. 326-338
3B: GPGPU/GPU
- Vinson Young, Aamer Jaleel, Evgeny Bolotin, Eiman Ebrahimi, David W. Nellans, Oreste Villa:

Combining HW/SW Mechanisms to Improve NUMA Performance of Multi-GPU Systems. 339-351 - Seunghee Shin, Michael LeBeane, Yan Solihin, Arkaprava Basu:

Neighborhood-Aware Address Translation for Irregular GPU Applications. 352-363 - Yunho Oh

, Myung Kuk Yoon
, William J. Song
, Won Woo Ro:
FineReg: Fine-Grained Register File Management for Augmenting GPU Throughput. 364-376 - Farzad Khorasani, Hodjat Asghari Esfeden, Nael B. Abu-Ghazaleh

, Vivek Sarkar:
In-Register Parameter Caching for Dynamic Neural Nets with Virtual Persistent Processor Specialization. 377-389 - An Zou, Jingwen Leng, Xin He, Yazhou Zu, Christopher D. Gill, Vijay Janapa Reddi, Xuan Zhang

:
Voltage-Stacked GPUs: A Control Theory Driven Cross-Layer Solution for Practical Voltage Stacking in GPUs. 390-402
4A: Security - I
- Mao Ye

, Clayton Hughes
, Amro Awad
:
Osiris: A Low-Cost Mechanism to Enable Restoration of Secure Non-Volatile Memories. 403-415 - Gururaj Saileshwar, Prashant J. Nair, Prakash Ramrakhyani, Wendy Elsasser

, José A. Joao, Moinuddin K. Qureshi:
Morphable Counters: Enabling Compact Integrity Trees For Low-Overhead Secure Memories. 416-427 - Mengjia Yan

, Jiho Choi, Dimitrios Skarlatos, Adam Morrison, Christopher W. Fletcher, Josep Torrellas:
InvisiSpec: Making Speculative Execution Invisible in the Cache Hierarchy. 428-441 - Pengfei Zuo

, Yu Hua, Ming Zhao
, Wen Zhou, Yuncheng Guo:
Improving the Performance and Endurance of Encrypted Non-Volatile Main Memory through Deduplicating Writes. 442-454
4B: Storage Systems & Technologies
- Joonsung Kim, Pyeongsu Park, Jaehyung Ahn, Jihun Kim, Jong Kim, Jangwoo Kim:

SSDcheck: Timely and Accurate Prediction of Irregular Behaviors in Black-Box SSDs. 455-468 - Donghyun Gouk, Miryeong Kwon, Jie Zhang, Sungjoon Koh, Wonil Choi, Nam Sung Kim, Mahmut T. Kandemir, Myoungsoo Jung:

Amber*: Enabling Precise Full-System Simulation with Detailed Modeling of All SSD Resources. 469-481 - Wonil Choi, Myoungsoo Jung, Mahmut T. Kandemir:

Invalid Data-Aware Coding to Enhance the Read Performance of High-Density Flash Memories. 482-493 - Xing Hu, Matheus Ogleari, Jishen Zhao, Shuangchen Li, Abanti Basak, Yuan Xie:

Persistence Parallelism Optimization: A Holistic Approach from Memory Bus to RDMA Network. 494-506
5A: Memory Systems - II
- Tri Minh Nguyen, David Wentzlaff:

PiCL: A Software-Transparent, Persistent Cache Log for Nonvolatile Main Memory. 507-519 - Jungi Jeong, Chang Hyun Park

, Jaehyuk Huh, Seungryoul Maeng:
Efficient Hardware-Assisted Logging with Asynchronous and Direct-Update for Persistent Memory. 520-532 - Jagadish B. Kotra, Haibo Zhang, Alaa R. Alameldeen, Chris Wilkerson, Mahmut T. Kandemir:

CHAMELEON: A Dynamically Reconfigurable Heterogeneous Memory System. 533-545 - Esha Choukse, Mattan Erez

, Alaa R. Alameldeen:
Compresso: Pragmatic Main Memory Compression. 546-558 - Amna Shahab, Mingcan Zhu, Artemiy Margaritov

, Boris Grot
:
Farewell My Shared LLC! A Case for Private Die-Stacked DRAM Caches for Servers. 559-572
5B: Measurement, Modeling, and Simulation
- Zacharias Hadjilambrou, Shidhartha Das, Marco A. Antoniades

, Yiannakis Sazeides:
Leveraging CPU Electromagnetic Emanations for Voltage Noise Characterization. 573-585 - Hanhwi Jang, Jae-Eon Jo, Jaewon Lee, Jangwoo Kim:

RpStacks-MT: A High-Throughput Design Evaluation Methodology for Multi-Core Processors. 586-599 - Joshua San Miguel

, Karthik Ganesan
, Mario Badr, Chunqiu Xia, Rose Li, Hsuan Hsiao, Natalie D. Enright Jerger
:
The EH Model: Early Design Space Exploration of Intermittent Processor Architectures. 600-612 - Yirong Lv, Bin Sun, Qingyi Luo, Jing Wang, Zhibin Yu, Xuehai Qian:

CounterMiner: Mining Big Performance Data from Hardware Counters. 613-626 - Shenghsun Cho, Amoghavarsha Suresh, Tapti Palit

, Michael Ferdman, Nima Honarmand:
Taming the Killer Microsecond. 627-640
6A: Near Memory Computing
- Po-An Tsai, Changping Chen, Daniel Sánchez:

Adaptive Scheduling for Systems with Asymmetric Memory Hierarchies. 641-654 - Jiawen Liu, Hengyu Zhao, Matheus A. Ogleari, Dong Li, Jishen Zhao:

Processing-in-Memory for Energy-Efficient Neural Network Training: A Heterogeneous Approach. 655-668 - Haiyu Mao, Mingcong Song, Tao Li, Yuting Dai, Jiwu Shu:

LerGAN: A Zero-Free, Low Data Movement and PIM-Based GAN Architecture. 669-681 - Byungchul Hong, Yeonju Ro, John Kim

:
Multi-dimensional Parallel Training of Winograd Layer on Memory-Centric Architecture. 682-695 - Shuangchen Li, Alvin Oliver Glova, Xing Hu, Peng Gu, Dimin Niu, Krishna T. Malladi, Hongzhong Zheng, Bob Brennan, Yuan Xie:

SCOPE: A Stochastic Computing Engine for DRAM-Based In-Situ Accelerator. 696-709
6B: Near Memory Computing
- Da Zhang, Vilas Sridharan, Xun Jian

:
Exploring and Optimizing Chipkill-Correct for Persistent Memory Based on High-Density NVRAMs. 710-723 - Behzad Salami

, Osman S. Unsal, Adrián Cristal Kestelman:
Comprehensive Evaluation of Supply Voltage Underscaling in FPGA on-Chip Memories. 724-736 - Emre Ozer, Balaji Venu, Xabier Iturbe

, Shidhartha Das, Spyros Lyberis, John Biggs, Peter Harrod, John Penton:
Error Correlation Prediction in Lockstep Processors for Safety-Critical Systems. 737-748 - Bin Nie, Lishan Yang, Adwait Jog, Evgenia Smirni

:
Fault Site Pruning for Practical Reliability Analysis of GPGPU Applications. 749-761 - Michael B. Sullivan, Siva Kumar Sastry Hari, Brian Zimmer, Timothy Tsai, Stephen W. Keckler:

SwapCodes: Error Codes for Hardware-Software Cooperative GPU Pipeline Error Detection. 762-774
7: Best Papers
- Moinuddin K. Qureshi:

CEASER: Mitigating Conflict-Based Cache Attacks via Encrypted-Address and Remapping. 775-787 - Yatin A. Manerkar, Daniel Lustig, Margaret Martonosi, Aarti Gupta

:
PipeProof: Automated Memory Consistency Proofs for Microarchitectural Specifications. 788-801 - Mohammad Alian

, Seungwon Min, Hadi Asgharimoghaddam, Ashutosh Dhar, Dong Kai Wang
, Thomas Roewer, Adam J. McPadden, Oliver O'Halloran, Deming Chen, Jinjun Xiong
, Daehoon Kim, Wen-Mei W. Hwu, Nam Sung Kim:
Application-Transparent Near-Memory Processing Architecture with Memory Channel Network. 802-814 - Rui Zhang, Calvin Deutschbein

, Peng Huang, Cynthia Sturton:
End-to-End Automated Exploit Generation for Validating the Security of Processor Designs. 815-827
8A: Non-Conventional Architectures
- Yongshan Ding

, Adam Holmes, Ali Javadi-Abhari, Diana Franklin
, Margaret Martonosi, Frederic T. Chong
:
Magic-State Functional Units: Mapping and Scheduling Multi-Level Distillation Circuits for Fault-Tolerant Quantum Architectures. 828-840 - Sumitha George, Minli Julie Liao, Huaipan Jiang, Jagadish B. Kotra, Mahmut T. Kandemir, Jack Sampson, Vijaykrishnan Narayanan:

MDACache: Caching for Multi-Dimensional-Access Memories. 841-854 - Ananda Samajdar, Parth Mannan, Kartikay Garg, Tushar Krishna:

GeneSys: Enabling Continuous Learning through Neural Network Evolution in Hardware. 855-866
8B: Mobile and Embedded Architectures
- Prasanna Venkatesh Rengasamy, Haibo Zhang, Shulin Zhao, Nachiappan Chidambaram Nachiappan, Anand Sivasubramaniam, Mahmut T. Kandemir, Chita R. Das:

CritICs Critiquing Criticality in Mobile Apps. 867-880 - Moumita Dey

, Alireza Nazari, Alenka G. Zajic
, Milos Prvulovic:
EMPROF: Memory Profiling Via EM-Emanation in IoT and Hand-Held Devices. 881-893 - Behzad Boroujerdian, Hasan Genc, Srivatsan Krishnan, Wenzhi Cui, Aleksandra Faust

, Vijay Janapa Reddi:
MAVBench: Micro Aerial Vehicle Benchmarking. 894-907
9A: Domain-Specific Architectures
- Hongyuan Liu

, Mohamed Assem Ibrahim
, Onur Kayiran, Sreepathi Pai
, Adwait Jog:
Architectural Support for Efficient Large-Scale Automata Processing. 908-920 - Kevin Angstadt

, Arun Subramaniyan, Elaheh Sadredini
, Reza Rahimi, Kevin Skadron
, Westley Weimer, Reetuparna Das
:
ASPEN: A Scalable In-SRAM Architecture for Pushdown Automata. 921-932 - Kartik Hegde, Rohit Agrawal, Yulun Yao, Christopher W. Fletcher:

Morph: Flexible Acceleration for 3D CNN-Based Video Understanding. 933-946
9B: Security-II
- Caroline Trippel, Daniel Lustig, Margaret Martonosi:

CheckMate: Automated Synthesis of Hardware Exploits and Security Litmus Tests. 947-960 - Xian Zhang, Guangyu Sun, Peichen Xie, Chao Zhang, Yannan Liu, Lingxiao Wei, Qiang Xu

, Chun Jason Xue:
Shadow Block: Accelerating ORAM Accesses with Data Duplication. 961-973 - Vladimir Kiriansky, Ilia A. Lebedev, Saman P. Amarasinghe, Srinivas Devadas, Joel S. Emer:

DAWG: A Defense Against Cache Timing Attacks in Speculative Execution Processors. 974-987

manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.


Google
Google Scholar
Semantic Scholar
Internet Archive Scholar
CiteSeerX
ORCID














