default search action
ASPLOS 2022: Lausanne, Switzerland
- Babak Falsafi, Michael Ferdman, Shan Lu, Thomas F. Wenisch:
ASPLOS '22: 27th ACM International Conference on Architectural Support for Programming Languages and Operating Systems, Lausanne, Switzerland, 28 February 2022 - 4 March 2022. ACM 2022, ISBN 978-1-4503-9205-1
Session 1A: Accelerators
- Vidushi Dadu, Tony Nowatzki:
TaskStream: accelerating task-parallel workloads by recovering program structure. 1-13 - Zheng Qu, Liu Liu, Fengbin Tu, Zhaodong Chen, Yufei Ding, Yuan Xie:
DOTA: detect and omit weak attentions for scalable transformer acceleration. 14-26 - Dan Zhang, Safeen Huda, Ebrahim M. Songhori, Kartik Prabhu, Quoc V. Le, Anna Goldie, Azalia Mirhoseini:
A full-stack search technique for domain optimized deep learning accelerators. 27-42 - Qihang Chen, Boyu Tian, Mingyu Gao:
FINGERS: exploiting fine-grained parallelism in graph mining accelerators. 43-55 - Enrico Reggiani, Cristóbal Ramírez Lazo, Roger Figueras Bagué, Adrián Cristal, Mauro Olivieri, Osman Sabri Unsal:
BiSon-e: a lightweight and high-performance accelerator for narrow integer linear algebra computing on the edge. 56-69
Session 1B: Address and Memory
- Jialiang Zhang, Michael M. Swift, Jing Jane Li:
Software-defined address mapping: a case on 3D memory. 70-83 - Jovan Stojkovic, Dimitrios Skarlatos, Apostolos Kokolis, Tianyin Xu, Josep Torrellas:
Parallel virtualized memory translation with nested elastic cuckoo page tables. 84-97 - Brian Suchy, Souradip Ghosh, Drew Kersnar, Siyuan Chai, Zhen Huang, Aaron Nelson, Michael Cuevas, Alex Bernat, Gaurav Chaudhary, Nikos Hardavellas, Simone Campanoni, Peter A. Dinda:
CARAT CAKE: replacing paging via compiler/kernel cooperation. 98-114 - Zheng Dang, Shuibing He, Peiyi Hong, Zhenxin Li, Xuechen Zhang, Xian-He Sun, Gang Chen:
NVAlloc: rethinking heap metadata management in persistent memory allocators. 115-127 - Chang Hyun Park, Ilias Vougioukas, Andreas Sandberg, David Black-Schaffer:
Every walk's a hit: making page walks single-access cache hits. 128-141
Session 2A: GPU and Data Analytics
- Shweta Pandey, Aditya K. Kamath, Arkaprava Basu:
GPM: leveraging persistent memory from a GPU. 142-156 - Heejin Park, Felix Xiaozhu Lin:
GPUReplay: a 50-KB GPU stack for client ML. 157-170 - Keren Zhou, Yueming Hao, John M. Mellor-Crummey, Xiaozhu Meng, Xu Liu:
ValueExpert: exploring value patterns in GPU-accelerated applications. 171-185 - Gengyu Rao, Jingji Chen, Jason Yik, Xuehai Qian:
SparseCore: stream ISA and processor specialization for sparse computation. 186-199 - Lin Jiang, Zhijia Zhao:
JSONSki: streaming semi-structured data with bit-parallel fast-forwarding. 200-211
Session 2B: Privacy and Software Security
- Márton Erdos, Sam Ainsworth, Timothy M. Jones:
MineSweeper: a "clean sweep" for drop-in use-after-free prevention. 212-225 - Oleksii Oleksenko, Christof Fetzer, Boris Köpf, Mark Silberstein:
Revizor: testing black-box CPUs against speculation contracts. 226-239 - Tejas Kannan, Henry Hoffmann:
Protecting adaptive sampling from information leakage on low-power sensors. 240-254 - Haotian Zhang, Mengfei Ren, Yu Lei, Jiang Ming:
One size does not fit all: security hardening of MIPS embedded systems via static binary debloating for shared libraries. 255-270 - Haehyun Cho, Jinbum Park, Adam Oest, Tiffany Bao, Ruoyu Wang, Yan Shoshitaishvili, Adam Doupé, Gail-Joon Ahn:
ViK: practical mitigation of temporal memory safety violations through object ID inspection. 271-284
Session 3A: Hardware Security (1)
- Boyuan Yang, Ruirong Chen, Kai Huang, Jun Yang, Wei Gao:
Eavesdropping user credentials via GPU side channels on smartphones. 285-299 - Heiner Litz, Grant Ayers, Parthasarathy Ranganathan:
CRISP: critical slice prefetching. 300-313
Session 3B: Misc.
- Zirui Neil Zhao, Houxiang Ji, Adam Morrison, Darko Marinov, Josep Torrellas:
Pinned loads: taming speculative loads in secure processors. 314-328 - Peter W. Deutsch, Yuheng Yang, Thomas Bourgeat, Jules Drean, Joel S. Emer, Mengjia Yan:
DAGguise: mitigating memory timing side channels. 329-343
Session 4A: Systems for Machine Learning
- Geet Sethi, Bilge Acun, Niket Agarwal, Christos Kozyrakis, Caroline Trippel, Carole-Jean Wu:
RecShard: statistical feature-based memory optimization for industry-scale neural recommendation. 344-358 - Zhen Zheng, Xuanda Yang, Pengzhan Zhao, Guoping Long, Kai Zhu, Feiwen Zhu, Wenyi Zhao, Xiaoyong Liu, Jun Yang, Jidong Zhai, Shuaiwen Leon Song, Wei Lin:
AStitch: enabling a new multi-dimensional optimization space for memory-intensive ML training and inference on modern SIMT architectures. 359-373 - Shixiong Zhao, Fanxin Li, Xusheng Chen, Tianxiang Shen, Li Chen, Sen Wang, Nicholas Zhang, Cheng Li, Heming Cui:
NASPipe: high performance and reproducible pipeline parallel supernet training via causal synchronous parallelism. 374-387 - Zihan Liu, Jingwen Leng, Zhihui Zhang, Quan Chen, Chao Li, Minyi Guo:
VELTAIR: towards high-performance multi-tenant deep learning services via adaptive compilation and scheduling. 388-401 - Abhinav Jangda, Jun Huang, Guodong Liu, Amir Hossein Nodehi Sabet, Saeed Maleki, Youshan Miao, Madanlal Musuvathi, Todd Mytkowicz, Olli Saarikivi:
Breaking the computation and communication abstraction barrier in distributed machine learning workloads. 402-416
Session 4B: Operating System
- Zhiyuan Guo, Yizhou Shan, Xuhao Luo, Yutong Huang, Yiying Zhang:
Clio: a hardware-software co-designed disaggregated memory system. 417-433 - David A. Cock, Abishek Ramdas, Daniel Schwyn, Michael Giardino, Adam Turowski, Zhenhao He, Nora Hossle, Dario Korolija, Melissa Licciardello, Kristina Martsenko, Reto Achermann, Gustavo Alonso, Timothy Roscoe:
Enzian: an open, general, CPU/FPGA platform for systems software research. 434-451 - Nils Asmussen, Sebastian Haas, Carsten Weinhold, Till Miemietz, Michael Roitzsch:
Efficient and scalable core multiplexing with M³v. 452-466 - Hugo Lefeuvre, Vlad-Andrei Badoiu, Alexander Jung, Stefan Lucian Teodorescu, Sebastian Rauch, Felipe Huici, Costin Raiciu, Pierre Olivier:
FlexOS: towards flexible OS isolation. 467-482 - Ruslan Nikolaev, Hassan Nadeem, Cathlyn Stone, Binoy Ravindran:
Adelie: continuous address space layout re-randomization for Linux drivers. 483-498
Session 5A: Quantum Computing
- Lei Xie, Jidong Zhai, Zhenxing Zhang, Jonathan Allcock, Shengyu Zhang, Yicong Zheng:
Suppressing ZZ crosstalk of Quantum computers through pulse and scheduling co-optimization. 499-513 - Tirthak Patel, Ed Younis, Costin Iancu, Wibe de Jong, Devesh Tiwari:
QUEST: systematically approximating Quantum circuits for higher output fidelity. 514-528 - Swamit S. Tannu, Poulami Das, Ramin Ayanzadeh, Moinuddin K. Qureshi:
HAMMER: boosting fidelity of noisy Quantum circuits by exploiting Hamming behavior of erroneous outcomes. 529-540 - Poulami Das, Aditya Locharla, Cody Jones:
LILLIPUT: a lightweight low-latency lookup-table decoder for near-term Quantum error correction. 541-553 - Gushu Li, Anbang Wu, Yunong Shi, Ali Javadi-Abhari, Yufei Ding, Yuan Xie:
Paulihedral: a generalized block-wise compiler optimization framework for Quantum simulation kernels. 554-569
Session 5B: Data Center and Cloud Services
- Wei Zhang, Quan Chen, Kaihua Fu, Ningxin Zheng, Zhiyi Huang, Jingwen Leng, Minyi Guo:
Astraea: towards QoS-aware and resource-efficient multi-stage GPU services. 570-582 - Alexander Fuerst, Stanko Novakovic, Iñigo Goiri, Gohar Irfan Chaudhry, Prateek Sharma, Kapil Arya, Kevin Broas, Eugene Bak, Mehmet Iyigun, Ricardo Bianchini:
Memory-harvesting VMs in cloud platforms. 583-594 - Tejun Heo, Dan Schatzberg, Andrew Newell, Song Liu, Saravanan Dhakshinamurthy, Iyswarya Narayanan, Josef Bacik, Chris Mason, Chunqiang Tang, Dimitrios Skarlatos:
IOCost: block IO control for containers in datacenters. 595-608 - Johannes Weiner, Niket Agarwal, Dan Schatzberg, Leon Yang, Hao Wang, Blaise Sanouillet, Bikash Sharma, Tejun Heo, Mayank Jain, Chunqiang Tang, Dimitrios Skarlatos:
TMO: transparent memory offloading in datacenters. 609-621 - Yawen Wang, Daniel Crankshaw, Neeraja J. Yadwadkar, Daniel S. Berger, Christos Kozyrakis, Ricardo Bianchini:
SOL: safe on-node learning in cloud platforms. 622-634
Session 6A: Accelerating Emerging Applications
- Nika Mansouri-Ghiasi, Jisung Park, Harun Mustafa, Jeremie S. Kim, Ataberk Olgun, Arvid Gollwitzer, Damla Senol Cali, Can Firtina, Haiyu Mao, Nour Almadhoun Alserr, Rachata Ausavarungnirun, Nandita Vijaykumar, Mohammed Alser, Onur Mutlu:
GenStore: a high-performance in-storage processing system for genome sequence analysis. 635-654 - Eyes Robson, Ceyu Xu, Lisa Wu Wills:
ProSE: the architecture and design of a protein discovery engine. 655-668 - Bangyan Wang, Lei Deng, Fei Sun, Guohao Dai, Liu Liu, Yu Wang, Yuan Xie:
A one-for-all and o(v log(v ))-cost solution for parallel merge style operations on sorted key-value arrays. 669-682 - McKenzie van der Hagen, Brandon Lucia:
Client-optimized algorithms and acceleration for encrypted compute offloading. 683-696
Session 6B: Bugs (1)
- Theodoros Theodoridis, Manuel Rigger, Zhendong Su:
Finding missed optimizations through the lens of dead code elimination. 697-709 - Umang Mathur, Andreas Pavlogiannis, Hünkar Can Tunç, Mahesh Viswanathan:
A tree clock data structure for causal orderings in concurrent executions. 710-725 - Benjamin Reidys, Peng Liu, Jian Huang:
RSSD: defend against ransomware with hardware-isolated network-storage codesign and post-attack analysis. 726-739 - Xiang Cheng, David Devecsery:
Creating concise and efficient dynamic analyses with ALDA. 740-752
Session 7A: Serverless
- Rohan Basu Roy, Tirthak Patel, Devesh Tiwari:
IceBreaker: warming serverless functions better with heterogeneity. 753-767 - Yanan Yang, Laiping Zhao, Yiming Li, Huanyu Zhang, Jie Li, Mingyang Zhao, Xingzhen Chen, Keqiu Li:
INFless: a native serverless system for low-latency, high-throughput inference. 768-781 - Zijun Li, Yushi Liu, Linsong Guo, Quan Chen, Jiagan Cheng, Wenli Zheng, Minyi Guo:
FaaSFlow: enable efficient workflow execution for function-as-a-service. 782-796 - Dong Du, Qingyuan Liu, Xueqiang Jiang, Yubin Xia, Binyu Zang, Haibo Chen:
Serverless computing on heterogeneous computers. 797-813 - Qiangyu Pei, Shutong Chen, Qixia Zhang, Xinhui Zhu, Fangming Liu, Ziyang Jia, Yishuo Wang, Yongjie Yuan:
CoolEdge: hotspot-relievable warm water cooling for energy-efficient edge datacenters. 814-829
Session 7B: Bugs (2)
- Hamed Gorjiara, Guoqing Harry Xu, Brian Demsky:
Yashme: detecting persistency races. 830-845 - Muhui Jiang, Tianyi Xu, Yajin Zhou, Yufeng Hu, Ming Zhong, Lei Wu, Xiapu Luo, Kui Ren:
EXAMINER: automatically locating inconsistent instructions between real devices and CPU emulators for ARM. 846-858 - Tuo Li, Jia-Ju Bai, Yulei Sui, Shi-Min Hu:
Path-sensitive and alias-aware typestate analysis for detecting OS bugs. 859-872 - Zhangyu Chen, Yu Hua, Yongle Zhang, Luochangqi Ding:
Efficiently detecting concurrency bugs in persistent memory programs. 873-887 - Ziheng Liu, Shihao Xia, Yu Liang, Linhai Song, Hong Hu:
Who goes first? detecting go concurrency bugs via message reordering. 888-902
Session 8A: Non-traditional Computing and Reconfigurable Hardware
- Dongmoon Min, Yujin Chung, Ilkwon Byun, Junpyo Kim, Jangwoo Kim:
CryoWire: wire-driven microarchitecture designs for cryogenic computing. 903-917 - Thilini Kaushalya Bandara, Dhananjaya Wijerathne, Tulika Mitra, Li-Shiuan Peh:
REVAMP: a systematic framework for heterogeneous CGRA realization. 918-932 - Yuanlong Xiao, Eric Micallef, Andrew Butt, Matthew Hofmann, Marc Alston, Matthew Goldsmith, Andrew Merczynski-Hait, André DeHon:
PLD: fast FPGA compilation to make reconfigurable acceleration compatible with modern incremental refinement software development. 933-945 - Jiacheng Ma, Gefei Zuo, Kevin Loughlin, Haoyang Zhang, Andrew Quinn, Baris Kasikci:
Debugging in the brave new world of reconfigurable hardware. 946-962 - Patricia Gonzalez-Guerrero, Meriam Gay Bautista, Darren Lyles, George Michelogiannakis:
Temporal and SFQ pulse-streams encoding for area-efficient superconducting accelerators. 963-976
Session 8B: Synthesis and Compilation
- Theodoros Theodoridis, Tobias Grosser, Zhendong Su:
Understanding and exploiting optimal function inlining. 977-989 - Hammad Ahmad, Yu Huang, Westley Weimer:
CirFix: automatically repairing defects in hardware design code. 990-1003 - Maaz Bin Safeer Ahmad, Alexander J. Root, Andrew Adams, Shoaib Kamil, Alvin Cheung:
Vector instruction selection for digital signal processors using program synthesis. 1004-1016 - Qian Zhang, Jiyuan Wang, Guoqing Harry Xu, Miryung Kim:
HeteroGen: transpiling C to heterogeneous HLS code with automated test generation and program repair. 1017-1029 - Yanju Chen, Junrui Liu, Yu Feng, Rastislav Bodík:
Tree traversal synthesis using domain-specific symbolic compilation. 1030-1042
Session 9A: Hardware Security (2)
- Jubayer Mahmod, Matthew Hicks:
SRAM has no chill: exploiting power domain separation to steal on-chip secrets. 1043-1055 - Gururaj Saileshwar, Bolin Wang, Moinuddin K. Qureshi, Prashant J. Nair:
Randomized row-swap: mitigating Row Hammer by breaking spatial correlation between aggressor and victim rows. 1056-1069 - Mark Zhao, Mingyu Gao, Christos Kozyrakis:
ShEF: shielded enclaves for cloud FPGAs. 1070-1085 - Jubayer Mahmod, Matthew Hicks:
Invisible bits: hiding secret messages in SRAM's analog domain. 1086-1098
Session 9B: Smart Networking
- Tushar Swamy, Alexander Rucker, Muhammad Shahbaz, Ishan Gaur, Kunle Olukotun:
Taurus: a data plane architecture for per-packet ML. 1099-1114 - Haggai Eran, Maxim Fudim, Gabi Malka, Gal Shalom, Noam Cohen, Amit Hermony, Dotan Levi, Liran Liss, Mark Silberstein:
FlexDriver: a network driver for your accelerator. 1115-1129 - Boris Pismenny, Liran Liss, Adam Morrison, Dan Tsafrir:
The benefits of general-purpose on-NIC memory. 1130-1147 - Sebastiano Miano, Alireza Sanaee, Fulvio Risso, Gábor Rétvári, Gianni Antichi:
Domain specific run time optimization for software data planes. 1148-1164
manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.