default search action
32nd PACT 2023: Vienna, Austria
- 32nd International Conference on Parallel Architectures and Compilation Techniques, PACT 2023, Vienna, Austria, October 21-25, 2023. IEEE 2023, ISBN 979-8-3503-4254-3
- Sawan Singh, Josué Feliu, Manuel E. Acacio, Alexandra Jimborean, Alberto Ros:
CELLO: Compiler-Assisted Efficient Load-Load Ordering in Data-Race-Free Regions. 1-13 - Zhen Peng, Rizwan A. Ashraf, Luanzheng Guo, Ruiqin Tian, Gokcen Kestor:
Automatic Code Generation for High-Performance Graph Algorithms. 14-26 - Aditya Agrawal, V. Krishna Nandivada:
UWOmppro: UWOmp++ with Point-to-Point Synchronization, Reduction and Schedules. 27-38 - Alexander Brauckmann, Elizabeth Polgreen, Tobias Grosser, Michael F. P. O'Boyle:
mlirSynth: Automatic, Retargetable Program Raising in Multi-Level IR Using Program Synthesis. 39-50 - Shubdeep Mohapatra, Biswabandan Panda:
Drishyam: An Image is Worth a Data Prefetcher. 51-61 - Weiwei Jia, Jiyuan Zhang, Jianchen Shan, Yiming Du, Xiaoning Ding, Tianyin Xu:
HugeGPT: Storing Guest Page Tables on Host Huge Pages to Accelerate Address Translation. 62-73 - Hussein Elnawawy, James Tuck, Gregory T. Byrd:
PreFlush: Lightweight Hardware Prediction Mechanism for Cache Line Flush and Writeback. 74-85 - Hyokeun Lee, Kwanseok Choi, Hyuk-Jae Lee, Jaewoong Sim:
SDM: Sharing-Enabled Disaggregated Memory System with Cache Coherent Compute Express Link. 86-98 - Jinfan Chen, Juan Gómez-Luna, Izzat El Hajj, Yuxin Guo, Onur Mutlu:
SimplePIM: A Software Framework for Productive and Efficient Processing-in-Memory. 99-111 - Donghyeon Kim, Taehoon Kim, Inyong Hwang, Taehyeong Park, Hanjun Kim, Youngsok Kim, Yongjun Park:
Virtual PIM: Resource-Aware Dynamic DPU Allocation and Workload Scheduling Framework for Multi-DPU PIM Architecture. 112-123 - Diya Joseph, Juan L. Aragón, Joan-Manuel Parcerisa, Antonio González:
Boustrophedonic Frames: Quasi-Optimal L2 Caching for Textures in GPUs. 124-136 - Yue Jin, Chengying Huan, Heng Zhang, Yongchao Liu, Shuaiwen Leon Song, Rui Zhao, Yao Zhang, Changhua He, Wenguang Chen:
G-Sparse: Compiler-Driven Acceleration for Generalized Sparse Computation for Graph Neural Networks on Modern GPUs. 137-149 - Giulia Gerometta, Alberto Zeni, Marco D. Santambrogio:
TSUNAMI: A GPU Implementation of the WFA Algorithm. 150-161 - Mohammad Almasri, Yen-Hsiang Chang, Izzat El Hajj, Rakesh Nagi, Jinjun Xiong, Wen-mei W. Hwu:
Parallelizing Maximal Clique Enumeration on GPUs. 162-175 - Jan van Lunteren:
Accelerating Decision-Tree-Based Inference Through Adaptive Parallelization. 176-186 - Louis Narmour, Steven Derrien, Sanjay V. Rajopadhye:
Automatic Algorithm-Based Fault Tolerance (AABFT) of Stencil Computations. 187-198 - Pablo Prieto, Pablo Abad Fidalgo, José-Ángel Gregorio, Valentin Puente:
Performance Characterization of Popular DNN Models on Out-of-Order CPUs. 199-210 - Juelin Liu, Sandeep Polisetty, Hui Guan, Marco Serafini:
GraphMini: Accelerating Graph Pattern Matching Using Auxiliary Graphs. 211-224 - Jiyoung An, Esmerald Aliaj, Sang-Woo Jun:
Barad-dur: Near-Storage Accelerator for Training Large Graph Neural Networks. 225-237 - Yuan Li, Ahmed Louri, Avinash Karanth:
A Silicon Photonic Multi-DNN Accelerator. 238-249 - Mahmut Taylan Kandemir, Gulsum Gudukbay Akbulut, Wonil Choi, Mustafa Karaköy:
Architecture-Aware Currying. 250-264 - Zack McKevitt, Ashutosh Trivedi, Tamara Silbergleit Lehman:
SpecCheck: A Tool for Systematic Identification of Vulnerable Transient Execution in gem5. 265-278 - Yaodong Sheng, Ahmed Hassan, Michael F. Spear:
Separating Mechanism from Policy in STM. 279-296 - Hongwei Cui, Yujie Cui, Honglan Zhan, Shuhao Liang, Xianhua Liu, Chun Yang, Xu Cheng:
MBAPIS: Multi-Level Behavior Analysis Guided Program Interval Selection for Microarchitecture Studies. 297-308 - Jae Seok Kwak, Myung Kuk Yoon, Ipoom Jeong, Seunghyun Jin, Won Woo Ro:
INTERPRET: Inter-Warp Register Reuse for GPU Tensor Core. 309-319 - Tiago Santos, João Bispo, João M. P. Cardoso:
A CPU-FPGA Holistic Source-To-Source Compilation Approach for Partitioning and Optimizing C/C++ Applications. 320-322 - Lucia Pons, Julio Sahuquillo, Timothy M. Jones:
Dynamic Allocation of Processor Cores to Graph Applications on Commodity Servers. 323-324 - Bahareh Khabbazan, Marc Riera, Antonio González:
QeiHaN: An Energy-Efficient DNN Accelerator that Leverages Log Quantization in NDP Architectures. 325-326 - Tayyeb Mahmood, Kashif Inayat, Jaeyong Chung:
Quickloop: An Efficient, FPGA-Accelerated Exploration of Parameterized DNN Accelerators. 327-328 - Luís Miguel Sousa, João Bispo, Nuno Paulino:
Retargeting Applications for Heterogeneous Systems with the Tribble Source-to-Source Framework. 329-331 - Raúl Taranco, José-María Arnau, Antonio González:
SLIDEX: Sliding Window Extension for Image Processing. 332-334 - Gwangeun Byeon, Seungtae Lee, Seongwook Kim, Yongjun Kim, Prashant J. Nair, Seokin Hong:
SparseFT: Sparsity-aware Fault Tolerance for Reliable CNN Inference on GPUs. 337-338
manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.