


default search action
ACM Transactions on Parallel Computing, Volume 11
Volume 11, Number 1, March 2024
- Anne Benoit
, Lucas Perotin
, Yves Robert
, Frédéric Vivien
:
Checkpointing Strategies to Tolerate Non-Memoryless Failures on HPC Platforms. 1:1-1:26 - Lucas Perotin
, Hongyang Sun
:
Improved Online Scheduling of Moldable Task Graphs under Common Speedup Models. 2:1-2:31 - Shengle Lin
, Wangdong Yang
, Yikun Hu
, Qinyun Cai
, Minlu Dai
, Haotian Wang
, Kenli Li
:
HPS Cholesky: Hierarchical Parallelized Supernodal Cholesky with Adaptive Parameters. 3:1-3:22 - Romolo Marotta
, Mauro Ianni
, Alessandro Pellegrini
, Francesco Quaglia
:
A Conflict-Resilient Lock-Free Linearizable Calendar Queue. 4:1-4:32 - Stefan K. Muller
, Jan Hoffmann
:
Modeling and Analyzing Evaluation Cost of CUDA Kernels. 5:1-5:53 - Qinyun Cai
, Guoqing Xiao
, Shengle Lin
, Wangdong Yang
, Keqin Li
, Kenli Li
:
ABSS: An Adaptive Batch-Stream Scheduling Module for Dynamic Task Parallelism on Chiplet-based Multi-Chip Systems. 6:1-6:24
Volume 11, Number 2, June 2024
- Qiang Fu, Yuede Ji, Thomas B. Rolinger, H. Howie Huang
:
TLPGNN: A Lightweight Two-level Parallelism Paradigm for Graph Neural Network Computation on Single and Multiple GPUs. 7 - Zixuan Li
, Yunchuan Qin
, Qi Xiao
, Wangdong Yang
, Kenli Li
:
cuFasterTucker: A Stochastic Optimization Strategy for Parallel Sparse FastTucker Decomposition on GPU Platform. 8 - Sébastien Darche
, Michel R. Dagenais
:
Low-Overhead Trace Collection and Profiling on GPU Compute Kernels. 9 - Ziyang Li
, Dongsheng Li
, Yingwen Chen
, Kai Chen
, Yiming Zhang
:
Decentralized Scheduling for Data-Parallel Tasks in the Cloud. 10 - Guoqing Xiao
, Tao Zhou
, Yuedan Chen
, Yikun Hu
, Kenli Li
:
Machine Learning-Based Kernel Selector for SpMV Optimization in Graph Analysis. 11 - Zixuan Li
, Yikun Hu
, Mengquan Li
, Wangdong Yang
, Kenli Li
:
cuFastTucker: A Novel Sparse FastTucker Decomposition For HHLST on Multi-GPUs. 12
Volume 11, Number 3, September 2024
- Yiqian Liu
, Noushin Azami
, Avery Vanausdal
, Martin Burtscher
:
Indigo3: A Parallel Graph Analytics Benchmark Suite for Exploring Implementation Styles and Common Bugs. 13:1-13:29 - Johan Bontes
, James Gain
:
Redzone stream compaction: removing k items from a list in parallel O(k) time. 14:1-14:16
Volume 11, Number 4, December 2024
- Cu Cui
:
Acceleration of Tensor-Product Operations with Tensor Cores. 15:1-15:24 - Wim H. Hesselink
, Peter A. Buhr
, Colby A. Parsons
:
First-Come-First-Served as a Separate Principle. 16:1-16:20 - Johannes Pahlke
, Ivo F. Sbalzarini
:
Proven Distributed Memory Parallelization of Particle Methods. 17:1-17:45 - Hermann Bogning Tepiele
, Vianney Kengne Tchendji
, Mathias Akong Onabid
, Jean Frédéric Myoupo
, Armel Nkonjoh Ngomade
:
Dominant Point-Based Sequential and Parallel Algorithms for the Multiple Sequential Substring Constrained-LCS Problem. 18:1-18:31

manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.