default search action
IPDPS 2022: Lyon, France - Workshops
- IEEE International Parallel and Distributed Processing Symposium, IPDPS Workshops 2022, Lyon, France, May 30 - June 3, 2022. IEEE 2022, ISBN 978-1-6654-9747-3
- Anne Benoit, Laurent Lefèvre:
Message from the 2022 General Co-Chairs. xxviii-xxix - Robin Abrahamse, Ákos Hadnagy, Zaid Al-Ars:
Memory-Disaggregated In-Memory Object Store Framework for Big Data Applications. 1-7 - Pascal Costanza, Ibrahim Hur, Timothy G. Mattson:
Towards a GraphBLAS Implementation for Go. 1-4 - Junqi Yin, Feiyi Wang, Mallikarjun Shankar:
Strategies for Integrating Deep Learning Surrogate Models with HPC Simulation Applications. 1-10 - Natsuki Hamada, Kazuhiro Saito, Hideyuki Kawashima:
Practical Effectiveness of Quantum Annealing for Shift Scheduling Problem. 1-4 - Stephanie Brink:
AI for Datacenter Optimization (ADOPT'22). 1 - Laurent White:
HCW 2022 Keynote Speaker: Heterogeneous Computing for Scientific Machine Learning. 5 - Shashank Adavally, Alex Weaver, Pranathi Vasireddy, Krishna Kavi, Gayatri Mehta, Nagendra Gulur:
HETEROGENEOUS ARCHITECTURE FOR SPARSE DATA PROCESSING. 6-15 - Enrico Russo, Maurizio Palesi, Davide Patti, Habiba Lahdhiri, Salvatore Monteleone, Giuseppe Ascia, Vincenzo Catania:
Combined Application of Approximate Computing Techniques in DNN Hardware Accelerators. 16-23 - Chen-Chun Chen, Kawthar Shafie Khorassani, Quentin G. Anthony, Aamir Shafi, Hari Subramoni, Dhabaleswar K. Panda:
Highly Efficient Alltoall and Alltoallv Communication Algorithms for GPU Systems. 24-33 - Ravi Reddy Manumachu, Alexey L. Lastovetsky:
On Energy Nonproportionality of CPUs and GPUs. 34-44 - Johannes Moe, Konstantin Pogorelov, Daniel Thilo Schroeder, Johannes Langguth:
Implementating Spatio-Temporal Graph Convolutional Networks on Graphcore IPUs. 45-54 - Giorgos Vasiliadis, Rafail Tsirbas, Sotiris Ioannidis:
The Best of Many Worlds: Scheduling Machine Learning Inference on CPU-GPU Integrated Architectures. 55-64 - Jürgen Becker, Lana Josipovic, Viktor K. Prasanna, Marco D. Santambrogio, Ramachandran Vaidyanathan:
29th Reconfigurable Architectures Workshop (RAW 2022). 65-66 - Gustavo Alonso:
RAW 2022 Keynote Speaker 1: Using FPGAs in datacenters and the cloud. 67 - Gustavo Alonso:
RAW 2022 Keynote Speaker 1: Using FPGAs in datacenters and the cloud. 68 - Daniele Paletti, Francesco Peverelli, Davide Conficconi:
Online Learning RTL Synthesis for Automated Design Space Exploration. 69-76 - Dana Diaconu, Lucian Petrica, Michaela Blott, Miriam Leeser:
Machine Learning Aided Hardware Resource Estimation for FPGA DNN Implementations. 77-83 - Lester Kalms, Tim Haering, Diana Goehringer:
DECISION: Distributing OpenVX Applications on CPUs, GPUs and FPGAs using OpenCL. 84-91 - Jonas Ney, Bilal Hammoud, Norbert Wehn:
A Hybrid Approach combining ANN-based and Conventional Demapping in Communication for Efficient FPGA-Implementation. 92-95 - Pascal Jungblut, Dieter Kranzlmüller:
Optimal Schedules for High-Level Programming Environments on FPGAs with Constraint Programming. 96-99 - Seung-Hun Chung, Tarek S. Abdelrahman:
Optimization of Compiler-Generated OpenCL CNN Kernels and Runtime for FPGAs. 100-103 - Raffaele Berzoini, Eleonora D'Arnese, Davide Conficconi:
On How to Push Efficient Medical Semantic Segmentation to the Edge: the SENECA approach. 104-111 - Lukas Weber, Johannes Wirth, Lukas Sommer, Andreas Koch:
Exploiting High-Bandwidth Memory for FPGA-Acceleration of Inference on Sum-Product Networks. 112-119 - Lennart Clausing, Marco Platzner:
ReconOS64: A Hardware Operating System for Modern Platform FPGAs with 64-Bit Support. 120-127 - Tze Hon Tan, Chia Yee Ooi, Muhammad N. Marsono:
An FPGA-based IP Core Subscription-Oriented Fog Computing Platform. 128-131 - Mingyuan Yang, Yemeng Zhang, Bohan Yang, Hanning Wang, Shouyi Yin, Shaojun Wei, Leibo Liu:
A SHA-512 Hardware Implementation Based on Block RAM Storage Structure. 132-135 - Beatrice Branchini, Sofia Breschi, Alberto Zeni, Marco D. Santambrogio:
Fast Genome Analysis Leveraging Exact String Matching. 136-139 - Christina Boucher:
Building scalable indexes that can be efficiently queried. 142 - Yatish Turakhia:
HiCOMB 2022 Invited Speaker: Pandemic-scale Phylogenetics. 143 - Yiqing Yan, Nimisha Chaturvedi, Raja Appuswamy:
Optimizing the Accuracy of Randomized Embedding for Sequence Alignment. 144-151 - Mario João Jr., Alexandre da Costa Sena, Vinod E. F. Rebello:
On Using Consistency Consistently in Multiple Sequence Alignments. 152-161 - Joël Lindegger, Damla Senol Cali, Mohammed Alser, Juan Gómez-Luna, Onur Mutlu:
Algorithmic Improvement and GPU Acceleration of the GenASM Algorithm. 162 - Safaa Diab, Amir Nassereldine, Mohammed Alser, Juan Gómez-Luna, Onur Mutlu, Izzat El Hajj:
High-throughput Pairwise Alignment with the Wavefront Algorithm using Processing-in-Memory. 163 - Haris Smajlovic, Ariya Shajii, Bonnie Berger, Hyunghoon Cho, Ibrahim Numanagic:
Sequre: a high-performance framework for rapid development of secure bioinformatics pipelines. 164-165 - Alvin Chon, Pawel Górecki, Oliver Eulenstein, Xiaoqiu Huang, Ali Jannesari:
Scalable and Extensible Robinson-Foulds for Comparative Phylogenetics. 166-175 - Narendra Chaudhary, Sanchit Misra, Dhiraj D. Kalamkar, Alexander Heinecke, Evangelos Georganas, Barukh Ziv, Menachem Adelman, Bharat Kaul:
Accelerating Deep Learning based Identification of Chromatin Accessibility from noisy ATAC-seq Data. 176-185 - Anoop Kumar, Vibha Balaji, M. A. Chandrashekar, Ambedkar Dukkipati, Sathish Vadhiyar:
Graph Convolutional Neural Networks for Alzheimer's Classification with Transfer Learning and HPC Methods. 186-195 - Reinout Corts, Niek Sterenborg, Nikolaos Alachiotis:
Accelerated LD-based selective sweep detection using GPUs and FPGAs. 196-205 - Mu Gao, Mark Coletti, Russell B. Davidson, Ryan Prout, Subil Abraham, Benjamín Hernández, Ada Sedova:
Proteome-scale Deployment of Protein Structure Prediction Workflows on the Summit Supercomputer. 206-215 - Pelin Icer Baykal, Niko Beerenwinkel, Serghei Mangul:
Reproducibility of Bioinformatics Tools. 216 - Varuni Sarwal, Serghei Mangul, David Koslicki:
TAMPA: interpretable analysis and visualization of metagenomics-based taxon abundance profiles. 217 - Tim Mattson:
GrAPL 2022 Keynote Speaker: GraphBLAS Beyond Simple Graphs. 220 - Ilya V. Afanasyev, Kazuhiko Komatsu, Dmitry I. Lichmanov, Vadim V. Voevodin, Hiroaki Kobayashi:
High-Performance GraphBLAS Backend Prototype for NEC SX-Aurora TSUBASA. 221-229 - Aristeidis Mastoras, Sotiris Anagnostidis, Albert-Jan Nicholas Yzelman:
Nonblocking execution in GraphBLAS. 230-233 - Benjamin Brock, Scott McMillan, Aydin Buluç, Timothy G. Mattson, José E. Moreira:
GraphBLAS: C++ Iterators for Sparse Matrices. 238-246 - Jeremy Kepner, Michael Jones, Daniel Andersen, Aydin Buluç, Chansup Byun, kc claffy, Timothy Davis, William Arcand, Jonathan Bernays, David Bestor, William Bergeron, Vijay Gadepally, Daniel Grant, Micheal Houle, Matthew Hubbell, Hayden Jananthan, Anna Klein, Chad R. Meiners, Lauren Milechin, Andrew Morris, Julie Mullen, Sandeep Pisharody, Andrew Prout, Albert Reuther, Antonio Rosa, Siddharth Samsi, Doug Stetson, Charles Yee, Peter Michaleas:
Temporal Correlation of Internet Observatories and Outposts. 247-254 - Eugenio Angriman, Fabian Brandt-Tumescheit, Leon Franke, Alexander van der Grinten, Henning Meyerhenke:
Interactive Visualization of Protein RINs using NetworKit in the Cloud. 255-264 - Somesh Singh, Bora Uçar:
An Efficient Parallel Implementation of a Perfect Hashing Method for Hypergraphs. 265-274 - Xu T. Liu, Jesun Firoz, Assefaw H. Gebremedhin, Andrew Lumsdaine:
NWHy: A Framework for Hypergraph Analytics: Representations, Data structures, and Algorithms. 275-284 - Md Taufique Hussain, Guttu Sai Abhishek, Aydin Buluç, Ariful Azad:
Parallel Algorithms for Adding a Collection of Sparse Matrices. 285-294 - Le Chen, Quazi Ishtiaque Mahmud, Ali Jannesari:
Multi-View Learning for Parallelism Discovery of Sequential Programs. 295-303 - Jay A. Acosta, Tze Meng Low, Devangi N. Parikh:
Families of Butterfly Counting Algorithms for Bipartite Graphs. 304-313 - Muhammad Osama, Serban D. Porumbescu, John D. Owens:
Essentials of Parallel Graph Analytics. 314-317 - Rajendra K. Raj:
"Crosscutting Themes in Computer Science: Where Does PDC Education Fit?". 320 - Tia Newhall, Kevin C. Webb, Vasanta Chaganti, Andrew Danner:
Introducing Parallel Computing in a Second CS Course. 321-329 - Jérémy Fix, Stéphane Vialle, Rémi Hellequin, Claudine Mercier, Patrick P. Mercier, Jean-Baptiste Tavernier:
Feedback from a data center for education at CentraleSupélec engineering school. 330-337 - Joel Antonio Trejo-Sánchez, Francisco Javier Hernández-López, Miguel Ángel Uh Zapata, José Luis López-Martínez, Daniel Fajardo-Delgado, Julio Cesar Ramírez Pacheco:
Teaching High-Performance Computing in Developing Countries: A Case Study in Mexican Universities. 338-345 - Patrick Bell, Kae Suarez, Barbara Fossum, Dylan Chapp, Sanjukta Bhowmick, Michela Taufer:
A Research-Based Course Module to Study Non-determinism in High Performance Applications. 346-353 - Joel Fuentes, Daniel López, Sebastián González:
Teaching Heterogeneous Computing Using DPC++. 354-360 - H. Martin Bücker, Henri Casanova, Rafael Ferreira da Silva, Alice Lasserre, Derrick Luyen, Raymond Namyst, Johannes Schoder, Pierre-André Wacrenier, David P. Bunde:
Peachy Parallel Assignments (EduPar 2022). 361-368 - Lena Oden:
12th IEEE International Workshop on Accelerators and Hybrid Emerging Systems. 369-370 - Estela Suarez:
AsHES 2022 Keynote Speaker: The Modular Supercomputing Architecture (MSA). 371 - Alan Ayala, Stan Tomov, Miroslav Stoyanov, Azzam Haidar, Jack J. Dongarra:
Performance Analysis of Parallel FFT on Large Multi-GPU Systems. 372-381 - Tristan Laan, Ana Lucia Varbanescu:
Heterogeneous GPU and FPGA computing: a VexCL case-study. 382-390 - Alok Mishra, Smeet Chheda, Carlos Soto, Abid Muslim Malik, Meifeng Lin, Barbara M. Chapman:
COMPOFF: A Compiler Cost model using Machine Learning to predict the Cost of OpenMP Offloading. 391-400 - Raul Torres, Roger Ferrer, Xavier Teruel:
A Novel Set of Directives for Multi-device Programming with OpenMP. 401-410 - Thorsten Koch, Daniel Rehfeldt, Yuji Shinano:
APDCM 2022 Keynote Talk: Solving QUBOs on Digital and Quantum Computers. 413 - Daiki Okonogi, Satoru Jimbo, Kota Ando, Thiem Van Chu, Jaehoon Yu, Masato Motomura, Kazushi Kawamura:
APC-SCA: A Fully-Parallel Annealing Algorithm with Autonomous Pinning Effect Control. 414-420 - Ryota Yasudo, Koji Nakano, Yasuaki Ito, Yuya Kawamata, Ryota Katsuki, Shiro Ozaki, Takashi Yazane, Kenichiro Hamano:
Graph-theoretic Formulation of QUBO for Scalable Local Search on GPUs. 425-434 - Robert Basili, Wenyang Qian, Shuo Tang, Austin Castellino, Mary Eshaghian-Wilner, Ashfaq Khokhar, Glenn R. Luecke, James P. Vary:
Performance Evaluations of Noisy Approximate Quantum Fourier Arithmetic. 435-444 - Yoshiyuki Morie, Yasutaka Wada, Ryohei Kobayashi, Ryuichi Sakamoto:
Performance Evaluation of Data Transfer API for Rank Level Approximate Computing on HPC Systems. 445-448 - Shulei Xu, Aamir Shafi, Hari Subramoni, Dhabaleswar K. Panda:
Arm meets Cloud: A Case Study of MPI Library Performance on AWS Arm-based HPC Cloud with Elastic Fabric Adapter. 449-456 - Osamu Ishimura, Yoshihide Yoshimoto:
Aspect-Oriented Programming based building block platform to construct Domain-Specific Language for HPC application. 457-466 - Sam White, Laxmikant V. Kalé:
Optimizing Non-commutative Allreduce Over Virtualized, Migratable MPI Ranks. 467-475 - Alexandre Denis, Emmanuel Jeannot, Philippe Swartvagher:
Modeling Memory Contention between Communications and Computations in Distributed HPC Systems. 476-485 - Nooshin Nokhanji, Paola Flocchini, Nicola Santoro:
Fully Dynamic Line Maintenance by Hybrid Programmable Matter. 486-495 - Zheming Jin, Jeffrey S. Vetter:
Integer Sum Reduction with OpenMP on an AMD MI100 GPU. 496-499 - Koji Nakano, Victor Poupet:
Optimal Triangulation on the High Bandwidth Memory Model. 500-507 - Kinan Al-Attar, Aamir Shafi, Hari Subramoni, Dhabaleswar K. Panda:
Towards Java-based HPC using the MVAPICH2 Library: Early Experiences. 510-519 - Ioannis Vardas, Sascha Hunold, Jordy I. Ajanohoun, Jesper Larsson Träff:
mpisee: MPI Profiling for Communication and Communicator Structure. 520-529 - Simon Schwitanski, Felix Tomski, Joachim Protze, Christian Terboven, Matthias S. Müller:
An On-the-Fly Method to Exchange Vector Clocks in Distributed-Memory Programs. 530-540 - Tao Tao, David A. Plaisted:
Automatic Parallelization of Programs via Software Stream Rewriting. 541-551 - Charly Castes, Emmanuel Agullo, Olivier Aumage, Emmanuelle Saillard:
Decentralized in-order execution of a sequential task-based code for shared-memory architectures. 552-561 - Zheming Jin, Jeffrey S. Vetter:
Evaluating Unified Memory Performance in HIP. 562-568 - Jaemin Choi, David F. Richards, Laxmikant V. Kalé:
Improving Scalability with GPU-Aware Asynchronous Tasks. 569-578 - Shayan Manoochehri, Patrick Cristofaro, Dhrubajyoti Goswami:
A Customizable Lightweight STM for Irregular Algorithms on GPU. 579-587 - Tsung-Wei Huang, Yibo Lin:
Concurrent CPU-GPU Task Programming using Modern C++. 588-597 - Ang Li, Qiang Guan:
International Workshop on Quantum Classical Cooperative Computing (QCCC 2022). 598 - Nathan Wiebe:
QCCC 2022 Keynote Talk: Hybrid Quantum / Classical Algorithms for Machine Learning. 599 - Elisha Siddiqui Matekole, Yao-Lung L. Fang, Meifeng Lin:
Methods and Results for Quantum Optimal Pulse Control on Superconducting Qubit Systems. 600-606 - Avah Banerjee, Xin Liang, R. Tohid:
Locality-aware Qubit Routing for the Grid Architecture. 607-613 - Betis Baheri, Qiang Guan, Shuai Xu, Vipin Chaudhary:
SQCC: Smart Quantum Circuit Cutting. 614-615 - Samuel Alexander Stein, Nathan Wiebe, James A. Ang, Ang Li:
Improving Variational Quantum Algorithms performance through Weighted Quantum Ensembles. 616-617 - Samuel Alexander Stein, Nathan Wiebe, James A. Ang, Ang Li:
Benchmarking Quantum Processor Performance through Quantum Distance Metrics Over An Algorithm Suite. 618-624 - Artur Podobas, Kentaro Sano, Jason Anderson:
The First International Workshop on Coarse-Grained Reconfigurable Architectures for High-Performance Computing (CGRA4HPC). 625-626 - Raghu Prabhakar:
(CGRA4HPC) 2022 Invited Speaker: Pushing the Boundaries of HPC with the Integration of AI. 627 - Elliott Delaye:
CGRA4HPC 2022 Invited Speaker: Mapping ML to the AMD/Xilinx AIE-ML architecture. 628 - Martin Snelgrove:
CGRA4HPC 2022 Invited Speaker: Dual-scale reconfigurable arrays for ML Inference. 629 - Ilan Tayari:
CGRA4HPC 2022 Invited Speaker: Practical, scalable, and easy-to-use CGRA for HPC. 630 - Takuya Kojima, Boma A. Adhi, Carlos Cortes, Yiyu Tan, Kentaro Sano:
An Architecture- Independent CGRA Compiler enabling OpenMP Applications. 631-638 - Boma A. Adhi, Carlos Cortes, Yiyu Tan, Takuya Kojima, Artur Podobas, Kentaro Sano:
Exploration Framework for Synthesizable CGRAs Targeting HPC: Initial Design and Evaluation. 639-646 - Markus Weinhardt:
An Analysis of Mapping Polybench Kernels to HPC CGRAs. 647-654 - Omar Ragheb, Tianyi Yu, Rami Beidas, Jason Helge Anderson:
Elastic Multi-Context CGRAs. 655-662 - Sho Ko, Alexander Rucker, Yaqi Zhang, Paul Mure, Kunle Olukotun:
Accelerating SLIDE: Exploiting Sparsity on Accelerator Architectures. 663-670 - Hoai Luan Pham, Thi Hong Tran, Vu Trung Duong Le, Yasuhiko Nakashima:
A Coarse Grained Reconfigurable Architecture for SHA-2 Acceleration. 671-678 - Kevin J. M. Martin:
Twenty Years of Automated Methods for Mapping Applications on CGRA. 679-686 - Lidia Kidane, Paul Townend, Thijs Metsch, Erik Elmroth:
When and How to Retrain Machine Learning-based Cloud Management Systems. 688-698 - Sohei Koyama, Osamu Tatebe:
Scalable Data Parallel Distributed Training for Graph Neural Networks. 699-707 - Benny J. Tang, Qiqi Chen, Matthew L. Weiss, Nathan C. Frey, Joseph McDonald, David Bestor, Charles Yee, William Arcand, William Bergeron, Chansup Byun, Daniel Edelman, Michael Houle, Matthew Hubbell, Michael Jones, Jeremy Kepner, Anna Klein, Adam Michaleas, Peter Michaleas, Lauren Milechin, Julia S. Mullen, Andrew Prout, Albert Reuther, Antonio Rosa, Andrew Bowne, Lindsey McEvoy, Baolin Li, Devesh Tiwari, Vijay Gadepally, Siddharth Samsi:
The MIT Supercloud Workload Classification Challenge. 708-714 - Dan Zhao, Nathan C. Frey, Vijay Gadepally, Siddharth Samsi:
Loss Curve Approximations for Fast Neural Architecture Ranking & Training Elasticity Estimation. 715-723 - Baolin Li, Vijay Gadepally, Siddharth Samsi, Devesh Tiwari:
Characterizing Multi-Instance GPU for Machine Learning Workloads. 724-731 - Nathan C. Frey, Dan Zhao, Simon Axelrod, Michael Jones, David Bestor, Vijay Gadepally, Rafael Gómez-Bombarelli, Siddharth Samsi:
Energy-aware neural architecture selection and hyperparameter optimization. 732-741 - Dan Zhao, Nathan C. Frey, Joseph McDonald, Matthew Hubbell, David Bestor, Michael Jones, Andrew Prout, Vijay Gadepally, Siddharth Samsi:
A Green(er) World for A.I. 742-750 - Georges Da Costa:
PDCO 2022 Keynote Talk: Performance and Energy models for modern HPC servers. 753 - Engelina L. Jenneskens, Rob H. Bisseling:
Exact k-way sparse matrix partitioning. 754-763