


default search action
International Journal of High Performance Computing Applications, Volume 39
Volume 39, Number 1, 2025
- Mark Gates

, Ahmad Abdelfattah
, Kadir Akbudak
, Mohammed A. Al Farhan, Rabab Alomairy
, Daniel Bielich
, Treece Burgess, Sébastien Cayrols, Neil Lindquist, Dalal Sukkari, Asim YarKhan
:
Evolution of the SLATE linear algebra library. 3-17 - Lisa Claus

, Pieter Ghysels, Wajih Halim Boukaram, Xiaoye Sherry Li
:
A graphics processing unit accelerated sparse direct solver and preconditioner with block low rank compression. 18-31 - James P. Ahrens, Marco Arienti, Utkarsh Ayachit, Janine Bennett, Roba Binyahib, Ayan Biswas, Peer-Timo Bremer, Eric Brugger, Roxana Bujack

, Hamish A. Carr
, Jieyang Chen, Hank Childs, Soumya Dutta, Abdelilah Essiari, Berk Geveci, Cyrus Harrison, Subhashis Hazarika, Megan Hickman Fulp
, Petar Hristov, Xuan Huang, Joseph A. Insley, Yuya Kawakami, Chloe Keilers, James Kress
, Matthew Larsen, Dan Lipsa, Meghanto Majumder
, Nicole Marsaglia, Victor A. Mateevitsi, Valerio Pascucci, John Patchett, Saumil Patel, Steve Petruzza, David Pugmire, Silvio Rizzi, David H. Rogers
, Oliver Rübel, Jorge Sebastian Salinas
, Sudhanshu Sane, Sergei Shudler, Alexandra Stewart
, Karen C. Tsai
, Terece L. Turton
, Will Usher, Zhe Wang, Gunther H. Weber
, Corey Wetterer-Nelson, Jonathan Woodring, Abhishek Yenpure
:
The ECP ALPINE project: In situ and post hoc visualization infrastructure and analysis capabilities for exascale. 32-51 - Logan T. Ward

, J. Gregory Pauloski
, Valérie Hayot-Sasson, Yadu N. Babuji, Alexander Brace
, Ryan Chard, Kyle Chard, Rajeev Thakur
, Ian T. Foster
:
Employing artificial intelligence to steer exascale workflows with colmena. 52-64 - M. Scot Breitenfeld

, Houjun Tang, Huihuo Zheng, Jordan Henderson, Suren Byna
:
HDF5 in the exascale era: Delivering efficient and scalable parallel I/O for exascale applications. 65-78 - Xingfu Wu

, John R. Tramm, Jeffrey Larson
, John-Luke Navarro
, Prasanna Balaprakash, Brice Videau, Michael Kruse, Paul D. Hovland, Valerie Taylor, Mary W. Hall
:
Integrating ytopt and libEnsemble to autotune OpenMC. 79-103 - Peter Lindstrom

, Jeffrey Hittinger
, James Diffenderfer
, Alyson Fox
, Daniel Osei-Kuffuor
, Jeffrey W. Banks
:
ZFP: A compressed array representation for numerical computations. 104-122 - Cody J. Balos

, Marcus Day, Lucas Esclapez, Anne M. Felden
, David J. Gardner
, Malik Hassanaly, Daniel R. Reynolds
, Jon S. Rood, Jean M. Sexton, Nicholas T. Wimer, Carol S. Woodward
:
SUNDIALS time integrators for exascale applications with many independent systems of ordinary differential equations. 123-146 - Aurelien Bouteiller

, Thomas Hérault
, Qinglei Cao
, Joseph Schuchart, George Bosilca:
PaRSEC: Scalability, flexibility, and hybrid architecture support for task-based applications in ECP. 147-166 - Andrey Prokopenko

, Daniel Arndt
, Damien Lebrun-Grandié
, Bruno Turcksin
, Nicholas Frontiere
, J. D. Emberson
, Michael Buehlmann
:
Advances in ArborX to support exascale applications. 167-176 - Stephen Hudson

, Jeffrey Larson
, John-Luke Navarro
, Stefan M. Wild
:
Portable, heterogeneous ensemble workflows at scale using libEnsemble. 177-192 - Roxana Bujack

, Maya B. Gokhale
, Latchesar Ionkov
, Keita Iwabuchi, Michael R. Jantz, Terry R. Jones
, Sumathi Lakshmiranganatha
, Michael K. Lang, Jason Lee, Matthew Ben Olson
, Scott Pakin
, Roger Pearce, Jonathan Pietarila Graham, Li Tang, Terece L. Turton
, Sean Williams:
The ECP SICM project: Managing complex memory hierarchies for exascale applications. 193-207
Volume 39, Number 2, 2025
- Lukas Spies

, Luke N. Olson
, Scott P. MacLachlan
:
Exploiting mesh structure to improve multigrid performance for saddle-point problems. 211-229 - Christie L. Alappat

, Jonas Thies
, Georg Hager
, Holger Fehske, Gerhard Wellein:
Algebraic temporal blocking for sparse iterative solvers on multi-core CPUs. 230-250 - Heike Jagode, Anthony Danalis, Giuseppe Congiu, Daniel Barry, Anthony Castaldo, Jack J. Dongarra:

Advancements of PAPI for the exascale generation. 251-268 - Ivy Peng

, Jacob Wahlgren
, Karim Youssef
, Keita Iwabuchi
, Roger Pearce
, Maya B. Gokhale
:
UMap: An application-oriented user level memory mapping library. 269-282 - Yanfei Guo

, Ken Raffenetti
, Hui Zhou
, Pavan Balaji, Min Si, Abdelhalim Amer, Shintaro Iwasaki, Sangmin Seo, Giuseppe Congiu
, Robert Latham, Lena Oden, Thomas Gillis
, Rohit Zambre, Kaiming Ouyang, Charles Archer, Wesley Bland, Jithin Jose, Sayantan Sur, Hajime Fujita, Dmitry Durnov, Michael Chuvelev, Gengbin Zheng, Alex Brooks, Sagar Thapaliya, Taru Doodi, Maria Garazan, Steve Oyanagi, Marc Snir, Rajeev Thakur
:
Preparing MPICH for exascale. 283-305 - Richard Tran Mills

, Mark F. Adams, Satish Balay
, Jed Brown
, Jacob Faibussowitsch, Toby Isaac, Matthew G. Knepley
, Todd S. Munson
, Hansol Suh, Stefano Zampini
, Hong Zhang
, Junchao Zhang
:
PETSc/TAO developments for GPU-based early exascale systems. 306-325
Volume 39, Number 3, 2025
- Martin Karp

, Estela Suarez
, Jan H. Meinke, Måns I. Andersson
, Philipp Schlatter
, Stefano Markidis, Niclas Jansson
:
Experience and analysis of scalable high-fidelity computational fluid dynamics on modular supercomputing architectures. 329-344 - Samuel Kemmler

, Christoph Rettinger
, Ulrich Rüde
, Pablo Cuéllar
, Harald Köstler
:
Efficiency and scalability of fully-resolved fluid-particle simulations on heterogeneous CPU-GPU architectures. 345-363 - Steven Dargaville

, Richard P. Smedley-Stevenson, Paul N. Smith, Christopher C. Pain
:
Coarsening and parallelism with reduction multigrids for hyperbolic Boltzmann transport. 364-384 - Dane C. Lacey

, Christie L. Alappat
, Florian Lange
, Georg Hager
, Holger Fehske, Gerhard Wellein:
Cache blocking of distributed-memory parallel matrix power kernels. 385-404 - Junsheng Zhou

, Wangdong Yang, Fengkun Dong, Shengle Lin
, Qinyun Cai
, Kenli Li:
NUMA-aware parallel sparse LU factorization for SPICE-based circuit simulators on ARM multi-core processors. 405-423 - Tianshi Xu

, Ruipeng Li
, Daniel Osei-Kuffuor
:
A two-level GPU-accelerated incomplete LU preconditioner for general sparse linear systems. 424-442 - Melven Röhrig-Zöllner

, Manuel Joey Becklas, Jonas Thies
, Achim Basermann
:
Performance of linear solvers in tensor-train format on current multicore architectures. 443-461 - Yuki Uchino

, Katsuhisa Ozaki, Toshiyuki Imamura:
Performance enhancement of the Ozaki Scheme on integer matrix multiplication unit. 462-476 - Corrigendum to large-scale direct numerical simulations of turbulence using GPUs and modern Fortran. 477

Volume 39, Number 4, 2025
- Gijs van den Oord

, Victor Azizi, Mohamad Fathi
, Stefan Hickel
:
Dynamic multi-level load balancing for scalable simulations of reacting multiphase flows. 519-531 - Luc Briand

, Hervé Jourdren, Marc Pérache:
Julia versus C++ Kokkos for performance portable Cartesian CFD solvers on heterogeneous architectures. 481-501 - Alessia Vignolo, Taylor J. Baird, Filippo Spiga, Claudia Canevari, Alessandro Coretti

, Rodolphe Vuilleumier, Andrea Cavalli, Sara Bonella, Sergio Decherchi
:
A tale of two codes: CUDA vs OpenACC for mass-zero constrained dynamics. 502-518 - Kevin A. Huck

, Sameer Shende, Allen D. Malony, Camille Coti, Wyatt Spear
, Jordi Alcaraz, Dewi Yokelson
, Srinivasan Ramesh, Mohammad Alaul Haque Monil, Chad Wood, Nicholas Chaimov
, Cameron Durbin, Alister Johnson, Jacob Lambert, Izaak Beekman
:
Preparing the TAU performance system for exascale and beyond. 532-552 - Christopher Kelly

, Wei Xu, Line C. Pouchard
, Hubertus Van Dam
, Tanzima Z. Islam
, Shinjae Yoo, Kerstin Kleese van Dam
:
Performance analysis and data reduction for exascale scientific workflows. 553-578 - Aymen Alsaadi, Mihael Hategan-Marandiuc, Ketan Maheshwari, André Merzky, Mikhail Titov, Matteo Turilli, Andreas Wilke, Justin M. Wozniak

, Kyle Chard, Rafael Ferreira da Silva, Shantenu Jha
, Daniel E. Laney:
Exascale workflow applications and middleware: An ExaWorks retrospective. 579-593 - Greg Eisenhauer

, Norbert Podhorszki, Ana Gainaru
, Scott Klasky, Junmin Gu
, Vicente Bolea
, Liz Dulac, Dmitry Ganyushin, William F. Godoy
, Qing Liu
, Caitlin Ross
, Lipeng Wan
, Scott Wittenburg, Kesheng Wu
:
HPC I/O innovations in the exascale era. 594-612
Volume 39, Number 5, 2025
- Pratik Nayak

, Isha Aggarwal, Hartwig Anzt
:
Efficient solution of batched band linear systems on GPUs. 615-630 - Adrián Pérez Diéguez, Seth Ockerman

, Tristan Aikman, Younghyun Cho
, Yang Liu
, Khaled Z. Ibrahim
:
Parallelizing autotuning for HPC applications: Unveiling the potential of the speculation strategy in Bayesian optimization. 631-654 - Stéphane Louise, Andrea Ajmar

, Alexey Androsov
, Lorenza Bovio
, Franca Disabato, Tara Evaz Zadeh, Rubén Jesús García Hernández, Thierry Goubier
, Sven Harig
, Cedric Koch-Hofer, Jan Krenek
, Marius Kriegerowski, Erwan Lenormand
, Jan Martinovic
, Tomás Martinovic, Francesca Perez, Karsten Prehn, Natalja Rakowsky, Danijel Schorlemmer:
Modeling and implementing an earthquake and tsunami event-triggered, time-constrained impact assessment workflow. 655-677 - Arash Ghorbannia, Cyrus Tanade, Ayman Yousef, Nusrat Sadia Khan, Madhurima Vardhan

, Jocelyn T. Chi
, Sayan Roychowdhury, Arpita Das, Jane A. Leopold, Eric C. Chi, Amanda Randles
:
Simulation-based machine learning for real-time assessment of side-branch hemodynamics in coronary bifurcation lesions. 678-691 - Prasanna Balaprakash, Krishnan Raghavan, Franck Cappello

, Ewa Deelman
, Anirban Mandal
, Hongwei Jin
, Imtiaz Mahmud, Komal Thareja, Shixun Wu, Pawel Zuk, Mariam Kiran, Zizhong Chen, Sheng Di
, Kesheng Wu
:
SWARM: Reimagining scientific workflow management systems in a distributed world. 692-712 - Sally Ellingson

, Guillaume Pallez
:
Result-scalability: Following the evolution of selected social impact of HPC. 713-721 - Tony Hey

:
Feynman and computation: From Los Alamos to quantum computers. 722-735
Volume 39, Number 6, 2025
- Nicolas Nytko

, Andrew Reisner
, J. David Moulton
, Luke N. Olson
, Matthew West
:
Teaching an old dog new tricks: Porting legacy code to heterogeneous compute architectures with automated code translation. 739-749 - Jan Solanti

, Michal Babej, Julius Ikkala
, Pekka Jääskeläinen
:
PoCL-R: An open standard based heterogeneous offloading layer with server side scalability. 750-769 - Adam Bertsch, Michael R. Collette, Shawn A. Dawson, Simon D. Hammond, Ian Karlin, Michael Scott McKinley, Kevin T. Pedretti, Robert N. Rieben, Brian S. Ryujin, Arturo Vargas

, Kenneth Weiss
:
Understanding power and energy utilization in large scale production physics simulation codes. 770-783 - Thiago Maltempi

, Sandro Rigo
, Márcio Machado Pereira, Hervé Yviquel, Gustavo Leite, Orlando Lee, Jessé Costa
, Guido Araujo
:
Checkpointing fine-tuning for accelerating seismic applications in GPUs. 784-802 - Héctor Martínez, Sandra Catalán

, Adrián Castelló
, Enrique S. Quintana-Ortí
:
Characterization of quantized inference with transformer encoders on low power CPUs. 803-821

manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.


Google
Google Scholar
Semantic Scholar
Internet Archive Scholar
CiteSeerX
ORCID














