default search action
International Journal of High Performance Computing Applications, Volume 33
Volume 33, Number 1, 2019
- Hugues Digonnet, Thierry Coupez, Patrice Laure, Luisa Silva:
Massively parallel anisotropic mesh adaptation. - John Loffeld, Jeffrey Hittinger:
On the arithmetic intensity of high-order finite-volume discretizations for hyperbolic systems of conservation laws. - Franz Pichler, Gundolf Haase:
Finite element method completely implemented for graphic processor units using parallel algorithm libraries. - Muhammed Nufail Farooqi, Daulet Izbassarov, Metin Muradoglu, Didem Unat:
Communication analysis and optimization of 3D front tracking method for multiphase flow simulations. - Daniel S. Abdi, Lucas C. Wilcox, Timothy C. Warburton, Francis X. Giraldo:
A GPU-accelerated continuous and discontinuous Galerkin non-hydrostatic atmospheric model. - Masahiro Nakao, Hitoshi Murai, Hidetoshi Iwashita, Taisuke Boku, Mitsuhisa Sato:
Implementation and evaluation of the HPC challenge benchmark in the XcalableMP PGAS language. - Enrico Calore, Alessandro Gabbana, Sebastiano Fabio Schifano, Raffaele Tripiccione:
Optimization of lattice Boltzmann simulations on heterogeneous computers. - Jan Hückelheim, Paul D. Hovland, Michelle Mills Strout, Jens-Dominik Müller:
Reverse-mode algorithmic differentiation of an OpenMP-parallel compressible flow solver. - Tadashi Yamazaki, Jun Igarashi, Junichiro Makino, Toshikazu Ebisuzaki:
Real-time simulation of a cat-scale artificial cerebellum on PEZY-SC processors. - Bei Wang, Stéphane Ethier, William M. Tang, Khaled Z. Ibrahim, Kamesh Madduri, Samuel Williams, Leonid Oliker:
Modern gyrokinetic particle-in-cell simulation of fusion plasmas on top supercomputers. - Linda Stals:
Algorithm-based fault recovery of adaptively refined parallel multilevel grids. - Vladimir A. Mironov, Alexander A. Moskovsky, Michael D'Mello, Yuri Alexeev:
An efficient MPI/OpenMP parallelization of the Hartree-Fock-Roothaan method for the first generation of Intel® Xeon Phi™ processor architecture.
Volume 33, Number 2, March 2019
- Carlos Teijeiro, Thomas Hammerschmidt, Ralf Drautz, Godehard Sutmann:
Optimized parallel simulations of analytic bond-order potentials on hybrid shared/distributed memory with MPI and OpenMP. - Daniel S. Abdi, Francis X. Giraldo, Emil M. Constantinescu, Lester E. Carr, Lucas C. Wilcox, Timothy C. Warburton:
Acceleration of the IMplicit-EXplicit nonhydrostatic unified model of the atmosphere on manycore processors. - Katherine J. Evans, Richard K. Archibald, David J. Gardner, Matthew R. Norman, Mark A. Taylor, Carol S. Woodward, Patrick H. Worley:
Performance analysis of fully explicit and fully implicit solvers within a spectral element shallow-water atmosphere model. - Dingwen Tao, Sheng Di, Hanqi Guo, Zizhong Chen, Franck Cappello:
Z-checker: A framework for assessing lossy compression of scientific data. - Hasan Metin Aktulga, Chris Knight, Paul K. Coffman, Kurt A. O'Hearn, Tzu-Ray Shan, Wei Jiang:
Optimizing the performance of reactive molecular dynamics simulations for many-core architectures. - Anshu Dubey, Petros Tzeferacos, Don Q. Lamb:
The dividends of investing in computational software design: A case study. - Irina Demeshko, Jerry Watkins, Irina K. Tezaur, Oksana Guba, William F. Spotz, Andrew G. Salinger, Roger P. Pawlowski, Michael A. Heroux:
Toward performance portability of the Albany finite element analysis code using the Kokkos library. - Elmar Peise, Paolo Bientinesi:
The ELAPS framework: Experimental Linear Algebra Performance Studies. - Marc Casas, Wilfried N. Gansterer, Elias Wimmer:
Resilient gossip-inspired all-reduce algorithms for high-performance computing: Potential, limitations, and open questions. - Pietro Cicotti, Manu Shantharam, Laura Carrington:
Reducing communication in parallel graph search algorithms with software caches. - Jon C. Calhoun, Franck Cappello, Luke N. Olson, Marc Snir, William D. Gropp:
Exploring the feasibility of lossy compression for PDE simulations. - Andreas Müller, Michal A. Kopera, Simone Marras, Lucas C. Wilcox, Tobin Isaac, Francis X. Giraldo:
Strong scaling for numerical weather prediction at petascale with the atmospheric model NUMA.
Volume 33, Number 3, May 2019
- Gabriele Mencagli, Felipe M. G. França, Cristiana Bentes Seidel, Leandro A. J. Marzulo, Maurício Lima Pilla:
Special issue on parallel applications for in-situ computing on the next-generation computing platforms. - Joao Vicente Ferreira Lima, Issam Raïs, Laurent Lefèvre, Thierry Gautier:
Performance and energy analysis of OpenMP runtime systems with dense linear algebra algorithms. - Jucele Franca de Alencar Vasconcellos, Edson Norberto Cáceres, Henrique Mongelli, Siang Wun Song, Frank Dehne, Jayme Luiz Szwarcfiter:
New BSP/CGM algorithms for spanning trees. - Anderson Braga de Avila, Renata Hax Sander Reiser, Maurício Lima Pilla, Adenauer Corrêa Yamin:
Improving in situ GPU simulation of quantum computing in the D-GM environment. - Matheus S. Serpa, Eduardo H. M. Cruz, Matthias Diener, Arthur M. Krause, Philippe O. A. Navaux, Jairo Panetta, Albert Farrés, Claudia Rosas, Mauricio Hanzich:
Optimization strategies for geophysics models on manycore systems.
- Roman Wyrzykowski, Ewa Deelman:
Guest editor's note: Special issue on challenges and solutions for porting applications to emerging high performance computing systems. - Adrian Klusek, Marcin Los, Maciej Paszynski, Witold Dzwinel:
Efficient model of tumor dynamics simulated in multi-GPU environment. - Vladimir V. Stegailov, Ekaterina Dlinnova, Timur Z. Ismagilov, Mikhail Khalilov, Nikolay Kondratyuk, Dmitry Makagon, Alexander Semenov, Alexei Simonov, Grigory S. Smirnov, Alexey Timofeev:
Angara interconnect makes GPU-based Desmos supercomputer an efficient tool for molecular dynamics calculations. - Daniel Langr, Tomás Dytrych, Kristina D. Launey, Jerry P. Draayer:
Accelerating many-nucleon basis generation for high performance computing enabled ab initio nuclear structure studies. - Lukasz Szustak, Pawel Bratek:
Performance portable parallel programming of heterogeneous stencils across shared-memory platforms with modern Intel processors. - Christian Simmendinger, Roman Iakymchuk, Luis Cebamanos, Dana Akhmetova, Valeria Bartsch, Tiberiu Rotaru, Mirko Rahn, Erwin Laure, Stefano Markidis:
Interoperability strategies for GASPI and MPI in large-scale scientific applications.
Volume 33, Number 4, July 2019
- Nils Kohl, Johannes Hötzer, Florian Schornbaum, Martin Bauer, Christian Godenschwager, Harald Köstler, Britta Nestler, Ulrich Rüde:
A scalable and extensible checkpointing scheme for massively parallel simulations. - Vaibhav Sundriyal, Kristopher Keipert, Masha Sosonkina, Mark S. Gordon:
Effect of frequency scaling granularity on energy-saving strategies. - Karl-Robert Wichmann, Martin Kronbichler, Rainald Löhner, Wolfgang A. Wall:
Practical applicability of optimizations and performance models to complex stencil-based loop kernels in CFD. - Samuel Elliott, Raghu Raj Prasanna Kumar, Natasha Flyer, Tuan Ta, Richard Loft:
Implementation of a scalable, performance portable shallow water equation solver using radial basis function-generated finite difference methods. - Hector Emilio Barrios Molano, Kamy Sepehrnoori:
Development of a framework for parallel reservoir simulation. - Domenico Rea, Giansimone Perrino, Diego di Bernardo, Livia Marcellino, Diego Romano:
A GPU algorithm for tracking yeast cells in phase-contrast microscopy images. - Lubomir Riha, Michal Merta, Radim Vavrík, Tomás Brzobohatý, Alexandros Markopoulos, Ondrej Meca, Ondrej Vysocky, Tomás Kozubek, Vít Vondrák:
A massively parallel and memory-efficient FEM toolbox with a hybrid total FETI solver with accelerator support. - Niclas Jansson, Rahul Bale, Keiji Onishi, Makoto Tsubokura:
CUBE: A scalable framework for large-scale industrial simulations. - Thomas Heller, Bryce Adelstein-Lelbach, Kevin A. Huck, John Biddiscombe, Patricia Grubel, Alice E. Koniges, Matthias Kretz, Dominic Marcello, David Pfander, Adrian Serio, Frank J. Seinstra, Geoffrey C. Clayton, Dirk Pflüger, David Eder, Hartmut Kaiser:
Harnessing billions of tasks for a scalable portable hydrodynamic simulation of the merger of two stars. - Andrea Borghesi, Andrea Bartolini, Michela Milano, Luca Benini:
Pricing schemes for energy-efficient HPC systems: Design and exploration. - Kasia Swirydowicz, Noel Chalmers, Ali Karakus, Tim Warburton:
Acceleration of tensor-product operations for high-order finite element methods.
Volume 33, Number 5, September 2019
- Michael Mascagni:
CRE2017 Special Issue Introduction IJHPCA.
- Line Pouchard, Sterling Baldwin, Todd Elsethagen, Shantenu Jha, Bibi Raju, Eric G. Stephan, Li Tang, Kerstin Kleese van Dam:
Computational reproducibility of scientific workflows at extreme scales. - Kento Sato, Ignacio Laguna, Gregory L. Lee, Martin Schulz, Christopher M. Chambreau, Simone Atzeni, Michael Bentley, Ganesh Gopalakrishnan, Zvonimir Rakamaric, Geof Sawaya, Joachim Protze, Dong H. Ahn:
Pruners. - Salil Mahajan, Katherine J. Evans, Joseph H. Kennedy, Min Xu, Matthew R. Norman, Marcia L. Branstetter:
Ongoing solution reproducibility of earth system models as they progress toward exascale computing. - Roman Iakymchuk, Stef Graillat, David Defour, Enrique S. Quintana-Ortí:
Hierarchical approach for deriving a reproducible unblocked LU factorization.
- Sergio Iserte, Héctor Martínez, Sergio Barrachina, Maribel Castillo, Rafael Mayo, Antonio J. Peña:
Dynamic reconfiguration of noniterative scientific applications: A case study with HPG aligner. - Markus Huber, Ulrich Rüde, Barbara I. Wohlmuth:
Adaptive control in roll-forward recovery for extreme scale multigrid. - Nikola Tchipev, Steffen Seckler, Matthias Heinen, Jadran Vrabec, Fabio Alexander Gratl, Martin Horsch, Martin Bernreuther, Colin W. Glass, Christoph Niethammer, Nicolay Hammer, Bernd Krischok, Michael Resch, Dieter Kranzlmüller, Hans Hasse, Hans-Joachim Bungartz, Philipp Neumann:
TweTriS: Twenty trillion-atom simulation. - Stefan Lemvig Glimberg, Allan Peter Engsig-Karup, Luke N. Olson:
A massively scalable distributed multigrid framework for nonlinear marine hydrodynamics. - Masahiro Nakao, Tetsuya Odajima, Hitoshi Murai, Akihiro Tabuchi, Norihisa Fujita, Toshihiro Hanawa, Taisuke Boku, Mitsuhisa Sato:
Evaluation of XcalableACC with tightly coupled accelerators/InfiniBand hybrid communication on accelerated cluster. - Milos R. Ivanovic, Ana M. Kaplarevic-Malisic, Boban S. Stojanovic, Marina R. Svicevic, Srboljub M. Mijailovich:
Machine learned domain decomposition scheme applied to parallel multi-scale muscle simulation. - Andrew C. Kirby, Michael J. Brazell, Zhi Yang, Rajib Roy, Behzad R. Ahrabi, Michael K. Stoellinger, Jayanarayanan Sitaraman, Dimitri J. Mavriplis:
Wind farm simulations using an overset hp-adaptive approach with blade-resolved turbine models. - Katharina Kormann, Klaus Reuter, Markus Rampp:
A massively parallel semi-Lagrangian solver for the six-dimensional Vlasov-Poisson equation. - David Strelák, Carlos Oscar Sánchez Sorzano, José María Carazo, Jiri Filipovic:
A GPU acceleration of 3-D Fourier reconstruction in cryo-EM. - Pierre Fortin, Maxime Touche:
Dual tree traversal on integrated GPUs for astrophysical N-body simulations. - Dominic Etienne Charrier, Benjamin Hazelwood, Ekaterina O. Tutlyaeva, Michael Bader, Michael Dumbser, Andrey Kudryavtsev, Alexander A. Moskovsky, Tobias Weinzierl:
Studies on the energy and deep memory behaviour of a cache-oblivious, task-based hyperbolic PDE solver. - Jorge Ejarque, Marc Domínguez, Rosa M. Badia:
A hierarchic task-based programming model for distributed heterogeneous computing. - Ibrahim Al-Kharusi, David W. Walker:
Locality properties of 3D data orderings with application to parallel molecular dynamics simulations. - Mohammad Y. Al-Shorman, Majd M. Al-Kofahi:
Ultrasonic pulse propagation simulation using OpenCL for environment mapping and discovery. - John M. Dennis, Brian Dobbins, Christopher Kerr, Youngsung Kim:
Optimizing the HOMME dynamical core for multicore platforms. - Ichitaro Yamazaki, Akihiro Ida, Rio Yokota, Jack J. Dongarra:
Distributed-memory lattice H-matrix factorization.
Volume 33, Number 6, November 2019
- Jack J. Dongarra, Bernard Tourancheau:
Guest editors' note: Special issue on clusters, clouds, and data for scientific computing.
- Hartwig Anzt, Goran Flegar, Thomas Grützmacher, Enrique S. Quintana-Ortí:
Toward a modular precision ecosystem for high-performance computing. - Mark Endrei, Chao Jin, Minh Ngoc Dinh, David Abramson, Heidi Poxon, Luiz DeRose, Bronis R. de Supinski:
Statistical and machine learning models for optimizing energy in parallel applications. - Jungwon Kim, Jeffrey S. Vetter:
Implementing efficient data compression and encryption in a persistent key-value store for HPC. - Heike Jagode, Anthony Danalis, Hartwig Anzt, Jack J. Dongarra:
PAPI software-defined events for in-depth performance analysis. - Ewa Deelman, Anirban Mandal, Ming Jiang, Rizos Sakellariou:
The role of machine learning in scientific workflows. - Ana Gainaru, Hongyang Sun, Guillaume Aupy, Yuankai Huo, Bennett A. Landman, Padma Raghavan:
On-the-fly scheduling versus reservation-based scheduling for unpredictable workflows. - Daniel Balouek-Thomert, Eduard Gibert Renart, Ali Reza Zamani, Anthony Simonet, Manish Parashar:
Towards a computing continuum: Enabling edge-to-cloud integration for data-driven workflows. - Dylan Chapp, Danny Rorabaugh, Kento Sato, Dong H. Ahn, Michela Taufer:
A three-phase workflow for general and expressive representations of nondeterminism in HPC applications. - Guillaume Aupy, Brice Goglin, Valentin Honoré, Bruno Raffin:
Modeling high-throughput applications for in situ analytics. - Franck Cappello, Sheng Di, Sihuan Li, Xin Liang, Ali Murat Gok, Dingwen Tao, Chun Hong Yoon, Xin-Chuan Wu, Yuri Alexeev, Frederic T. Chong:
Use cases of lossy compression for floating-point data in scientific data sets. - Guillaume Aupy, Anne Benoit, Brice Goglin, Loïc Pottier, Yves Robert:
Co-scheduling HPC workloads on cache-partitioned CMP platforms. - Alexandre Denis, Julien Jaeger, Emmanuel Jeannot, Marc Pérache, Hugo Taboada:
Study on progress threads placement and dedicated cores for overlapping MPI nonblocking collectives on manycore processor. - Li Han, Valentin Le Fèvre, Louis-Claude Canon, Yves Robert, Frédéric Vivien:
A generic approach to scheduling and checkpointing workflows. - Anand Venkat, Tharindu Rusira, Raj Barik, Mary W. Hall, Leonard Truong:
SWIRL: High-performance many-core CPU code generation for deep neural networks. - Thiago S. F. X. Teixeira, William Gropp, David A. Padua:
Managing code transformations for better performance portability.
manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.