default search action
Hartwig Anzt
Person information
- affiliation: University of Tennessee, Knoxville, TN, USA
Refine list
refinements active!
zoomed in on ?? of ?? records
view refined list in
export refined list as
showing all ?? records
2020 – today
- 2024
- [j45]Hartwig Anzt, Axel Huebl, Xiaoye S. Li:
Then and Now: Improving Software Portability, Productivity, and 100× Performance. Comput. Sci. Eng. 26(1): 61-70 (2024) - [j44]Piotr Luszczek, Ahmad Abdelfattah, Hartwig Anzt, Atsushi Suzuki, Stanimire Tomov:
Batched sparse and mixed-precision linear algebra interface for efficient use of GPU hardware accelerators in scientific applications. Future Gener. Comput. Syst. 160: 359-374 (2024) - [c67]Pratik Nayak, Hartwig Anzt:
A Probabilistic Model for Asynchronous Iterative Methods. IPDPS (Workshops) 2024: 260-269 - [i16]Andrés E. Tomás, Enrique S. Quintana-Ortí, Hartwig Anzt:
Fast Truncated SVD of Sparse and Dense Matrices on Graphics Processors. CoRR abs/2403.06218 (2024) - 2023
- [j43]Yu-Hsiang Tsai, Terry Cojean, Hartwig Anzt:
Providing performance portable numerics for Intel GPUs. Concurr. Comput. Pract. Exp. 35(20) (2023) - [j42]José Ignacio Aliaga, Hartwig Anzt, Enrique S. Quintana-Ortí, Andrés E. Tomás:
Sparse matrix-vector and matrix-multivector products for the truncated SVD on graphics processors. Concurr. Comput. Pract. Exp. 35(28) (2023) - [j41]Torsten Hoefler, Bjorn Stevens, Andreas F. Prein, Johanna Baehr, Thomas C. Schulthess, Thomas F. Stocker, John A. Taylor, Daniel Klocke, Pekka Manninen, Piers M. Forster, Tobias Kölling, Nicolas Gruber, Hartwig Anzt, Claudia Frauen, Florian Ziemen, Milan Klöwer, Karthik Kashinath, Christoph M. Schär, Oliver Fuhrer, Bryan N. Lawrence:
Earth Virtualization Engines: A Technical Perspective. Comput. Sci. Eng. 25(3): 50-59 (2023) - [j40]Yu-Hsiang Mike Tsai, Natalie Beams, Hartwig Anzt:
Three-precision algebraic multigrid on GPUs. Future Gener. Comput. Syst. 149: 280-293 (2023) - [j39]José Ignacio Aliaga, Hartwig Anzt, Thomas Grützmacher, Enrique S. Quintana-Ortí, Andrés E. Tomás:
Compressed basis GMRES on high-performance graphics processing units. Int. J. High Perform. Comput. Appl. 37(2): 82-100 (2023) - [j38]Andrés E. Tomás, Enrique S. Quintana-Ortí, Hartwig Anzt:
Fast truncated SVD of sparse and dense matrices on graphics processors. Int. J. High Perform. Comput. Appl. 37(3-4): 380-393 (2023) - [j37]Aditya Kashi, Pratik Nayak, Dhruva Kulkarni, Aaron Scheinberg, Paul Lin, Hartwig Anzt:
Integrating batched sparse iterative solvers for the collision operator in fusion plasma simulations on GPUs. J. Parallel Distributed Comput. 178: 69-81 (2023) - [j36]Thomas Grützmacher, Hartwig Anzt, Enrique S. Quintana-Ortí:
Using Ginkgo's memory accessor for improving the accuracy of memory-bound low precision BLAS. Softw. Pract. Exp. 53(1): 81-98 (2023) - [c66]Fritz Göbel, Terry Cojean, Hartwig Anzt:
BDDC Preconditioning on GPUs for Cardiac Simulations. Euro-Par Workshops 2023: 265-268 - [c65]Wissam M. Sid-Lakhdar, Sébastien Cayrols, Daniel Bielich, Ahmad Abdelfattah, Piotr Luszczek, Mark Gates, Stanimire Tomov, Hans Johansen, David B. Williams-Young, Timothy A. Davis, Jack J. Dongarra, Hartwig Anzt:
PAQR: Pivoting Avoiding QR factorization. IPDPS 2023: 322-332 - [c64]Pratik Nayak, Hartwig Anzt:
Utilizing batched solver ideas for efficient solution of non-batched linear systems. IPDPS Workshops 2023: 662-665 - [c63]Phuong Nguyen, Pratik Nayak, Hartwig Anzt:
Porting Batched Iterative Solvers onto Intel GPUs with SYCL. SC Workshops 2023: 1048-1058 - [c62]Ahmad Abdelfattah, Stanimire Tomov, Piotr Luszczek, Hartwig Anzt, Jack J. Dongarra:
GPU-based LU Factorization and Solve on Batches of Matrices with Band Structure. SC Workshops 2023: 1670-1679 - [c61]Dalal Sukkari, Mark Gates, Mohammed A. Al Farhan, Hartwig Anzt, Jack J. Dongarra:
Task-Based Polar Decomposition Using SLATE on Massively Parallel Systems with Hardware Accelerators. SC Workshops 2023: 1680-1687 - [c60]Tobias Ribizel, Hartwig Anzt:
Parallel Symbolic Cholesky Factorization. SC Workshops 2023: 1721-1727 - [c59]Vasileios Georgiou, Christos Boutsikas, Petros Drineas, Hartwig Anzt:
A Mixed Precision Randomized Preconditioner for the LSQR Solver on GPUs. ISC 2023: 164-181 - [i15]Kasia Swirydowicz, Nicholson Koukpaizan, Tobias Ribizel, Fritz Göbel, Shrirang Abhyankar, Hartwig Anzt, Slaven Peles:
GPU-Resident Sparse Direct Linear Solvers for Alternating Current Optimal Power Flow Analysis. CoRR abs/2306.14337 (2023) - [i14]Phuong Nguyen, Pratik Nayak, Hartwig Anzt:
Porting Batched Iterative Solvers onto Intel GPUs with SYCL. CoRR abs/2308.08417 (2023) - [i13]Torsten Hoefler, Bjorn Stevens, Andreas F. Prein, Johanna Baehr, Thomas C. Schulthess, Thomas F. Stocker, John A. Taylor, Daniel Klocke, Pekka Manninen, Piers M. Forster, Tobias Kölling, Nicolas Gruber, Hartwig Anzt, Claudia Frauen, Florian Ziemen, Milan Klöwer, Karthik Kashinath, Christoph M. Schär, Oliver Fuhrer, Bryan N. Lawrence:
Earth Virtualization Engines - A Technical Perspective. CoRR abs/2309.09002 (2023) - 2022
- [j35]José Ignacio Aliaga, Hartwig Anzt, Thomas Grützmacher, Enrique S. Quintana-Ortí, Andrés E. Tomás:
Compression and load balancing for efficient sparse matrix-vector product on multicore processors and graphics processing units. Concurr. Comput. Pract. Exp. 34(14) (2022) - [j34]Emmanuel Agullo, Mirco Altenbernd, Hartwig Anzt, Leonardo Bautista-Gomez, Tommaso Benacchio, Luca Bonaventura, Hans-Joachim Bungartz, Sanjay Chatterjee, Florina M. Ciorba, Nathan DeBardeleben, Daniel Drzisga, Sebastian Eibl, Christian Engelmann, Wilfried N. Gansterer, Luc Giraud, Dominik Göddeke, Marco Heisig, Fabienne Jézéquel, Nils Kohl, Xiaoye Sherry Li, Romain Lion, Miriam Mehl, Paul Mycek, Michael Obersteiner, Enrique S. Quintana-Ortí, Francesco Rizzi, Ulrich Rüde, Martin Schulz, Fred Fung, Robert Speck, Linda Stals, Keita Teranishi, Samuel Thibault, Dominik Thönnes, Andreas Wagner, Barbara I. Wohlmuth:
Resiliency in numerical algorithm design for extreme scale simulations. Int. J. High Perform. Comput. Appl. 36(2): 251-285 (2022) - [j33]Terry Cojean, Yu-Hsiang Mike Tsai, Hartwig Anzt:
Ginkgo - A math library designed for platform portability. Parallel Comput. 111: 102902 (2022) - [j32]Hartwig Anzt, Terry Cojean, Goran Flegar, Fritz Göbel, Thomas Grützmacher, Pratik Nayak, Tobias Ribizel, Yuhsiang Mike Tsai, Enrique S. Quintana-Ortí:
Ginkgo: A Modern Linear Operator Algebra Framework for High Performance Computing. ACM Trans. Math. Softw. 48(1): 2:1-2:33 (2022) - [c58]Apan Qasem, Hartwig Anzt, Eduard Ayguadé, Katharine J. Cahill, Ramon Canal, Jany Chan, Eric Fosler-Lussier, Fritz Göbel, Arpan Jain, Marcel Koch, Mateusz Kuzak, Josep Llosa, Raghu Machiraju, Xavier Martorell, Pratik Nayak, Shameema Oottikkal, Marcin Ostasz, Dhabaleswar K. Panda, Dirk Pleiter, Rajiv Ramnath, Maria-Ribera Sancho, Alessio Sclocco, Aamir Shafi, Hanno Spreeuw, Hari Subramoni, Karen Tomko:
Lightning Talks of EduHPC 2022. EduHPC@SC 2022: 42-49 - [c57]Aditya Kashi, Pratik Nayak, Dhruva Kulkarni, Aaron Scheinberg, Paul Lin, Hartwig Anzt:
Batched sparse iterative solvers on GPU for the collision operator for fusion plasma simulations. IPDPS 2022: 157-167 - [c56]Yu-Hsiang Mike Tsai, Natalie Beams, Hartwig Anzt:
Mixed Precision Algebraic Multigrid on GPUs. PPAM (1) 2022: 113-125 - [c55]Yannick Funk, Markus Götz, Hartwig Anzt:
Prediction of Optimal Solvers for Sparse Linear Systems Using Deep Learning. PP 2022: 14-24 - [c54]Yu-Hsiang Mike Tsai, Pratik Nayak, Edmond Chow, Hartwig Anzt:
Implementing Asynchronous Jacobi Iteration on GPUs. ScalAH@SC 2022: 1-9 - [c53]Isha Aggarwal, Pratik Nayak, Aditya Kashi, Hartwig Anzt:
Preconditioners for Batched Iterative Linear Solvers on GPUs. SMC 2022: 38-53 - [e3]Hartwig Anzt, Amanda Bienz, Piotr Luszczek, Marc Baboulin:
High Performance Computing. ISC High Performance 2022 International Workshops - Hamburg, Germany, May 29 - June 2, 2022, Revised Selected Papers. Lecture Notes in Computer Science 13387, Springer 2022, ISBN 978-3-031-23219-0 [contents] - 2021
- [j31]Pratik Nayak, Terry Cojean, Hartwig Anzt:
Evaluating asynchronous Schwarz solvers on GPUs. Int. J. High Perform. Comput. Appl. 35(3) (2021) - [j30]Ahmad Abdelfattah, Hartwig Anzt, Erik G. Boman, Erin C. Carson, Terry Cojean, Jack J. Dongarra, Alyson Fox, Mark Gates, Nicholas J. Higham, Xiaoye S. Li, Jennifer A. Loe, Piotr Luszczek, Srikara Pranesh, Siva Rajamanickam, Tobias Ribizel, Barry F. Smith, Kasia Swirydowicz, Stephen J. Thomas, Stanimire Tomov, Yaohung M. Tsai, Ulrike Meier Yang:
A survey of numerical linear algebra methods utilizing mixed-precision arithmetic. Int. J. High Perform. Comput. Appl. 35(4) (2021) - [j29]Hartwig Anzt, Eileen Kuehn, Goran Flegar:
Crediting pull requests to open source research software as an academic contribution. J. Comput. Sci. 49: 101278 (2021) - [j28]Goran Flegar, Hartwig Anzt, Terry Cojean, Enrique S. Quintana-Ortí:
Adaptive Precision Block-Jacobi for High Performance Preconditioning in the Ginkgo Linear Algebra Software. ACM Trans. Math. Softw. 47(2): 14:1-14:28 (2021) - [c52]Yuhsiang M. Tsai, Terry Cojean, Hartwig Anzt:
Porting Sparse Linear Algebra to Intel GPUs. Euro-Par Workshops 2021: 57-68 - [c51]Fritz Göbel, Thomas Grützmacher, Tobias Ribizel, Hartwig Anzt:
Mixed Precision Incomplete and Factorized Sparse Approximate Inverse Preconditioning on GPUs. Euro-Par 2021: 550-564 - [c50]Pratik Nayak, Fritz Göbel, Hartwig Anzt:
A Collaborative Peer Review Process for Grading Coding Assignments. ICCS (6) 2021: 654-660 - [c49]Isha Aggarwal, Aditya Kashi, Pratik Nayak, Cody J. Balos, Carol S. Woodward, Hartwig Anzt:
Batched Sparse Iterative Solvers for Computational Chemistry Simulations on GPUs. ScalA@SC 2021: 35-43 - [e2]Heike Jagode, Hartwig Anzt, Hatem Ltaief, Piotr Luszczek:
High Performance Computing - ISC High Performance Digital 2021 International Workshops, Frankfurt am Main, Germany, June 24 - July 2, 2021, Revised Selected Papers. Lecture Notes in Computer Science 12761, Springer 2021, ISBN 978-3-030-90538-5 [contents] - [i12]Daniel S. Katz, Morane Gruenpeter, Tom Honeyman, Lorraine J. Hwang, Mark D. Wilkinson, Vanessa V. Sochat, Hartwig Anzt, Carole A. Goble:
A Fresh Look at FAIR for Research Software. CoRR abs/2101.10883 (2021) - [i11]Yuhsiang M. Tsai, Terry Cojean, Hartwig Anzt:
Porting a sparse linear algebra math library to Intel GPUs. CoRR abs/2103.10116 (2021) - 2020
- [j27]Thomas Grützmacher, Terry Cojean, Goran Flegar, Fritz Göbel, Hartwig Anzt:
A customized precision format based on mantissa segmentation for accelerating sparse linear algebra. Concurr. Comput. Pract. Exp. 32(15) (2020) - [j26]Hartwig Anzt, Terry Cojean, Yen-Chen Chen, Goran Flegar, Fritz Göbel, Thomas Grützmacher, Pratik Nayak, Tobias Ribizel, Yuhsiang M. Tsai:
Ginkgo: A high performance numerical linear algebra library. J. Open Source Softw. 5(52): 2260 (2020) - [j25]Tobias Ribizel, Hartwig Anzt:
Parallel selection on GPUs. Parallel Comput. 91 (2020) - [j24]Hartwig Anzt, Terry Cojean, Chen Yen-Chen, Jack J. Dongarra, Goran Flegar, Pratik Nayak, Stanimire Tomov, Yuhsiang M. Tsai, Weichung Wang:
Load-balancing Sparse Matrix Vector Product Kernels on GPUs. ACM Trans. Parallel Comput. 7(1): 2:1-2:26 (2020) - [j23]Thomas Grützmacher, Terry Cojean, Goran Flegar, Hartwig Anzt, Enrique S. Quintana-Ortí:
Acceleration of PageRank with Customized Precision Based on Mantissa Segmentation. ACM Trans. Parallel Comput. 7(1): 4:1-4:19 (2020) - [c48]José Ignacio Aliaga, Hartwig Anzt, Enrique S. Quintana-Ortí, Andrés E. Tomás, Yuhsiang M. Tsai:
Balanced and Compressed Coordinate Layout for the Sparse Matrix-Vector Product on GPUs. Euro-Par Workshops 2020: 83-95 - [c47]Yuhsiang M. Tsai, Terry Cojean, Tobias Ribizel, Hartwig Anzt:
Preparing Ginkgo for AMD GPUs - A Testimonial on Porting CUDA Code to HIP. Euro-Par Workshops 2020: 109-121 - [c46]Fritz Göbel, Hartwig Anzt, Terry Cojean, Goran Flegar, Enrique S. Quintana-Ortí:
Multiprecision Block-Jacobi for Iterative Triangular Solves. Euro-Par 2020: 546-560 - [c45]Piotr Luszczek, Yaohung M. Tsai, Neil Lindquist, Hartwig Anzt, Jack J. Dongarra:
Scalable Data Generation for Evaluating Mixed-Precision Solvers. HPEC 2020: 1-6 - [c44]Hartwig Anzt, Yuhsiang M. Tsai, Ahmad Abdelfattah, Terry Cojean, Jack J. Dongarra:
Evaluating the Performance of NVIDIA's A100 Ampere GPU for Sparse and Batched Computations. PMBS@SC 2020: 26-38 - [c43]Pratik Nayak, Terry Cojean, Hartwig Anzt:
Two-stage Asynchronous Iterative Solvers for multi-GPU Clusters. ScalA@SC 2020: 9-18 - [c42]Yuhsiang M. Tsai, Terry Cojean, Hartwig Anzt:
Sparse Linear Algebra on AMD and NVIDIA GPUs - The Race Is On. ISC 2020: 309-327 - [e1]Heike Jagode, Hartwig Anzt, Guido Juckeland, Hatem Ltaief:
High Performance Computing - ISC High Performance 2020 International Workshops, Frankfurt, Germany, June 21-25, 2020, Revised Selected Papers. Lecture Notes in Computer Science 12321, Springer 2020, ISBN 978-3-030-59850-1 [contents] - [d1]Daniel S. Katz, Michelle Barker, Paula Andrea Martínez, Hartwig Anzt, Alejandra N. González-Beltrán, Tom Bakker:
The Research Software Alliance (ReSA) and the community landscape. Zenodo, 2020 - [i10]Pratik Nayak, Terry Cojean, Hartwig Anzt:
Evaluating Abstract Asynchronous Schwarz solvers. CoRR abs/2003.05361 (2020) - [i9]Hartwig Anzt, Felix Bach, Stephan Druskat, Frank Löffler, Axel Loewe, Bernhard Y. Renard, Gunnar Seemann, Alexander Struck, Elke Achhammer, Piush Aggarwal, Franziska Appel, Michael Bader, Lutz Brusch, Christian Busse, Gerasimos Chourdakis, Piotr Wojtek Dabrowski, Peter Ebert, Bernd Flemisch, Sven Friedl, Bernadette Fritzsch, Maximilian D. Funk, Volker Gast, Florian Goth, Jean-Noël Grad, Sibylle Hermann, Florian Hohmann, Stephan Janosch, Dominik Kutra, Jan Linxweiler, Thilo Muth, Wolfgang Peters-Kottig, Fabian Rack, Fabian H. C. Raters, Stephan Rave, Guido Reina, Malte Reißig, Timo Ropinski, Jörg Schaarschmidt, Heidi Seibold, Jan P. Thiele, Benjamin Uekermann, Stefan Unger, Rudolf Weeber:
An Environment for Sustainable Research Software in Germany and Beyond: Current State, Open Challenges, and Call for Action. CoRR abs/2005.01469 (2020) - [i8]Yuhsiang M. Tsai, Terry Cojean, Tobias Ribizel, Hartwig Anzt:
Preparing Ginkgo for AMD GPUs - A Testimonial on Porting CUDA Code to HIP. CoRR abs/2006.14290 (2020) - [i7]Hartwig Anzt, Terry Cojean, Goran Flegar, Fritz Göbel, Thomas Grützmacher, Pratik Nayak, Tobias Ribizel, Yu-Hsiang Tsai, Enrique S. Quintana-Ortí:
Ginkgo: A Modern Linear Operator Algebra Framework for High Performance Computing. CoRR abs/2006.16852 (2020) - [i6]Ahmad Abdelfattah, Hartwig Anzt, Erik G. Boman, Erin C. Carson, Terry Cojean, Jack J. Dongarra, Mark Gates, Thomas Grützmacher, Nicholas J. Higham, Xiaoye Sherry Li, Neil Lindquist, Yang Liu, Jennifer A. Loe, Piotr Luszczek, Pratik Nayak, Srikara Pranesh, Sivasankaran Rajamanickam, Tobias Ribizel, Barry Smith, Kasia Swirydowicz, Stephen J. Thomas, Stanimire Tomov, Yaohung M. Tsai, Ichitaro Yamazaki, Ulrike Meier Yang:
A Survey of Numerical Methods Utilizing Mixed Precision Arithmetic. CoRR abs/2007.06674 (2020) - [i5]Yuhsiang Mike Tsai, Terry Cojean, Hartwig Anzt:
Evaluating the Performance of NVIDIA's A100 Ampere GPU for Sparse Linear Algebra Computations. CoRR abs/2008.08478 (2020) - [i4]José Ignacio Aliaga, Hartwig Anzt, Thomas Grützmacher, Enrique S. Quintana-Ortí, Andrés E. Tomás:
Compressed Basis GMRES on High Performance GPUs. CoRR abs/2009.12101 (2020) - [i3]Emmanuel Agullo, Mirco Altenbernd, Hartwig Anzt, Leonardo Bautista-Gomez, Tommaso Benacchio, Luca Bonaventura, Hans-Joachim Bungartz, Sanjay Chatterjee, Florina M. Ciorba, Nathan DeBardeleben, Daniel Drzisga, Sebastian Eibl, Christian Engelmann, Wilfried N. Gansterer, Luc Giraud, Dominik Göddeke, Marco Heisig, Fabienne Jézéquel, Nils Kohl, Xiaoye Sherry Li, Romain Lion, Miriam Mehl, Paul Mycek, Michael Obersteiner, Enrique S. Quintana-Ortí, Francesco Rizzi, Ulrich Rüde, Martin Schulz, Fred Fung, Robert Speck, Linda Stals, Keita Teranishi, Samuel Thibault, Dominik Thönnes, Andreas Wagner, Barbara I. Wohlmuth:
Resiliency in Numerical Algorithm Design for Extreme Scale Simulations. CoRR abs/2010.13342 (2020) - [i2]Terry Cojean, Yu-Hsiang Mike Tsai, Hartwig Anzt:
Ginkgo - A Math Library designed for Platform Portability. CoRR abs/2011.08879 (2020) - [i1]Hartwig Anzt, Felix Bach, Stephan Druskat, Frank Löffler, Axel Loewe, Bernhard Y. Renard, Gunnar Seemann, Alexander Struck, Elke Achhammer, Piush Aggarwal, Franziska Appel, Michael Bader, Lutz Brusch, Christian Busse, Gerasimos Chourdakis, Piotr Wojciech Dabrowski, Peter Ebert, Bernd Flemisch, Sven Friedl, Bernadette Fritzsch, Maximilian D. Funk, Volker Gast, Florian Goth, Jean-Noël Grad, Sibylle Hermann, Florian Hohmann, Stephan Janosch, Dominik Kutra, Jan Linxweiler, Thilo Muth, Wolfgang Peters-Kottig, Fabian Rack, Fabian H. C. Raters, Stephan Rave, Guido Reina, Malte Reißig, Timo Ropinski, Jörg Schaarschmidt, Heidi Seibold, Jan P. Thiele, Benjamin Uekermann, Stefan Unger, Rudolf Weeber:
An environment for sustainable research software in Germany and beyond: current state, open challenges, and call for action. F1000Research 9: 295 (2020)
2010 – 2019
- 2019
- [j22]Hartwig Anzt, Jack J. Dongarra, Goran Flegar, Nicholas J. Higham, Enrique S. Quintana-Ortí:
Adaptive precision in block-Jacobi preconditioning for iterative sparse linear system solvers. Concurr. Comput. Pract. Exp. 31(6) (2019) - [j21]Hartwig Anzt, Goran Flegar, Thomas Grützmacher, Enrique S. Quintana-Ortí:
Toward a modular precision ecosystem for high-performance computing. Int. J. High Perform. Comput. Appl. 33(6) (2019) - [j20]Heike Jagode, Anthony Danalis, Hartwig Anzt, Jack J. Dongarra:
PAPI software-defined events for in-depth performance analysis. Int. J. High Perform. Comput. Appl. 33(6) (2019) - [j19]Hartwig Anzt, Jack J. Dongarra, Enrique S. Quintana-Ortí:
Fine-grained bit-flip protection for relaxation methods. J. Comput. Sci. 36 (2019) - [j18]Hartwig Anzt, Jack J. Dongarra, Goran Flegar, Enrique S. Quintana-Ortí:
Variable-size batched Gauss-Jordan elimination for block-Jacobi preconditioning on graphics processors. Parallel Comput. 81: 131-146 (2019) - [c41]Hartwig Anzt, Tobias Ribizel, Goran Flegar, Edmond Chow, Jack J. Dongarra:
ParILUT - A Parallel Threshold ILU for GPUs. IPDPS 2019: 231-241 - [c40]Tobias Ribizel, Hartwig Anzt:
Approximate and Exact Selection on GPUs. IPDPS Workshops 2019: 471-478 - [c39]Hartwig Anzt, Goran Flegar:
Are we Doing the Right Thing? - A Critical Analysis of the Academic HPC Community. IPDPS Workshops 2019: 739-745 - [c38]Hartwig Anzt, Yen-Chen Chen, Terry Cojean, Jack J. Dongarra, Goran Flegar, Pratik Nayak, Enrique S. Quintana-Ortí, Yuhsiang M. Tsai, Weichung Wang:
Towards Continuous Benchmarking: An Automated Performance Evaluation Framework for High Performance Software. PASC 2019: 9:1-9:11 - 2018
- [j17]Hartwig Anzt, Moritz Kreutzer, Eduardo Ponce, Gregory D. Peterson, Gerhard Wellein, Jack J. Dongarra:
Optimization and performance evaluation of the IDR iterative Krylov solver on GPUs. Int. J. High Perform. Comput. Appl. 32(2): 220-230 (2018) - [j16]Edmond Chow, Hartwig Anzt, Jennifer A. Scott, Jack J. Dongarra:
Using Jacobi iterations and blocking for solving sparse triangular systems in incomplete factorization preconditioning. J. Parallel Distributed Comput. 119: 219-230 (2018) - [j15]Hartwig Anzt, Thomas K. Huckle, Jürgen Bräckle, Jack J. Dongarra:
Incomplete Sparse Approximate Inverses for Parallel Preconditioning. Parallel Comput. 71: 1-22 (2018) - [j14]Hartwig Anzt, Edmond Chow, Jack J. Dongarra:
ParILUT - A New Parallel Threshold ILU Factorization. SIAM J. Sci. Comput. 40(4): C503-C519 (2018) - [c37]Thomas Grützmacher, Hartwig Anzt:
A Modular Precision Format for Decoupling Arithmetic Format and Storage Format. Euro-Par Workshops 2018: 434-443 - [c36]Hartwig Anzt, Jack J. Dongarra, Goran Flegar, Thomas Grützmacher:
Variable-Size Batched Condition Number Calculation on GPUs. SBAC-PAD 2018: 132-139 - [c35]Hartwig Anzt, Jack J. Dongarra:
A Jaccard Weights Kernel Leveraging Independent Thread Scheduling on GPUs. SBAC-PAD 2018: 229-232 - [c34]Thomas Grützmacher, Hartwig Anzt, Florian Scheidegger, Enrique S. Quintana-Ortí:
High-Performance GPU Implementation of PageRank with Reduced Precision Based on Mantissa Segmentation. IA3@SC 2018: 61-68 - [c33]Hartwig Anzt, Goran Flegar, Vedran Novakovic, Enrique S. Quintana-Ortí, Andrés E. Tomás:
Residual Replacement in Mixed-Precision Iterative Refinement for Sparse Linear Systems. ISC Workshops 2018: 554-561 - 2017
- [j13]Jack J. Dongarra, Stanimire Tomov, Piotr Luszczek, Jakub Kurzak, Mark Gates, Ichitaro Yamazaki, Hartwig Anzt, Azzam Haidar, Ahmad Abdelfattah:
With Extreme Computing, the Rules Have Changed. Comput. Sci. Eng. 19(3): 52-62 (2017) - [j12]Hartwig Anzt, Stanimire Tomov, Jack J. Dongarra:
On the performance and energy efficiency of sparse linear algebra on GPUs. Int. J. High Perform. Comput. Appl. 31(5): 375-390 (2017) - [j11]Hartwig Anzt, Mark Gates, Jack J. Dongarra, Moritz Kreutzer, Gerhard Wellein, Martin Koehler:
Preconditioned Krylov solvers on GPUs. Parallel Comput. 68: 32-44 (2017) - [c32]Hartwig Anzt, Jack J. Dongarra, Goran Flegar, Enrique S. Quintana-Ortí, Andrés E. Tomás:
Variable-Size Batched Gauss-Huard for Block-Jacobi Preconditioning. ICCS 2017: 1783-1792 - [c31]Hartwig Anzt, Jack J. Dongarra, Goran Flegar, Enrique S. Quintana-Ortí:
Variable-Size Batched LU for Small Matrices and Its Integration into Block-Jacobi Preconditioning. ICPP 2017: 91-100 - [c30]Hartwig Anzt, Jack J. Dongarra, Goran Flegar, Enrique S. Quintana-Ortí:
Batched Gauss-Jordan Elimination for Block-Jacobi Preconditioner Generation on GPUs. PMAM@PPoPP 2017: 1-10 - [c29]Goran Flegar, Hartwig Anzt:
Overcoming Load Imbalance for Irregular Sparse Matrices. IA3@SC 2017: 2:1-2:8 - [c28]Hartwig Anzt, Gary Collins, Jack J. Dongarra, Goran Flegar, Enrique S. Quintana-Ortí:
Flexible batched sparse matrix-vector product on GPUs. ScalA@SC 2017: 3:1-3:8 - [p2]Hartwig Anzt, Jack J. Dongarra, Mark Gates, Jakub Kurzak, Piotr Luszczek, Stanimire Tomov, Ichitaro Yamazaki:
Bringing High Performance Computing to Big Data Algorithms. Handbook of Big Data Technologies 2017: 777-806 - 2016
- [j10]Ahmad Abdelfattah, Hartwig Anzt, Jack J. Dongarra, Mark Gates, Azzam Haidar, Jakub Kurzak, Piotr Luszczek, Stanimire Tomov, Ichitaro Yamazaki, Asim YarKhan:
Linear algebra software for large-scale accelerated multicore computing. Acta Numer. 25: 1-160 (2016) - [j9]Hartwig Anzt, Edmond Chow, Jens Saak, Jack J. Dongarra:
Updating incomplete factorization preconditioners for model order reduction. Numer. Algorithms 73(3): 611-630 (2016) - [j8]Jakub Kurzak, Hartwig Anzt, Mark Gates, Jack J. Dongarra:
Implementation and Tuning of Batched Cholesky Factorization and Solve for NVIDIA GPUs. IEEE Trans. Parallel Distributed Syst. 27(7): 2036-2048 (2016) - [c27]Chris J. Newburn, Gaurav Bansal, Michael Wood, Luis Crivelli, Judit Planas, Alejandro Duran, Paulo Souza, Leonardo Borges, Piotr Luszczek, Stanimire Tomov, Jack J. Dongarra, Hartwig Anzt, Mark Gates, Azzam Haidar, Yulu Jia, Khairul Kabir, Ichitaro Yamazaki, Jesús Labarta:
Heterogeneous Streaming. IPDPS Workshops 2016: 611-620 - [c26]Hartwig Anzt, Jack J. Dongarra, Moritz Kreutzer, Gerhard Wellein, Martin Koehler:
Efficiency of General Krylov Methods on GPUs - An Experimental Study. IPDPS Workshops 2016: 683-691 - [c25]Hartwig Anzt, Edmond Chow, Thomas Huckle, Jack J. Dongarra:
Batched Generation of Incomplete Sparse Approximate Inverses on GPUs. ScalA@SC 2016: 49-56 - [c24]Hartwig Anzt, Marc Baboulin, Jack J. Dongarra, Yvan Fournier, Frank Hülsemann, Amal Khabou, Yushan Wang:
Accelerating the Conjugate Gradient Algorithm with GPUs in CFD Simulations. VECPAR 2016: 35-43 - [p1]