default search action
Kurt B. Ferreira
Person information
Refine list
refinements active!
zoomed in on ?? of ?? records
view refined list in
export refined list as
2020 – today
- 2023
- [c69]Nicholas H. Bacon, Patrick G. Bridges, Scott Levy, Kurt B. Ferreira, Amanda Bienz:
Evaluating the Viability of LogGP for Modeling MPI Performance with Non-contiguous Datatypes on Modern Architectures. EuroMPI 2023: 8:1-8:10 - [c68]Kurt B. Ferreira, Scott Levy:
Using Benford's Law to Identify Unusual Failure Regions. SC Workshops 2023: 516-519 - 2022
- [c67]Kurt B. Ferreira, Scott Levy, Joshua Hemmert, Kevin T. Pedretti:
Understanding Memory Failures on a Petascale Arm System. HPDC 2022: 84-96 - 2021
- [j12]Kurt B. Ferreira, Scott Levy:
Evaluating MPI resource usage summary statistics. Parallel Comput. 108: 102825 (2021) - [c66]Kurt B. Ferreira, Scott Levy, Victor Kuhns, Nathan DeBardeleben, Sean Blanchard:
Understanding the Effects of DRAM Correctable Error Logging at Scale. CLUSTER 2021: 421-432 - [c65]Kurt B. Ferreira, Scott Levy:
Characterizing Memory Failures Using Benford's Law. Euro-Par Workshops 2021: 310-321 - 2020
- [j11]Kurt B. Ferreira, Ryan E. Grant, Michael J. Levenhagen, Scott Levy, Taylor L. Groves:
Hardware MPI message matching: Insights into MPI matching behavior to inform design. Concurr. Comput. Pract. Exp. 32(3) (2020) - [j10]Scott Levy, Kurt B. Ferreira, Patrick M. Widener:
The unexpected virtue of almost: Exploiting MPI collective operations to approximately coordinate checkpoints. Concurr. Comput. Pract. Exp. 32(3) (2020) - [c64]Kurt B. Ferreira, Scott Levy:
Evaluating MPI Message Size Summary Statistics. EuroMPI 2020: 61-70 - [c63]Ron Brightwell, Kurt B. Ferreira, Ryan E. Grant, Scott Levy, Jay F. Lofstead, Stephen L. Olivier, Kevin T. Pedretti, Andrew J. Younge, Ann C. Gentile, Jim M. Brandt:
ALAMO: Autonomous Lightweight Allocation, Management, and Optimization. SMC 2020: 408-422
2010 – 2019
- 2019
- [j9]Thomas Hérault, Yves Robert, Aurélien Bouteiller, Dorian C. Arnold, Kurt B. Ferreira, George Bosilca, Jack J. Dongarra:
Checkpointing Strategies for Shared High-Performance Computing Platforms. Int. J. Netw. Comput. 9(1): 28-52 (2019) - [j8]Scott Levy, Kurt B. Ferreira, Whit Schonbein, Ryan E. Grant, Matthew G. F. Dosanjh:
Using simulation to examine the effect of MPI message matching costs on application performance. Parallel Comput. 84: 63-74 (2019) - [c62]Scott Levy, Kurt B. Ferreira:
Space-Efficient Reed-Solomon Encoding to Detect and Correct Pointer Corruption. Euro-Par Workshops 2019: 657-668 - [c61]Scott Levy, Kurt B. Ferreira:
Evaluating tradeoffs between MPI message matching offload hardware capacity and performance. EuroMPI 2019: 12:1-12:11 - [p1]Ron Brightwell, Kurt B. Ferreira, Arthur B. Maccabe, Kevin T. Pedretti, Rolf Riesen:
Sandia Line of LWKs. Operating Systems for Supercomputers and High Performance Computing 2019: 23-46 - 2018
- [j7]Kurt B. Ferreira, Scott Levy, Kevin T. Pedretti, Ryan E. Grant:
Characterizing MPI matching via trace-based simulation. Parallel Comput. 77: 57-83 (2018) - [c60]Elisabeth Baseman, Nathan DeBardeleben, Sean Blanchard, Juston S. Moore, Olena Tkachenko, Kurt B. Ferreira, Taniya Siddiqua, Vilas Sridharan:
Physics-Informed Machine Learning for DRAM Error Modeling. DFT 2018: 1-6 - [c59]Scott Levy, Kevin T. Pedretti, Kurt B. Ferreira:
Open Science on Trinity's Knights Landing Partition: An Analysis of User Job Data. ICPP Workshops 2018: 42:1-42:9 - [c58]Thomas Hérault, Yves Robert, Aurélien Bouteiller, Dorian C. Arnold, Kurt B. Ferreira, George Bosilca, Jack J. Dongarra:
Optimal Cooperative Checkpointing for Shared High-Performance Computing Platforms. IPDPS Workshops 2018: 803-812 - [c57]Scott Levy, Kurt B. Ferreira:
Using Simulation to Examine the Effect of MPI Message Matching Costs on Application Performance. EuroMPI 2018: 16:1-16:11 - [c56]Scott Levy, Kurt B. Ferreira, Nathan DeBardeleben, Taniya Siddiqua, Vilas Sridharan, Elisabeth Baseman:
Lessons learned from memory errors observed over the lifetime of Cielo. SC 2018: 43:1-43:12 - 2017
- [c55]Scott Levy, Kurt B. Ferreira, Patrick G. Bridges:
Evaluating the Viability of Using Compression to Mitigate Silent Corruption of Read-Mostly Application Data. CLUSTER 2017: 603-607 - [c54]Taniya Siddiqua, Vilas Sridharan, Steven E. Raasch, Nathan DeBardeleben, Kurt B. Ferreira, Scott Levy, Elisabeth Baseman, Qiang Guan:
Lifetime memory reliability data from the field. DFT 2017: 1-6 - [c53]Elisabeth Baseman, Nathan DeBardeleben, Kurt B. Ferreira, Vilas Sridharan, Taniya Siddiqua, Olena Tkachenko:
Automating DRAM Fault Mitigation By Learning From Experience. DSN Workshops 2017: 137-140 - [c52]Patrick M. Widener, Kurt B. Ferreira, Scott Levy:
It's Not the Heat, It's the Humidity: Scheduling Resilience Activity at Scale. Euro-Par Workshops 2017: 581-592 - [c51]Kurt B. Ferreira, Scott Levy, Kevin T. Pedretti, Ryan E. Grant:
Characterizing MPI matching via trace-based simulation. EuroMPI/USA 2017: 8:1-8:11 - 2016
- [j6]Patrick M. Widener, Scott Levy, Kurt B. Ferreira, Torsten Hoefler:
On noise and the performance benefit of nonblocking collectives. Int. J. High Perform. Comput. Appl. 30(1): 121-133 (2016) - [c50]Oscar H. Mondragon, Patrick G. Bridges, Scott Levy, Kurt B. Ferreira, Patrick M. Widener:
Scheduling In-Situ Analytics in Next-Generation Applications. CCGrid 2016: 102-105 - [c49]Elisabeth Baseman, Nathan DeBardeleben, Kurt B. Ferreira, Scott Levy, Steven Raasch, Vilas Sridharan, Taniya Siddiqua, Qiang Guan:
Improving DRAM Fault Characterization through Machine Learning. DSN Workshops 2016: 250-253 - [c48]David Fiala, Frank Mueller, Kurt B. Ferreira:
FlipSphere: A Software-Based DRAM Error Detection and Correction Library for HPC. DS-RT 2016: 19-28 - [c47]Patrick M. Widener, Kurt B. Ferreira, Scott Levy:
Horseshoes and Hand Grenades: The Case for Approximate Coordination in Local Checkpointing Protocols. Euro-Par Workshops 2016: 623-634 - [c46]Scott Levy, Kurt B. Ferreira:
An Examination of the Impact of Failure Distribution on Coordinated Checkpoint/Restart. FTXS@HPDC 2016: 35-42 - [c45]David Fiala, Frank Mueller, Kurt B. Ferreira, Christian Engelmann:
Mini-Ckpts: Surviving OS Failures in Persistent Memory. ICS 2016: 7:1-7:14 - [c44]Scott Levy, Kurt B. Ferreira, Patrick M. Widener, Patrick G. Bridges, Oscar H. Mondragon:
How I Learned to Stop Worrying and Love In Situ Analytics: Leveraging Latent Synchronization in MPI Collective Algorithms. EuroMPI 2016: 140-153 - [c43]Scott Levy, Kurt B. Ferreira, Patrick G. Bridges:
Improving application resilience to memory errors with lightweight compression. SC 2016: 323-334 - [c42]Oscar H. Mondragon, Patrick G. Bridges, Scott Levy, Kurt B. Ferreira, Patrick M. Widener:
Understanding performance interference in next-generation HPC systems. SC 2016: 384-395 - 2015
- [j5]Scott Levy, Kurt B. Ferreira, Patrick G. Bridges, Aidan P. Thompson, Christian R. Trott:
A study of the viability of exploiting memory content similarity to improve resilience to memory errors. Int. J. High Perform. Comput. Appl. 29(1): 5-20 (2015) - [j4]Dewan Ibtesham, Kurt B. Ferreira, Dorian C. Arnold:
A checkpoint compression study for high-performance computing systems. Int. J. High Perform. Comput. Appl. 29(4): 387-402 (2015) - [c41]Vilas Sridharan, Nathan DeBardeleben, Sean Blanchard, Kurt B. Ferreira, Jon Stearley, John Shalf, Sudhanva Gurumurthi:
Memory Errors in Modern Systems: The Good, The Bad, and The Ugly. ASPLOS 2015: 297-310 - [c40]Patrick M. Widener, Kurt B. Ferreira, Scott Levy, Nathan Fabian:
Canaries in a Coal Mine: Using Application-Level Checkpoints to Detect Memory Failures. Euro-Par Workshops 2015: 669-681 - [c39]Alireza Goudarzi, Dorian C. Arnold, Darko Stefanovic, Kurt B. Ferreira, Guy Feldman:
A Principled Approach to HPC Event Monitoring. FTXS@HPDC 2015: 3-10 - [c38]Rolf Riesen, Arthur Barney Maccabe, Balazs Gerofi, David N. Lombard, John Jack Lange, Kevin T. Pedretti, Kurt B. Ferreira, Mike Lang, Pardo Keppel, Robert W. Wisniewski, Ron Brightwell, Todd Inglett, Yoonho Park, Yutaka Ishikawa:
What is a Lightweight Kernel? ROSS@HPDC 2015: 9:1-9:8 - [c37]Kevin T. Pedretti, Stephen L. Olivier, Kurt B. Ferreira, Galen M. Shipman, Wei Shu:
Early experiences with node-level power capping on the Cray XC40 platform. E2SC@SC 2015: 1:1-1:10 - 2014
- [j3]Kurt B. Ferreira, Rolf Riesen, Patrick G. Bridges, Dorian C. Arnold, Ron Brightwell:
Accelerating incremental checkpointing for extreme-scale computing. Future Gener. Comput. Syst. 30: 66-77 (2014) - [c36]Dewan Ibtesham, David Debonis, Dorian C. Arnold, Kurt B. Ferreira:
Coarse-Grained Energy Modeling of Rollback/Recovery Mechanisms. DSN 2014: 708-713 - [c35]Scott Levy, Kurt B. Ferreira, Patrick G. Bridges:
Characterizing the Impact of Rollback Avoidance at Extreme-Scale: A Modeling Approach. ICPP 2014: 401-410 - [c34]Bryan N. Mills, Taieb Znati, Rami G. Melhem, Kurt B. Ferreira, Ryan E. Grant:
Energy Consumption of Resilience Mechanisms in Large Scale Systems. PDP 2014: 528-535 - [c33]Patrick M. Widener, Kurt B. Ferreira, Scott Levy, Torsten Hoefler:
Exploring the effect of noise on the performance benefit of nonblocking allreduce. EuroMPI/ASIA 2014: 77 - [c32]Kurt B. Ferreira, Patrick M. Widener, Scott Levy, Dorian C. Arnold, Torsten Hoefler:
Understanding the Effects of Communication and Coordination on Checkpointing at Scale. SC 2014: 883-894 - 2013
- [b1]James H. Laros III, Kevin T. Pedretti, Suzanne M. Kelly, Wei Shu, Kurt B. Ferreira, John Van Dyke, Courtenay T. Vaughan:
Energy-Efficient High Performance Computing - Measurement and Tuning. Springer Briefs in Computer Science, Springer 2013, ISBN 978-1-4471-4491-5, pp. I-XIV, 1-67 - [j2]Kurt B. Ferreira, Patrick G. Bridges, Ron Brightwell, Kevin T. Pedretti:
The impact of system design parameters on application noise sensitivity. Clust. Comput. 16(1): 117-129 (2013) - [c31]Patrick M. Widener, Kurt B. Ferreira, Scott Levy, Patrick G. Bridges, Dorian C. Arnold, Ron Brightwell:
Asking the Right Questions: Benchmarking Fault-Tolerant Extreme-Scale Systems. Euro-Par Workshops 2013: 717-726 - [c30]Scott Levy, Matthew G. F. Dosanjh, Patrick G. Bridges, Kurt B. Ferreira:
Using unreliable virtual hardware to inject errors in extreme-scale systems. FTXS 2013: 21-26 - [c29]Scott Levy, Patrick G. Bridges, Kurt B. Ferreira, Aidan P. Thompson, Christian R. Trott:
Evaluating the feasibility of using memory content similarity to improve system resilience. ROSS@ICS 2013: 7:1-7:8 - [c28]Bryan N. Mills, Ryan E. Grant, Kurt B. Ferreira, Rolf Riesen:
Evaluating energy savings for checkpoint/restart. E2SC@SC 2013: 6:1-6:8 - [c27]Scott Levy, Bryan Topp, Kurt B. Ferreira, Dorian C. Arnold, Torsten Hoefler, Patrick M. Widener:
Using Simulation to Evaluate the Performance of Resilience Strategies at Scale. PMBS@SC 2013: 91-114 - 2012
- [c26]Jon Stearley, Kurt B. Ferreira, David J. Robinson, Jim Laros, Kevin T. Pedretti, Dorian C. Arnold, Patrick G. Bridges, Rolf Riesen:
Does partial replication pay off? DSN Workshops 2012: 1-6 - [c25]Kurt B. Ferreira, Rolf Riesen, Dorian C. Arnold, Dewan Ibtesham, Ron Brightwell:
The Viability of Using Compression to Decrease Message Log Sizes. Euro-Par Workshops 2012: 484-493 - [c24]James Elliott, Kishor Kharbas, David Fiala, Frank Mueller, Kurt B. Ferreira, Christian Engelmann:
Combining Partial Redundancy and Checkpointing for HPC. ICDCS 2012: 615-626 - [c23]Dewan Ibtesham, Dorian C. Arnold, Patrick G. Bridges, Kurt B. Ferreira, Ron Brightwell:
On the Viability of Compression for Reducing the Overheads of Checkpoint/Restart-Based Fault Tolerance. ICPP 2012: 148-157 - [c22]Kurt B. Ferreira, Kevin T. Pedretti, Ron Brightwell, Patrick G. Bridges, David Fiala, Frank Mueller:
Evaluating operating system vulnerability to memory errors. ROSS@ICS 2012: 11:1-11:8 - [c21]Rolf Riesen, Kurt B. Ferreira, Dilma Da Silva, Pierre Lemarinier, Dorian C. Arnold, Patrick G. Bridges:
Alleviating scalability issues of checkpointing protocols. SC 2012: 18 - [c20]David Fiala, Frank Mueller, Christian Engelmann, Rolf Riesen, Kurt B. Ferreira, Ron Brightwell:
Detection and correction of silent data corruption for large-scale high-performance computing. SC 2012: 78 - [c19]Dewan Ibtesham, Dorian C. Arnold, Kurt B. Ferreira, Ronald Brightwell:
Abstract: Comparing GPU and Increment-Based Checkpoint Compression. SC Companion 2012: 1505-1506 - [c18]Dewan Ibtesham, Dorian C. Arnold, Kurt B. Ferreira, Ronald Brightwell:
Poster: Comparing GPU and Increment-Based Checkpoint Compression. SC Companion 2012: 1507 - [c17]Arun Rodrigues, Elliott Cooper-Balis, Keren Bergman, Kurt B. Ferreira, David P. Bunde, K. Scott Hemmert:
Improvements to the structural simulation toolkit. SimuTools 2012: 190-195 - [i1]Patrick G. Bridges, Kurt B. Ferreira, Michael A. Heroux, Mark Hoemmen:
Fault-tolerant linear solvers via selective reliability. CoRR abs/1206.1390 (2012) - 2011
- [c16]Rolf Riesen, Kurt B. Ferreira, Maria Ruiz Varela, Michela Taufer, Arun Rodrigues:
Simulating Application Resilience at Exascale. Euro-Par Workshops (2) 2011: 221-230 - [c15]Patrick G. Bridges, Mark Hoemmen, Kurt B. Ferreira, Michael A. Heroux, Philip Soltero, Ron Brightwell:
Cooperative Application/OS DRAM Fault Recovery. Euro-Par Workshops (2) 2011: 241-250 - [c14]David Fiala, Kurt B. Ferreira, Frank Mueller, Christian Engelmann:
A Tunable, Software-Based DRAM Error Detection and Correction Library for HPC. Euro-Par Workshops (2) 2011: 251-261 - [c13]Dewan Ibtesham, Dorian C. Arnold, Kurt B. Ferreira, Patrick G. Bridges:
On the Viability of Checkpoint Compression for Extreme Scale Fault Tolerance. Euro-Par Workshops (2) 2011: 302-311 - [c12]Edgar A. León, Rolf Riesen, Kurt B. Ferreira, Arthur B. Maccabe:
Cache injection for parallel applications. HPDC 2011: 15-26 - [c11]Kurt B. Ferreira, Rolf Riesen, Ron Brightwell, Patrick G. Bridges, Dorian C. Arnold:
libhashckpt: Hash-Based Incremental Checkpointing Using GPU's. EuroMPI 2011: 272-281 - [c10]Kurt B. Ferreira, Jon Stearley, James H. Laros III, Ron A. Oldfield, Kevin T. Pedretti, Ron Brightwell, Rolf Riesen, Patrick G. Bridges, Dorian C. Arnold:
Evaluating the viability of process replication reliability for exascale systems. SC 2011: 44:1-44:12 - [c9]David Fiala, Frank Mueller, Christian Engelmann, Rolf Riesen, Kurt B. Ferreira:
Poster: detection and correction of silent data corruption for large-scale high-performance computing. SC Companion 2011: 47-48 - [c8]David Fiala, Kurt B. Ferreira, Frank Mueller, Christian Engelmann:
Poster: a tunable, software-based DRAM error detection and correction library for HPC. SC Companion 2011: 49-50 - 2010
- [c7]Kurt B. Ferreira, Patrick G. Bridges, Ron Brightwell, Kevin T. Pedretti:
The Impact of System Design Parameters on Application Noise Sensitivity. CLUSTER 2010: 146-155 - [c6]Rolf Riesen, Kurt B. Ferreira, Jon Stearley:
See applications run and throughput jump: The case for redundant computing in HPC. DSN Workshops 2010: 29-34 - [c5]Ron Brightwell, Kurt B. Ferreira, Rolf Riesen:
Transparent Redundant Computing with MPI. EuroMPI 2010: 208-218
2000 – 2009
- 2009
- [j1]Rolf Riesen, Ron Brightwell, Patrick G. Bridges, Trammell Hudson, Arthur B. Maccabe, Patrick M. Widener, Kurt B. Ferreira:
Designing and implementing lightweight kernels for capability computing. Concurr. Comput. Pract. Exp. 21(6): 793-817 (2009) - [c4]James H. Laros III, Kevin T. Pedretti, Suzanne M. Kelly, John P. Vandyke, Kurt B. Ferreira, Courtenay T. Vaughan, Mark Swan:
Topics on measuring real power usage on high performance computing platforms. CLUSTER 2009: 1-8 - 2008
- [c3]Ron Brightwell, Kevin T. Pedretti, Kurt B. Ferreira:
Instrumentation and Analysis of MPI Queue Times on the SeaStar High-Performance Network. ICCCN 2008: 590-596 - [c2]Kurt B. Ferreira, Patrick G. Bridges, Ron Brightwell:
Characterizing application sensitivity to OS interference using kernel-level noise injection. SC 2008: 19 - 2007
- [c1]Edgar A. León, Kurt B. Ferreira, Arthur B. Maccabe:
Reducing the Impact of the MemoryWall for I/O Using Cache Injection. Hot Interconnects 2007: 143-150
Coauthor Index
manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.
Unpaywalled article links
Add open access links from to the list of external document links (if available).
Privacy notice: By enabling the option above, your browser will contact the API of unpaywall.org to load hyperlinks to open access articles. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Unpaywall privacy policy.
Archived links via Wayback Machine
For web page which are no longer available, try to retrieve content from the of the Internet Archive (if available).
Privacy notice: By enabling the option above, your browser will contact the API of archive.org to check for archived content of web pages that are no longer available. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Internet Archive privacy policy.
Reference lists
Add a list of references from , , and to record detail pages.
load references from crossref.org and opencitations.net
Privacy notice: By enabling the option above, your browser will contact the APIs of crossref.org, opencitations.net, and semanticscholar.org to load article reference information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Crossref privacy policy and the OpenCitations privacy policy, as well as the AI2 Privacy Policy covering Semantic Scholar.
Citation data
Add a list of citing articles from and to record detail pages.
load citations from opencitations.net
Privacy notice: By enabling the option above, your browser will contact the API of opencitations.net and semanticscholar.org to load citation information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the OpenCitations privacy policy as well as the AI2 Privacy Policy covering Semantic Scholar.
OpenAlex data
Load additional information about publications from .
Privacy notice: By enabling the option above, your browser will contact the API of openalex.org to load additional information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the information given by OpenAlex.
last updated on 2024-05-08 21:53 CEST by the dblp team
all metadata released as open data under CC0 1.0 license
see also: Terms of Use | Privacy Policy | Imprint