


default search action
EuroMPI 2016: Edinburgh, United Kingdom
- Jack J. Dongarra, Daniel J. Holmes, Antonia B. K. Collis, Jesper Larsson Träff, Lorna Smith:

Proceedings of the 23rd European MPI Users' Group Meeting, EuroMPI 2016, Edinburgh, United Kingdom, September 25-28, 2016. ACM 2016, ISBN 978-1-4503-4234-6
Overall Winner
- Hoang-Vu Dang, Marc Snir, William Gropp:

Towards millions of communicating threads. 1-14
Runner-up Winners
- A. A. Awan, Khaled Hamidouche, Akshay Venkatesh, Dhabaleswar K. Panda:

Efficient Large Message Broadcast using NCCL and CUDA-Aware MPI for Deep Learning. 15-22 - Martin Ruefenacht, Mark Bull, Stephen Booth:

Generalisation of Recursive Doubling for AllReduce. 23-31
Scalability and the Road to Exascale
- Sameer Kumar, Philip Heidelberger, Craig B. Stunkel:

Space Performance Tradeoffs in Compressing MPI Group Data Structures. 32-40 - William Gropp, Luke N. Olson, Philipp Samfass

:
Modeling MPI Communication Performance on SMP Nodes: Is it Time to Retire the Ping Pong Test. 41-50 - Jean-Baptiste Besnard

, Julien Adam, Sameer Shende, Marc Pérache
, Patrick Carribault, Julien Jaeger:
Introducing Task-Containers as an Alternative to Runtime-Stacking. 51-63
Fault tolerance
- Federico Reghenzani

, Gianmario Pozzi, Giuseppe Massari
, Simone Libutti, William Fornaciari
:
The MIG Framework: Enabling Transparent Process Migration in Open MPI. 64-73 - Pierre Lemarinier

, Khalid Hasanov, Srikumar Venugopal, Kostas Katrinis:
Architecting Malleable MPI Applications for Priority-driven Adaptive Scheduling. 74-81 - Isaías A. Comprés Ureña, Ao Mo-Hellenbrand, Michael Gerndt, Hans-Joachim Bungartz:

Infrastructure and API Extensions for Elastic Execution of MPI Applications. 82-97
Challenges and Extensions
- Jesper Larsson Träff:

A Library for Advanced Datatype Programming. 98-107 - Alexandra Carpen-Amarie, Sascha Hunold

, Jesper Larsson Träff:
On the Expected and Observed Communication Performance with MPI Derived Datatypes. 108-120 - Daniel J. Holmes, Kathryn M. Mohror

, Ryan E. Grant, Anthony Skjellum, Martin Schulz
, Wesley Bland, Jeffrey M. Squyres:
MPI Sessions: Leveraging Runtime Infrastructure to Increase Scalability of Applications at Exascale. 121-129
Parallel Applications using MPI
- António Esteves

, Alfredo Moura:
Distributed Memory Implementation Strategies for the kinetic Monte Carlo Algorithm. 130-139 - Scott Levy, Kurt B. Ferreira, Patrick M. Widener

, Patrick G. Bridges
, Oscar H. Mondragon
:
How I Learned to Stop Worrying and Love In Situ Analytics: Leveraging Latent Synchronization in MPI Collective Algorithms. 140-153 - Matthias Lieber

, Kerstin Gößner, Wolfgang E. Nagel:
The Potential of Diffusive Load Balancing at Large Scale. 154-157
Single-sided RDMA
- Sameer Kumar, Robert Blackmore, Sameh Sharkawi, K. A. Nysal Jan, Amith R. Mamidala, T. J. Chris Ward:

Optimization of Message Passing Services on POWER8 InfiniBand Clusters. 158-166 - Ana Gainaru, Richard L. Graham, Artem Y. Polyakov, Gilad Shainer:

Using InfiniBand Hardware Gather-Scatter Capabilities to Optimize MPI All-to-All. 167-179 - Balazs Gerofi, Masamichi Takagi, Yutaka Ishikawa:

Revisiting RDMA Buffer Registration in the Context of Lightweight Multi-kernels. 180-183 - Nathan T. Hjelm:

An Evaluation of the One-Sided Performance in Open MPI. 184-187
Tools
- Tobias Hilbrich, Matthias Weber, Joachim Protze

, Bronis R. de Supinski, Wolfgang E. Nagel:
Runtime Correctness Analysis of MPI-3 Nonblocking Collectives. 188-197 - Alessandro Fanfarillo

, Jeff R. Hammond:
CAF Events Implementation Using MPI-3 Capabilities. 198-207 - Søren Rasmussen, Martin Schulz

, Kathryn M. Mohror
:
Allowing MPI tools builders to forget about Fortran. 208-211
Posters
- Fabio Affinito

, Carlo Cavazzoni:
FFT data distribution in plane-waves DFT codes. A case study from Quantum ESPRESSO. 212 - Alexey Malhanov, Ariel J. Biller, Michael Chuvelev:

Optimizing PARSEC for Knights Landing. 213-214 - Keiichiro Fukazawa, Toshiya Takami, Takeshi Soga, Yoshiyuki Morie

, Takeshi Nanri:
Effective Calculation with Halo communication using Halo Functions. 215-216 - Alice Koniges

, Brandon Cook, Jack Deslippe, Thorsten Kurth, Hongzhang Shan:
MPI usage at NERSC: Present and Future. 217 - Takayuki Umeda

, Keiichiro Fukazawa:
Performance comparison of Eulerian kinetic Vlasov code between flat-MPI parallelism and hybrid parallelism on Fujitsu FX100 supercomputer. 218-221

manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.


Google
Google Scholar
Semantic Scholar
Internet Archive Scholar
CiteSeerX
ORCID














