


Остановите войну!
for scientists:


default search action
Dhabaleswar K. Panda 0001
Dhabaleswar K. D. K. Panda – Dhabaleswar Kumar Panda 0001
Person information

- affiliation: Ohio State University, Columbus, USA
Refine list

refinements active!
zoomed in on ?? of ?? records
view refined list in
export refined list as
showing all ?? records
2020 – today
- 2023
- [j62]Kaushik Kandadi Suresh, Kawthar Shafie Khorassani, Chen-Chun Chen, Bharath Ramesh, Mustafa Abduljabbar, Aamir Shafi, Hari Subramoni, Dhabaleswar K. Panda:
Network-Assisted Noncontiguous Transfers for GPU-Aware MPI Libraries. IEEE Micro 43(2): 131-139 (2023) - [i13]Hyunho Ahn, Tian Chen, Nawras Alnaasan, Aamir Shafi, Mustafa Abduljabbar, Hari Subramoni, Dhabaleswar K. Panda:
Performance Characterization of using Quantization for DNN Inference on Edge Devices: Extended Version. CoRR abs/2303.05016 (2023) - [i12]Quentin Anthony, Ammar Ahmad Awan, Jeff Rasley, Yuxiong He, Aamir Shafi, Mustafa Abduljabbar, Hari Subramoni, Dhabaleswar K. Panda:
MCR-DL: Mix-and-Match Communication Runtime for Deep Learning. CoRR abs/2303.08374 (2023) - 2022
- [j61]Arpan Jain
, Nawras Alnaasan
, Aamir Shafi
, Hari Subramoni
, Dhabaleswar K. Panda:
Optimizing Distributed DNN Training Using CPUs and BlueField-2 DPUs. IEEE Micro 42(2): 53-60 (2022) - [c471]Kinan Al-Attar, Aamir Shafi, Mustafa Abduljabbar
, Hari Subramoni, Dhabaleswar K. Panda:
Spark Meets MPI: Towards High-Performance Communication Framework for Spark using MPI. CLUSTER 2022: 71-81 - [c470]Apan Qasem, Hartwig Anzt, Eduard Ayguadé, Katharine Cahill, Ramon Canal, Jany Chan
, Eric Fosler-Lussier, Fritz Göbel, Arpan Jain, Marcel Koch, Mateusz Kuzak, Josep Llosa, Raghu Machiraju, Xavier Martorell, Pratik Nayak, Shameema Oottikkal, Marcin Ostasz, Dhabaleswar K. Panda, Dirk Pleiter, Rajiv Ramnath, Maria-Ribera Sancho, Alessio Sclocco, Aamir Shafi, Hanno Spreeuw, Hari Subramoni, Karen Tomko:
Lightning Talks of EduHPC 2022. EduHPC@SC 2022: 42-49 - [c469]Kaushik Kandadi Suresh, Kawthar Shafie Khorassani, Chen-Chun Chen, Bharath Ramesh, Mustafa Abduljabbar
, Aamir Shafi, Hari Subramoni, Dhabaleswar K. Panda:
Network Assisted Non-Contiguous Transfers for GPU-Aware MPI Libraries. HOTI 2022: 13-20 - [c468]Tu Tran, Benjamin Michalowicz, Bharath Ramesh, Hari Subramoni, Aamir Shafi, Dhabaleswar K. Panda:
Designing Hierarchical Multi-HCA Aware Allgather in MPI. ICPP Workshops 2022: 28:1-28:10 - [c467]Chen-Chun Chen, Kawthar Shafie Khorassani, Quentin G. Anthony, Aamir Shafi
, Hari Subramoni, Dhabaleswar K. Panda:
Highly Efficient Alltoall and Alltoallv Communication Algorithms for GPU Systems. IPDPS Workshops 2022: 24-33 - [c466]Shulei Xu, Aamir Shafi
, Hari Subramoni, Dhabaleswar K. Panda:
Arm meets Cloud: A Case Study of MPI Library Performance on AWS Arm-based HPC Cloud with Elastic Fabric Adapter. IPDPS Workshops 2022: 449-456 - [c465]Kinan Al-Attar, Aamir Shafi
, Hari Subramoni, Dhabaleswar K. Panda:
Towards Java-based HPC using the MVAPICH2 Library: Early Experiences. IPDPS Workshops 2022: 510-519 - [c464]Nawras Alnaasan, Arpan Jain, Aamir Shafi
, Hari Subramoni, Dhabaleswar K. Panda:
OMB-Py: Python Micro-Benchmarks for Evaluating Performance of MPI Libraries on HPC Systems. IPDPS Workshops 2022: 870-879 - [c463]Qinghua Zhou, Pouya Kousha, Quentin Anthony, Kawthar Shafie Khorassani, Aamir Shafi
, Hari Subramoni
, Dhabaleswar K. Panda:
Accelerating MPI All-to-All Communication with Online Compression on Modern GPU Clusters. ISC 2022: 3-25 - [c462]Pouya Kousha, Arpan Jain, Ayyappa Kolli, Prasanna Sainath, Hari Subramoni
, Aamir Shafi
, Dhabaleswar K. Panda:
"Hey CAI" - Conversational AI Enabled User Interface for HPC Tools. ISC 2022: 87-108 - [c461]Arpan Jain, Aamir Shafi
, Quentin Anthony, Pouya Kousha, Hari Subramoni, Dhabaleswar K. Panda:
Hy-Fi: Hybrid Five-Dimensional Parallel DNN Training on High-Performance GPU Clusters. ISC 2022: 109-130 - [c460]Kawthar Shafie Khorassani, Chen-Chun Chen, Bharath Ramesh, Aamir Shafi, Hari Subramoni, Dhabaleswar K. Panda:
High Performance MPI over the Slingshot Interconnect: Early Experiences. PEARC 2022: 15:1-15:7 - 2021
- [j60]Dhabaleswar Kumar Panda
, Hari Subramoni
, Ching-Hsiang Chu
, Mohammadreza Bayatpour:
The MVAPICH project: Transforming research into high-performance MPI library for HPC community. J. Comput. Sci. 52: 101208 (2021) - [c459]Kawthar Shafie Khorassani, Ching-Hsiang Chu, Quentin G. Anthony, Hari Subramoni, Dhabaleswar K. Panda:
Adaptive and Hierarchical Large Message All-to-all Communication Algorithms for Large-scale Dense GPU Systems. CCGRID 2021: 113-122 - [c458]Aamir Shafi
, Jahanzeb Maqbool Hashmi, Hari Subramoni, Dhabaleswar K. D. K. Panda:
Efficient MPI-based Communication for GPU-Accelerated Dask Applications. CCGRID 2021: 277-286 - [c457]Bharath Ramesh, Jahanzeb Maqbool Hashmi, Shulei Xu, Aamir Shafi
, Seyedeh Mahdieh Ghazimirsaeed, Mohammadreza Bayatpour, Hari Subramoni, Dhabaleswar K. Panda:
Towards Architecture-aware Hierarchical Communication Trees on Modern HPC Systems. HiPC 2021: 272-281 - [c456]Yuntian He, Saket Gurukar, Pouya Kousha, Hari Subramoni, Dhabaleswar K. Panda, Srinivasan Parthasarathy:
DistMILE: A Distributed Multi-Level Framework for Scalable Graph Embedding. HiPC 2021: 282-291 - [c455]Kaushik Kandadi Suresh, Bharath Ramesh, Chen-Chun Chen, Seyedeh Mahdieh Ghazimirsaeed, Mohammadreza Bayatpour, Aamir Shafi
, Hari Subramoni, Dhabaleswar K. Panda:
Layout-aware Hardware-assisted Designs for Derived Data Types in MPI. HiPC 2021: 302-311 - [c454]Nick Sarkauskas, Mohammadreza Bayatpour, Tu Tran, Bharath Ramesh, Hari Subramoni, Dhabaleswar K. Panda:
Large-Message Nonblocking MPI_Iallgather and MPI Ibcast Offload via BlueField-2 DPU. HiPC 2021: 388-393 - [c453]Arpan Jain, Nawras Alnaasan, Aamir Shafi
, Hari Subramoni, Dhabaleswar K. Panda:
Accelerating CPU-based Distributed DNN Training on Modern HPC Clusters using BlueField-2 DPUs. HOTI 2021: 17-24 - [c452]Q. Zhou, C. Chu, N. S. Kumar, Pouya Kousha, Seyedeh Mahdieh Ghazimirsaeed, Hari Subramoni
, Dhabaleswar K. Panda:
Designing High-Performance MPI Libraries with On-the-fly Compression for Modern GPU Clusters*. IPDPS 2021: 444-453 - [c451]Arpan Jain, Tim Moon, Tom Benson, Hari Subramoni, Sam Adé Jacobs, Dhabaleswar K. Panda, Brian Van Essen:
SUPER: SUb-Graph Parallelism for TransformERs. IPDPS 2021: 629-638 - [c450]Quentin Anthony, Lang Xu, Hari Subramoni, Dhabaleswar K. D. K. Panda:
Scaling Single-Image Super-Resolution Training on Modern HPC Clusters: Early Experiences. IPDPS Workshops 2021: 923-932 - [c449]Mohammadreza Bayatpour, Nick Sarkauskas, Hari Subramoni, Jahanzeb Maqbool Hashmi, Dhabaleswar K. Panda:
BluesMPI: Efficient MPI Non-blocking Alltoall Offloading Designs on Modern BlueField Smart NICs. ISC 2021: 18-37 - [c448]Kawthar Shafie Khorassani, Jahanzeb Maqbool Hashmi, Ching-Hsiang Chu
, Chen-Chun Chen, Hari Subramoni, Dhabaleswar K. Panda:
Designing a ROCm-Aware MPI Library for AMD GPUs: Early Experiences. ISC 2021: 118-136 - [c447]Pouya Kousha, Kamal Raj Sankarapandian Dayala Ganesh Ram, Mansa Kedia, Hari Subramoni
, Arpan Jain, Aamir Shafi
, Dhabaleswar K. Panda, Trey Dockendorf, Heechang Na, Karen Tomko:
INAM: Cross-stack Profiling and Analysis of Communication in MPI-based Applications. PEARC 2021: 14:1-14:11 - [i11]Aamir Shafi, Jahanzeb Maqbool Hashmi, Hari Subramoni, Dhabaleswar K. Panda:
Efficient MPI-based Communication for GPU-Accelerated Dask Applications. CoRR abs/2101.08878 (2021) - [i10]Pouya Kousha, Quentin Anthony, Hari Subramoni, Dhabaleswar K. Panda:
Cross-layer Visualization and Profiling of Network and I/O Communication for HPC Clusters. CoRR abs/2109.08329 (2021) - [i9]Nawras Alnaasan, Arpan Jain, Aamir Shafi, Hari Subramoni, Dhabaleswar K. Panda:
OMB-Py: Python Micro-Benchmarks for Evaluating Performance of MPI Libraries on HPC Systems. CoRR abs/2110.10659 (2021) - 2020
- [j59]Sourav Chakraborty, Ignacio Laguna
, Murali Emani, Kathryn Mohror, Dhabaleswar K. Panda, Martin Schulz, Hari Subramoni:
EReinit: Scalable and efficient fault-tolerance for bulk-synchronous MPI applications. Concurr. Comput. Pract. Exp. 32(3) (2020) - [j58]Jahanzeb Maqbool Hashmi, Ching-Hsiang Chu
, Sourav Chakraborty, Mohammadreza Bayatpour, Hari Subramoni, Dhabaleswar K. Panda:
FALCON-X: Zero-copy MPI derived datatype processing on modern CPU and GPU architectures. J. Parallel Distributed Comput. 144: 1-13 (2020) - [j57]Ammar Ahmad Awan, Arpan Jain, Ching-Hsiang Chu
, Hari Subramoni, Dhabaleswar K. Panda:
Communication Profiling and Characterization of Deep-Learning Workloads on Clusters With High-Performance Interconnects. IEEE Micro 40(1): 35-43 (2020) - [c446]Mohammadreza Bayatpour, Seyedeh Mahdieh Ghazimirsaeed, Shulei Xu, Hari Subramoni, Dhabaleswar K. Panda:
Design and Characterization of InfiniBand Hardware Tag Matching in MPI. CCGRID 2020: 101-110 - [c445]Ching-Hsiang Chu
, Kawthar Shafie Khorassani, Qinghua Zhou, Hari Subramoni, Dhabaleswar K. Panda:
Dynamic Kernel Fusion for Bulk Non-contiguous Data Transfer on GPU Clusters. CLUSTER 2020: 130-141 - [c444]Aamir Shafi
, Jahanzeb Maqbool Hashmi, Hari Subramoni, Dhabaleswar K. Panda:
Blink: Towards Efficient RDMA-based Communication Coroutines for Parallel Python Applications. HiPC 2020: 111-120 - [c443]Ching-Hsiang Chu, Pouya Kousha, Ammar Ahmad Awan, Kawthar Shafie Khorassani, Hari Subramoni, Dhabaleswar K. D. K. Panda:
NV-group: link-efficient reduction for distributed deep learning on modern dense GPU systems. ICS 2020: 6:1-6:12 - [c442]Jahanzeb Maqbool Hashmi, Shulei Xu, Bharath Ramesh, Mohammadreza Bayatpour, Hari Subramoni, Dhabaleswar K. D. K. Panda:
Machine-agnostic and Communication-aware Designs for MPI on Emerging Architectures. IPDPS 2020: 32-41 - [c441]Amit Ruhela
, Shulei Xu, Karthik Vadambacheri Manian
, Hari Subramoni, Dhabaleswar K. Panda:
Analyzing and Understanding the Impact of Interconnect Performance on HPC, Big Data, and Deep Learning Applications: A Case Study with InfiniBand EDR and HDR. IPDPS Workshops 2020: 869-878 - [c440]Kaushik Kandadi Suresh, Bharath Ramesh, Seyedeh Mahdieh Ghazimirsaeed, Mohammadreza Bayatpour, Jahanzeb Maqbool Hashmi, Hari Subramoni, Dhabaleswar K. D. K. Panda:
Performance Characterization of Network Mechanisms for Non-Contiguous Data Transfers in MPI. IPDPS Workshops 2020: 896-905 - [c439]Quentin Anthony, Ammar Ahmad Awan, Arpan Jain, Hari Subramoni, Dhabaleswar K. D. K. Panda:
Efficient Training of Semantic Image Segmentation on Summit using Horovod and MVAPICH2-GDR. IPDPS Workshops 2020: 1015-1023 - [c438]Bharath Ramesh, Kaushik Kandadi Suresh, Nick Sarkauskas, Mohammadreza Bayatpour, Jahanzeb Maqbool Hashmi, Hari Subramoni, Dhabaleswar K. Panda:
Scalable MPI Collectives using SHARP: Large Scale Performance Evaluation on the TACC Frontera System. ExaMPI@SC 2020: 11-20 - [c437]Seyedeh Mahdieh Ghazimirsaeed, Quentin Anthony, Aamir Shafi
, Hari Subramoni, Dhabaleswar K. D. K. Panda:
Accelerating GPU-based Machine Learning in Python using MPI Library: A Case Study with MVAPICH2-GDR. MLHPC/AI4S@SC 2020: 17-28 - [c436]Shulei Xu, Seyedeh Mahdieh Ghazimirsaeed, Jahanzeb Maqbool Hashmi, Hari Subramoni, Dhabaleswar K. Panda:
MPI Meets Cloud: Case Study with Amazon EC2 and Microsoft Azure. IPDRM@SC 2020: 41-48 - [c435]Arpan Jain, Ammar Ahmad Awan, Asmaa M. Aljuhani, Jahanzeb Maqbool Hashmi, Quentin G. Anthony, Hari Subramoni, Dhabaleswar K. Panda, Raghu Machiraju, Anil Parwani:
GEMS: GPU-enabled memory-aware model-parallelism system for distributed DNN training. SC 2020: 45 - [c434]Samuel Khuvis
, Karen Tomko, Jahanzeb Maqbool Hashmi, Dhabaleswar K. Panda:
Exploring Hybrid MPI+Kokkos Tasks Programming Model. PAW-ATM@SC 2020: 66-73 - [c433]Ammar Ahmad Awan, Arpan Jain, Quentin Anthony, Hari Subramoni, Dhabaleswar K. Panda:
HyPar-Flow: Exploiting MPI and Keras for Scalable Hybrid-Parallel DNN Training with TensorFlow. ISC 2020: 83-103 - [c432]Mohammadreza Bayatpour, Jahanzeb Maqbool Hashmi, Sourav Chakraborty, Kaushik Kandadi Suresh, Seyedeh Mahdieh Ghazimirsaeed, Bharath Ramesh, Hari Subramoni, Dhabaleswar K. Panda:
Communication-Aware Hardware-Assisted MPI Overlap Engine. ISC 2020: 517-535 - [c431]Dan Stanzione
, John West
, R. Todd Evans
, Tommy Minyard, Omar Ghattas, Dhabaleswar K. Panda:
Frontera: The Evolution of Leadership Computing at the National Science Foundation. PEARC 2020: 106-111 - [c430]Pouya Kousha, Kamal Raj S. D., Hari Subramoni
, Dhabaleswar K. Panda, Heechang Na, Trey Dockendorf, Karen Tomko
:
Accelerated Real-time Network Monitoring and Profiling at Scale using OSU INAM. PEARC 2020: 215-223 - [e7]Dhabaleswar K. Panda:
Supercomputing Frontiers - 6th Asian Conference, SCFA 2020, Singapore, February 24-27, 2020, Proceedings. Lecture Notes in Computer Science 12082, Springer 2020, ISBN 978-3-030-48841-3 [contents] - [i8]Ritu Arora, Xiaosong Li, Bonnie Hurwitz, Daniel Fay, Dhabaleswar K. Panda, Edward F. Valeev, Shaowen Wang, Shirley Moore, Sunita Chandrasekaran, Ting Cao, Holly Bik, Matthew Curry, Tanzima Z. Islam:
Future Directions of the Cyberinfrastructure for Sustained Scientific Innovation (CSSI) Program. CoRR abs/2010.15584 (2020)
2010 – 2019
- 2019
- [j56]Depai Qian, Dhabaleswar K. Panda:
CCF THPC inaugural issue editorial. CCF Trans. High Perform. Comput. 1(1): 1-2 (2019) - [j55]Amit Ruhela
, Hari Subramoni, Sourav Chakraborty, Mohammadreza Bayatpour, Pouya Kousha, Dhabaleswar K. Panda:
Efficient design for MPI asynchronous progress without dedicated resources. Parallel Comput. 85: 13-26 (2019) - [j54]Ammar Ahmad Awan
, Karthik Vadambacheri Manian
, Ching-Hsiang Chu
, Hari Subramoni, Dhabaleswar K. Panda:
Optimized large-message broadcast for deep learning workloads: MPI, MPI+NCCL, or NCCL2? Parallel Comput. 85: 141-152 (2019) - [j53]Ching-Hsiang Chu
, Xiaoyi Lu
, Ammar Ahmad Awan
, Hari Subramoni
, Bracy Elton
, Dhabaleswar K. Panda:
Exploiting Hardware Multicast and GPUDirect RDMA for Efficient Broadcast. IEEE Trans. Parallel Distributed Syst. 30(3): 575-588 (2019) - [c429]Karthik Vadambacheri Manian
, A. A. Ammar, Amit Ruhela
, Ching-Hsiang Chu, Hari Subramoni, Dhabaleswar K. Panda:
Characterizing CUDA Unified Memory (UM)-Aware MPI Designs on Modern GPU Architectures. GPGPU@ASPLOS 2019: 43-52 - [c428]Jahanzeb Maqbool Hashmi, Sourav Chakraborty, Mohammadreza Bayatpour, Hari Subramoni, Dhabaleswar K. Panda:
Design and Characterization of Shared Address Space MPI Collectives on Modern Architectures. CCGRID 2019: 410-419 - [c427]Ammar Ahmad Awan, Jeroen Bédorf, Ching-Hsiang Chu, Hari Subramoni, Dhabaleswar K. Panda:
Scalable Distributed DNN Training using TensorFlow and CUDA-Aware MPI: Characterization, Designs, and Performance Evaluation. CCGRID 2019: 498-507 - [c426]Arpan Jain, Ammar Ahmad Awan, Quentin Anthony, Hari Subramoni, Dhabaleswar K. Panda:
Performance Characterization of DNN Training using TensorFlow and PyTorch on Modern Clusters. CLUSTER 2019: 1-11 - [c425]Pouya Kousha, Bharath Ramesh, Kaushik Kandadi Suresh, Ching-Hsiang Chu, Arpan Jain, Nick Sarkauskas, Hari Subramoni
, Dhabaleswar K. Panda:
Designing a Profiling and Visualization Tool for Scalable and In-depth Analysis of High-Performance GPU Clusters. HiPC 2019: 93-102 - [c424]Dipti Shankar, Xiaoyi Lu, Dhabaleswar K. Panda:
SCOR-KV: SIMD-Aware Client-Centric and Optimistic RDMA-Based Key-Value Store for Emerging CPU Architectures. HiPC 2019: 257-266 - [c423]Ching-Hsiang Chu, Jahanzeb Maqbool Hashmi, Kawthar Shafie Khorassani, Hari Subramoni, Dhabaleswar K. Panda:
High-Performance Adaptive MPI Derived Datatype Communication for Modern Multi-GPU Systems. HiPC 2019: 267-276 - [c422]Sourav Chakraborty, Shulei Xu, Hari Subramoni, Dhabaleswar K. Panda:
Designing Scalable and High-Performance MPI Libraries on Amazon Elastic Fabric Adapter. Hot Interconnects 2019: 40-44 - [c421]Ammar Ahmad Awan, Arpan Jain, Ching-Hsiang Chu, Hari Subramoni, Dhabaleswar K. Panda:
Communication Profiling and Characterization of Deep Learning Workloads on Clusters with High-Performance Interconnects. Hot Interconnects 2019: 49-53 - [c420]Haiyang Shi, Xiaoyi Lu, Dipti Shankar, Dhabaleswar K. Panda:
UMR-EC: A Unified and Multi-Rail Erasure Coding Library for High-Performance Distributed Storage Systems. HPDC 2019: 219-230 - [c419]Dipti Shankar, Xiaoyi Lu, Dhabaleswar K. D. K. Panda:
SimdHT-Bench: Characterizing SIMD-Aware Hash Table Designs on Emerging CPU Architectures. IISWC 2019: 178-188 - [c418]Jie Zhang, Xiaoyi Lu, Ching-Hsiang Chu, Dhabaleswar K. Panda:
C-GDR: High-Performance Container-Aware GPUDirect MPI Communication Schemes on RDMA Networks. IPDPS 2019: 242-251 - [c417]Jahanzeb Maqbool Hashmi, Sourav Chakraborty, Mohammadreza Bayatpour, Hari Subramoni, Dhabaleswar K. Panda:
FALCON: Efficient Designs for Zero-Copy MPI Datatype Processing on Emerging Architectures. IPDPS 2019: 355-364 - [c416]Xiaoyi Lu, Jianfeng Zhan, Dhabaleswar K. Panda:
Introduction to HPBDC 2019. IPDPS Workshops 2019: 394 - [c415]Dhabaleswar K. Panda, Ammar Ahmad Awan, Hari Subramoni:
High performance distributed deep learning: a beginner's guide. PPoPP 2019: 452-454 - [c414]Amit Ruhela
, Bharath Ramesh, Sourav Chakraborty, Hari Subramoni, Jahanzeb Maqbool Hashmi, Dhabaleswar K. Panda:
Leveraging Network-level parallelism with Multiple Process-Endpoints for MPI Broadcast. IPDRM@SC 2019: 34-41 - [c413]Shulei Xu, Jahanzeb Maqbool Hashmi, Sourav Chakraborty, Hari Subramoni, Dhabaleswar K. Panda:
Design and Evaluation of Shared Memory CommunicationBenchmarks on Emerging Architectures using MVAPICH2. IPDRM@SC 2019: 42-49 - [c412]Arpan Jain, Ammar Ahmad Awan, Hari Subramoni, Dhabaleswar K. Panda:
Scaling TensorFlow, PyTorch, and MXNet using MVAPICH2 for High-Performance Deep Learning on Frontera. DLS@SC 2019: 76-83 - [c411]Kawthar Shafie Khorassani, Ching-Hsiang Chu
, Hari Subramoni, Dhabaleswar K. Panda:
Performance Evaluation of MPI Libraries on GPU-Enabled OpenPOWER Architectures: Early Experiences. ISC Workshops 2019: 361-378 - [i7]Ammar Ahmad Awan, Arpan Jain, Quentin Anthony, Hari Subramoni, Dhabaleswar K. Panda:
HyPar-Flow: Exploiting MPI and Keras for Scalable Hybrid-Parallel DNN Training using TensorFlow. CoRR abs/1911.05146 (2019) - 2018
- [j52]Md. Wasi-ur-Rahman, Nusrat Sharmin Islam, Xiaoyi Lu, Dipti Shankar, Dhabaleswar K. Panda:
MR-Advisor: A comprehensive tuning, profiling, and prediction tool for MapReduce execution frameworks on HPC clusters. J. Parallel Distributed Comput. 120: 237-250 (2018) - [j51]Dhabaleswar K. Panda, Xiaoyi Lu
, Hari Subramoni:
Networking and communication challenges for post-exascale systems. Frontiers Inf. Technol. Electron. Eng. 19(10): 1230-1235 (2018) - [j50]Srinivasan Ramesh, Aurèle Mahéo, Sameer Shende, Allen D. Malony, Hari Subramoni, Amit Ruhela
, Dhabaleswar K. Panda:
MPI performance engineering with the MPI tool interface: The integration of MVAPICH and TAU. Parallel Comput. 77: 19-37 (2018) - [j49]Xiaoyi Lu
, Haiyang Shi, Rajarshi Biswas, M. Haseeb Javed
, Dhabaleswar K. Panda:
DLoBD: A Comprehensive Study of Deep Learning over Big Data Stacks on HPC Clusters. IEEE Trans. Multi Scale Comput. Syst. 4(4): 635-648 (2018) - [c410]Haiyang Shi, Xiaoyi Lu, Dhabaleswar K. Panda:
EC-Bench: Benchmarking Onload and Offload Erasure Coders on Modern Hardware Architectures. Bench 2018: 215-230 - [c409]Xiaoyi Lu, Dipti Shankar, Haiyang Shi, Dhabaleswar K. Panda:
Spark-uDAPL: Cost-Saving Big Data Analytics on Microsoft Azure Cloud with RDMA Networks*. IEEE BigData 2018: 321-326 - [c408]Haiyang Shi, Xiaoyi Lu, Dipti Shankar, Dhabaleswar K. Panda:
High-Performance Multi-Rail Erasure Coding Library over Modern Data Center Architectures: Early Experiences. SoCC 2018: 530-531 - [c407]Mohammadreza Bayatpour, Jahanzeb Maqbool Hashmi, Sourav Chakraborty, Hari Subramoni, Pouya Kousha, Dhabaleswar K. Panda:
SALaR: Scalable and Adaptive Designs for Large Message Reduction Collectives. CLUSTER 2018: 12-23 - [c406]M. Haseeb Javed, Xiaoyi Lu, Dhabaleswar K. Panda:
Cutting the Tail: Designing High Performance Message Brokers to Reduce Tail Latencies in Stream Processing. CLUSTER 2018: 223-233 - [c405]Rajarshi Biswas
, Xiaoyi Lu, Dhabaleswar K. Panda:
Accelerating TensorFlow with Adaptive RDMA-Based gRPC. HiPC 2018: 2-11 - [c404]Ammar Ahmad Awan, Ching-Hsiang Chu, Hari Subramoni, Xiaoyi Lu, Dhabaleswar K. Panda:
OC-DNN: Exploiting Advanced Unified Memory Capabilities in CUDA 9 and Volta GPUs for Out-of-Core DNN Training. HiPC 2018: 143-152 - [c403]Xiaoyi Lu, Jianfeng Zhan, Dhabaleswar K. Panda:
Introduction to HPBDC 2018. IPDPS Workshops 2018: 446 - [c402]Jahanzeb Maqbool Hashmi, Sourav Chakraborty, Mohammadreza Bayatpour, Hari Subramoni, Dhabaleswar K. Panda:
Designing Efficient Shared Address Space Reduction Collectives for Multi-/Many-cores. IPDPS 2018: 1020-1029 - [c401]Ammar Ahmad Awan, Ching-Hsiang Chu, Hari Subramoni, Dhabaleswar K. Panda:
Optimized Broadcast for Deep Learning Workloads on Dense-GPU InfiniBand Clusters: MPI or NCCL? EuroMPI 2018: 2:1-2:9 - [c400]Mingzhe Li, Xiaoyi Lu, Hari Subramoni, Dhabaleswar K. Panda:
Multi-Threading and Lock-Free MPI RMA Based Graph Processing on KNL and POWER Architectures. EuroMPI 2018: 4:1-4:10 - [c399]Amit Ruhela
, Hari Subramoni
, Sourav Chakraborty, Mohammadreza Bayatpour, Pouya Kousha, Dhabaleswar K. Panda:
Efficient Asynchronous Communication Progress for MPI without Dedicated Resources. EuroMPI 2018: 14:1-14:11 - [c398]Sourav Chakraborty, Mohammadreza Bayatpour, Jahanzeb Maqbool Hashmi, Hari Subramoni, Dhabaleswar K. Panda:
Cooperative rendezvous protocols for improved performance and overlap. SC 2018: 28:1-28:13 - [c397]Shashank Gugnani, Xiaoyi Lu, Dhabaleswar K. Panda:
Analyzing, Modeling, and Provisioning QoS for NVMe SSDs. UCC 2018: 247-256 - [e6]Esam El-Araby, Dhabaleswar K. Panda, Sandra Gesing, Amy W. Apon, Volodymyr V. Kindratenko, Massimo Cafaro, Alfredo Cuzzocrea:
18th IEEE/ACM International Symposium on Cluster, Cloud and Grid Computing, CCGRID 2018, Washington, DC, USA, May 1-4, 2018. IEEE Computer Society 2018, ISBN 978-1-5386-5815-4 [contents] - [i6]Rajarshi Biswas, Xiaoyi Lu, Dhabaleswar K. Panda:
Designing a Micro-Benchmark Suite to Evaluate gRPC for TensorFlow: Early Experiences. CoRR abs/1804.01138 (2018) - [i5]Ammar Ahmad Awan, Jeroen Bédorf, Ching-Hsiang Chu, Hari Subramoni, Dhabaleswar K. Panda:
Scalable Distributed DNN Training using TensorFlow and CUDA-Aware MPI: Characterization, Designs, and Performance Evaluation. CoRR abs/1810.11112 (2018) - 2017
- [j48]Xiaoyi Lu, Dipti Shankar, Dhabaleswar K. Panda:
Scalable and Distributed Key-Value Store-based Data Management Using RDMA-Memcached. IEEE Data Eng. Bull. 40(1): 50-61 (2017) - [j47]Md. Wasi-ur-Rahman, Nusrat Sharmin Islam, Xiaoyi Lu, Dhabaleswar K. Panda:
A Comprehensive Study of MapReduce Over Lustre for Intermediate Data Placement and Shuffle Strategies on HPC Clusters. IEEE Trans. Parallel Distributed Syst. 28(3): 633-646 (2017) - [c396]M. Haseeb Javed, Xiaoyi Lu, Dhabaleswar K. Panda:
Characterization of Big Data Stream Processing Pipeline: A Case Study using Flink and Kafka. BDCAT 2017: 1-10 - [c395]Shashank Gugnani, Xiaoyi Lu, Houliang Qi, Li Zha, Dhabaleswar K. Panda:
Characterizing and accelerating indexing techniques on distributed ordered tables. IEEE BigData 2017: 173-182 - [c394]Xiaoyi Lu, Haiyang Shi, Dipti Shankar, Dhabaleswar K. Panda:
Performance characterization and acceleration of big data workloads on OpenPOWER system. IEEE BigData 2017: 213-222 - [c393]Md. Wasi-ur-Rahman, Nusrat Sharmin Islam, Xiaoyi Lu, Dhabaleswar K. Panda:
NVMD: Non-volatile memory assisted design for accelerating MapReduce and DAG execution frameworks on HPC systems. IEEE BigData 2017: 369-374 - [c392]Shashank Gugnani, Xiaoyi Lu, Dhabaleswar K. Panda:
Swift-X: Accelerating OpenStack Swift with RDMA for Building an Efficient HPC Cloud. CCGrid 2017: 238-247 - [c391]Sourav Chakraborty, Hari Subramoni, Dhabaleswar K. Panda:
Contention-Aware Kernel-Assisted MPI Collectives for Multi-/Many-Core Systems. CLUSTER 2017: 13-24 - [c390]Hari Subramoni, Xiaoyi Lu, Dhabaleswar K. Panda:
A Scalable Network-Based Performance Analysis Tool for MPI on Large-Scale HPC Systems. CLUSTER 2017: 354-358 - [c389]