


default search action
IPDPS 2016: Chicago, IL, USA - Workshops
- 2016 IEEE International Parallel and Distributed Processing Symposium Workshops, IPDPS Workshops 2016, Chicago, IL, USA, May 23-27, 2016. IEEE Computer Society 2016, ISBN 978-1-5090-3682-0
Workshop 1-HCW - Heterogeneity in Computing Workshop
- Denis Trystram, Erik Saule:
HCW Introduction. 1-2 - Behrooz A. Shirazi:
Message from the HCW Steering Committee Chair. 3 - Denis Trystram:
Message from the HCW General Chair. 4 - Erik Saule:
Message from the HCW Program Committee Chair. 5 - Mahmut T. Kandemir:
HCW 2016 Keynote Talk. 6
Session 1: Heterogeneity in the Cloud
- Julio Proaño
, Carmen Carrión
, María Blanca Caminero
:
Towards a Green, QoS-Enabled Heterogeneous Cloud Infrastructure. 7-16 - Rekha Singhal, Abhishek Verma:
Predicting Job Completion Time in Heterogeneous MapReduce Environments. 17-27 - Fouad Hanna, Loris Marchal
, Jean-Marc Nicod, Laurent Philippe, Veronika Rehn-Sonigo, Hala Sabbah:
Minimizing Rental Cost for Multiple Recipe Applications in the Cloud. 28-37
Session 2: Heterogeneity in Single Node Systems
- Saeid Barati, Hank Hoffmann:
Providing Fairness in Heterogeneous Multicores with a Predictive, Adaptive Scheduler. 38-49 - Jeremy Bottleson, SungYe Kim, Jeff Andrews, Preeti Bindu, Deepak N. Murthy, Jingyi Jin:
clCaffe: OpenCL Accelerated Caffe for Convolutional Neural Networks. 50-57 - Bahareh Goodarzi, Martin Burtscher, Dhrubajyoti Goswami:
Parallel Graph Partitioning on a CPU-GPU Architecture. 58-66
Session 3: Heterogeneity and Energy
- Dylan Machovec, Bhavesh Khemka, Sudeep Pasricha, Anthony A. Maciejewski
, Howard Jay Siegel, Gregory A. Koenig, Michael Wright, Marcia Hilton, Rajendra Rambharos, Neena Imam:
Dynamic Resource Management for Parallel Tasks in an Oversubscribed Energy-Constrained Heterogeneous Environment. 67-78 - JeeWhan Choi, Richard W. Vuduc
:
Analyzing the Energy Efficiency of the Fast Multipole Method Using a DVFS-Aware Energy Model. 79-88 - John E. Stone
, Michael J. Hallock
, James C. Phillips
, Joseph R. Peterson, Zaida Luthey-Schulten, Klaus Schulten:
Evaluation of Emerging Energy-Efficient Heterogeneous Computing Platforms for Biomolecular and Cellular Simulation Workloads. 89-100
Workshop 2-RAW - Reconfigurable Architectures Workshop
- Marco D. Santambrogio, Ramachandran Vaidyanathan, Diana Goehringer, Steven J. E. Wilton:
RAW Introduction and Committees. 101-102 - H. Peter Hofstee, Patrick Lysaght, Dirk van den Heuvel:
RAW 2016 Keynotes. 103-104
Session 1: Application Mapping and Design Space Exploration
- Lester Kalms, Diana Göhringer:
Clustering and Mapping Algorithm for Application Distribution on a Scalable FPGA Cluster. 105-113 - Syed Waqar Nabi, Wim Vanderbauwhede:
A Fast and Accurate Cost Model for FPGA Design Space Exploration in HPC Applications. 114-123 - Hyunsuk Nam, Roman Lysecky:
Latency, Power, and Security Optimization in Distributed Reconfigurable Embedded Systems. 124-131
Session 2: Applications
- Daniel Llamocca
, Daniel N. Aloi:
A Reconfigurable Fixed-Point Architecture for Adaptive Beamforming. 132-138 - Aaron Mills, Phillip H. Jones, Joseph Zambreno:
Parameterizable FPGA-Based Kalman Filter Coprocessor Using Piecewise Affine Modeling. 139-147 - Chi Zhang, Ren Chen, Viktor K. Prasanna:
High Throughput Large Scale Sorting on a CPU-FPGA Heterogeneous Platform. 148-155 - Juan Andrés Pérez-Celis, José Martínez-Carranza
, Alicia Morales-Reyes
, Claudia Feregrino Uribe, René Cumplido:
An FPGA Architecture to Accelerate the Burrows Wheeler Transform by Using a Linear Sorter. 156-161
Session 3: Processor Architectures
- Mohamed El-Hadedy, Hristina Mihajloska, Danilo Gligoroski, Amit Kulkarni, Dirk Stroobandt, Kevin Skadron
:
A 16-Bit Reconfigurable Encryption Processor for p-Cipher. 162-171 - Stephan Nolting, Guillermo Payá-Vayá, Florian Giesemann, Holger Blume
, Sebastian Niemann, Christian Müller-Schloer:
Dynamic Self-Reconfiguration of a MIPS-Based Soft-Processor Architecture. 172-180 - Steffen Vaas, Marc Reichenbach
, Dietmar Fey:
An Application-Specific Instruction Set Processor for Power Quality Monitoring. 181-188
Session 4: Scheduler and Runtime Systems
- Andrea Purgato, Davide Tantillo, Marco Rabozzi, Donatella Sciuto, Marco D. Santambrogio:
Resource-Efficient Scheduling for Partially-Reconfigurable FPGA-Based Systems. 189-197 - Tajas Ruschke, Lukas Johannes Jung, Dennis Wolf, Christian Hochberger:
Scheduler for Inhomogeneous and Irregular CGRAs with Support for Complex Control Flow. 198-207 - Jens Rettkowski, Philipp Wehner, Evgheni Cutiscev, Diana Göhringer:
LinROS: A Linux-Based Runtime System for Reconfigurable MPSoCs. 208-216
Session 5: High Level Synthesis and Object-Oriented Programming
- Emanuele Del Sozzo
, Andrea Solazzo, Antonio Miele
, Marco D. Santambrogio:
On the Automation of High Level Synthesis of Convolutional Neural Networks. 217-224 - Gianluca C. Durelli, Fabrizio Spada, Christian Pilato
, Marco D. Santambrogio:
Scala-Based Domain-Specific Language for Creating Accelerator-Based SoCs. 225-232 - Hongyuan Ding, Sen Ma, Miaoqing Huang, David Andrews
:
OOGen: An Automated Generation Tool for Custom MPSoC Architectures Based on Object-Oriented Programming Methods. 233-240
Short Papers
- Benedikt Janßen, Moataz Naserddin, Michael Hübner:
A Hardware/Software Co-Design Approach for Control Applications with Static Real-Time Reallocation. 241-246 - Giulia Guidi, Enrico Reggiani, Lorenzo Di Tucci, Gianluca Durelli, Michaela Blott, Marco D. Santambrogio:
On How to Improve FPGA-Based Systems Design Productivity via SDAccel. 247-252 - Jones Yudi Mori
, André Werner, Florian Fricke, Michael Hübner:
A Rapid Prototyping Method to Reduce the Design Time in Commercial High-Level Synthesis Tools. 253-258 - Salma Hesham, Diana Göhringer
, Mohamed A. Abd El Ghany
:
ARTNoCs: An Evaluation Framework for Hardware Architectures of Real-Time NoCs. 259-264 - Amit Kulkarni, Elias Vansteenkiste, Dirk Stroobandt, Andreas Brokalakis, Antonis Nikitakis:
A Fully Parameterized Virtual Coarse Grained Reconfigurable Array for High Performance Computing Applications. 265-270 - Anita Tino, Kaamran Raahemifar:
Assessing Multi-task Placement Algorithms in RCUs. 271-276 - Alexandra Kourfali
, Dirk Stroobandt:
Efficient Hardware Debugging Using Parameterized FPGA Reconfiguration. 277-282 - Fynn Schwiegelshohn, Florian Kastner, Michael Hübner:
Enabling Dynamic Reconfiguration of Numerical Methods for the Robotic Motion Control Task. 283-288 - Martín Letras
, Raudel Hernández-León, René Cumplido:
Hardware Architectures for Frequent Itemset Mining Based on Equivalence Classes Partitioning. 289-294 - Fabiola Casasopra, Gea Bianchi, Gianluca C. Durelli, Marco D. Santambrogio:
Parallel Protein Identification Using an FPGA-Based Solution. 295-299 - Nikolaos Stekas, Dirk van den Heuvel:
Face Recognition Using Local Binary Patterns Histograms (LBPH) on an FPGA-Based System on Chip (SoC). 300-304
Workshop 3-HIPS - High-Level Parallel Programming Models and Supportive Environments
- David Böhme
, Xu Liu:
HIPS Introduction and Committees. 305-306 - Tim Mattson:
HIPS 2016 Keynote. 307
Session 1: Debugging and Optimization
- Faheem Ullah, Thomas R. Gross:
Detecting Anomalies in Concurrent Programs Based on Dynamic Control Flow Changes. 308-317 - Marc Sergent, David Goudin, Samuel Thibault, Olivier Aumage:
Controlling the Memory Subscription of Distributed Applications with a Task-Based Runtime System. 318-327 - Shingo Okuno
, Tasuku Hiraishi, Hiroshi Nakashima, Masahiro Yasugi, Jun Sese
:
Reducing Redundant Search in Parallel Graph Mining Using Exceptions. 328-337
Session 2: Heterogeneous Computing
- Matt Martineau, Simon McIntosh-Smith
, Wayne P. Gaudin:
Evaluating OpenMP 4.0's Effectiveness as a Heterogeneous Parallel Programming Model. 338-347 - Ebad Salehi, Ahmad Lashgar
, Amirali Baniasadi:
Employing Compression Solutions under OpenACC. 348-356 - Craig Edward Rasmussen, Matthew J. Sottile
, Søren Rasmussen, Daniel Nagle, William Dumas:
CAFe: Coarray Fortran Extensions for Heterogeneous Computing. 357-365
Session 3: Parallel Algorithms and Systems
- Peter Mills, Clinton Jeffery:
Embedding Concurrent Generators. 366-375 - Josef Weidendorfer, Jens Breitbart:
The Case for Binary Rewriting at Runtime for Efficient Implementation of High-Level Programming Models in HPC. 376-385 - Seyed Hessam Mirsadeghi, Ahmad Afsahi:
PTRAM: A Parallel Topology-and Routing-Aware Mapping Framework for Large-Scale HPC Systems. 386-396 - Joshua Dennis Booth, Kyungjoo Kim, Sivasankaran Rajamanickam:
A Comparison of High-Level Programming Choices for Incomplete Sparse Factorization Across Different Architectures. 397-406
Workshop 4-HiCOMB - High Performance Computational Biology
- Srinivas Aluru, David A. Bader
, Ananth Kalyanaraman, Jaroslaw Zola:
HiCOMB Introduction and Committees. 407
Session I
- Constantin Scholl, Kassian Kobert, Tomás Flouri
, Alexandros Stamatakis
:
The Divisible Load Balance Problem with Shared Cost and Its Application to Phylogenetic Inference. 408-417 - Nikolaos Alachiotis, Doru-Thom Popovici, Tze Meng Low:
Efficient Computation of Linkage Disequilibria as Dense Linear Algebra Operations. 418-427 - Michael J. Hallock
, Zaida Luthey-Schulten:
Improving Reaction Kernel Performance in Lattice Microbes: Particle-Wise Propensities and Run-Time Generated Code. 428-434
Session II
- Amir Bahmani, Alexander B. Sibley, Mahmoud Parsian, Kouros Owzar, Frank Mueller:
SparkScore: Leveraging Apache Spark for Distributed Genomic Inference. 435-442 - Shayan Shams, Nayong Kim, Xiandong Meng, Ming Tai Ha, Shantenu Jha
, Zhong Wang
, Joohyun Kim:
A Scalable Pipeline for Transcriptome Profiling Tasks with On-Demand Computing Clouds. 443-452 - Vipin Sachdeva
, Srinivas Aluru, David A. Bader
:
A Memory and Time Scalable Parallelization of the Reptile Error-Correction Code. 453-462
Session III
- Nuttiiya Seekhao, Caroline Shung, Joseph F. JáJá, Luc Mongeau
, Nicole Y. K. Li-Jessen
:
Real-Time Agent-Based Modeling Simulation with in-Situ Visualization of Complex Biological Systems: A Case Study on Vocal Fold Inflammation and Healing. 463-472 - M. Ali Mirzaei, Francesco Crescioli, Sebastien Viret, William Tromeur, Giovanni Calderini, Giovanni Marchiori, Guillaume Baulieu, Geoffrey Galbit:
A Novel Associative Memory Based Architecture for Sequence Alignment. 473-478
Workshop 5-APDCM - Advances in Parallel and Distributed Computational Models
- Oscar H. Ibarra, Koji Nakano, Akihiro Fujiwara, Susumu Matsumae
:
APDCM Introduction and Committees. 479
Session 1: Graph Algorithms
- Jie Wu:
Stable Matching Beyond Bipartite Graphs. 480-488 - Paula Aguilera, Dong Ping Zhang, Nam Sung Kim, Nuwan Jayasena:
Fine-Grained Task Migration for Graph Algorithms Using Processing in Memory. 489-498
Session 2: Wireless Networks and Distributed Computing
- Wei Chen, Liang Hong, Sachin Shetty
, Dan Chia-Tien Lo, Reginald Cooper:
Cross-Layered Security Approach with Compromised Nodes Detection in Cooperative Sensor Networks. 499-508 - Hideharu Kojima
, Yuta Nagashima, Tatsuhiro Tsuchiya
:
Model Checking Techniques for State Space Reduction in MANET Protocol Verification. 509-516 - Feng Luo, Pradip K. Srimani:
New Biology Inspired Anonymous Distributed Algorithms to Compute Dominating and Total Dominating Sets in Network Graphs. 517-524
Session 3: Distributed Computing and Models
- Ta Yuan Hsu, Ajay D. Kshemkalyani:
Performance of Causal Consistency Algorithms for Partially Replicated Systems. 525-534 - Hassan Nawaz, Gideon Juve, Rafael Ferreira da Silva
, Ewa Deelman:
Performance Analysis of an I/O-Intensive Workflow Executing on Google Cloud and Amazon Web Services. 535-544 - Travis S. Humble, Alexander J. McCaskey, Jonathan Schrock, Hadayat Seddiqi, Keith A. Britt, Neena Imam:
Performance Models for Split-Execution Computing Systems. 545-554 - Ernesto Gomez, Keith E. Schubert
, Zongqi Ritchie Cai:
A Model for Entropy of Parallel Execution. 555-560
Session 4: Parallel Computing
- James Alexander Edwards, Uzi Vishkin:
FFT on XMT: Case Study of a Bandwidth-Intensive Regular Algorithm on a Highly-Parallel Many Core. 561-569 - Makoto Nakayama, Kenichi Yamazaki, Satoshi Tanaka:
Parallelization of Recursive Preorder Traversal Based on Building and Winding Call Stacks. 570-579 - P. B. Jayaraj
, K. Rahamathulla, G. Gopakumar
:
A GPU Based Maximum Common Subgraph Algorithm for Drug Discovery Applications. 580-588 - Toru Fujita, Koji Nakano
, Yasuaki Ito:
Bitwise Parallel Bulk Computation on the GPU, with Application to the CKY Parsing for Context-Free Grammars. 589-598 - Xin Zhou, Yasuaki Ito, Koji Nakano
:
An Efficient Implementation of LZW Decompression in the FPGA. 599-607
Workshop 6-ASHES - Accelerators and Hybrid Exascale Systems
- James Dinan:
AsHES Introduction and Committees. 608-609 - Wen-mei W. Hwu:
AsHES 2016 Keynote. 610
Session 1: Programming Models and Tools
- Chris J. Newburn, Gaurav Bansal, Michael Wood, Luis Crivelli, Judit Planas
, Alejandro Duran, Paulo Souza, Leonardo Borges, Piotr Luszczek, Stanimire Tomov
, Jack J. Dongarra, Hartwig Anzt
, Mark Gates
, Azzam Haidar, Yulu Jia, Khairul Kabir, Ichitaro Yamazaki, Jesús Labarta:
Heterogeneous Streaming. 611-620 - John D. Leidel, Yong Chen
:
HMC-Sim-2.0: A Simulation Platform for Exploring Custom Memory Cube Operations. 621-630 - Erik Zenker, Benjamin Worpitz, René Widera, Axel Huebl
, Guido Juckeland
, Andreas Knüpfer
, Wolfgang E. Nagel, Michael Bussmann
:
Alpaka - An Abstraction Library for Parallel Kernel Acceleration. 631-640 - Souley Madougou, Ana Lucia Varbanescu, Cees de Laat, Rob van Nieuwpoort
:
A Tool for Bottleneck Analysis and Performance Prediction for GPU-Accelerated Applications. 641-652
Session 2: Algorithms and Applications
- Yulu Jia, Piotr Luszczek, Jack J. Dongarra:
Hessenberg Reduction with Transient Error Resilience on GPU-Based Hybrid Architectures. 653-662 - Ryan Eberhardt, Mark Hoemmen:
Optimization of Block Sparse Matrix-Vector Multiplication on Shared-Memory Parallel Architectures. 663-672 - Joshua Dennis Booth, Sivasankaran Rajamanickam, Heidi Thornquist:
Basker: A Threaded Sparse LU Factorization Utilizing Hierarchical Parallelism and Data Layouts. 673-682 - Hartwig Anzt
, Jack J. Dongarra, Moritz Kreutzer, Gerhard Wellein, Martin Koehler
:
Efficiency of General Krylov Methods on GPUs - An Experimental Study. 683-691
Session 3: Workload Scheduling
- Luis Costero
, Francisco D. Igual
, Katzalin Olcoz
, Sandra Catalán
, Rafael Rodríguez-Sánchez
, Enrique S. Quintana-Ortí
:
Refactoring Conventional Task Schedulers to Exploit Asymmetric ARM big.LITTLE Architectures in Dense Linear Algebra. 692-701 - Valeria Cardellini
, Alessandro Fanfarillo
, Salvatore Filippone
:
Heterogeneous CAF-Based Load Balancing on Intel Xeon Phi. 702-711 - Iman Faraji, Seyed Hessam Mirsadeghi, Ahmad Afsahi:
Topology-Aware GPU Selection on Multi-GPU Nodes. 712-720
Workshop 7-PCO - Parallel Computing and Optimization
- Didier El Baz
, Bora Uçar
:
PCO Introduction and Committees. 721
Session I: Parallel Computing and Optimization
- Kevin Ryan, Deepak Rajan, Shabbir Ahmed
:
Scenario Decomposition for 0-1 Stochastic Programs: Improvements and Asynchronous Implementation. 722-729 - Lluís-Miquel Munguía, Geoffrey Oxberry, Deepak Rajan:
PIPS-SBB: A Parallel Distributed-Memory Branch-and-Bound Algorithm for Stochastic Mixed-Integer Programs. 730-739 - Adam Polak
:
Counting Triangles in Large Graphs on GPU. 740-746 - Adel Dabah, Ahcène Bendjoudi, Didier El Baz
, Abdelhakim AitZai:
GPU-Based Two Level Parallel B&B for the Blocking Job Shop Scheduling Problem. 747-755
Session II: Parallel Algorithms for Scheduling problems GPU-Based Two Level Parallel B&B for the Blocking Job Shop Scheduling
- Yumei Huo, Jun Xiong Huang:
Parallel Ant Colony Optimization for Flow Shop Scheduling Subject to Limited Machine Availability. 756-765 - Abhishek Awasthi, Jörg Lässig, Jens Leuschner, Thomas Weise:
GPGPU-Based Parallel Algorithms for Scheduling Against Due Date. 766-775 - Ali Al Buhussain, Robson Eduardo De Grande
, Azzedine Boukerche:
Performance Analysis of Bio-Inspired Scheduling Algorithms for Cloud Environments. 776-785
Session III: Parallel Heuristics and Metaheuristics
- José-Matías Cutillas-Lozano, Domingo Giménez, Luis-Pedro García:
Optimizing Metaheuristics and Hyperheuristics through Multi-level Parallelism on a Many-Core System. 786-795 - Didier El Baz
, Mhand Hifi, Lei Wu, Xiaochuan Shi:
A Parallel Ant Colony Optimization for the Maximum-Weight Clique Problem. 796-800 - Giovanni Cammarata, Antonella Di Stefano, Giovanni Morana, Daniele Zito
:
Evaluating the Performance of A4SDN on Various Network Topologies. 801-808 - Ania Kaci, Huy-Nam Nguyen, Amir Nakib
, Patrick Siarry:
Hybrid Heuristics for Mapping Task Problem on Large Scale Heterogeneous Platforms. 809-816 - Karl-Eduard Berger, François Galea, Bertrand Le Cun, Renaud Sirdey
:
A Semi-Greedy Heuristic for the Mapping of Large Task Graphs. 817-824
Session IV: Combinatorial Scientific Computing
- Yu Jin, Joseph F. JáJá:
A High Performance Implementation of Spectral Clustering on CPU-GPU Platforms. 825-834 - Ning Hao, AmirReza Oghbaee, Mohammad Rostami, Nate Derbinsky, José Bento:
Testing Fine-Grained Parallelism for the ADMM on a Factor-Graph. 835-844 - Pingfan Li, Xuhao Chen, Zhe Quan, Jianbin Fang
, Huayou Su, Tao Tang, Canqun Yang:
High Performance Parallel Graph Coloring on GPGPUs. 845-854
Workshop 8-GABB - Graph Algorithms Building Blocks
- Tim Mattson:
GABB Introduction and Committees. 855 - David A. Bader
:
GABB 2016 Keynote. 856 - Mark Tullsen, Matthew J. Sottile
:
Array Types for a Graph Processing Language. 857-866 - Jiahao Chen, Weijian Zhang:
The Right Way to Search Evolving Graphs. 867-876 - E. Jason Riedy:
Updating PageRank for Streaming Graphs. 877-884 - Sriram Srinivasan, Sanjukta Bhowmick, Sajal K. Das
:
Application of Graph Sparsification in Developing Parallel Algorithms for Updating Connected Components. 885-891 - Keita Iwabuchi, Scott Sallinen, Roger A. Pearce, Brian Van Essen, Maya B. Gokhale, Satoshi Matsuoka:
Towards a Distributed Large-Scale Dynamic Graph Data Store. 892-901 - Brendan Gavin, Vijay Gadepally, Jeremy Kepner:
Enforced Sparse Non-negative Matrix Factorization. 902-911 - Peter Zhang, Marcin Zalewski, Andrew Lumsdaine
, Samantha Misurda, Scott McMillan:
GBTL-CUDA: Graph Algorithms and Primitives for GPUs. 912-920 - Peter M. Kogge:
Jaccard Coefficients as a Potential Graph Benchmark. 921-928 - Patrick Dreher, Chansup Byun, Chris Hill, Vijay Gadepally, Bradley C. Kuszmaul, Jeremy Kepner:
PageRank Pipeline Benchmark: Proposal for a Holistic System Benchmark for Big-Data Platforms. 929-937
Workshop 9-EduPar - NSF/TCPP Workshop on Parallel and Distributed Computing Education
- Ramachandran Vaidyanathan, Sushil K. Prasad
, Satish Puri
:
EduPar Introduction and Committees. 938-940 - Randal E. Bryant:
EduPar 2016 Keynote. 941
Session 1: Programming Framework and Tools
- Abdul Dakkak, Carl Pearson, Wen-mei W. Hwu:
WebGPU: A Scalable Online Development Platform for GPU Programming Courses. 942-949 - Annette C. Feng, Wu-chun Feng:
Parallel Programming with Pictures in a Snap! 950-957 - José R. Ortiz-Ubarri, Rafael A. Arce-Nazario
, Edusmildo Orozco:
Modules to Teach Parallel and Distributed Computing Using MPI for Python and Disco. 958-962 - Yinong Chen
, Gennaro De Luca:
VIPLE: Visual IoT/Robotics Programming Language Environment for Computer Science Education. 963-971
Session 2: Instruction Techniques and Experiences
- Joel C. Adams
, Patrick A. Crain, Christopher P. Dilley:
Seeing Multithreaded Behavior Using TSGL. 972-977 - Barry Wilkinson, Clayton Ferner:
The Suzaku Pattern Programming Framework. 978-986 - Shirley Moore, Steven R. Dunlop:
A Flipped Classroom Approach to Teaching Concurrency and Parallelism. 987-995 - Javier Cuenca
, Domingo Giménez:
A Parallel Programming Course Based on an Execution Time-Energy Consumption Optimization Problem. 996-1003
Workshop 10-HPDAV - High Performance Data Analysis and Visualization
- Wes Bethel:
HPDAV Introduction and Committees. 1004-1005 - Jim Jeffers:
HPDAV 2016 Keynote. 1006
Full Papers Session I
- David Pugmire, James Kress
, Jong Youl Choi, Scott Klasky, Tahsin M. Kurç, Michael Churchill
, Matthew Wolf, Greg Eisenhauer, Hank Childs, Kesheng Wu
, Alexander Sim
, Junmin Gu, Jonathan Low:
Visualization and Analysis for Near-Real-Time Decision Making in Distributed Workflows. 1007-1013 - John E. Stone
, Peter Messmer, Robert Sisneros, Klaus Schulten:
High Performance Molecular Visualization: In-Situ and Parallel Rendering with EGL. 1014-1023 - Miyuru Dayarathna, Isuru Herath, Yasima Dewmini, Gayan Mettananda, Sameera Nandasiri, Sanath Jayasena
, Toyotaro Suzumura:
Introducing Acacia-RDF: An X10-Based Scalable Distributed RDF Graph Database Engine. 1024-1032
Short Papers Session
- Philippe P. Pébay
, Janine C. Bennett, David S. Hollman, Sean Treichler, Patrick S. McCormick, Christine Sweeney
, Hemanth Kolla, Alex Aiken:
Towards Asynchronous Many-Task in Situ Data Analysis Using Legion. 1033-1037 - Silvio Rizzi, Mark Hereld, Joseph A. Insley, Preeti Malakar, Michael E. Papka
, Thomas D. Uram, Venkatram Vishwanath:
Coupling LAMMPS and the vl3 Framework for Co-Visualization of Atomistic Simulations. 1038-1042 - Krishna Bharadwaj, Samuel Flores, Joshua Rodriguez, Lance Long
, G. Elisabeta Marai:
Developing a Scalable SNMP Monitor. 1043-1047
Full Papers Session II
- John E. Stone
, William R. Sherman, Klaus Schulten:
Immersive Molecular Visualization with Omnidirectional Stereoscopic Ray Tracing and Remote Rendering. 1048-1057 - Robert Sisneros, David Pugmire:
Tuned to Terrible: A Study of Parallel Particle Advection State of the Practice. 1058-1067
Workshop 11-VarSys - Variability in Parallel and Distributed Systems
- Kirk W. Cameron
, Todd Gamblin, Dimitrios S. Nikolopoulos
:
VarSys Introduction. 1068 - Allan Porterfield, Sridutt Bhalachandra, Wei Wang, Rob Fowler:
Variability: A Tuning Headache. 1069-1072 - Bilge Acun, Laxmikant V. Kalé:
Mitigating Processor Variation through Dynamic Load Balancing. 1073-1076 - Ivo Jimenez
, Carlos Maltzahn, Jay F. Lofstead
, Adam Moody, Kathryn M. Mohror
, Remzi H. Arpaci-Dusseau, Andrea C. Arpaci-Dusseau:
Characterizing and Reducing Cross-Platform Performance Variability Using OS-Level Virtualization. 1077-1080 - Ali Anwar
, Yue Cheng, Ali Raza Butt
:
Towards Managing Variability in the Cloud. 1081-1084 - Jin-Seong Kim, Jae J. Jang, Im Young Jung:
Near Real-Time Tracking of IoT Device Users. 1085-1088
Workshop 12-HPPAC - High-Performance, Power-Aware Computing
- Barry Rountree, Shuaiwen Leon Song:
HPPAC Introduction and Committees. 1089
Lightning Talks A
- Chung-Hsing Hsu, Wu-chun Feng:
The Right Metric for Efficient Supercomputing: A Ten-Year Retrospective. 1090-1093 - Ryan E. Grant, Michael J. Levenhagen, Stephen L. Olivier
, David Debonis
, Kevin T. Pedretti, James H. Laros III:
Overcoming Challenges in Scalable Power Monitoring with the Power API. 1094-1097 - Shirley Moore:
Achieving Safety for Power Shifting in Overprovisioned High Performance Computing Systems. 1098-1101 - Rogelio Long, Shirley Moore:
POSITION PAPER: Countering the Noise-Induced Critical Path Problem. 1102-1105
Lightning Talks B
- Natalie J. Bates, Chung-Hsing Hsu, Neena Imam, Torsten Wilde, Dale Sartor:
Re-Examining HPC Energy Efficiency Dashboard Elements. 1106-1109 - Neha Gholkar, Frank Mueller, Barry Rountree:
A Power-Aware Cost Model for HPC Procurement. 1110-1113 - Christopher Eibel, Timo Hönig, Wolfgang Schröder-Preikschat:
Energy Claims at Scale: Decreasing the Energy Demand of HPC Workloads at OS Level. 1114-1117 - Daniel A. Ellsworth, Tapasya Patki
, Swann Perarnau, Sangmin Seo, Abdelhalim Amer, Judicael A. Zounmevo, Rinku Gupta
, Kazutomo Yoshii, Henry Hoffmann, Allen D. Malony, Martin Schulz
, Peter H. Beckman:
Systemwide Power Management with Argo. 1118-1121
Regular Papers A
- Scott Walker, Marty McFadden:
Best Practices for Scalable Power Measurement and Control. 1122-1131 - Aniruddha Marathe
, Hormozd Gahvari, Jae-Seung Yeom
, Abhinav Bhatele:
LibPowerMon: A Lightweight Profiling Framework to Profile Program Context and System-Level Metrics. 1132-1141 - Matthias Maiterth
, Martin Schulz
, Dieter Kranzlmüller, Barry Rountree:
Power Balancing in an Emulated Exascale Environment. 1142-1149 - Sand Luz Correa, Mariam Umar, Kirk W. Cameron
:
Combining Power and Performance Modeling for Application Analysis: A Case Study Using Aspen. 1150-1159 - Ryan S. Luley, Qinru Qiu:
Effective Utilization of CUDA Hyper-Q for Improved Power and Performance Efficiency. 1160-1169
Regular Papers B
- Nidhi Tiwari, Umesh Bellur
, Santonu Sarkar
, Maria Indrawan:
Identification of critical parameters for MapReduce energy efficiency using statistical Design of Experiments. 1170-1179 - Xingfu Wu
, Valerie E. Taylor:
Utilizing Hardware Performance Counters to Model and Optimize the Energy and Performance of Large Scale Scientific Applications on Power-Aware Supercomputers. 1180-1189 - Jared Coplin, Martin Burtscher:
Energy, Power, and Performance Characterization of GPGPU Benchmark Programs. 1190-1199
Workshop 13-PDSEC - Parallel and Distributed Scientific and Engineering Computing
- Peter Strazdins, Raphaël Couturier
, Keita Teranishi, Alan Gray, Thomas Rauber, Gudula Rünger, Laurence T. Yang:
PDSEC Introduction and Committees. 1200-1201
Session 1: Application and Task Parallelism
- Yuta Hirokawa, Taisuke Boku, Shunsuke A. Sato
, Kazuhiro Yabana:
Electron Dynamics Simulation with Time-Dependent Density Functional Theory on Large Scale Symmetric Mode Xeon Phi Cluster. 1202-1211 - Jean Marie Couteyen Carpaye, Jean Roman, Pierre Brenner:
Towards an Efficient Task-Based Parallelization over a Runtime System of an Explicit Finite-Volume CFD Code with Adaptive Time Stepping. 1212-1221 - Alan Humphrey, Daniel Sunderland, Todd Harman, Martin Berzins
:
Radiative Heat Transfer Calculation on 16384 GPUs Using a Reverse Monte Carlo Ray Tracing Approach with Adaptive Mesh Refinement. 1222-1231
Session 2: Resilience
- Peter E. Strazdins, Md. Mohsin Ali
, Bert J. Debusschere
:
Application Fault Tolerance for Shrinking Resources via the Sparse Grid Combination Technique. 1232-1238 - Anne Benoit
, Aurélien Cavelan, Yves Robert
, Hongyang Sun:
Two-Level Checkpointing and Verifications for Linear Task Graphs. 1239-1248
Session 3: Performance
- Ahmad Abdelfattah, Azzam Haidar, Stanimire Tomov
, Jack J. Dongarra:
On the Development of Variable Size Batched Computation for Heterogeneous Parallel Architectures. 1249-1258 - André Merzky, Shantenu Jha
:
Synapse: Synthetic Application Profiler and Emulator. 1259-1268
Workshop 14-DPDNS - Dependable Parallel, Distributed and Network-Centric Systems
- Dimiter Avresky, Erik Maehle, Roberto Palmieri
:
DPDNS Introduction and Committees. 1269 - Shlomi Dolev
:
DPDNS 2016 Keynote. 1270
Session 1: Distributed Services
- Brendan Benshoof, Andrew Rosen, Anu G. Bourgeois, Robert W. Harrison
:
Distributed Decentralized Domain Name Service. 1279-1287 - Kaliappa Ravindran:
Management Software for Protocol-level Adaptations in Dependable Network Services. 1288-1297
Session 2: Cloud and Fault Tolerance
- Soham Sinha
, Di Niu, Zhi Wang, Paul Lu:
Mitigating Routing Inefficiencies to Cloud-Storage Providers: A Case Study. 1298-1306 - Roberto Palmieri
:
Leaderless Consensus: The State of the Art. 1307-1310 - Alessandro Pellegrini
, Pierangelo di Sanzo
, Dimiter R. Avresky:
Proactive Cloud Management for Highly Heterogeneous Multi-cloud Infrastructures. 1311-1318
Session 3: Multicore Computing
- Vishal Chandra Sharma, Ganesh Gopalakrishnan, Sriram Krishnamoorthy
:
Towards Resiliency Evaluation of Vector Programs. 1319-1328 - Gilles Bizot, Dimiter Avresky, Fabien Chaix:
Analysis of Adaptive Mapping of Parallelized Application on Multicore System. 1329-1338
Workshop 15-LSPP - Large-Scale Parallel Processing
- Kevin J. Barker
, Christopher D. Carothers, Eric Van Hensbergen:
LSPP Introduction and Committees. 1339 - Michael E. Papka:
LSPP 2016 Keynote. 1340
Session 1: Making Efficient Use of Advanced Architectures
- Zhaokui Li, Jianbin Fang
, Tao Tang, Xuhao Chen, Cheng Chen, Canqun Yang:
Evaluating the Performance Impact of Multiple Streams on the MIC-Based Heterogeneous Platform. 1341-1350 - Max Plauth
, Wieland Hagen, Frank Feinbube, Felix Eberhardt, Lena Feinbube, Andreas Polze:
Parallel Implementation Strategies for Hierarchical Non-uniform Memory Access Systems by Example of the Scale-Invariant Feature Transform Algorithm. 1351-1359 - Ryan D. Friese
:
Efficient Genetic Algorithm Encoding for Large-Scale Multi-objective Resource Allocation. 1360-1369
Session 2: Workflow Modeling and Optimization and Modeling at Scale
- Anirban Mandal
, Paul Ruth
, Ilya Baldin
, Dariusz Król, Gideon Juve, Rajiv Mayani, Rafael Ferreira da Silva
, Ewa Deelman, Jeremy S. Meredith, Jeffrey S. Vetter, Vickie E. Lynch
, Benjamin Mayer, James Wynne
, Mark P. Blanco, Christopher D. Carothers, Justin M. LaPre, Brian Tierney:
Toward an End-to-End Framework for Modeling, Monitoring and Anomaly Detection for Scientific Workflows. 1370-1379 - Kevin J. Barker
, Darren J. Kerbyson:
Modeling the Performance and Energy Impact of Dynamic Power Steering. 1380-1389
Workshop 16-ParLearning - Parallel and Distributed Computing for Large Scale Machine Learning and Big Data Analytics
- Charalampos Chelmis, Sutanay Choudhury, Arindam Pal, Anand V. Panangadan, Weiqin Tong, Yinglong Xia:
ParLearning Introduction and Committees. 1390-1391 - Peter M. Kogge:
ParLearning 2016 Keynote. 1392
Session I
- Dianwei Han, Ankit Agrawal
, Wei-keng Liao
, Alok N. Choudhary:
A Novel Scalable DBSCAN Algorithm with Spark. 1393-1402 - Alex Gittens, Jey Kottalam, Jiyan Yang, Michael F. Ringenburg
, Jatin Chhugani, Evan Racah, Mohitdeep Singh, Yushu Yao, Curt Fischer, Oliver Rübel, Benjamin P. Bowen, Norman G. Lewis
, Michael W. Mahoney, Venkat Krishnamurthy, Prabhat:
A Multi-Platform Evaluation of the Randomized CX Low-Rank Matrix Factorization in Spark. 1403-1412 - Orhan Kislal, Mahmut T. Kandemir, Jagadish Kotra:
Cache-Aware Approximate Computing for Decision Tree Learning. 1413-1422 - Vasileios Zois, Anand V. Panangadan, Viktor K. Prasanna:
Accelerating Support Count for Association Rule Mining on GPUs. 1423-1432
Session II
- Andrew Wylie, Wei Shi
, Jean-Pierre Corriveau, Yang Wang:
A Scheduling Algorithm for Hadoop MapReduce Workflows with Budget Constraints in the Heterogeneous Cloud. 1433-1442 - Yanik Ngoko, Denis Trystram, Valentin Reis, Christophe Cérin:
An Automatic Tuning System for Solving NP-Hard Problems in Clouds. 1443-1452 - Daniel G. Chavarría-Miranda, Vito Giovanni Castellana, Alessandro Morari, David Haglin, John Feo:
GraQL: A Query Language for High-Performance Attributed Graph Databases. 1453-1462 - Ismail El-Helw, Rutger F. H. Hofman, Wenzhe Li, Sungjin Ahn, Max Welling, Henri E. Bal:
Scalable Overlapping Community Detection. 1463-1472
Session III
- Xiang-You Peng, Yu-Bo Yang, Chang-Dong Wang, Dong Huang
, Jian-Huang Lai:
An Efficient Parallel Nonlinear Clustering Algorithm Using MapReduce. 1473-1476 - Wenhua Yu, Lei Zhao
, Xiangyu He, Jiacheng Zhou, Tong Cheng, Chengzhao Xue, Fan Yang:
A New Evaluation System for Scholars and Majors Based on Big-Data Techniques. 1477-1480 - Sarwar Morshed, Juwel Rana, Marcelo Milrad
:
Open Source Initiatives and Frameworks Addressing Distributed Real-Time Data Analytics. 1481-1484
Workshop 17-JSSPP - Job Scheduling Strategies for Parallel Processing
- Walfredo Cirne, Narayan Desai:
JSSPP Introduction and Committees. 1485
Workshop 18-iWAPT - International Workshop on Automatic Performance Tuning
- Weichung Wang:
iWAPT Introduction and Committees. 1486-1487
Session 1
- Takahiro Katagiri, Masaharu Matsumoto, Satoshi Ohshima
:
Auto-Tuning of Hybrid MPI/OpenMP Execution with Code Selection by ppOpen-AT. 1488-1495 - Satoshi Ohshima
, Takahiro Katagiri, Masaharu Matsumoto:
Utilization and Expansion of ppOpen-AT for OpenACC. 1496-1505
Session 2
- Lars Kirkholt Melhus, Rune Erlend Jensen:
Measurement Bias from Address Aliasing. 1506-1515 - Hiroko Midorikawa:
Blk-Tune: Blocking Parameter Auto-Tuning to Minimize Input-Output Traffic for Flash-Based Out-of-Core Stencil Computations. 1516-1526 - Rong Gu, Zhiqiang Liu, Chunfeng Yuan, Yihua Huang:
A Time-Cost Based Automatic Scheduling Framework for Matrix Computation on Various Distributed Computing Platforms. 1527-1534
Session 3
- Amit Roy, Prasanna Balaprakash
, Paul D. Hovland
, Stefan M. Wild
:
Exploiting Performance Portability in Search Algorithms for Autotuning. 1535-1544 - Piotr Luszczek, Mark Gates
, Jakub Kurzak, Anthony Danalis, Jack J. Dongarra:
Search Space Generation and Pruning System for Autotuners. 1545-1554
Workshop 19-CHIUW - Chapel Implementers and Users Workshop
- Tom MacDonald, Greg Titus:
CHIUW Introduction and Committees. 1555-1556 - Nikhil Padmanabhan:
CHIUW 2016 Keynote. 1557
Session 1: Benchmarking and Optimization
- Richard B. Johnson, Jeffrey K. Hollingsworth:
Optimizing Chapel for Single-Node Environments. 1558-1567 - Engin Kayraklioglu, Olivier Serres, Ahmad Anbar, Hashem Elezabi, Tarek A. El-Ghazawi:
PGAS Access Overhead Characterization in Chapel. 1568-1577
Session 2: Chapel Improvement
- Philip A. Nelson, Greg Titus:
Chplvis: A Communication and Task Visualization Tool for Chapel. 1578-1585 - Konstantina Panagiotopoulou, Hans-Wolfgang Loidl:
Transparently Resilient Task Parallelism for Chapel. 1586-1595
Workshop 20-HPBDC - High-Performance Big Data Computing
- Dhabaleswar K. Panda, Jianfeng Zhan, Xiaoyi Lu:
HPBDC Introduction and Committees. 1596
Session I: High-Performance Big Data Applications and Systems
- Andrew J. Younge, Christopher Reidy, Robert Henschel, Geoffrey C. Fox:
Evaluation of SMP Shared Memory Machines for Use with In-Memory and OpenMP Big Data Applications. 1597-1606 - André Luckow
, Ioannis Paraskevakos, George Chantzialexiou, Shantenu Jha
:
Hadoop on HPC: Integrating Hadoop and Pilot-Based Dynamic Resource Management. 1607-1616 - Ruijian Wang, Chao Wang, Li Zha:
PACM: A Prediction-Based Auto-Adaptive Compression Model for HDFS. 1617-1626
Session II: High-Performance Streaming Systems
- Milinda Pathirage, Julian Hyde, Yi Pan, Beth Plale
:
SamzaSQL: Scalable Fast Data Management with Streaming SQL. 1627-1636 - Supun Kamburugamuve, Saliya Ekanayake, Milinda Pathirage, Geoffrey C. Fox:
Towards High Performance Processing of Streaming Data in Large Data Centers. 1637-1644 - Yining Zhao, Haili Xiao:
Extracting Log Patterns from System Logs in LARGE. 1645-1652
Session III (Short Papers): Performance Studies of Big Data Systems and Applications
- Saba Sehrish, Jim Kowalkowski, Marc F. Paterno:
Exploring the Performance of Spark for a Scientific Use Case. 1653-1659 - Rui Zhang, Hongzhi Wang, Renu Tewari, Gero Schmidt, Deepika Kakrania:
Big Data for Medical Image Analysis: A Performance Study. 1660-1664
Workshop 21-HPCMASPA - Monitoring and Analysis for High Performance Computing Systems Plus Applications
- Benjamin A. Allan, Jim M. Brandt, Ann C. Gentile, Cory Lueninghoener, Nichamon Naksinehaboon, Boyana Norris, Narate Taerat:
HPCMASPA Introduction and Committees. 1665-1666 - William T. C. Kramer:
HPCMASPA 2016 Keynote. 1667
Session 1: Instrumentation and Metrics
- Christian Iwainsky
, Christian H. Bischof:
Calltree-Controlled Instrumentation for Low-Overhead Survey Measurements. 1668-1677 - Mohammed Tanash
, Nasim Ghazanfari, Omar Aaziz, Jonathan Cook:
Automatically Instrumenting Scientific Applications to Produce Heartbeat Events. 1678-1686 - Anthony M. Agelastos:
Defining Metrics to Distill Large-Scale HPC Platform and Application Performance Data into Actionable Quantities. 1687-1691
Session 2: Monitoring Systems
- Patricia Grubel
, Hartmut Kaiser
, Kevin A. Huck
, Jeanine E. Cook:
Using Intrinsic Performance Counters to Assess Efficiency in Task-Based Parallel Applications. 1692-1701 - R. Todd Evans, James C. Browne, William L. Barth:
Understanding Application and System Performance Through System-Wide Monitoring. 1702-1710 - Jim M. Brandt, Ann C. Gentile, Michael T. Showerman, Jeremy Enos
, Joshi Fullop, Gregory H. Bauer
:
Large-Scale Persistent Numerical Data Source Monitoring System Experiences. 1711-1720 - Sam Sanchez, Amanda Bonnie, Graham van Heule, Conor Robinson, Adam DeConinck, Kathleen Kelly, Quellyn Snead, Jim M. Brandt:
Design and Implementation of a Scalable HPC Monitoring System. 1721-1725
Workshop 22-IPDRM - Emerging Parallel and Distributed Runtime Systems and Middleware
- Shuaiwen Leon Song, Todd Gamblin:
IPDRM Introduction and Committees. 1726 - Henry Hoffmann:
IPDRM 2016 Keynote. 1727
Session 1
- Simon Pickartz, Carsten Clauss, Stefan Lankes
, Stephan Krempel, Thomas Moschny, Antonello Monti:
Non-intrusive Migration of MPI Processes in OS-Bypass Networks. 1728-1735 - Ezra Kissel, Martin Swany
:
Photon: Remote Memory Access Middleware for High-Performance Runtime Systems. 1736-1743 - Joshua Suetterlein
, Joshua Landwehr, Andrès Márquez
, Joseph B. Manzano
, Guang R. Gao:
Asynchronous Runtimes in Action: An Introspective Framework for a Next Gen Runtime. 1744-1751
Session 2
- Alireza Haghdoost, David H. C. Du:
OWBP: Flash-Aware Offline Write Buffer Policy. 1752-1758 - Seyed Hessam Mirsadeghi, Ahmad Afsahi:
Topology-Aware Rank Reordering for MPI Collectives. 1759-1768 - Anshuman Goswami, Jeffrey S. Young
, Karsten Schwan, Naila Farooqui, Ada Gavrilovska, Matthew Wolf, Greg Eisenhauer:
GPUShare: Fair-Sharing Middleware for GPU Clouds. 1769-1776
Session 3
- Jie Zhang, Xiaoyi Lu, Dhabaleswar K. Panda:
Performance Characterization of Hypervisor-and Container-Based Virtualization for HPC on SR-IOV Enabled InfiniBand Clusters. 1777-1784 - Heng Zhang, Chunliang Hao, Yanjun Wu, Mingshu Li:
Macaca: A Scalable and Energy-Efficient Platform for Coupling Cloud Computing with Distributed Embedded Computing. 1785-1788 - Sanket Chintapalli, Derek Dagit, Bobby Evans, Reza Farivar, Thomas Graves, Mark Holderbaugh, Zhuo Liu, Kyle Nusbaum, Kishorkumar Patil, Boyang Peng, Paul Poulosky:
Benchmarking Streaming Computation Engines: Storm, Flink and Spark Streaming. 1789-1792
Workshop 23-ParSocial - Parallel and Distributed Processing for Computational Social Systems
- Eunice E. Santos, John Korah:
ParSocial Introduction and Committees. 1793-1794 - George Cybenko:
ParSocial 2016 Keynote. 1795
Paper Session 1
- Chao Huang, Jermaine Marshall, Dong Wang, Mianxiong Dong:
Towards Reliable Social Sensing in Cyber-Physical-Social Systems. 1796-1802 - Gennaro Cordasco
, Carmine Spagnuolo
, Vittorio Scarano
:
Toward the New Version of D-MASON: Efficiency, Effectiveness and Correctness in Parallel and Distributed Agent-Based Simulations. 1803-1812 - Bhavani Thuraisingham, Murat Kantarcioglu, Latifur Khan
, Barbara Carminati
, Elena Ferrari
, Leila Bahri
:
Emergency-Driven Assured Information Sharing in Secure Online Social Networks: A Position Paper. 1813-1820
Paper Session 2
- Eunice E. Santos, John Korah, Vairavan Murugappan
, Suresh Subramanian
:
Efficient Anytime Anywhere Algorithms for Closeness Centrality in Large and Dynamic Graphs. 1821-1830 - Thanh Hong Nguyen, Arunesh Sinha
, Milind Tambe:
Addressing Behavioral Uncertainty in Security Games: An Efficient Robust Strategic Solution for Defender Patrols. 1831-1838
Workshop 24-Roundtable I - PDC in Core Undergraduate Education
- Dick Brown, Suzanne J. Matthews
:
Workshop 24-Roundtable I Introduction. 1839

manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.