


default search action
22nd PPOPP 2017: Austin, TX, USA
- Vivek Sarkar, Lawrence Rauchwerger:

Proceedings of the 22nd ACM SIGPLAN Symposium on Principles and Practice of Parallel Programming, Austin, TX, USA, February 4-8, 2017. ACM 2017, ISBN 978-1-4503-4493-7
Keynote
- Guy L. Steele Jr.:

It's Time for a New Old Language. 1
Session 1: GPU I
- Guoyang Chen, Yue Zhao, Xipeng Shen

, Huiyang Zhou
:
EffiSha: A Software Framework for Enabling Effficient Preemptive Scheduling of GPU. 3-16 - Nachshon Cohen, Arie Tal, Erez Petrank:

Layout Lock: A Scalable Locking Paradigm for Concurrent Data Layout Modifications. 17-29 - Xiuxia Zhang, Guangming Tan, Shuangbai Xue, Jiajia Li

, Keren Zhou
, Mingyu Chen:
Understanding the GPU Microarchitecture to Achieve Bare-Metal Performance Tuning. 31-43
Session 2: Concurrency
- Peizhao Ou, Brian Demsky:

Checking Concurrent Data Structures Under the C/C++11 Memory Model. 45-59 - Milind Chabbi, Abdelhalim Amer, Shasha Wen, Xu Liu:

An Efficient Abortable-locking Protocol for Multi-level NUMA Systems. 61-74 - Umut A. Acar, Naama Ben-David, Mike Rainey:

Contention in Structured Concurrency: Provably Efficient Dynamic Non-Zero Indicators for Nested Parallelism. 75-88 - Kento Sato, Dong H. Ahn, Ignacio Laguna, Gregory L. Lee, Martin Schulz

, Christopher M. Chambreau:
Noise Injection Techniques to Expose Subtle and Unintended Message Races. 89-101
Session 3: Tools
- Hao Luo, Pengcheng Li, Chen Ding:

Thread Data Sharing in Cache: Theory and Measurement. 103-115 - Bin Ren, Sriram Krishnamoorthy

, Kunal Agrawal, Milind Kulkarni:
Exploiting Vector and Multicore Parallelism for Recursive, Data- and Task-Parallel Programs. 117-130 - Sergei Shudler

, Alexandru Calotoiu, Torsten Hoefler, Felix Wolf:
Isoefficiency in Practice: Configuring and Understanding the Performance of Task-based Applications. 131-143 - Robert Utterback, Kunal Agrawal, I-Ting Angelina Lee, Milind Kulkarni:

Processor-Oblivious Record and Replay. 145-161
Session 4: GPU II
- Nirmal Prajapati

, Waruna Ranasinghe, Sanjay V. Rajopadhye, Rumen Andonov, Hristo N. Djidjev
, Tobias Grosser
:
Simple, Accurate, Analytical Time Modeling and Optimal Tile Size Selection for GPGPU Stencils. 163-177 - Peng Jiang, Gagan Agrawal:

Combining SIMD and Many/Multi-core Parallelism for Finite State Machines with Enumerative Speculation. 179-191 - Ammar Ahmad Awan, Khaled Hamidouche, Jahanzeb Maqbool Hashmi, Dhabaleswar K. Panda:

S-Caffe: Co-designing MPI Runtimes and Caffe for Scalable Deep Learning on Modern GPU Clusters. 193-205 - Amit Sabne, Xiao Wang

, Sherman J. Kisner, Charles A. Bouman, Anand Raghunathan
, Samuel P. Midkiff
:
Model-based Iterative CT Image Reconstruction on GPUs. 207-220
Session 5: Best Papers
- Tsung Tai Yeh, Amit Sabne, Putt Sakdhnagool, Rudolf Eigenmann, Timothy G. Rogers

:
Pagoda: Fine-Grained GPU Resource Virtualization for Narrow Tasks. 221-234 - Tal Ben-Nun, Michael Sutton, Sreepathi Pai

, Keshav Pingali:
Groute: An Asynchronous Multi-GPU Programming Model for Irregular Computations. 235-248 - Tao B. Schardl, William S. Moses

, Charles E. Leiserson:
Tapir: Embedding Fork-Join Parallelism into LLVM's Intermediate Representation. 249-265 - Alexander Matveev, Yaron Meirovitch, Hayk Saribekyan

, Wiktor Jakubiuk, Tim Kaler, Gergely Ódor
, David M. Budden, Aleksandar Zlateski, Nir Shavit:
A Multicore Path to Connectomics-on-Demand. 267-281
Session 6: Languages & Compilers
- Michael Vollmer, Ryan G. Scott, Madanlal Musuvathi, Ryan R. Newton:

SC-Haskell: Sequential Consistency in Languages That Minimize Mutable Shared Heap. 283-298 - Martin Bättig

, Thomas R. Gross:
Synchronized-by-Default Concurrency for Shared-Memory Systems. 299-312 - Rubens E. A. Moreira, Caroline Collange, Fernando Magno Quintão Pereira:

Function Call Re-Vectorization. 313-326 - Samyam Rajbhandari, Fabrice Rastello, Karol Kowalski, Sriram Krishnamoorthy

, P. Sadayappan:
Optimizing the Four-Index Integral Transform Using Data Movement Lower Bounds Analysis. 327-340
Session 7: Data Analytics
- Guy L. Steele Jr., Jean-Baptiste Tristan:

Using Butterfly-Patterned Partial Sums to Draw from Discrete Distributions. 341-355 - Dmitry Basin, Edward Bortnikov, Anastasia Braginsky, Guy Golan-Gueta, Eshcar Hillel, Idit Keidar, Moshe Sulamy:

KiWi: A Key-Value Map for Scalable Real-Time Analytics. 357-369 - Lin Jiang, Zhijia Zhao:

Grammar-aware Parallelization for Scalable XPath Querying. 371-383 - Xin Wang, Weihua Zhang, Zhaoguo Wang, Ziyun Wei, Haibo Chen, Wenyun Zhao:

Eunomia: Scaling Concurrent Search Trees under Contention Using HTM. 385-399
Session 8: Fault Tolerance
- Xiongchao Tang, Jidong Zhai, Bowen Yu, Wenguang Chen, Weimin Zheng:

Self-Checkpoint: An In-Memory Checkpoint Method Using Less Space and Its Practice on Fault-Tolerant HPL. 401-413 - Panruo Wu

, Nathan DeBardeleben, Qiang Guan, Sean Blanchard, Jieyang Chen, Dingwen Tao
, Xin Liang, Kaiming Ouyang, Zizhong Chen
:
Silent Data Corruption Resilient Two-sided Matrix Factorizations. 415-427
Session 9: Posters
- Maya Arbel-Raviv, Trevor Brown:

POSTER: Reuse, don't Recycle: Transforming Algorithms that Throw Away Descriptors. 429-430 - Vignesh Balaji

, Dhruva Tirumala, Brandon Lucia:
POSTER: An Architecture and Programming Model for Accelerating Parallel Commutative Computations via Privatization. 431-432 - Arnamoy Bhattacharyya, Mike Dai Wang, Mihai Burcea, Yi Ding, Allen Deng, Sai Varikooty, Shafaaf Hossain, Cristiana Amza:

POSTER: HythTM: Extending the Applicability of Intel TSX Hardware Transactional Support. 433-434 - Rezaul Chowdhury, Pramod Ganapathi

, Yuan Tang, Jesmin Jahan Tithi:
POSTER: Provably Efficient Scheduling of Cache-Oblivious Wavefront Algorithms. 435-436 - Nachshon Cohen, Maurice Herlihy, Erez Petrank, Elias Wald:

POSTER: State Teleportation via Hardware Transactional Memory. 437-438 - Dong Dai, Wei Zhang, Yong Chen

:
POSTER: IOGP: An Incremental Online Graph Partitioning for Large-Scale Distributed Graph Databases. 439-440 - Jesun Sahariar Firoz, Thejaka Amila Kanewala, Marcin Zalewski, Martina Barnas, Andrew Lumsdaine

:
POSTER: Distributed Control: The Benefits of Eliminating Global Synchronization via Effective Scheduling. 441-442 - Gangwon Jo, Jaehoon Jung, Jiyoung Park, Jaejin Lee:

POSTER: MAPA: An Automatic Memory Access Pattern Analyzer for GPU Applications. 443-444 - Shigang Li, Yunquan Zhang, Torsten Hoefler:

POSTER: Cache-Oblivious MPI All-to-All Communications on Many-Core Architectures. 445-446 - Harshitha Menon, Kavitha Chandrasekar

, Laxmikant V. Kalé:
POSTER: Automated Load Balancer Selection Based on Application Characteristics. 447-448 - Nurit Moscovici, Nachshon Cohen, Erez Petrank:

POSTER: A GPU-Friendly Skiplist Algorithm. 449-450 - Pedro Ramalhete, Andreia Correia:

POSTER: Poor Man's URCU. 451-452 - Pedro Ramalhete, Andreia Correia:

POSTER: A Wait-Free Queue with Wait-Free Memory Reclamation. 453-454 - Yuan Tang, Ronghui You:

POSTER: STAR (Space-Time Adaptive and Reductive) Algorithms for Real-World Space-Time Optimality. 455-456 - Mingyu Wu, Haibing Guan, Binyu Zang, Haibo Chen:

POSTER: Recovering Performance for Vector-based Machine Learning on Managed Runtime. 457-458 - Minjia Zhang, Swarnendu Biswas, Michael D. Bond:

POSTER: On the Problem of Consistency Exceptions in the Context of Strong Memory Models. 459-460 - Yue Zhao, Chunhua Liao

, Xipeng Shen
:
POSTER: An Infrastructure for HPC Knowledge Sharing and Reuse. 461-462

manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.


Google
Google Scholar
Semantic Scholar
Internet Archive Scholar
CiteSeerX
ORCID














