:facetid:toc:\"db/conf/ppopp/ppopp2017.bht\"OK:facetid:toc:db/conf/ppopp/ppopp2017.bhtShigang Li 0002Yunquan ZhangTorsten HoeflerPOSTER: Cache-Oblivious MPI All-to-All Communications on Many-Core Architectures.PPoPP445-4462017Conference and Workshop Papersclosedconf/ppopp/0002ZH1710.1145/3018743.3019025https://doi.org/10.1145/3018743.3019025https://dblp.org/rec/conf/ppopp/0002ZH17URL#3004526Umut A. AcarNaama Ben-DavidMike RaineyContention in Structured Concurrency: Provably Efficient Dynamic Non-Zero Indicators for Nested Parallelism.PPoPP75-882017Conference and Workshop Papersclosedconf/ppopp/AcarBR1710.1145/3018743.3018762https://doi.org/10.1145/3018743.3018762https://dblp.org/rec/conf/ppopp/AcarBR17URL#3004527Maya Arbel-RavivTrevor Brown 0001POSTER: Reuse, don't Recycle: Transforming Algorithms that Throw Away Descriptors.PPoPP429-4302017Conference and Workshop Papersclosedconf/ppopp/Arbel-RavivB1710.1145/3018743.3019035https://doi.org/10.1145/3018743.3019035https://dblp.org/rec/conf/ppopp/Arbel-RavivB17URL#3004531Ammar Ahmad AwanKhaled HamidoucheJahanzeb Maqbool HashmiDhabaleswar K. Panda 0001S-Caffe: Co-designing MPI Runtimes and Caffe for Scalable Deep Learning on Modern GPU Clusters.PPoPP193-2052017Conference and Workshop Papersclosedconf/ppopp/AwanHHP1710.1145/3018743.3018769https://doi.org/10.1145/3018743.3018769https://dblp.org/rec/conf/ppopp/AwanHHP17URL#3004532Vignesh BalajiDhruva TirumalaBrandon LuciaPOSTER: An Architecture and Programming Model for Accelerating Parallel Commutative Computations via Privatization.PPoPP431-4322017Conference and Workshop Papersclosedconf/ppopp/BalajiTL1710.1145/3018743.3019030https://doi.org/10.1145/3018743.3019030https://dblp.org/rec/conf/ppopp/BalajiTL17URL#3004533Dmitry BasinEdward BortnikovAnastasia BraginskyGuy Golan-GuetaEshcar HillelIdit KeidarMoshe SulamyKiWi: A Key-Value Map for Scalable Real-Time Analytics.PPoPP357-3692017Conference and Workshop Papersclosedconf/ppopp/BasinBBGHKS1710.1145/3018743.3018761https://doi.org/10.1145/3018743.3018761https://dblp.org/rec/conf/ppopp/BasinBBGHKS17URL#3004534Martin BättigThomas R. GrossSynchronized-by-Default Concurrency for Shared-Memory Systems.PPoPP299-3122017Conference and Workshop Papersclosedconf/ppopp/BattigG1710.1145/3018743.3018747https://doi.org/10.1145/3018743.3018747https://dblp.org/rec/conf/ppopp/BattigG17URL#3004535Tal Ben-NunMichael Sutton 0001Sreepathi PaiKeshav PingaliGroute: An Asynchronous Multi-GPU Programming Model for Irregular Computations.PPoPP235-2482017Conference and Workshop Papersclosedconf/ppopp/Ben-NunSPP1710.1145/3018743.3018756https://doi.org/10.1145/3018743.3018756https://dblp.org/rec/conf/ppopp/Ben-NunSPP17URL#3004536Arnamoy BhattacharyyaMike Dai WangMihai BurceaYi DingAllen DengSai VarikootyShafaaf HossainCristiana AmzaPOSTER: HythTM: Extending the Applicability of Intel TSX Hardware Transactional Support.PPoPP433-4342017Conference and Workshop Papersclosedconf/ppopp/BhattacharyyaWB1710.1145/3018743.3019027https://doi.org/10.1145/3018743.3019027https://dblp.org/rec/conf/ppopp/BhattacharyyaWB17URL#3004537Milind ChabbiAbdelhalim AmerShasha WenXu Liu 0001An Efficient Abortable-locking Protocol for Multi-level NUMA Systems.PPoPP61-742017Conference and Workshop Papersclosedconf/ppopp/ChabbiAWL1710.1145/3018743.3018768https://doi.org/10.1145/3018743.3018768https://dblp.org/rec/conf/ppopp/ChabbiAWL17URL#3004539Guoyang ChenYue Zhao 0011Xipeng ShenHuiyang ZhouEffiSha: A Software Framework for Enabling Effficient Preemptive Scheduling of GPU.PPoPP3-162017Conference and Workshop Papersclosedconf/ppopp/ChenZSZ1710.1145/3018743.3018748https://doi.org/10.1145/3018743.3018748https://dblp.org/rec/conf/ppopp/ChenZSZ17URL#3004540Rezaul ChowdhuryPramod GanapathiYuan TangJesmin Jahan TithiPOSTER: Provably Efficient Scheduling of Cache-Oblivious Wavefront Algorithms.PPoPP435-4362017Conference and Workshop Papersclosedconf/ppopp/ChowdhuryGTT1710.1145/3018743.3019031https://doi.org/10.1145/3018743.3019031https://dblp.org/rec/conf/ppopp/ChowdhuryGTT17URL#3004541Nachshon CohenMaurice HerlihyErez PetrankElias WaldPOSTER: State Teleportation via Hardware Transactional Memory.PPoPP437-4382017Conference and Workshop Papersclosedconf/ppopp/CohenHPW1710.1145/3018743.3019026https://doi.org/10.1145/3018743.3019026https://dblp.org/rec/conf/ppopp/CohenHPW17URL#3004542Nachshon CohenArie TalErez PetrankLayout Lock: A Scalable Locking Paradigm for Concurrent Data Layout Modifications.PPoPP17-292017Conference and Workshop Papersclosedconf/ppopp/CohenTP1710.1145/3018743.3018753https://doi.org/10.1145/3018743.3018753https://dblp.org/rec/conf/ppopp/CohenTP17URL#3004543Dong Dai 0001Wei Zhang 0097Yong Chen 0001POSTER: IOGP: An Incremental Online Graph Partitioning for Large-Scale Distributed Graph Databases.PPoPP439-4402017Conference and Workshop Papersclosedconf/ppopp/DaiZC1710.1145/3018743.3019037https://doi.org/10.1145/3018743.3019037https://dblp.org/rec/conf/ppopp/DaiZC17URL#3004544Jesun Sahariar FirozThejaka Amila KanewalaMarcin ZalewskiMartina BarnasAndrew LumsdainePOSTER: Distributed Control: The Benefits of Eliminating Global Synchronization via Effective Scheduling.PPoPP441-4422017Conference and Workshop Papersclosedconf/ppopp/FirozKZBL1710.1145/3018743.3019036https://doi.org/10.1145/3018743.3019036https://dblp.org/rec/conf/ppopp/FirozKZBL17URL#3004546Peng Jiang 0004Gagan AgrawalCombining SIMD and Many/Multi-core Parallelism for Finite State Machines with Enumerative Speculation.PPoPP179-1912017Conference and Workshop Papersclosedconf/ppopp/JiangA1710.1145/3018743.3018760https://doi.org/10.1145/3018743.3018760https://dblp.org/rec/conf/ppopp/JiangA17URL#3004549Lin JiangZhijia Zhao 0001Grammar-aware Parallelization for Scalable XPath Querying.PPoPP371-3832017Conference and Workshop Papersclosedconf/ppopp/JiangZ1710.1145/3018743.3018772https://doi.org/10.1145/3018743.3018772https://dblp.org/rec/conf/ppopp/JiangZ17URL#3004550Gangwon JoJaehoon JungJiyoung ParkJaejin LeePOSTER: MAPA: An Automatic Memory Access Pattern Analyzer for GPU Applications.PPoPP443-4442017Conference and Workshop Papersclosedconf/ppopp/JoJPL1710.1145/3018743.3019034https://doi.org/10.1145/3018743.3019034https://dblp.org/rec/conf/ppopp/JoJPL17URL#3004551Hao Luo 0007Pengcheng Li 0001Chen Ding 0001Thread Data Sharing in Cache: Theory and Measurement.PPoPP103-1152017Conference and Workshop Papersclosedconf/ppopp/LuoLD1710.1145/3018743.3018759https://doi.org/10.1145/3018743.3018759https://dblp.org/rec/conf/ppopp/LuoLD17URL#3004556Alexander MatveevYaron MeirovitchHayk SaribekyanWiktor JakubiukTim KalerGergely ÓdorDavid M. BuddenAleksandar ZlateskiNir ShavitA Multicore Path to Connectomics-on-Demand.PPoPP267-2812017Conference and Workshop Papersclosedconf/ppopp/MatveevMSJKOBZS1710.1145/3018743.3018766https://doi.org/10.1145/3018743.3018766https://dblp.org/rec/conf/ppopp/MatveevMSJKOBZS17URL#3004557Harshitha MenonKavitha ChandrasekarLaxmikant V. KaléPOSTER: Automated Load Balancer Selection Based on Application Characteristics.PPoPP447-4482017Conference and Workshop Papersclosedconf/ppopp/MenonCK1710.1145/3018743.3019033https://doi.org/10.1145/3018743.3019033https://dblp.org/rec/conf/ppopp/MenonCK17URL#3004558Rubens E. A. MoreiraCaroline CollangeFernando Magno Quintão PereiraFunction Call Re-Vectorization.PPoPP313-3262017Conference and Workshop Papersclosedconf/ppopp/MoreiraCP1710.1145/3018743.3018751https://doi.org/10.1145/3018743.3018751https://dblp.org/rec/conf/ppopp/MoreiraCP17URL#3004559Nurit MoscoviciNachshon CohenErez PetrankPOSTER: A GPU-Friendly Skiplist Algorithm.PPoPP449-4502017Conference and Workshop Papersclosedconf/ppopp/MoscoviciCP1710.1145/3018743.3019032https://doi.org/10.1145/3018743.3019032https://dblp.org/rec/conf/ppopp/MoscoviciCP17URL#3004560Peizhao OuBrian DemskyChecking Concurrent Data Structures Under the C/C++11 Memory Model.PPoPP45-592017Conference and Workshop Papersclosedconf/ppopp/OuD1710.1145/3018743.3018749https://doi.org/10.1145/3018743.3018749https://dblp.org/rec/conf/ppopp/OuD17URL#3004562Nirmal PrajapatiWaruna RanasingheSanjay V. RajopadhyeRumen AndonovHristo N. DjidjevTobias GrosserSimple, Accurate, Analytical Time Modeling and Optimal Tile Size Selection for GPGPU Stencils.PPoPP163-1772017Conference and Workshop Papersclosedconf/ppopp/PrajapatiRRADG1710.1145/3018743.3018744https://doi.org/10.1145/3018743.3018744https://dblp.org/rec/conf/ppopp/PrajapatiRRADG17URL#3004563Samyam RajbhandariFabrice RastelloKarol KowalskiSriram KrishnamoorthyP. SadayappanOptimizing the Four-Index Integral Transform Using Data Movement Lower Bounds Analysis.PPoPP327-3402017Conference and Workshop Papersclosedconf/ppopp/RajbhandariRKKS1710.1145/3018743.3018771https://doi.org/10.1145/3018743.3018771https://dblp.org/rec/conf/ppopp/RajbhandariRKKS17URL#3004564Pedro RamalheteAndreia CorreiaPOSTER: Poor Man's URCU.PPoPP451-4522017Conference and Workshop Papersclosedconf/ppopp/RamalheteC1710.1145/3018743.3019021https://doi.org/10.1145/3018743.3019021https://dblp.org/rec/conf/ppopp/RamalheteC17URL#3004565Pedro RamalheteAndreia CorreiaPOSTER: A Wait-Free Queue with Wait-Free Memory Reclamation.PPoPP453-4542017Conference and Workshop Papersclosedconf/ppopp/RamalheteC17a10.1145/3018743.3019022https://doi.org/10.1145/3018743.3019022https://dblp.org/rec/conf/ppopp/RamalheteC17aURL#3004566Bin RenSriram KrishnamoorthyKunal AgrawalMilind Kulkarni 0001Exploiting Vector and Multicore Parallelism for Recursive, Data- and Task-Parallel Programs.PPoPP117-1302017Conference and Workshop Papersclosedconf/ppopp/RenKAK1710.1145/3018743.3018763https://doi.org/10.1145/3018743.3018763https://dblp.org/rec/conf/ppopp/RenKAK17URL#3004567Amit SabneXiao Wang 0004Sherman J. KisnerCharles A. BoumanAnand RaghunathanSamuel P. MidkiffModel-based Iterative CT Image Reconstruction on GPUs.PPoPP207-2202017Conference and Workshop Papersclosedconf/ppopp/SabneWKBRM1710.1145/3018743.3018765https://doi.org/10.1145/3018743.3018765https://dblp.org/rec/conf/ppopp/SabneWKBRM17URL#3004568Kento SatoDong H. AhnIgnacio LagunaGregory L. LeeMartin Schulz 0001Christopher M. ChambreauNoise Injection Techniques to Expose Subtle and Unintended Message Races.PPoPP89-1012017Conference and Workshop Papersclosedconf/ppopp/SatoALLSC1710.1145/3018743.3018767https://doi.org/10.1145/3018743.3018767https://dblp.org/rec/conf/ppopp/SatoALLSC17URL#3004569Tao B. SchardlWilliam S. MosesCharles E. LeisersonTapir: Embedding Fork-Join Parallelism into LLVM's Intermediate Representation.PPoPP249-2652017Conference and Workshop Papersclosedconf/ppopp/SchardlML1710.1145/3018743.3018758https://doi.org/10.1145/3018743.3018758https://dblp.org/rec/conf/ppopp/SchardlML17URL#3004570Sergei ShudlerAlexandru CalotoiuTorsten HoeflerFelix Wolf 0001Isoefficiency in Practice: Configuring and Understanding the Performance of Task-based Applications.PPoPP131-1432017Conference and Workshop Papersclosedconf/ppopp/ShudlerCHW1710.1145/3018743.3018770https://doi.org/10.1145/3018743.3018770https://dblp.org/rec/conf/ppopp/ShudlerCHW17URL#3004572Guy L. Steele Jr.It's Time for a New Old Language.PPoPP12017Conference and Workshop Papersclosedconf/ppopp/Steele1710.1145/3018743.3018773https://doi.org/10.1145/3018743.3018773https://dblp.org/rec/conf/ppopp/Steele17URL#3004573Guy L. Steele Jr.Jean-Baptiste TristanUsing Butterfly-Patterned Partial Sums to Draw from Discrete Distributions.PPoPP341-3552017Conference and Workshop Papersclosedconf/ppopp/SteeleT1710.1145/3018743.3018757https://doi.org/10.1145/3018743.3018757https://dblp.org/rec/conf/ppopp/SteeleT17URL#3004574Yuan TangRonghui YouPOSTER: STAR (Space-Time Adaptive and Reductive) Algorithms for Real-World Space-Time Optimality.PPoPP455-4562017Conference and Workshop Papersclosedconf/ppopp/TangY1710.1145/3018743.3019029https://doi.org/10.1145/3018743.3019029https://dblp.org/rec/conf/ppopp/TangY17URL#3004575Xiongchao TangJidong ZhaiBowen Yu 0003Wenguang ChenWeimin ZhengSelf-Checkpoint: An In-Memory Checkpoint Method Using Less Space and Its Practice on Fault-Tolerant HPL.PPoPP401-4132017Conference and Workshop Papersclosedconf/ppopp/TangZYCZ1710.1145/3018743.3018745https://doi.org/10.1145/3018743.3018745https://dblp.org/rec/conf/ppopp/TangZYCZ17URL#3004576Robert UtterbackKunal AgrawalI-Ting Angelina LeeMilind Kulkarni 0001Processor-Oblivious Record and Replay.PPoPP145-1612017Conference and Workshop Papersclosedconf/ppopp/UtterbackALK1710.1145/3018743.3018764https://doi.org/10.1145/3018743.3018764https://dblp.org/rec/conf/ppopp/UtterbackALK17URL#3004577Michael Vollmer 0003Ryan G. ScottMadanlal MusuvathiRyan R. NewtonSC-Haskell: Sequential Consistency in Languages That Minimize Mutable Shared Heap.PPoPP283-2982017Conference and Workshop Papersclosedconf/ppopp/VollmerSMN1710.1145/3018743.3018746https://doi.org/10.1145/3018743.3018746https://dblp.org/rec/conf/ppopp/VollmerSMN17URL#3004579Xin WangWeihua ZhangZhaoguo WangZiyun WeiHaibo Chen 0001Wenyun ZhaoEunomia: Scaling Concurrent Search Trees under Contention Using HTM.PPoPP385-3992017Conference and Workshop Papersclosedconf/ppopp/WangZWWCZ1710.1145/3018743.3018752https://doi.org/10.1145/3018743.3018752https://dblp.org/rec/conf/ppopp/WangZWWCZ17URL#3004580Panruo WuNathan DeBardelebenQiang GuanSean BlanchardJieyang ChenDingwen TaoXin Liang 0001Kaiming OuyangZizhong ChenSilent Data Corruption Resilient Two-sided Matrix Factorizations.PPoPP415-4272017Conference and Workshop Papersclosedconf/ppopp/WuDGBCTLOC1710.1145/3018743.3018750https://doi.org/10.1145/3018743.3018750https://dblp.org/rec/conf/ppopp/WuDGBCTLOC17URL#3004582Mingyu Wu 0001Haibing GuanBinyu ZangHaibo Chen 0001POSTER: Recovering Performance for Vector-based Machine Learning on Managed Runtime.PPoPP457-4582017Conference and Workshop Papersclosedconf/ppopp/WuGZC1710.1145/3018743.3019039https://doi.org/10.1145/3018743.3019039https://dblp.org/rec/conf/ppopp/WuGZC17URL#3004583Tsung Tai YehAmit SabnePutt SakdhnagoolRudolf EigenmannTimothy G. RogersPagoda: Fine-Grained GPU Resource Virtualization for Narrow Tasks.PPoPP221-2342017Conference and Workshop Papersclosedconf/ppopp/YehSSER1710.1145/3018743.3018754https://doi.org/10.1145/3018743.3018754https://dblp.org/rec/conf/ppopp/YehSSER17URL#3004584Minjia ZhangSwarnendu BiswasMichael D. BondPOSTER: On the Problem of Consistency Exceptions in the Context of Strong Memory Models.PPoPP459-4602017Conference and Workshop Papersclosedconf/ppopp/ZhangBB1710.1145/3018743.3019024https://doi.org/10.1145/3018743.3019024https://dblp.org/rec/conf/ppopp/ZhangBB17URL#3004586Xiuxia ZhangGuangming TanShuangbai XueJiajia Li 0001Keren Zhou 0001Mingyu Chen 0001Understanding the GPU Microarchitecture to Achieve Bare-Metal Performance Tuning.PPoPP31-432017Conference and Workshop Papersclosedconf/ppopp/ZhangTXLZC1710.1145/3018743.3018755https://doi.org/10.1145/3018743.3018755https://dblp.org/rec/conf/ppopp/ZhangTXLZC17URL#3004587Yue Zhao 0011Chunhua LiaoXipeng ShenPOSTER: An Infrastructure for HPC Knowledge Sharing and Reuse.PPoPP461-4622017Conference and Workshop Papersclosedconf/ppopp/ZhaoLS1710.1145/3018743.3019023https://doi.org/10.1145/3018743.3019023https://dblp.org/rec/conf/ppopp/ZhaoLS17URL#3004588Vivek SarkarLawrence RauchwergerProceedings of the 22nd ACM SIGPLAN Symposium on Principles and Practice of Parallel Programming, Austin, TX, USA, February 4-8, 2017PPoPPACM2017Editorshipconf/ppopp/201710.1145/3018743https://doi.org/10.1145/3018743https://dblp.org/rec/conf/ppopp/2017URL#3037986