


default search action
Parallel Computing, Volume 40
Volume 40, Number 1, January 2014
- Leonid Yavits, Amir Morad, Ran Ginosar:

The effect of communication and synchronization on Amdahl's law in multicore systems. 1-16 - Lois Curfman McInnes, Barry Smith, Hong Zhang, Richard Tran Mills:

Hierarchical Krylov and nested Krylov methods for extreme-scale computing. 17-31
Volume 40, Number 2, February 2014
- Pavan Balaji, Zhiyi Huang:

Special issue on programming models and applications for multicores and manycores - Guest Editors' Introduction. 33-34 - Mark Utting

, Min-Hsien Weng
, John G. Cleary:
The JStar language philosophy. 35-50 - Weihua Sheng, Stefan Schürmans, Maximilian Odendahl, Mark Bertsch, Vitaliy Volevach, Rainer Leupers, Gerd Ascheid:

A compiler infrastructure for embedded heterogeneous MPSoCs. 51-68 - Vikas, Nasser Giacaman, Oliver Sinnen

:
Multiprocessing with GUI-awareness using OpenMP-like directives in Java. 69-89 - Oded Green, Yitzhak Birk

:
Scheduling directives: Accelerating shared-memory many-core processor execution. 90-106 - Zhenning Wang, Long Zheng, Quan Chen, Minyi Guo:

CPU + GPU scheduling with asymptotic profiling. 107-115 - Yu Liu, Kento Emoto, Zhenjiang Hu:

A Generate-Test-Aggregate parallel programming library for systematic parallel programming. 116-135 - Zhijun Hao, Chenning Xie, Haibo Chen, Binyu Zang:

X10-FT: Transparent fault tolerance for APGAS language and runtime. 136-156
Volume 40, Numbers 3-4, March 2014
- Mohammad Reza Selim, Mohammed Ziaur Rahman:

Carrying on the legacy of imperative languages in the future parallel computing era. 1-33 - Jean-Yves L'Excellent, Wissam M. Sid-Lakhdar:

A study of shared-memory parallelism in a multifrontal solver. 34-46
Volume 40, Numbers 5-6, May 2014
- Urban Borstnik, Joost VandeVondele

, Valéry Weber, Jürg Hutter:
Sparse matrix multiplication: The distributed block-compressed sparse row library. 47-58 - Yuki Sugimoto, Fumihiko Ino, Kenichi Hagihara:

Improving cache locality for GPU-based volume rendering. 59-69 - Ray-Bing Chen

, Yaohung M. Tsai, Weichung Wang
:
Adaptive block size for dense QR factorization in hybrid CPU-GPU systems via statistical modeling. 70-85 - Michael J. Hallock

, John E. Stone
, Elijah Roberts, Corey Fry, Zaida Luthey-Schulten
:
Simulation of reaction diffusion processes over biologically relevant size and time scales using multi-GPU workstations. 86-99 - Ivan Teixido

, Francesc Sebé
, Josep Conde
, Francesc Solsona
:
MPI-based implementation of an enhanced algorithm to solve the LPN problem in a memory-constrained environment. 100-112 - Alberto F. Martín

, Ruymán Reyes
, Rosa M. Badia
, Enrique S. Quintana-Ortí
:
Leveraging task-parallelism in message-passing dense matrix factorizations using SMPSs. 113-128 - Jose Antonio Pascual

, José Miguel-Alonso
, José Antonio Lozano:
Application-aware metrics for partition selection in cube-shaped topologies. 129-139 - Robert Hallberg

, Alistair Adcroft
:
An order-invariant real-to-integer conversion sum. 140-143 - Oscar Peredo

, Julián M. Ortiz
, José R. Herrero, Cristóbal Samaniego
:
Tuning and hybrid parallelization of a genetic-based multi-point statistics simulation code. 144-158
Volume 40, Number 7, July 2014
- Costas Bekas, Ananth Grama, Yousef Saad

, Olaf Schenk
:
Parallel matrix algorithms. 159-160 - Robert Andrew, Nicholas J. Dingle:

Implementing QR factorization updating algorithms on GPUs. 161-172 - Yiannis Cotronis, Elias Konstantinidis, Maria A. Louka

, Nikolaos M. Missirlis
:
A comparison of CPU and GPU implementations for solving the Convection Diffusion equation using the local Modified SOR method. 173-185 - Thomas Auckenthaler, Thomas Huckle, Roland Wittmann:

A blocked QR-decomposition for the parallel symmetric eigenvalue problem. 186-194 - Hasan Metin Aktulga

, Lin Lin, Christopher Haine, Esmond G. Ng, Chao Yang:
Parallel eigenvalue calculation based on multiple shift-invert Lanczos and contour integral based spectral projection method. 195-212 - Marc Baboulin, Dulceneia Becker, George Bosilca, Anthony Danalis, Jack J. Dongarra:

An efficient distributed randomized algorithm for solving large dense symmetric indefinite linear systems. 213-223 - Pieter Ghysels, Wim Vanroose

:
Hiding global synchronization latency in the preconditioned Conjugate Gradient algorithm. 224-238 - Erhan Turan

, Peter Arbenz
:
Large scale micro finite element analysis of 3D bone poroelasticity. 239-250 - Michele Martone:

Efficient multithreaded untransposed, transposed or symmetric sparse matrix-vector multiplication with the Recursive Sparse Blocks format. 251-270 - Lars Karlsson, Bo Kågström, Eddie Wadbro

:
Fine-grained bulge-chasing kernels for strongly scalable parallel QR algorithms. 271-288 - Johannes Langguth, Ariful Azad, Mahantesh Halappanavar, Fredrik Manne:

On parallel push-relabel based algorithms for bipartite maximum matching. 289-308 - Jesús Cámara

, Javier Cuenca
, Luis-Pedro García, Domingo Giménez:
Auto-tuned nested parallelism: A way to reduce the execution time of scientific software in NUMA systems. 309-327 - Emanuel H. Rubensson, Elias Rudberg

:
Chunks and Tasks: A programming model for parallelization of dynamic algorithms. 328-343
Volume 40, Number 8, August 2014
- María Botón-Fernández, Miguel A. Vega-Rodríguez

, Francisco Prieto Castrillo
:
Self-adaptivity for grid applications. An Efficient Resources Selection model based on evolutionary computation algorithms. 345-361 - Chihiro Kodama

, Masaaki Terai, Akira T. Noda, Yohei Yamada, Masaki Satoh
, Tatsuya Seiki
, Shin-ichi Iga, Hisashi Yashiro
, Hirofumi Tomita, Kazuo Minami:
Scalable rank-mapping algorithm for an icosahedral grid system on the massive parallel computer with a 3-D torus network. 362-373 - Julio Sánchez-Curto

, Pedro Chamorro-Posada
, Graham S. McDonald
:
Efficient parallel implementation of the nonparaxial beam propagation method. 394-407 - Jie Chen, Tom L. H. Li, Mihai Anitescu

:
A parallel linear solver for multilevel Toeplitz systems with possibly several right-hand sides. 408-424 - Roman Wyrzykowski

, Lukasz Szustak
, Krzysztof Rojek
:
Parallelization of 2D MPDATA EULAG algorithm on hybrid architectures with GPU accelerators. 425-447
Volume 40, Number 9, October 2014
- João Andrade, Gabriel Falcão Paiva Fernandes

, Vítor Manuel Mendes da Silva:
Optimized Fast Walsh-Hadamard Transform on GPUs for non-binary LDPC decoding. 449-453
- Ehsan Totoni, Michael T. Heath, Laxmikant V. Kalé:

Structure-adaptive parallel solution of sparse triangular linear systems. 454-470 - Diego Arroyuelo, Carolina Bonacic, Veronica Gil-Costa

, Mauricio Marín
, Gonzalo Navarro:
Distributed text search using suffix arrays. 471-495 - Yingchong Situ, Chandra S. Martha, Matthew E. Louis, Zhiyuan Li, Ahmed H. Sameh, Gregory A. Blaisdell, Anastasios S. Lyrintzis

:
Petascale large eddy simulation of jet engine noise based on the truncated SPIKE algorithm. 496-511
- Lucas Mello Schnorr, Philippe Olivier Alexandre Navaux:

Best of SBAC-PAD 2012. 512-513 - Luiz E. Ramos, Ricardo Bianchini:

Robust performance in hybrid-memory cooperative caches. 514-525 - Joefon Jann, R. Sarma Burugula, Ching-Farn Eric Wu, Kaoutar El Maghraoui

:
Towards an immortal operating system in virtual environments. 526-535 - Esteban Meneses

, Osman Sarood, Laxmikant V. Kalé:
Energy profile of rollback-recovery strategies in high performance computing. 536-547 - Teo Milanez, Caroline Collange, Fernando Magno Quintão Pereira, Wagner Meira Jr., Renato Ferreira:

Thread scheduling and memory coalescing for dynamic vectorization of SPMD workloads. 548-558
Volume 40, Number 10, December 2014
- Li Tan, Shashank Kothapalli, Longxiang Chen, Omar Hussaini, Ryan Bissiri, Zizhong Chen

:
A survey of power and energy efficient techniques for high performance numerical linear algebra operations. 559-573
- Antonio J. Peña

, Carlos Reaño
, Federico Silla, Rafael Mayo
, Enrique S. Quintana-Ortí
, José Duato
:
A complete and efficient CUDA-sharing solution for HPC clusters. 574-588 - George Teodoro, Tony Pan

, Tahsin M. Kurç, Jun Kong, Lee Cooper, Scott Klasky, Joel H. Saltz:
Region templates: Data representation and management for high-throughput image analysis. 589-610 - Yizhuo Wang, Yang Zhang, Yan Su, Xiaojun Wang, Xu Chen, Weixing Ji, Feng Shi:

An adaptive and hierarchical task scheduling scheme for multi-core clusters. 611-627 - Andrew White, Soo-Young Lee:

Derivation of optimal input parameters for minimizing execution time of matrix-based computations on a GPU. 628-645 - Nicholas Horelik, Andrew R. Siegel, Benoit Forget, Kord Smith:

Monte Carlo domain decomposition for robust nuclear reactor analysis. 646-660 - Leandro A. J. Marzulo, Tiago A. O. Alves, Felipe M. G. França

, Vítor Santos Costa
:
Couillard: Parallel programming via coarse-grained Data-flow Compilation. 661-680
- Philip C. Roth, Yong Chen

:
Guest Editors' introduction to the special issue on "DISCS-2013". 681 - Jesse Weaver, Vito Giovanni Castellana, Alessandro Morari, Antonino Tumeo

, Sumit Purohit, Alan R. Chappell
, David Haglin, Oreste Villa, Sutanay Choudhury, Karen Schuchardt, John Feo:
Toward a data scalable solution for facilitating discovery of science resources. 682-696 - Jiangling Yin, Junyao Zhang, Jun Wang

, Wu-chun Feng:
SDAFT: A novel scalable data access framework for parallel BLAST. 697-709 - Yong Li, Dan Feng, Zhan Shi:

Heterogeneous-aware cache partitioning: Improving the fairness of shared storage cache. 710-721 - Joong-Yeon Cho, Hyun-Wook Jin, Min Lee, Karsten Schwan:

Dynamic core affinity for high-performance file upload on Hadoop Distributed File System. 722-737 - Peter Coetzee, Matthew Leeke, Stephen A. Jarvis

:
Towards unified secure on- and off-line analytics at scale. 738-753 - Dominique LaSalle, George Karypis

:
MPI for Big Data: New tricks for an old dog. 754-767 - Lan Vu, Gita Alaghband:

Novel parallel method for association rule mining on multi-core shared memory systems. 768-785

manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.


Google
Google Scholar
Semantic Scholar
Internet Archive Scholar
CiteSeerX
ORCID














