


default search action
PACT 1996: Boston, MA, USA
- Proceedings of the Fifth International Conference on Parallel Architectures and Compilation Techniques, PACT'96, Boston, MA, USA, October 20-23, 1996. IEEE Computer Society 1996, ISBN 0-8186-7632-9

- Stephen Jenks, Jean-Luc Gaudiot:

Nomadic Threads: a migrating multithreaded approach to remote memory accesses in multiprocessors. 2-11 - Laurie J. Hendren, Xinan Tang, Yingchun Zhu, Guang R. Gao, Xun Xue, Haiying Cai, Pierre Ouellet:

Compiling C for the EARTH multithreaded architecture. 12-23 - Michael Bekerman, Avi Mendelson, Gad Sheaffer:

Performance and hardware complexity tradeoffs in designing multithreaded architectures. 24-34 - Jenn-Yuan Tsai, Pen-Chung Yew:

The superthreaded architecture: thread pipelining with run-time data dependence checking and control speculation. 35-46 - Po-Yung Chang, Marius Evers, Yale N. Patt:

Improving branch prediction accuracy by reducing pattern history table interference. 48-57 - Stéphan Jourdan, Tse-Hao Hsing, Jared Stark, Yale N. Patt:

The effects of mispredicted-path execution on branch prediction structures. 58-67 - Rafael H. Saavedra, Daeyeon Park:

Improving the effectiveness of software prefetching with adaptive executions. 68-78 - Josep Llosa, Antonio González

, Eduard Ayguadé, Mateo Valero:
Swing module scheduling: a lifetime-sensitive approach. 80-86 - Steven Novack, Alexandru Nicolau:

An efficient, global resource-directed approach to exploiting instruction-level parallelism. 87-96 - P. Tinumalai, Boris Beylin, Krishna Subramanian:

The design of a modulo scheduler for a superscalar RISC processor. 97-109 - Marc Pouzet:

Using the parallel complexity of programs to improve compaction. 111-115 - Kazuaki Okamoto, Shuichi Sakai, Hiroshi Matsuoka, Takashi Yokota, Hideo Hirono:

Multithread execution mechanisms on RICA-1 for massively parallel computation. 116-121 - Wen-Yen Lin, Jean-Luc Gaudiot:

I-Structure Software Cache: a split-phase transaction runtime cache system. 122-126 - Claude G. Diderich, Marc Gengler:

A heuristic approach for finding a solution to the constant-degree parallelism alignment problem. 127-132 - Andrew Sohn, Jui Ku, Yuetsu Kodama, Mitsuhisa Sato, Hirofumi Sakane, Hayato Yamana, Shuichi Sakai, Yoshinori Yamaguchi:

Identifying the capability of overlapping computation with communication. 133-138 - Shigeru Kusakabe, Taku Nagai, Kentaro Inenaga, Makoto Amamiya:

Address generation of dataflow fine-grain parallel data-structures on a distributed-memory computer. 139-143 - Alexander Goikhman, Jacob Katzenelson:

Elastic-plastic flow simulation using the Supercomputer Toolkit. 144-149 - Dmitry Arapov, Alexey Ya. Kalinov, Alexey L. Lastovetsky

:
Managing the computing space in the mpC compiler. 150-155 - Sekhar Darbha, Santosh Pande

:
A robust compile time method for scheduling task parallelism on distributed memory machines. 156-162 - M. Loikkanen, Nader Bagherzadeh:

A fine-grain multithreading superscalar architecture. 163-168 - Sébastien Hily, André Seznec:

Branch prediction and simultaneous multithreading. 169-173 - Mayez A. Al-Mouhamed, Lubomir F. Bic, Husam Abu-Haimed:

A compiler transformation to improve memory access time in SIMD systems. 174-178 - Steven Wallace, Nader Bagherzadeh:

A scalable register file architecture for dynamically scheduled processors. 179-184 - Lorenz Huelsbergen:

Dynamic parallelization of modifications to directed acyclic graphs. 186-197 - Andrew Shaw, Arvind, R. Paul Johnson:

Performance tuning scientific codes for dataflow execution. 198-207 - Danny Krizanc, Anton Saarimaki:

Bulk Synchronous Parallel: practical experience with a model for parallel computing. 208-217 - David Abramson, Rok Sosic, Greg Watson:

Implementation techniques for a parallel relative debugger. 218-226 - Shin-Ming Liu, Raymond Lo, Fred C. Chow:

Loop induction variable canonicalization in parallelizing compilers. 228-237 - Steve Carr

:
Combining optimization for cache and instruction-level parallelism. 238-247 - Michael F. P. O'Boyle, Andy Nisbet, Rupert W. Ford:

A compiler algorithm to reduce invalidation latency in virtual shared memory systems. 248-257 - Daeyeon Park, Rafael H. Saavedra:

Adaptive granularity: transparent integration of fine- and coarse-grain communication. 260-268 - Chris J. Newburn, John Paul Shen:

Automatic partitioning of signal processing programs for symmetric multiprocessors. 269-280 - Alain Darte, Frédéric Vivien:

Optimal fine and medium grain parallelism detection in polyhedral reduced dependence graphs. 281-291 - Thomas Rauber, Gudula Rünger:

The compiler TwoL for the design of parallel implementations. 292-301

manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.


Google
Google Scholar
Semantic Scholar
Internet Archive Scholar
CiteSeerX
ORCID














