


default search action
12th IPPS / 9. SPDP 1998: Orlando, Florida, USA
- 12th International Parallel Processing Symposium / 9th Symposium on Parallel and Distributed Processing (IPPS/SPDP '98), March 30 - April 3, 1998, Orlando, Florida, USA, Proceedings. IEEE Computer Society 1998, ISBN 0-8186-8403-8

Session 1: Communication
- Jyh-Jong Tsay, Wen-Tsong Wang:

Nearly Optimal Algorithms for Broadcast on d-Dimensional All-Port and Wormhole-Routed Torus. 2-9 - Songluan Cang, Jie Wu:

Minimizing Total Communication Distance of Time-Step Optimal Broadcast in Mesh Networks. 10-17 - Vivek Garg, David E. Schimmel:

Hiding Communication Latency in Data Parallel Applications. 18-23 - Erik D. Demaine:

Protocols for Non-Deterministic Communication over Synchronous Channels. 24-30 - Koji Nakano

, Stephan Olariu, James L. Schwing:
Broadcast-Efficient Algorithms on the Coarse-Grain Broadcast Communication Model with Few Channels. 31-35 - Y. Charlie Hu:

Optimal All-to-Some Personalized Communication on Hypercubes. 36-40
Session 2: Compilers I
- Bo Lu, John M. Mellor-Crummey

:
Compiler-Optimization of Implicit Reductions for Distributed Memory Multiprocessors. 42-51 - Gerardo Bandera, Pablo P. Trabado, Emilio L. Zapata:

Local Enumeration Techniques for Sparse Algorithms. 52-56 - Yi Tian, Edwin Hsing-Mean Sha, Chantana Chantrapornchai, Peter M. Kogge:

Optimizing Data Scheduling on Processor-in-Memory Arrays. 57-61 - Gwan-Hwan Hwang, Jenq Kuen Lee:

An Expression-Rewriting Framework to Generic Communication Sets for HPF Programs with Block-Cyclic Distribution. 62-68 - Mahmut T. Kandemir, Prithviraj Banerjee, Alok N. Choudhary, J. Ramanujam

, U. Nagaraj Shenoy:
A Generalized Framework for Global Communication Optimization. 69-73 - Dhruva R. Chakrabarti, Prithviraj Banerjee, Antonio Lain:

Evaluation of Compiler and Runtime Library Approaches for Supporting Parallel Regular Applications. 74-79
Session 3: Mathematical Applications
- Michael J. Quinn, Alexey G. Malishevsky

, Nagajagadeswar Seelam, Yan Zhao:
Preliminary Results from a Parallel MATLAB Compiler. 81-87 - Dolors Royo, Antonio González, Miguel Valero-García:

Jacobi Orderings for Multi-Port Hypercubes. 88-97 - Paul D. Hovland

, Christian H. Bischof:
Automatic Differentiation for Message-Passing Parallel Programs. 98-104 - Peter R. Cappello, Ömer Egecioglu:

Processor Lower Bound Formulas for Array Computations and Parametric Diophantine Systems. 105-109 - John A. Gunnels, Calvin Lin, Greg Morrow, Robert A. van de Geijn

:
A Flexible Class of Parallel Matrix Multiplication Algorithms. 110-116 - Peter Sulatycke, Kanad Ghose:

Caching-Efficient Multithreaded Fast Multiplication of Sparse Matrices. 117-123
Session 4: Networks
- Yuanyuan Yang

, Jianchao Wang, Yi Pan:
Permutation Capability of Optical Multistage Interconnect Networks. 125-133 - Rajeev Sivaram, Craig B. Stunkel, Dhabaleswar K. Panda:

HIPIQS: A High-Performance Switch Architecture Using Input Queuing. 134-143 - Claudson F. Bornstein, Ami Litman, Bruce M. Maggs, Ramesh K. Sitaraman

, Tal Yatzkar:
On the Bisection Width and Expansion of Butterfly Networks. 144-150 - David Coudert, Afonso Ferreira, Xavier Muñoz

:
Multiprocessor Architectures Using Multi-Hop Multi-OPS Lightwave Networks and Distributed Control. 151-155 - Charles A. Salisbury, Rami G. Melhem:

Distributed Dynamic Control of Circuit-Switched Banyan Networks. 156-161 - Raymond Hoare, Henry G. Dietz:

A Case for Aggregate Networks. 162-166
Session 5: Compilers II
- Ramaswamy Govindarajan, N. S. S. Narasimha Rao, Erik R. Altman, Guang R. Gao:

An Enhanced Co-Scheduling Method Using Reduced MS-State Diagrams. 168-175 - Dragan Milicev, Zoran Jovanovic:

Predicated Software Pipelining Technique for Loops with Conditions. 176-180 - Weng-Long Chang, Chih-Ping Chu, Jesse Wu:

The Generalized Lambda Test. 181-186 - Yunheung Paek, David A. Padua:

Experimental Study of Compiler Techniques for NUMA Machines. 187-193 - Amod K. Dani, V. Janaki Ramanan, Ramaswamy Govindarajan:

Register-Sensitive Software Pipelining. 194-198 - M. Srinivas, Alexandru Nicolau:

Analyzing the Individual/Combined Effects of Speculative and Guarded Execution on a Superscalar Architecture. 199-208
Session 6: Signal and Image Processing
- Frank Munz, T. Stephan, Ursula Maier, Thomas Ludwig, Arndt Bode, Sibylle Ziegler, Stephan G. Nekolla

, Peter Bartenstein, Markus Schwaiger
:
NOW Parallel Reconstruction of Functional Images. 210-214 - Neelima Gupta, Sandeep Sen:

An Improved Output-Size Sensitive Parallel Algorithm for Hidden-Surface Removal for Terrains. 215-219 - Alok N. Choudhary, Wei-keng Liao

, Donald D. Weiner, Pramod K. Varshney, Richard W. Linderman, Mark H. Linderman:
Design, Implementation and Evaluation of Parallel Pipelined STAP on Parallel Computers. 220-225 - Mikael Taveniku, Anders Ahlander, Magnus Jonsson, Bertil Svensson:

The VEGA Moderately Parallel MIMD, Moderately Parallel SIMD, Architectures for High Performance Array Signal Processing. 226-232 - Christoph Giess, Achim Mayer, Harald Evers, Hans-Peter Meinzer:

Medical Image Processing and Visualization on Heterogeneous Clusters of Symmetric Multiprocessors Using MPI and POSIX Threads. 233-237 - Radovan Sernec, Matej Zajc

, Jurij F. Tasic:
A Quantitative Code Analysis of Scientific Systolic Programs: DSP vs. Matrix Algorithms. 238-242
Session 7: Collective Communication
- Ran Libeskind-Hadas, Dominic Mazzoni, Ranjith Rajagopalan:

Tree-Based Multicasting in Wormhole-Routed Irregular Topologies. 244-249 - Thomas Hopfner, Franz Fischer, Georg Färber:

NoWait-RPC: Extending ONC RPC to a Fully Compatible Message Passing System. 250-254 - Jin-Soo Kim, Soonhoi Ha, Chu Shik Jhon:

Efficient Barrier Synchronization Mechanism for BSP Model on Message Passing Architectures. 255-259 - Gautam Shah, Jarek Nieplocha, Jamshed H. Mirza, Chulho Kim, Robert J. Harrison

, Rama Govindaraju, Kevin J. Gildea, Paul DiNicola, Carl A. Bender:
Performance and Experience with LAPI - a New High-Performance Communication Library for the IBM RS/6000 SP. 260-266 - Fabrizio Petrini:

Total Exchange on k-ary n-cubes with Adaptive Routing. 267-271 - Steven Lumetta, David E. Culler:

Managing Concurrent Access for Shared Memory Active Messages. 272-278
Session 8: Memory Hierarchy and I/O
- Jaechun No, Sung-Soon Park, Jesús Carretero

, Alok N. Choudhary, Pang Chen:
Design and Implementation of a Parallel I/O Runtime System for Irregular Applications. 280-284 - Ian Parsons, Jonathan Schaeffer, Duane Szafron, Ronald C. Unrau:

Using PI/OT to Support Complex Parallel I/O. 285-291 - Chidamber Kulkarni, Francky Catthoor, Hugo De Man:

Code Transformations for Low Power Caching in Embedded Multimedia Processors. 292-297 - Ibraheem Al-Furaih, Sanjay Ranka

:
Memory Hierarchy Management for Iterative Graph Structures. 298-302 - Jang Sun Lee, Sung Hoon Ko, Sanjay Ranka

, Byung Eui Min:
High-Performance External Computations Using User-Controllable I/O. 303-307 - Hiroshi Tezuka, Francis O'Carroll, Atsushi Hori, Yutaka Ishikawa:

Pin-Down Cache: A Virtual Memory Management Technique for Zero-Copy Communication. 308-314
Session 9: Algorithms I
- Graham M. Megson, I. M. Bland:

Synthesis of a Systolic Array Genetic Algorithm. 316-320 - Seungjo Bae, Dongmin Kim, Sanjay Ranka

:
Vector Prefix and Reduction Computation on Coarse-Grained, Distributed-Memory Parallel Machines. 321-325 - Yuji Shinano, Tetsuya Fujie, Yoshiko Ikebe, Ryuichi Hirabayashi:

Solving the Maximum Clique Problem Using PUBB. 326-332 - Rong Lin, Koji Nakano

, Stephan Olariu, Maria Cristina Pinotti
, James L. Schwing, Albert Y. Zomaya
:
A Scalable VLSI Architecture for Binary Prefix Sums. 333-337 - Bojana Obrenic:

Emulating Direct Products by Index-Shuffle Graphs. 338-344 - Lee Wang, Anthony A. Maciejewski

, Howard Jay Siegel, Vwani P. Roychowdhury:
A Comparative Study of Five Parallel Genetic Algorithms Using the Traveling Salesman Problem. 345-349
Session 10: Routing
- Yuanyuan Yang

, Jianchao Wang:
A New Self-Routing Multicast Network. 351-357 - Ran Libeskind-Hadas, Dominic Mazzoni, Ranjith Rajagopalan:

Optimal Contention-Free Unicast-Based Multicasting in Switch-Based Networks of Workstations. 358-364 - Weifa Liang

, Hong Shen:
Multicasting and Broadcasting in Large WDM Networks. 365-369 - Shan-Chyun Ku, Biing-Feng Wang:

Optimally Locating a Structured Facility of a Specified Length in a Weighted Tree Network. 370-374 - Andrea Pietracaprina:

Deterministic Routing of h-relations on the Multibutterfly. 375-379 - Costas Busch, Marios Mavronicolas

:
An Efficient Counting Network. 380-384
Session 11: Operating Systems and Scheduling
- Marcio Merino Fernandes, Josep Llosa

, Nigel P. Topham:
Partitioned Schedules for Clustered VLIW Architectures. 386-391 - Kelvin K. Yue, David J. Lilja:

Dynamic Processor Allocation with the Solaris Operating System. 392-397 - Shivakant Mishra, Rongguang Yang:

Thread-Based vs Event-Based Implementation of a Group Communication Service. 398-402 - Sivarama P. Dandamudi, Hai Yu:

Performance Sensitivity of Space Sharing Processor Scheduling in Distributed-Memory Multicomputers. 403-409 - Boris Weissman, Benedict Gomes, Jürgen Quittek, Michael Holtkamp:

Efficient Fine-Grain Thread Migration with Active Threads. 410-414 - Miquel A. Senar, Ana Ripoll

, Ana Cortés
, Emilio Luque
:
Clustering and Reassignment-Based Mapping Strategy for Message Passing Architectures. 415-421
Session 12: Algorithms II
- Keqin Li:

Asymptotically Optimal Randomized Tree Embedding in Static Networks. 423-430 - Bader Almohammad, Bella Bose:

Resource Placement in 2D Tori. 431-438 - Tatsuya Hayashi, Koji Nakano

, Stephan Olariu:
An O((log log n)2) Time Convex Hull Algorithm on Reconfigurable Meshes. 439-446 - Vincenzo Auletta

, Sajal K. Das
, Amelia De Vivo, Maria Cristina Pinotti
, Vittorio Scarano
:
Toward a Universal Mapping Algorithm for Accessing Trees in Parallel Memory Systems. 447-454 - Marius Zimand:

Sharing Random Bits with No Process Coordination. 455-459 - M. Cemil Azizoglu, Ömer Egecioglu:

Lower Bounds on Communication Loads and Optimal Placements in Torus Networks. 460-464
Session 13: Multiprocessor Performance Evaluation
- Laxmi N. Bhuyan, Hu-Jun Wang, Ravi R. Iyer, Akhilesh Kumar:

Impact of Switch Design on the Application Performance of Cache-Coherent Multiprocessors. 466-474 - Hongzhang Shan, Jaswinder Pal Singh:

Parallel Tree Building on a Range of Shared Address Space Multiprocessors: Algorithms and Application Performance. 475-484 - Gheith A. Abandah

, Edward S. Davidson:
Configuration Independent Analysis for Characterizing Shared-Memory Applications. 485-491 - Ben H. H. Juurlink:

Experimental Validation of Parallel Computation Models on the Intel Paragon. 492-497 - Lars Lundberg, Håkan Lennerstad:

Comparing the Optimal Performance of Different MIMD Multiprocessor Architectures. 498-502 - Ashwini K. Nanda, Yiming Hu, Moriyoshi Ohara, Caroline Benveniste, Mark Giampapa, Maged M. Michael:

The Design of COMPASS: An Execution Driven Simulator for Commercial Applications Running on Shared Memory Multiprocessors. 503-509
Session 14: Scheduling
- Sylvain Lauzac, Rami G. Melhem, Daniel Mossé:

An Efficient RMS Admission Control and Its Application to Multiprocessor Scheduling. 511-518 - Arnold L. Rosenberg:

Guidelines for Data-Parallel Cycle-Stealing in Networks of Workstations. 519-523 - Michel Cosnard, Emmanuel Jeannot, Laurence Rougeot:

Low Memory Cost Dynamic Scheduling of Large Coarse Grain Task Graphs. 524-530 - Yu-Kwong Kwok, Ishfaq Ahmad:

Benchmarking the Task Graph Scheduling Algorithms. 531-537 - Benjamin S. Macey, Albert Y. Zomaya

:
A Performance Evaluation of CP List Scheduling Heuristics for Communication Intensive Task Graphs. 538-541 - Dror G. Feitelson

, Ahuva Mu'alem Weil:
Utilization and Predictability in Scheduling the IBM SP2 with Backfilling. 542-546
Session 15: Databases and Sorting
- Sanjay Goil, Alok N. Choudhary:

High Performance Data Mining Using Data Cubes on Parallel Computers. 548-555 - Khaled Alsabti, Sanjay Ranka

, Vineet Singh:
An Efficient Parallel Algorithms for High Dimensional Similarity Join. 556-560 - David R. Helman, Joseph F. JáJá:

Sorting on Clusters of SMPs. 561-567 - Ju-wook Jang:

An AT2 Optimal Mapping of Sorting onto the Mesh Connected Array without Comparators. 568-572 - Mahesh V. Joshi, George Karypis

, Vipin Kumar:
ScalParC: A New Scalable and Efficient Parallel Classification Algorithm for Mining Large Datasets. 573-579 - Kothuri Venkata Ravi Kanth, David Serena, Ambuj K. Singh:

Improved Concurrency Control Techniques For Multi-Dimensional Index Structures. 580-586
Industrial Track: Session I Environments, Tools, and Evaluation Methods
- Brian Q. Brode, Chris R. Warber:

DEEP: A Development Environment for Parallel Programs. 588-593 - Milissa M. Benincasa, Richard Besler, Diane Brassaw, Ralph Kohler Jr.:

Rapid Development of Real-Time Systems Using RTExpress. 594-599 - Marc E. Campbell:

Evaluating ASIC, DSP, and RISC Architectures for Embedded Applications. 600-603 - Vladimir Shurbanov, Dimiter R. Avresky, Robert W. Horst:

The Effect of the Router Arbitration Policy on the Scalability of ServerNet. 604-609
Industrial Track: Session II Reconfigurable Systems
- Bradley K. Fross, Dennis M. Hawver, James B. Peterson:

WILDFIRE Heterogeneous Adaptive Parallel Processing Systems. 611-615 - Don Davis, Jonathan Harris:

ACEcard: A High-Performance Architecture for Run-Time Reconfiguration. 616-619 - John Schwel:

A Hardware/Software Co-Design System Using Configurable Computing Technology. 620-625
Session 16: Performance Prediction and Evaluation
- Venkata Krishnan, Josep Torrellas:

A Clustered Approach to Multithreaded Processors. 627-634 - Federico Bassetti, Kei Davis, Daniel J. Quinlan:

C++ Expression Templates Performance Issues in Scientific Computing. 635-639 - Benjamin Bishop, Robert Michael Owens, Mary Jane Irwin:

Aggressive Dynamic Execution of Multimedia Kernel Traces. 640-646 - Jennifer M. Schopf, Francine Berman:

Performance Prediction in Production Environments. 647-653 - Radu Rugina, Klaus E. Schauser:

Predicting the Running Times of Parallel Programs by Simulation. 654-660
Session 17: Software Distributed Shared Memory
- Hwansoo Han, Chau-Wen Tseng:

Compile-Time Synchronization Optimization for Software DSMs. 662-669 - Taesoon Park, Heon Young Yeom:

An Efficient Logging Scheme for Lazy Release Consistent Distributed Shared Memory Systems. 670-674 - Peter J. Keleher:

Update Protocols and Iterative Scientific Applications. 675-681 - Alex Gontmakher, Assaf Schuster:

Characterization for Java Memory Behavior. 682-686 - Bryan Roger Buck, Peter J. Keleher:

Locality and Performance of Page- and Object-Based DSMs. 687-693 - Peter Frey, Radharamanan Radhakrishnan:

Optimistic Synchronization of Mixed-Mode Simulators. 694-699
Session 18: Scientific Simulation
- Jaspal Subhlok, Peter Steenkiste

, James M. Stichnoth, Peter Lieu:
Airshed Pollution Modeling: A Case Study in Application Development in an HPF Environment. 701-710 - Alex Rhomberg, Rolf Enzler, Markus Thaler, Gerhard Tröster:

Design of a FEM Computation Engine for Real-Time Laparoscopic Surgery Simulation. 711-715 - Mark Bernd Kulaczewski, Howard Jay Siegel:

SIMD and Mixed-Mode Implementations of a Visual Tracking Algorithm. 716-720 - John B. Pormann:

The Implicit Pipeline Method. 721-725 - Timothy D. Davis, Edward W. Davis:

Rendering Computer Animations on a Network of Workstations. 726-730
Session 19: Fault Tolerance
- Wei Shi, Pradip K. Srimani:

Hyper Butterfly Network: A Scalable Optimally Fault Tolerant Architecture. 732-736 - K. Mahesh, G. Manimaran, C. Siva Ram Murthy, Arun K. Somani:

Scheduling Algorithms Exploiting Spare Capacity and Tasks' Laxities for Fault Detection and Location in Real-Time Multiprocessor Systems. 737-741 - Behrooz Parhami, Chi-Hsiang Yeh:

The Robust-Algorithm Approach to Fault Tolerance on Processor Arrays: Fault Models, Fault Diameter, and Basic Algorithms. 742-746 - Paul S. LeMahieu, Vasken Bohossian, Jehoshua Bruck

:
Fault-Tolerant Switched Local Area Networks. 747-751
Session 20: Performance and Debugging Tools
- Michael A. Frumkin, Robert Hood, Luis Lopez:

Trace-Driven Debugging of Message Passing Programs. 753-762 - Ashis Tarafdar, Vijay K. Garg:

Predicate Control for Active Debugging of Distributed Programs. 763-769 - Magnus Broberg, Lars Lundberg, Håkan Grahn

:
VPPB - A Visualization and Performance Prediction Tool for Multithreaded Solaris Programs. 770-776 - T. J. Godin, Michael J. Quinn, Cherri M. Pancake:

Parallel Performance Visualization Using Moments of Utilization Data. 777-782
Session 21: Distributed Systems
- Henri E. Bal, Aske Plaat

, Mirjam G. Bakker, Peter Dozy, Rutger F. H. Hofman:
Optimizing Parallel Applications for Wide-Area Clusters. 784-790 - Frank Mueller:

Prioritized Token-Based Mutual Exclusion for Distributed Systems. 791-795 - Nihar R. Mahapatra, Shantanu Dutt:

Adaptive Quality Equalizing: High-Performance Load Balancing for Parallel Branch-and-Bound Across Applications and Computing Systems. 796-800 - Kasidit Chanchio, Xian-He Sun:

Memory Space Representation for Heterogeneous Network Process Migration. 801-805

manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.


Google
Google Scholar
Semantic Scholar
Internet Archive Scholar
CiteSeerX
ORCID














