default search action
21st ICS 2007: Seattle, Washington, USA
- Burton J. Smith:
Proceedings of the 21th Annual International Conference on Supercomputing, ICS 2007, Seattle, Washington, USA, June 17-21, 2007. ACM 2007, ISBN 978-1-59593-768-1 - Avi Mendelson:
Current trends in computer architectures: multi-cores, many-cores and special-cores. 1 - Craig B. Stunkel:
Harnessing massive parallelism in the era of parallelism for the masses. 2
Algorithms and applications I
- José E. Moreira, Maged M. Michael, Dilma Da Silva, Doron Shiloach, Parijat Dube, Li Zhang:
Scalability of the Nutch search engine. 3-12 - Cristian Coarfa, John M. Mellor-Crummey, Nathan Froyd, Yuri Dotsenko:
Scalability analysis of SPMD codes using expectations. 13-22
Runtime systems
- Arun Babu Nagarajan, Frank Mueller, Christian Engelmann, Stephen L. Scott:
Proactive fault tolerance for HPC with Xen virtualization. 23-32 - Manolis Marazakis, Vassilis Papaefstathiou, Angelos Bilas:
Optimization and bottleneck analysis of network block I/O in commodity storage systems. 33-42 - Shahaan Ayyub, David Abramson:
GridRod: a dynamic runtime scheduler for grid workflows. 43-52
Workload characterization
- Dror G. Feitelson:
Locality of sampling and diversity in parallel system workloads. 53-63 - Hui Li, Michael Muskulus, Lex Wolters:
Modeling correlated workloads by combining model based clustering and a localized sampling algorithm. 64-72 - Razvan Cheveresan, Matthew Ramsay, Chris Feucht, Ilya Sharapov:
Characteristics of workloads used in high performance and technical computing. 73-82
Algorithms and applications II
- Mehmet Belgin, Calvin J. Ribbens, Godmar Back:
An operation stacking framework for large ensemble computations. 83-92 - Mattan Erez, Jung Ho Ahn, Jayanth Gummaraju, Mendel Rosenblum, William J. Dally:
Executing irregular scientific applications on stream architectures. 93-104 - J. V. Sumanth, David R. Swanson, Hong Jiang:
A symmetric transformation for 3-body potential molecular dynamics using force-decomposition in a heterogeneous distributed environment. 105-115 - Peter Gottschling, David S. Wise, Michael D. Adams:
Representation-transparent matrix algorithms with scalable performance. 116-125
Architecture -- processor
- Jung Ho Ahn, Mattan Erez, William J. Dally:
Tradeoff between data-, instruction-, and thread-level parallelism in stream processors. 126-137 - Joseph J. Sharkey, Dmitry V. Ponomarev:
An L2-miss-driven early register deallocation for SMT processors. 138-147 - Perry H. Wang, Jamison D. Collins, Gautham N. Chinya, Bernard Lint, Asit Mallick, Koichi Yamada, Hong Wang:
Sequencer virtualization. 148-157
Message passing systems
- Wei-Yu Chen, Dan Bonachea, Costin Iancu, Katherine A. Yelick:
Automatic nonblocking communication for partitioned global address space programs. 158-167 - Ahmad Faraj, Pitch Patarasuk, Xin Yuan:
A study of process arrival patterns for MPI collective operations. 168-179 - Matthew J. Koop, Sayantan Sur, Qi Gao, Dhabaleswar K. Panda:
High performance MPI design using unreliable datagram for ultra-scale InfiniBand clusters. 180-189
Architecture -- memory hierarchy
- Ali-Reza Adl-Tabatabai, Anwar M. Ghuloum, Shobhit O. Kanaujia:
Compression in cache design. 190-201 - Jean Christophe Beyler, Philippe Clauss:
Performance driven data cache prefetching in a dynamic software optimization system. 202-209 - Tor M. Aamodt, Paul Chow:
Optimization of data prefetch helper threads with path-expression based statistical modeling. 210-221 - Prateek Pujara, Aneesh Aggarwal:
Increasing cache capacity through word filtering. 222-231
Architecture -- multiprocessor systems
- Zhen Fang, Lixin Zhang, John B. Carter, Ali Ibrahim, Michael A. Parker:
Active memory operations. 232-241 - Jichuan Chang, Gurindar S. Sohi:
Cooperative cache partitioning for chip multiprocessors. 242-252 - Shorin Kyo, Takuya Koga, Hanno Lieske, Shouhei Nomoto, Shin'ichiro Okazaki:
A low-cost mixed-mode parallel processor architecture for embedded systems. 253-262
Application optimization
- Silvius Vasile Rus, Maikel Pennings, Lawrence Rauchwerger:
Sensitivity analysis for automatic parallelization on multi-cores. 263-273 - Mohamed Khamiss Hussein, Kenneth R. Mayes, Mikel Luján, John R. Gurd:
Adaptive performance control for distributed scientific coupled models. 274-283 - Paolo D'Alberto, Alexandru Nicolau:
Adaptive Strassen's matrix multiplication. 284-292 - Ayaz Ali, S. Lennart Johnsson, Jaspal Subhlok:
Scheduling FFT computation on SMP and multicore systems. 293-301
manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.