


default search action
ISPASS 2024: Indianapolis, IN, USA
- IEEE International Symposium on Performance Analysis of Systems and Software, ISPASS 2024, Indianapolis, IN, USA, May 5-7, 2024. IEEE 2024, ISBN 979-8-3503-7638-8

- Wim Heirman, Stijn Eyerman:

Message from the Program Chairs; ISPASS 2024. xii-xiii - Timothy Rogers:

Message from the General Chair; ISPASS 2024. xi - Erick Carvajal Barboza, Mahesh Ketkar, Paul Gratz, Jiang Hu:

Aiding Microprocessor Performance Validation with Machine Learning. 1-9 - Tanner Andrulis, Joel S. Emer, Vivienne Sze:

CiMLoop: A Flexible, Accurate, and Fast Compute-In-Memory Modeling Tool. 10-23 - Mohammadreza Rezvani, Ali Jahanshahi, Daniel Wong

:
Characterizing In-Kernel Observability of Latency-Sensitive Request-Level Metrics with eBPF. 24-35 - Xinyu Li, Yanzhi Lan, Gen Niu, Feng Xue, Fuxin Zhang:

BTBench: A Benchmark for Comprehensive Binary Translation Performance Evaluation. 36-47 - Marcelo Orenes-Vera, Esin Tureci, Margaret Martonosi, David Wentzlaff:

MuchiSim: A Simulation Framework for Design Exploration of Multi-Chip Manycore Systems. 48-60 - Negar Neda, Austin Ebel, Benedict Reynwar, Brandon Reagen

:
CiFlow: Dataflow Analysis and Optimization of Key Switching for Homomorphic Encryption. 61-72 - Victor Kariofillis, Natalie Enright Jerger:

Workload Characterization of Commercial Mobile Benchmark Suites. 73-84 - Yonghong Yan, Kewei Yan, Anjia Wang

:
RTune: Towards Automated and Coordinated Optimization of Computing and Computational Objectives of Parallel Iterative Applications. 85-95 - Abhishek Tyagi, Reiley Jeyapaul, Chuteng Zhou, Paul N. Whatmough, Yuhao Zhu:

Characterizing Soft-Error Resiliency in Arm's Ethos-U55 Embedded Machine Learning Accelerator. 96-108 - Md Sami Ul Islam Sami, Jingbo Zhou, Sujan Kumar Saha, Fahim Rahman, Farimah Farahmandi, Mark Tehranipoor:

SAP: Silicon Authentication Platform for System-on-Chip Supply Chain Vulnerabilities. 109-119 - Odysseas Chatzopoulos, Maria Trakosa, George Papadimitriou, Wing Shek Wong, Dimitris Gizopoulos:

SimPoint-Based Microarchitectural Hotspot & Energy-Efficiency Analysis of RISC-V OoO CPUs. 120-131 - Gabin Schieffer, Daniel Araújo de Medeiros, Jennifer Faj, Aniruddha Marathe, Ivy Peng:

On the Rise of AMD Matrix Cores: Performance, Power Efficiency, and Programmability. 132-143 - Puru Sharma

, Gary Goh Yipeng, Bin Gao, Longshen Ou, Dehui Lin, Deepak Sharma, Djordje Jevdjic
:
DNA Storage Toolkit: A Modular End-to-End DNA Data Storage Codec and Simulator. 144-155 - Davit Grigoryan, Yuan-Hsi Chou

, Tor M. Aamodt:
Zatel: Sample Complexity-Aware Scale-Model Simulation for Ray Tracing. 156-166 - Panagiotis Strikos, Ahsen Ejaz, Ioannis Sourdis:

BZSim: Fast, Large-Scale Microarchitectural Simulation with Detailed Interconnect Modeling. 167-178 - Johnson Umeike

, Siddharth Agarwal
, Nikita Lazarev, Mohammad Alian:
Userspace Networking in gem5. 179-191 - Kavya Sreedhar, Jason Clemons, Rangharajan Venkatesan, Stephen W. Keckler, Mark Horowitz:

Vision Transformer Computation and Resilience for Dynamic Inference. 192-204 - William Won

, Saeed Rashidi, Sudarshan Srinivasan, Tushar Krishna:
LIBRA: Enabling Workload-Aware Multi-Dimensional Network Topology Optimization for Distributed Training of Large AI Models. 205-216 - Kailash Gogineni, Sai Santosh Dayapule, Juan Gómez-Luna, Karthikeya Gogineni, Peng Wei, Tian Lan, Mohammad Sadrosadati, Onur Mutlu

, Guru Venkataramani:
SwiftRL: Towards Efficient Reinforcement Learning on Real Processing-In-Memory Systems. 217-229 - Sanyam Mehta, Anna Yue:

Forward to the Past: An Alternative to Hybrid CPU Design. 230-240 - Bagus Hanindhito

, Bhavesh Patel, Lizy K. John:
Bandwidth Characterization of DeepSpeed on Distributed Large Language Model Training. 241-256 - Alicia Golden, Samuel Hsia, Fei Sun, Bilge Acun, Basil Hosmer, Yejin Lee, Zachary DeVito, Jeff Johnson, Gu-Yeon Wei, David Brooks, Carole-Jean Wu:

Generative AI Beyond LLMs: System Implications of Multi-Modal Generation. 257-267 - Zishen Wan, Che-Kai Liu, Hanchen Yang, Ritik Raj

, Chaojian Li, Haoran You, Yonggan Fu, Cheng Wan, Ananda Samajdar, Yingyan Celine Lin, Tushar Krishna, Arijit Raychowdhury:
Towards Cognitive AI Systems: Workload and Characterization of Neuro-Symbolic AI. 268-279 - Chandra Irugalbandara, Ashish Mahendra, Roland Daynauth

, Tharuka Kasthuri Arachchige, Jayanaka L. Dantanarayana, Krisztián Flautner, Lingjia Tang, Yiping Kang, Jason Mars:
Scaling Down to Scale Up: A Cost-Benefit Analysis of Replacing OpenAI's LLM with Open Source SLMs in Production. 280-291 - Divya Kiran Kadiyala

, Saeed Rashidi, Taekyung Heo, Abhimanyu Bambhaniya
, Tushar Krishna, Alexandros Daglis:
Leveraging Memory Expansion to Accelerate Large-Scale DL Training. 292-294 - Pranab Dash, Y. Charlie Hu, Abhilash Jindal:

APGPM: Automated PMC-Based Power Modeling Methodology for Modern Mobile GPUs. 295-297 - Umer Shahid, Ayesha Ahmad, Shanzay Wasim:

Gem5-Based Evaluation of CVA6 SoC: Insights into the Architectural Design. 298-300 - Abenezer Wudenhe, Yu-Chia Liu, Chris Chen, Hung-Wei Tseng:

Accel-Bench: Exploring the Potential of Programming Using Hardware-Accelerated Functions. 301-303 - Debpratim Adak, Hyokeun Lee, Ben Feinberg, Gwendolyn Voskuilen, Clayton Hughes, Huiyang Zhou

, Amro Awad
:
SEFsim: A Statistically-Guided Fast DRAM Simulator. 304-306 - Tanner Andrulis, Gohar Irfan Chaudhry, Vinith M. Suriyakumar, Joel S. Emer, Vivienne Sze:

Architecture-Level Modeling of Photonic Deep Neural Network Accelerators. 307-309 - Joshua Suetterlein

, Stephen J. Young, Jesun Firoz, Joseph B. Manzano, Ryan D. Friese, Nathan R. Tallent, Kevin J. Barker, Timothy Stavenger:
Automatic Extraction of Network Configurations for Realistic Simulation and Validation. 310-312 - Kaifeng Xu, Georgios Tziantzioulis, David Wentzlaff:

MindPalace: A Framework for Studying Microarchitecture Design of Function-as-a-Service. 313-315 - Nikitha Karman, Kevin Wei, Dylan Scott, Natheesan Ratnasegar, Oguzhan Canpolat, Hieu Mai, Michael Ferdman:

Infrastructure for Exploring SIMT Architecture in General-Purpose Processors. 316-318 - Adrian Zhao

, Louis Zhang, Sankeerth Durvasula, Fan Chen, Nilesh Jain, Selvakumar Panneer, Nandita Vijaykumar:
Distributed Training of Neural Radiance Fields: A Performance Characterization. 319-321 - Shubhendra Pal Singhal, Akihiro Hayashi, Vivek Sarkar:

Bottleneck Scenarios in Use of the Conveyors Message Aggregation Library. 322-324 - Andreas Abel, Yuying Li, Richard O'Grady, Chris Kennelly, Darryl Gove:

A Profiling-Based Benchmark Suite for Warehouse-Scale Computers. 325-327 - Yuxin Qin, Dejice Jacob, Jeremy Singer:

Characterizing Dynamic Memory Behavior in WebAssembly Workloads. 328-330 - Lishan Yang, George Papadimitriou, Dimitrios Sartzetakis, Adwait Jog, Evgenia Smirni, Dimitris Gizopoulos:

Probing Weaknesses in GPU Reliability Assessment: A Cross-Layer Approach. 331-333

manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.


Google
Google Scholar
Semantic Scholar
Internet Archive Scholar
CiteSeerX
ORCID














