


default search action
32nd ASAP 2021: Virtual Event, USA
- 32nd IEEE International Conference on Application-specific Systems, Architectures and Processors, ASAP 2021, Virtual Conference, USA, July 7-9, 2021. IEEE 2021, ISBN 978-1-6654-2701-2

- Luca Bertaccini, Luca Benini, Francesco Conti:

To Buffer, or Not to Buffer? A Case Study on FFT Accelerators for Ultra-Low-Power Multicore Clusters. 1-8 - Yu Wang, Peng Li:

Algorithm and Hardware Co-Design for FPGA Acceleration of Hamiltonian Monte Carlo Based No-U-Turn Sampler. 9-16 - Shihao Song, Twisha Titirsha, Anup Das

:
Improving Inference Lifetime of Neuromorphic Systems via Intelligent Synapse Mapping. 17-24 - Ben Marshall, Daniel Page, Thinh Hung Pham

:
A lightweight ISE for ChaCha on RISC-V. 25-32 - Dong Wen, Jingfei Jiang, Jinwei Xu, Kang Wang, Tao Xiao, Yang Zhao, Yong Dou:

RFC-HyPGCN: A Runtime Sparse Feature Compress Accelerator for Skeleton-Based GCNs Action Recognition Model with Hybrid Pruning. 33-40 - Tomohiro Ueno

, Atsushi Koshiba, Kentaro Sano:
Virtual Circuit-Switching Network with Flexible Topology for High-Performance FPGA Cluster. 41-48 - Vimal Chacko, Jason Helge Anderson:

Power, Performance and Area Consequences of Multi-Context Support in CGRAs. 49-52 - Lukas Sommer, Michael Halkenhäuser

, Cristian Axenie
, Andreas Koch:
SPNC: Accelerating Sum-Product Network Inference on CPUs and GPUs. 53-56 - Grant Brown, Valerio Tenace, Pierre-Emmanuel Gaillardon:

NEMO-CNN: An Efficient Near-Memory Accelerator for Convolutional Neural Networks. 57-60 - Yu Qian, Baolei Cheng, Jianxi Fan, Yifeng Wang

, Ruofan Jiang:
Edge-disjoint spanning trees in the line graph of hypercubes. 61-64 - Shihang Wang, Jianghan Zhu, Qi Wang, Can He, Terry Tao Ye

:
Customized Instruction on RISC-V for Winograd-Based Convolution Acceleration. 65-68 - Yanpeng Cao

, Chengcheng Wang, Changjun Song, Yongming Tang, He Li:
Real-Time Super-Resolution System of 4K-Video Based on Deep Learning. 69-76 - Mingshuo Liu, Shiyi Luo, Kevin Han, Bo Yuan, Ronald F. DeMara, Yu Bai:

An Efficient Real-Time Object Detection Framework on Resource-Constricted Hardware Devices via Software and Hardware Co-design. 77-84 - Hongwu Peng

, Shanglin Zhou, Scott Weitze, Jiaxin Li, Sahidul Islam
, Tong Geng, Ang Li, Wei Zhang, Minghu Song, Mimi Xie, Hang Liu, Caiwen Ding:
Binary Complex Neural Network Acceleration on FPGA : (Invited Paper). 85-92 - Stylianos I. Venieris, Ioannis Panopoulos, Ilias Leontiadis, Iakovos S. Venieris:

How to Reach Real-Time AI on Consumer Devices? Solutions for Programmable and Custom Architectures. 93-100 - Yuanjia Xu, Heng Wu, Wenbo Zhang, Chen Yang, Yuewen Wu, Heran Gao, Tao Wang:

Talos: A Weighted Speedup-Aware Device Placement of Deep Learning Models. 101-108 - Masashi Ogaki, Yukinori Sato

:
Hodgkin-Huxley-Based Neural Simulation with Networks Connecting to Near-Neighbor Neurons. 109-116 - Zhiqiang Que, Erwei Wang, Umar Marikar, Eric A. Moreno

, Jennifer Ngadiuba, Hamza Javed, Bartlomiej Borzyszkowski
, Thea Aarrestad, Vladimir Loncar, Sioni Summers, Maurizio Pierini, Peter Y. K. Cheung, Wayne Luk:
Accelerating Recurrent Neural Networks for Gravitational Wave Experiments. 117-124 - Krishna Teja Chitty-Venkata

, Arun K. Somani:
Array-Aware Neural Architecture Search. 125-132 - Hyunmin Jeong, Deming Chen:

TwinDNN: A Tale of Two Deep Neural Networks. 133-140 - Liping Zhang, Qin Lu:

Image caption generation method based on an interaction mechanism and scene concept selection module. 141-148 - Cheng Tan

, Nicolas Bohm Agostini
, Jeff Zhang, Marco Minutoli
, Vito Giovanni Castellana, Chenhao Xie
, Tong Geng, Ang Li, Kevin J. Barker
, Antonino Tumeo:
OpenCGRA: Democratizing Coarse-Grained Reconfigurable Arrays. 149-155 - Jason Helge Anderson, Rami Beidas, Vimal Chacko, Hsuan Hsiao, Xiaoyi Ling, Omar Ragheb, Xinyuan Wang, Tianyi Yu:

CGRA-ME: An Open-Source Framework for CGRA Architecture and CAD Research : (Invited Paper). 156-162 - Rogério Paludo, Leonel Sousa:

Number Theoretic Transform Architecture suitable to Lattice-based Fully-Homomorphic Encryption. 163-170 - Sukarn Agarwal

, Shounak Chakraborty
:
ABACa: Access Based Allocation on Set Wise Multi-Retention in STT-RAM Last Level Cache. 171-174 - Xiao Hu, Minghao Li

, Jing Tian, Zhongfeng Wang:
DARM: A Low-Complexity and Fast Modular Multiplier for Lattice-Based Cryptography. 175-178 - Thinh Hung Pham

, Ben Marshall, Alexander Fell
, Siew-Kei Lam, Daniel Page:
XDIVINSA: eXtended DIVersifying INStruction Agent to Mitigate Power Side-Channel Leakage. 179-186 - Jishnu Banerjee, Sahidul Islam

, Wei Wei
, Chen Pan, Dakai Zhu, Mimi Xie:
Memory-aware Efficient Deep Learning Mechanism for IoT Devices. 187-194 - Ben Li, Jingweijia Tan, Kaige Yan:

AERO: Towards Energy-Efficient Autonomous Flight in MAVs Using Approximate Execution. 196-202 - Mingjian Sun, Yuan Li, Song Chen, Yi Kang:

A Low Power Branch Prediction for Deep Learning on RISC-V Processor. 203-206 - Huanwen Zhang, Yan Wang, Jianxi Fan, Ruyan Guo:

Parallel Construction of Independent Spanning Trees on Folded Crossed Cubes. 207-210 - Sizhe Zhang

, Ruixuan Wang, Jeff Jun Zhang, Abbas Rahimi
, Xun Jiao:
Assessing Robustness of Hyperdimensional Computing Against Errors in Associative Memory : (Invited Paper). 211-217 - Jeff Jun Zhang, Nicolas Bohm Agostini

, Shihao Song, Cheng Tan
, Ankur Limaye
, Vinay Amatya, Joseph B. Manzano
, Marco Minutoli
, Vito Giovanni Castellana, Antonino Tumeo, Gu-Yeon Wei, David Brooks:
Towards Automatic and Agile AI/ML Accelerator Design with End-to-End Synthesis. 218-225 - Yoshiki Fujiwara, Shinya Takamaeda-Yamazaki:

ASBNN: Acceleration of Bayesian Convolutional Neural Networks by Algorithm-hardware Co-design. 226-233 - Yuchen Qiu, Chao Xiao, LingHui Peng, Junhui Wang, Ziyang Kang

, Shiming Li, Lei Wang:
A Novel Ring-based Small-World NoC for Neuromorphic Processor. 234-241 - Xinyuan Wang, Tianyi Yu, Hsuan Hsiao, Jason Helge Anderson:

Double-Pumping the Interconnect for Area Reduction in Coarse-Grained Reconfigurable Arrays. 242-249 - Murad Qasaimeh, Joseph Zambreno, Phillip H. Jones:

An Efficient Hardware Architecture for Sparse Convolution using Linear Feedback Shift Registers. 250-257 - Xinheng Liu, Yao Chen

, Cong Hao, Ashutosh Dhar, Deming Chen:
WinoCNN: Kernel Sharing Winograd Systolic Array for Efficient Convolutional Neural Network Acceleration on FPGAs. 258-265 - En-Yu Yang, Tianyu Jia, David Brooks, Gu-Yeon Wei:

FlexACC: A Programmable Accelerator with Application-Specific ISA for Flexible Deep Neural Network Inference. 266-273

manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.


Google
Google Scholar
Semantic Scholar
Internet Archive Scholar
CiteSeerX
ORCID














