default search action
Sai Qian Zhang
Person information
Refine list
refinements active!
zoomed in on ?? of ?? records
view refined list in
export refined list as
2020 – today
- 2024
- [j3]Xiaoyu Sun, Xiaochen Peng, Sai Qian Zhang, Jorge Gomez, Win-San Khwa, Syed Shakib Sarwar, Ziyun Li, Weidong Cao, Zhao Wang, Chiao Liu, Meng-Fan Chang, Barbara De Salvo, Kerem Akarvardar, H.-S. Philip Wong:
Estimating Power, Performance, and Area for On-Sensor Deployment of AR/VR Workloads Using an Analytical Framework. ACM Trans. Design Autom. Electr. Syst. 29(6): 1-27 (2024) - [c33]Chao Gao, Sai Qian Zhang:
DLoRA: Distributed Parameter-Efficient Fine-Tuning Solution for Large Language Model. EMNLP (Findings) 2024: 13703-13714 - [c32]Sai Qian Zhang, Thierry Tambe, Nestor Cuevas, Gu-Yeon Wei, David Brooks:
CAMEL: Co-Designing AI Models and eDRAMs for Efficient On-Device Learning. HPCA 2024: 861-875 - [c31]Jieyu Lin, Minghao Li, Sai Qian Zhang, Alberto Leon-Garcia:
Murmuration: On-the-fly DNN Adaptation for SLO-Aware Distributed Inference in Dynamic Edge Environments. ICPP 2024: 792-801 - [c30]Tianhua Xia, Sai Qian Zhang:
Hyft: A Reconfigurable Softmax Accelerator with Hybrid Numeric Format for both Training and Inference. ISLPED 2024: 1-6 - [c29]Sai Qian Zhang, Thierry Tambe, Gu-Yeon Wei, David Brooks:
JointNF: Enhancing DNN Performance through Adaptive N: M Pruning across both Weight and Activation. ISLPED 2024: 1-6 - [c28]Jieyu Lin, Sai Qian Zhang, Alberto Leon-Garcia:
sLLM: Accelerating LLM Inference using Semantic Load Balancing with Shared Memory Data Structures. ISQED 2024: 1-6 - [c27]Sai Qian Zhang, Jieyu Lin, Qi Zhang, Yu-Jia Chen:
Learning Client Selection Strategy for Federated Learning across Heterogeneous Mobile Devices. ISQED 2024: 1-7 - [c26]Andre Nakkab, Sai Qian Zhang, Ramesh Karri, Siddharth Garg:
Rome was Not Built in a Single Step: Hierarchical Prompting for LLM-based Chip Design. MLCAD 2024: 26:1-26:11 - [c25]Wenshuo Peng, Kaipeng Zhang, Sai Qian Zhang:
T3M: Text Guided 3D Human Motion Synthesis from Speech. NAACL-HLT (Findings) 2024: 1168-1177 - [i26]Zeyu Han, Chao Gao, Jinyang Liu, Jeff Zhang, Sai Qian Zhang:
Parameter-Efficient Fine-Tuning for Large Models: A Comprehensive Survey. CoRR abs/2403.14608 (2024) - [i25]Chao Gao, Sai Qian Zhang:
DLoRA: Distributed Parameter-Efficient Fine-Tuning Solution for Large Language Model. CoRR abs/2404.05182 (2024) - [i24]Wenxuan Liu, Sai Qian Zhang:
HQ-DiT: Efficient Diffusion Transformer with FP4 Hybrid Quantization. CoRR abs/2405.19751 (2024) - [i23]Andre Nakkab, Sai Qian Zhang, Ramesh Karri, Siddharth Garg:
Rome was Not Built in a Single Step: Hierarchical Prompting for LLM-based Chip Design. CoRR abs/2407.18276 (2024) - [i22]Wenshuo Peng, Kaipeng Zhang, Sai Qian Zhang:
T3M: Text Guided 3D Human Motion Synthesis from Speech. CoRR abs/2408.12885 (2024) - [i21]Zhenyuan Dong, Sai Qian Zhang:
DiTAS: Quantizing Diffusion Transformers via Enhanced Activation Smoothing. CoRR abs/2409.07756 (2024) - [i20]Yiwei Zhao, Ziyun Li, Win-San Khwa, Xiaoyu Sun, Sai Qian Zhang, Syed Shakib Sarwar, Kleber Hugo Stangherlin, Yi-Lun Lu, Jorge Tomás Gómez, Jae-Sun Seo, Phillip B. Gibbons, Barbara De Salvo, Chiao Liu:
Neural Architecture Search of Hybrid Models for NPU-CIM Heterogeneous AR/VR Devices. CoRR abs/2410.08326 (2024) - [i19]Maximilian Augustin, Syed Shakib Sarwar, Mostafa Elhoushi, Sai Qian Zhang, Yuecheng Li, Barbara De Salvo:
PETAH: Parameter Efficient Task Adaptation for Hybrid Transformers in a resource-limited Context. CoRR abs/2410.17661 (2024) - [i18]He-Yen Hsieh, Ziyun Li, Sai Qian Zhang, Wei-Te Mark Ting, Kao-Den Chang, Barbara De Salvo, Chiao Liu, H. T. Kung:
GazeGen: Gaze-Driven User Interaction for Visual Content Generation. CoRR abs/2411.04335 (2024) - [i17]Jingyang Xiang, Sai Qian Zhang:
DFRot: Achieving Outlier-Free and Massive Activation-Free for Rotated LLMs with Refined Rotation. CoRR abs/2412.00648 (2024) - [i16]Sai Qian Zhang, Ziyun Li, Chuan Guo, Saeed Mahloujifar, Deeksha Dangwal, G. Edward Suh, Barbara De Salvo, Chiao Liu:
Unlocking Visual Secrets: Inverting Features with Diffusion Priors for Image Reconstruction. CoRR abs/2412.10448 (2024) - [i15]Wenxuan Liu, Monde Duinkharjav, Qi Sun, Sai Qian Zhang:
FovealNet: Advancing AI-Driven Gaze Tracking Solutions for Optimized Foveated Rendering System Performance in Virtual Reality. CoRR abs/2412.10456 (2024) - 2023
- [i14]Sai Qian Zhang, Thierry Tambe, Nestor Cuevas, Gu-Yeon Wei, David Brooks:
CAMEL: Co-Designing AI Models and Embedded DRAMs for Efficient On-Device Learning. CoRR abs/2305.03148 (2023) - [i13]Tianhua Xia, Sai Qian Zhang:
Softmax Acceleration with Adaptive Numeric Format for both Training and Inference. CoRR abs/2311.13290 (2023) - [i12]Yixuan Luo, Mengye Ren, Sai Qian Zhang:
BIM: Block-Wise Self-Supervised Learning with Masked Image Modeling. CoRR abs/2311.17218 (2023) - 2022
- [c24]Sai Qian Zhang, Jieyu Lin, Qi Zhang:
A Multi-Agent Reinforcement Learning Approach for Efficient Client Selection in Federated Learning. AAAI 2022: 9091-9099 - [c23]Xin Dong, Sai Qian Zhang, Ang Li, H. T. Kung:
SphereFed: Hyperspherical Federated Learning. ECCV (26) 2022: 165-184 - [c22]Sai Qian Zhang, Bradley McDanel, H. T. Kung:
FAST: DNN Training Under Variable Precision Block Floating Point with Stochastic Rounding. HPCA 2022: 846-860 - [i11]Sai Qian Zhang, Jieyu Lin, Qi Zhang:
A Multi-agent Reinforcement Learning Approach for Efficient Client Selection in Federated Learning. CoRR abs/2201.02932 (2022) - [i10]Xin Dong, Sai Qian Zhang, Ang Li, H. T. Kung:
SphereFed: Hyperspherical Federated Learning. CoRR abs/2207.09413 (2022) - 2021
- [c21]Sai Qian Zhang, Bradley McDanel, H. T. Kung, Xin Dong:
Training for multi-resolution inference using reusable quantization terms. ASPLOS 2021: 845-860 - [c20]Bradley McDanel, Sai Qian Zhang, H. T. Kung:
Saturation RRAM Leveraging Bit-Level Sparsity Resulting from Term Quantization. ISCAS 2021: 1-5 - [i9]Sai Qian Zhang, Bradley McDanel, H. T. Kung:
FAST: DNN Training Under Variable Precision Block Floating Point with Stochastic Rounding. CoRR abs/2110.15456 (2021) - 2020
- [c19]Yuhang Li, Xin Dong, Sai Qian Zhang, Haoli Bai, Yuanpeng Chen, Wei Wang:
RTN: Reparameterized Ternary Network. AAAI 2020: 4780-4787 - [c18]Sai Qian Zhang, Jieyu Lin, Qi Zhang:
Adaptive Distributed Convolutional Neural Network Inference at the Network Edge with ADCNN. ICPP 2020: 10:1-10:11 - [c17]Sai Qian Zhang, Qi Zhang, Jieyu Lin:
Succinct and Robust Multi-Agent Communication With Temporal Message Control. NeurIPS 2020 - [c16]Hsiang-Tsung Kung, Bradley McDanel, Sai Qian Zhang:
Term quantization: furthering quantization at run time. SC 2020: 96 - [c15]Jieyu Lin, Kristina Dzeparoska, Sai Qian Zhang, Alberto Leon-Garcia, Nicolas Papernot:
On the Robustness of Cooperative Multi-Agent Reinforcement Learning. SP (Workshops) 2020: 62-68 - [i8]Jieyu Lin, Kristina Dzeparoska, Sai Qian Zhang, Alberto Leon-Garcia, Nicolas Papernot:
On the Robustness of Cooperative Multi-Agent Reinforcement Learning. CoRR abs/2003.03722 (2020) - [i7]H. T. Kung, Bradley McDanel, Sai Qian Zhang:
Term Revealing: Furthering Quantization at Run Time on Quantized DNNs. CoRR abs/2007.06389 (2020) - [i6]Sai Qian Zhang, Jieyu Lin, Qi Zhang:
Succinct and Robust Multi-Agent Communication With Temporal Message Control. CoRR abs/2010.14391 (2020)
2010 – 2019
- 2019
- [c14]H. T. Kung, Bradley McDanel, Sai Qian Zhang, Xin Dong, Chih-Chiang Chen:
Maestro: A Memory-on-Logic Architecture for Coordinated Parallel Use of Many Systolic Arrays. ASAP 2019: 42-50 - [c13]H. T. Kung, Bradley McDanel, Sai Qian Zhang:
Packing Sparse Convolutional Neural Networks for Efficient Systolic Array Implementations: Column Combining Under Joint Optimization. ASPLOS 2019: 821-834 - [c12]Bradley McDanel, Sai Qian Zhang, H. T. Kung, Xin Dong:
Full-stack optimization for accelerating CNNs using powers-of-two weights with FPGA validation. ICS 2019: 449-460 - [c11]H. T. Kung, Bradley McDanel, Sai Qian Zhang, C. T. Wang, Jin Cai, C. Y. Chen, Victor C. Y. Chang, M. F. Chen, Jack Yuan-Chen Sun, Douglas Yu:
Systolic Building Block for Logic-on-Logic 3D-IC Implementations of Convolutional Neural Networks. ISCAS 2019: 1-5 - [c10]Sai Qian Zhang, Qi Zhang, Jieyu Lin:
Efficient Communication in Multi-Agent Reinforcement Learning via Variance Based Control. NeurIPS 2019: 3230-3239 - [i5]Bradley McDanel, Sai Qian Zhang, H. T. Kung, Xin Dong:
Full-stack Optimization for Accelerating CNNs with FPGA Validation. CoRR abs/1905.00462 (2019) - [i4]Sai Qian Zhang, Qi Zhang, Jieyu Lin:
Efficient Communication in Multi-Agent Reinforcement Learning via Variance Based Control. CoRR abs/1909.02682 (2019) - [i3]Yuhang Li, Xin Dong, Sai Qian Zhang, Haoli Bai, Yuanpeng Chen, Wei Wang:
RTN: Reparameterized Ternary Network. CoRR abs/1912.02057 (2019) - 2018
- [c9]H. T. Kung, Bradley McDanel, Sai Qian Zhang:
Adaptive Tiling: Applying Fixed-size Systolic Arrays To Sparse Convolutional Neural Networks. ICPR 2018: 1006-1011 - [c8]H. T. Kung, Bradley McDanel, Sai Qian Zhang:
Mapping Systolic Arrays onto 3D Circuit Structures: Accelerating Convolutional Neural Network Inference. SiPS 2018: 330-336 - [c7]Sai Qian Zhang, Feng Xue, Nageen Himayat, Shilpa Talwar, H. T. Kung:
A Machine Learning Assisted Cell Selection Method for Drones in Cellular Networks. SPAWC 2018: 1-5 - [i2]Sai Qian Zhang, H. T. Kung, Youngjune Gwon:
InferBeam: A Fast Beam Alignment Protocol for Millimeter-wave Networking. CoRR abs/1802.03373 (2018) - [i1]H. T. Kung, Bradley McDanel, Sai Qian Zhang:
Packing Sparse Convolutional Neural Networks for Efficient Systolic Array Implementations: Column Combining Under Joint Optimization. CoRR abs/1811.04770 (2018) - 2017
- [j2]Sai Qian Zhang, Qi Zhang, Ali Tizghadam, Byungchul Park, Hadi Bannazadeh, Raouf Boutaba, Alberto Leon-Garcia:
TCAM space-efficient routing in a software defined network. Comput. Networks 125: 26-40 (2017) - 2016
- [c6]Sai Qian Zhang, Ali Tizghadam, Byungchul Park, Hadi Bannazadeh, Alberto Leon-Garcia:
Joint NFV placement and routing for multicast service on SDN. NOMS 2016: 333-341 - [c5]Sai Qian Zhang, Qi Zhang, Ali Tizghadam, Byungchul Park, Hadi Bannazadeh, Raouf Boutaba, Alberto Leon-Garcia:
Sector: TCAM Space Aware Routing on SDN. ITC 2016: 216-224 - 2015
- [j1]Sai Qian Zhang, Qi Zhang, Hadi Bannazadeh, Alberto Leon-Garcia:
Routing Algorithms for Network Function Virtualization Enabled Multicast Topology on SDN. IEEE Trans. Netw. Serv. Manag. 12(4): 580-594 (2015) - [c4]Sai Qian Zhang, Qi Zhang, Hadi Bannazadeh, Alberto Leon-Garcia:
Network Function Virtualization enabled multicast routing on SDN. ICC 2015: 5595-5601 - [c3]Qi Zhang, Sai Qian Zhang, Alberto Leon-Garcia, Raouf Boutaba:
Aurora: Adaptive Block Replication in Distributed File Systems. ICDCS 2015: 442-451 - [c2]Sai Qian Zhang, Pouya Yasrebi, Ali Tizghadam, Hadi Bannazadeh, Alberto Leon-Garcia:
Fast Network Flow Resumption for Live Virtual Machine Migration on SDN. ICNP 2015: 446-452 - [c1]Qi Zhang, Sai Qian Zhang, Jieyu Lin, Hadi Bannazadeh, Alberto Leon-Garcia:
Kaleidoscope: Real-time content delivery in software defined infrastructures. IM 2015: 686-692
Coauthor Index
manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.
Unpaywalled article links
Add open access links from to the list of external document links (if available).
Privacy notice: By enabling the option above, your browser will contact the API of unpaywall.org to load hyperlinks to open access articles. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Unpaywall privacy policy.
Archived links via Wayback Machine
For web page which are no longer available, try to retrieve content from the of the Internet Archive (if available).
Privacy notice: By enabling the option above, your browser will contact the API of archive.org to check for archived content of web pages that are no longer available. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Internet Archive privacy policy.
Reference lists
Add a list of references from , , and to record detail pages.
load references from crossref.org and opencitations.net
Privacy notice: By enabling the option above, your browser will contact the APIs of crossref.org, opencitations.net, and semanticscholar.org to load article reference information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Crossref privacy policy and the OpenCitations privacy policy, as well as the AI2 Privacy Policy covering Semantic Scholar.
Citation data
Add a list of citing articles from and to record detail pages.
load citations from opencitations.net
Privacy notice: By enabling the option above, your browser will contact the API of opencitations.net and semanticscholar.org to load citation information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the OpenCitations privacy policy as well as the AI2 Privacy Policy covering Semantic Scholar.
OpenAlex data
Load additional information about publications from .
Privacy notice: By enabling the option above, your browser will contact the API of openalex.org to load additional information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the information given by OpenAlex.
last updated on 2025-01-22 21:31 CET by the dblp team
all metadata released as open data under CC0 1.0 license
see also: Terms of Use | Privacy Policy | Imprint