default search action
Ningyi Xu
Person information
Refine list
refinements active!
zoomed in on ?? of ?? records
view refined list in
export refined list as
2020 – today
- 2024
- [j14]Wenjie Li, Aokun Hu, Ningyi Xu, Guanghui He:
CoDA: A Co-Design Framework for Versatile and Efficient Attention Accelerators. IEEE Trans. Computers 73(8): 1924-1938 (2024) - [j13]Wenjie Li, Aokun Hu, Ningyi Xu, Guanghui He:
A Precision-Scalable Deep Neural Network Accelerator With Activation Sparsity Exploitation. IEEE Trans. Comput. Aided Des. Integr. Circuits Syst. 43(1): 263-276 (2024) - [j12]Jinming Zhang, Xi Fan, Yaoyao Ye, Xuyan Wang, Guojie Xiong, Xianglun Leng, Ningyi Xu, Yong Lian, Guanghui He:
INDM: Chiplet-Based Interconnect Network and Dataflow Mapping for DNN Accelerators. IEEE Trans. Comput. Aided Des. Integr. Circuits Syst. 43(4): 1107-1120 (2024) - [j11]Wenjie Li, Aokun Hu, Ningyi Xu, Guanghui He:
Quantization and Hardware Architecture Co-Design for Matrix-Vector Multiplications of Large Language Models. IEEE Trans. Circuits Syst. I Regul. Pap. 71(6): 2858-2871 (2024) - [c41]Yijia Zhang, Sicheng Zhang, Shijie Cao, Dayou Du, Jianyu Wei, Ting Cao, Ningyi Xu:
AFPQ: Asymmetric Floating Point Quantization for LLMs. ACL (Findings) 2024: 28-36 - [c40]Dayou Du, Yijia Zhang, Shijie Cao, Jiaqi Guo, Ting Cao, Xiaowen Chu, Ningyi Xu:
BitDistiller: Unleashing the Potential of Sub-4-Bit LLMs via Self-Distillation. ACL (1) 2024: 102-116 - [i8]Dayou Du, Yijia Zhang, Shijie Cao, Jiaqi Guo, Ting Cao, Xiaowen Chu, Ningyi Xu:
BitDistiller: Unleashing the Potential of Sub-4-Bit LLMs via Self-Distillation. CoRR abs/2402.10631 (2024) - [i7]Zihao Liu, Xiaoyu Zhang, Guangwei Liu, Ji Zhao, Ningyi Xu:
Leveraging Enhanced Queries of Point Sets for Vectorized Map Construction. CoRR abs/2402.17430 (2024) - [i6]Shaoting Zhu, Derun Li, Yong Liu, Ningyi Xu, Hang Zhao:
Cross Anything: General Quadruped Robot Navigation through Complex Terrains. CoRR abs/2407.16412 (2024) - 2023
- [j10]Ye Liu, Fei Wu, Neng Zhao, Qirong Zhang, Wenqiang Wang, Yutong Yang, Xiangting Li, Sixu Li, Zili Huang, Shuang Hao, Guangbin Ou, Liang Zhou, Liang Chang, Shuisheng Lin, Ningyi Xu, Jun Zhou:
NVP: A Flexible and Efficient Processor Architecture for Accelerating Diverse Computer Vision Tasks including DNN. IEEE Trans. Circuits Syst. II Express Briefs 70(1): 271-275 (2023) - [j9]Wenjie Li, Aokun Hu, Gang Wang, Ningyi Xu, Guanghui He:
Low-Complexity Precision-Scalable Multiply-Accumulate Unit Architectures for Deep Neural Network Accelerators. IEEE Trans. Circuits Syst. II Express Briefs 70(4): 1610-1614 (2023) - [c39]Dongxu Lyu, Zhenyu Li, Yuzhou Chen, Ningyi Xu, Guanghui He:
FLNA: An Energy-Efficient Point Cloud Feature Learning Accelerator with Dataflow Decoupling. DAC 2023: 1-6 - [c38]Zhican Wang, Gang Wang, Honglan Jiang, Ningyi Xu, Guanghui He:
COSA:Co-Operative Systolic Arrays for Multi-head Attention Mechanism in Neural Network using Hybrid Data Reuse and Fusion Methodologies. DAC 2023: 1-6 - [c37]Yijia Zhang, Yibo Han, Shijie Cao, Guohao Dai, Youshan Miao, Ting Cao, Fan Yang, Ningyi Xu:
Adam Accumulation to Reduce Memory Footprints of Both Activations and Gradients for Large-Scale DNN Training. ECAI 2023: 3058-3065 - [c36]Yaoxiu Lian, Xinhao Yang, Ke Hong, Yu Wang, Guohao Dai, Ningyi Xu:
A Point Transformer Accelerator with Fine-Grained Pipelines and Distribution-Aware Dynamic FPS. ICCAD 2023: 1-9 - [c35]Dongxu Lyu, Zhenyu Li, Yuzhou Chen, Jinming Zhang, Ningyi Xu, Guanghui He:
SpOctA: A 3D Sparse Convolution Accelerator with Octree-Encoding-Based Map Search and Inherent Sparsity-Aware Processing. ICCAD 2023: 1-9 - [c34]Ke Hong, Zhongming Yu, Guohao Dai, Xinhao Yang, Yaoxiu Lian, Zehao Liu, Ningyi Xu, Yuhan Dong, Yu Wang:
Exploiting Hardware Utilization and Adaptive Dataflow for Efficient Sparse Convolution in 3D Point Clouds. MLSys 2023 - [c33]Weijie Luo, Zihao Liu, Guohao Dai, Ningyi Xu:
History-Detr: Optimize Query Initialization Strategy by Using Historical Information and Kinematics. MMAsia 2023: 4:1-4:7 - [i5]Yijia Zhang, Lingran Zhao, Shijie Cao, Wenqiang Wang, Ting Cao, Fan Yang, Mao Yang, Shanghang Zhang, Ningyi Xu:
Integer or Floating Point? New Outlooks for Low-Bit Quantization on Large Language Models. CoRR abs/2305.12356 (2023) - [i4]Yijia Zhang, Yibo Han, Shijie Cao, Guohao Dai, Youshan Miao, Ting Cao, Fan Yang, Ningyi Xu:
Adam Accumulation to Reduce Memory Footprints of both Activations and Gradients for Large-scale DNN Training. CoRR abs/2305.19982 (2023) - [i3]Dongxu Lyu, Zhenyu Li, Yuzhou Chen, Jinming Zhang, Ningyi Xu, Guanghui He:
SpOctA: A 3D Sparse Convolution Accelerator with Octree-Encoding-Based Map Search and Inherent Sparsity-Aware Processing. CoRR abs/2308.09249 (2023) - [i2]Qiao Sun, Shiduo Zhang, Danjiao Ma, Jingzhe Shi, Derun Li, Simian Luo, Yu Wang, Ningyi Xu, Guangzhi Cao, Hang Zhao:
Large Trajectory Models are Scalable Motion Predictors and Planners. CoRR abs/2310.19620 (2023) - [i1]Yijia Zhang, Sicheng Zhang, Shijie Cao, Dayou Du, Jianyu Wei, Ting Cao, Ningyi Xu:
AFPQ: Asymmetric Floating Point Quantization for LLMs. CoRR abs/2311.01792 (2023) - 2022
- [j8]Wenjie Li, Ningyi Xu, Runsheng Wang, Guanghui He:
Efficient Compression Methods for Wire-Spread-Based Stochastic Computing Deep Neural Networks. IEEE Trans. Circuits Syst. II Express Briefs 69(11): 4538-4542 (2022) - 2021
- [c32]Xi Fan, Xuyan Wang, Yaoyao Ye, Xianglun Leng, Ningyi Xu, Guanghui He:
CCASM: A Computation- and Communication-Aware Scheduling and Mapping Algorithm for NoC-Based DNN Accelerators. ASICON 2021: 1-4 - [c31]Jinming Zhang, Lifu Cheng, Cen Li, Yongfu Li, Guanghui He, Ningyi Xu, Yong Lian:
A Low-Latency FPGA Implementation for Real-Time Object Detection. ISCAS 2021: 1-5 - 2020
- [j7]Mingxuan Li, Yue Wang, Yonghui Liu, Ningyi Xu, Sirui Shu, Wanjun Lei:
Enhanced Power Decoupling Strategy for Virtual Synchronous Generator. IEEE Access 8: 73601-73613 (2020) - [j6]Yijin Guan, Guangyu Sun, Zhihang Yuan, Xingchen Li, Ningyi Xu, Shu Chen, Jason Cong, Yuan Xie:
Crane: Mitigating Accelerator Under-utilization Caused by Sparsity Irregularities in CNNs. IEEE Trans. Computers 69(7): 931-943 (2020)
2010 – 2019
- 2019
- [j5]Shijie Cao, Lanshun Nie, De-chen Zhan, Wenqiang Wang, Ningyi Xu, Ramashis Das, Ming Wu, Lintao Zhang, Derek Chiou:
FlexSaaS: A Reconfigurable Accelerator for Web Search Selection. ACM Trans. Reconfigurable Technol. Syst. 12(1): 5:1-5:20 (2019) - 2017
- [c30]Yijin Guan, Ningyi Xu, Chen Zhang, Zhihang Yuan, Jason Cong:
Using Data Compression for Optimizing FPGA-Based Convolutional Neural Network Accelerators. APPT 2017: 14-26 - [c29]Jiansong Zhang, Yongqiang Xiong, Ningyi Xu, Ran Shu, Bojie Li, Peng Cheng, Guo Chen, Thomas Moscibroda:
The Feniks FPGA Operating System for Cloud Computing. APSys 2017: 22:1-22:7 - [c28]Yijin Guan, Hao Liang, Ningyi Xu, Wenqiang Wang, Shaoshuai Shi, Xi Chen, Guangyu Sun, Wei Zhang, Jason Cong:
FP-DNN: An Automated Framework for Mapping Deep Neural Networks onto FPGAs with RTL-HLS Hybrid Templates. FCCM 2017: 152-159 - [c27]Guohao Dai, Tianhao Huang, Yuze Chi, Ningyi Xu, Yu Wang, Huazhong Yang:
ForeGraph: Exploring Large-scale Graph Processing on Multi-FPGA Architecture. FPGA 2017: 217-226 - [c26]Xi Chen, Xiaolin Hu, Hucheng Zhou, Ningyi Xu:
FxpNet: Training a deep convolutional neural network in fixed-point representation. IJCNN 2017: 2494-2501 - 2016
- [c25]Jiantao Qiu, Jie Wang, Song Yao, Kaiyuan Guo, Boxun Li, Erjin Zhou, Jincheng Yu, Tianqi Tang, Ningyi Xu, Sen Song, Yu Wang, Huazhong Yang:
Going Deeper with Embedded FPGA Platform for Convolutional Neural Network. FPGA 2016: 26-35 - [c24]Bojie Li, Kun Tan, Layong Larry Luo, Yanqing Peng, Renqian Luo, Ningyi Xu, Yongqiang Xiong, Peng Cheng:
ClickNP: Highly flexible and High-performance Network Processing with Reconfigurable Hardware. SIGCOMM 2016: 1-14 - 2015
- [j4]Wenqiang Wang, Jing Yan, Ningyi Xu, Yu Wang, Feng-Hsiung Hsu:
Real-Time High-Quality Stereo Vision System in FPGA. IEEE Trans. Circuits Syst. Video Technol. 25(10): 1696-1708 (2015) - 2014
- [c23]Yu Wang, Boxun Li, Rong Luo, Yiran Chen, Ningyi Xu, Huazhong Yang:
Energy efficient neural networks for big data analytics. DATE 2014: 1-2 - [c22]Boxun Li, Erjin Zhou, Bo Huang, Jiayi Duan, Yu Wang, Ningyi Xu, Jiaxing Zhang, Huazhong Yang:
Large scale recurrent neural network on GPU. IJCNN 2014: 4062-4069 - 2013
- [c21]Wenqiang Wang, Jing Yan, Ning-Yi Xu, Yu Wang, Feng-Hsiung Hsu:
Real-time high-quality stereo vision system in FPGA. FPT 2013: 358-361 - 2012
- [c20]Jing Yan, Zhanxiang Zhao, Ning-Yi Xu, Xi Jin, Lin-Tao Zhang, Feng-Hsiung Hsu:
Efficient Query Processing for Web Search Engine with FPGAs. FCCM 2012: 97-100 - [c19]Huiling Chen, Guoqing Zhao, Ningyi Xu:
The Analysis of Research Hotspots and Fronts of Knowledge Visualization Based on CiteSpace II. ICHL 2012: 57-68 - [c18]Ningyi Xu, Guoqing Zhao, Huiling Chen, Leisi Pei:
The Colored Concept Map and Its Application in Learning Assistance Program. ICHL 2012: 198-209 - [c17]Mo Xu, Xiaorui Zhang, Yu Wang, Ling Ren, Ziyu Wen, Yi Xu, Gaolang Gong, Ningyi Xu, Huazhong Yang:
Probabilistic Brain Fiber Tractography on GPUs. IPDPS Workshops 2012: 742-751 - 2011
- [j3]Jing Yan, Ning-Yi Xu, Xiongfei Cai, Rui Gao, Yu Wang, Rong Luo, Feng-Hsiung Hsu:
An FPGA-based accelerator for LambdaRank in Web search engines. ACM Trans. Reconfigurable Technol. Syst. 4(3): 25:1-25:19 (2011) - [c16]Tianji Wu, Di Wu, Yu Wang, Xiaorui Zhang, Hong Luo, Ningyi Xu, Huazhong Yang:
Gemma in April: A matrix-like parallel programming architecture on OpenCL. DATE 2011: 703-708 - [c15]Yu Wang, Mo Xu, Ling Ren, Xiaorui Zhang, Di Wu, Yong He, Ningyi Xu, Huazhong Yang:
A heterogeneous accelerator platform for multi-subject voxel-based brain network analysis. ICCAD 2011: 339-344 - 2010
- [c14]Yi Shan, Bo Wang, Jing Yan, Yu Wang, Ningyi Xu, Huazhong Yang:
FPMR: MapReduce framework on FPGA. FPGA 2010: 93-102 - [c13]Jing Yan, Ningyi Xu, Xiongfei Cai, Rui Gao, Yu Wang, Rong Luo, Feng-Hsiung Hsu:
LambdaRank acceleration for relevance ranking in web search engines (abstract only). FPGA 2010: 285 - [c12]Jing Yan, Ningyi Xu, Zenglin Xia, Rong Luo, Feng-Hsiung Hsu:
A compression method for inverted index and its FPGA-based decompression solution. FPT 2010: 261-264 - [c11]Di Wu, Tianji Wu, Yi Shan, Yu Wang, Yong He, Ningyi Xu, Huazhong Yang:
Making Human Connectome Faster: GPU Acceleration of Brain Network Analysis. ICPADS 2010: 593-600 - [c10]Tianji Wu, Bo Wang, Yi Shan, Feng Yan, Yu Wang, Ningyi Xu:
Efficient PageRank and SpMV Computation on AMD GPUs. ICPP 2010: 81-89 - [c9]Yi Shan, Tianji Wu, Yu Wang, Bo Wang, Zilong Wang, Ningyi Xu, Huazhong Yang:
FPGA and GPU implementation of large scale SpMV. SASP 2010: 64-70
2000 – 2009
- 2009
- [j2]Ningyi Xu, Xiongfei Cai, Rui Gao, Lei Zhang, Feng-Hsiung Hsu:
FPGA Acceleration of RankBoost in Web Search Engines. ACM Trans. Reconfigurable Technol. Syst. 1(4): 19:1-19:19 (2009) - [c8]Jing Yan, Rong Luo, Rui Gao, Ningyi Xu:
An Efficient Lossless Compression Method for Internet Search Data in Hardware Accelerators. CSIE (3) 2009: 453-457 - [c7]Jing Yan, Ningyi Xu, Xiongfei Cai, Rui Gao, Yu Wang, Rong Luo, Feng-Hsiung Hsu:
FPGA-based acceleration of neural network for ranking in web search engine with a streaming architecture. FPL 2009: 662-665 - [c6]Bo Wang, Tianji Wu, Feng Yan, Ruirui Li, Ningyi Xu, Yu Wang:
RankBoost Acceleration on both NVIDIA CUDA and ATI Stream Platforms. ICPADS 2009: 284-291 - [c5]Ji-Yong Shin, Zenglin Xia, Ning-Yi Xu, Rui Gao, Xiongfei Cai, Seungryoul Maeng, Feng-Hsiung Hsu:
FTL design exploration in reconfigurable high-performance SSD for server applications. ICS 2009: 338-349 - [c4]Feng Yan, Ningyi Xu, Yuan (Alan) Qi:
Parallel Inference for Latent Dirichlet Allocation on Graphics Processing Units. NIPS 2009: 2134-2142 - 2008
- [c3]Zhijun Li, Ning-Yi Xu, Feng-Hsiung Hsu, Xiongfei Cai, Rui Gao, Zenglin Xia:
Distributed RankBoost Acceleration Using FPGA and MPI for Web Relevance Ranking. ICPADS 2008: 35-42 - 2007
- [c2]Ning-Yi Xu, Xiongfei Cai, Rui Gao, Lei Zhang, Feng-Hsiung Hsu:
FPGA-based Accelerator Design for RankBoost in Web Search Engines. FPT 2007: 33-40 - 2006
- [j1]Guanghui He, Ningyi Xu, Wei Yu, Zucheng Zhou:
A single receiving chip for DVB data broadcasting system. IEEE Trans. Consumer Electron. 52(3): 1084-1091 (2006) - 2005
- [c1]Ningyi Xu, Shaohua Li, Wei Yu, Guanghui He, Hao Zhang, Fei Luo, Zucheng Zhou:
The design and implementation of a DVB receiving chip with PCI interface. ASP-DAC 2005: 15-16
Coauthor Index
manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.
Unpaywalled article links
Add open access links from to the list of external document links (if available).
Privacy notice: By enabling the option above, your browser will contact the API of unpaywall.org to load hyperlinks to open access articles. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Unpaywall privacy policy.
Archived links via Wayback Machine
For web page which are no longer available, try to retrieve content from the of the Internet Archive (if available).
Privacy notice: By enabling the option above, your browser will contact the API of archive.org to check for archived content of web pages that are no longer available. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Internet Archive privacy policy.
Reference lists
Add a list of references from , , and to record detail pages.
load references from crossref.org and opencitations.net
Privacy notice: By enabling the option above, your browser will contact the APIs of crossref.org, opencitations.net, and semanticscholar.org to load article reference information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Crossref privacy policy and the OpenCitations privacy policy, as well as the AI2 Privacy Policy covering Semantic Scholar.
Citation data
Add a list of citing articles from and to record detail pages.
load citations from opencitations.net
Privacy notice: By enabling the option above, your browser will contact the API of opencitations.net and semanticscholar.org to load citation information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the OpenCitations privacy policy as well as the AI2 Privacy Policy covering Semantic Scholar.
OpenAlex data
Load additional information about publications from .
Privacy notice: By enabling the option above, your browser will contact the API of openalex.org to load additional information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the information given by OpenAlex.
last updated on 2024-09-26 01:54 CEST by the dblp team
all metadata released as open data under CC0 1.0 license
see also: Terms of Use | Privacy Policy | Imprint