


default search action
Ke Hong
Person information
Refine list

refinements active!
zoomed in on ?? of ?? records
view refined list in
export refined list as
2020 – today
- 2025
[j4]Guohao Dai
, Ke Hong
, Qiuli Mao
, Xiuhong Li
, Jiaming Xu
, Haofeng Huang
, Hongtu Xia
, Xuefei Ning
, Shengen Yan
, Yun Liang
, Yu Wang
:
FlashDecoding++Next: High Throughput LLM Inference With Latency and Memory Optimization. IEEE Trans. Computers 74(10): 3263-3276 (2025)
[j3]Yaoxiu Lian
, Xinhao Yang, Ke Hong
, Yu Wang
, Ningyi Xu, Guohao Dai
:
A Point Transformer Accelerator With Distribution-Aware Heuristic Distance Calculation. IEEE Trans. Comput. Aided Des. Integr. Circuits Syst. 44(2): 751-764 (2025)
[c12]Shiyao Li, Yingchun Hu, Xuefei Ning, Xihui Liu, Ke Hong, Xiaotao Jia, Xiuhong Li, Yaqi Yan, Pei Ran, Guohao Dai, Shengen Yan, Huazhong Yang, Yu Wang:
MBQ: Modality-Balanced Quantization for Large Vision-Language Models. CVPR 2025: 4167-4177
[i9]Ke Hong, Xiuhong Li, Minxu Liu, Qiuli Mao, Tianqi Wu, Zixiao Huang, Lufang Chen, Zhong Wang, Yichong Zhang, Zhenhua Zhu, Guohao Dai, Yu Wang:
FlashOverlap: A Lightweight Design for Efficiently Overlapping Communication and Computation. CoRR abs/2504.19519 (2025)
[i8]Ke Hong, Lufang Chen, Zhong Wang, Xiuhong Li, Qiuli Mao, Jianping Ma, Chao Xiong, Guanyu Wu, Buhe Han, Guohao Dai, Yu Wang:
semi-PD: Towards Efficient LLM Serving via Phase-Wise Disaggregated Computation and Unified Storage. CoRR abs/2504.19867 (2025)
[i7]Tianchen Zhao, Ke Hong, Xinhao Yang, Xuefeng Xiao, Huixia Li, Feng Ling, Ruiqi Xie, Siqi Chen, Hongyu Zhu, Yichong Zhang, Yu Wang:
PAROAttention: Pattern-Aware ReOrdering for Efficient Sparse and Quantized Attention in Visual Generation Models. CoRR abs/2506.16054 (2025)
[i6]Yida Wang, Ke Hong, Xiuhong Li, Yuanchao Xu, Wenxun Wang, Guohao Dai, Yu Wang:
TASP: Topology-aware Sequence Parallelism. CoRR abs/2509.26541 (2025)- 2024
[c11]Kai Zhong
, Zhenhua Zhu
, Guohao Dai, Hongyi Wang
, Xinhao Yang
, Haoyu Zhang
, Jin Si
, Qiuli Mao, Shulin Zeng
, Ke Hong
, Genghan Zhang, Huazhong Yang, Yu Wang:
FEASTA: A Flexible and Efficient Accelerator for Sparse Tensor Algebra in Machine Learning. ASPLOS (3) 2024: 349-366
[c10]Ke Hong, Guohao Dai, Jiaming Xu, Qiuli Mao, Xiuhong Li, Jun Liu, Kangdi Chen, Yuhan Dong, Yu Wang:
FlashDecoding++: Faster Large Language Model Inference with Asynchronization, Flat GEMM Optimization, and Heuristics. MLSys 2024
[i5]Zixuan Zhou, Xuefei Ning, Ke Hong, Tianyu Fu, Jiaming Xu, Shiyao Li, Yuming Lou
, Luning Wang, Zhihang Yuan, Xiuhong Li, Shengen Yan, Guohao Dai, Xiao-Ping Zhang, Yuhan Dong, Yu Wang:
A Survey on Efficient Inference for Large Language Models. CoRR abs/2404.14294 (2024)
[i4]Shiyao Li, Yingchun Hu, Xuefei Ning, Xihui Liu
, Ke Hong, Xiaotao Jia, Xiuhong Li, Yaqi Yan, Pei Ran, Guohao Dai, Shengen Yan, Huazhong Yang, Yu Wang:
MBQ: Modality-Balanced Quantization for Large Vision-Language Models. CoRR abs/2412.19509 (2024)- 2023
[j2]Yaofeng Tu, Jiahao Niu, Dezheng Wang, Hong Gao, Jin Xu, Ke Hong, Fang Yang:
BDMasker: Dynamic Data Protection System for Open Big Data Environment. Int. J. Softw. Informatics 13(1): 87-115 (2023)
[c9]Haotian Tang, Shang Yang, Zhijian Liu, Ke Hong, Zhongming Yu, Xiuyu Li, Guohao Dai, Yu Wang, Song Han:
TorchSparse++: Efficient Point Cloud Engine. CVPR Workshops 2023: 202-209
[c8]Xinhao Yang, Tianyu Fu
, Guohao Dai, Shulin Zeng, Kai Zhong, Ke Hong, Yu Wang:
An Efficient Accelerator for Point-based and Voxel-based Point Cloud Neural Networks. DAC 2023: 1-6
[c7]Yaoxiu Lian, Xinhao Yang, Ke Hong, Yu Wang, Guohao Dai, Ningyi Xu:
A Point Transformer Accelerator with Fine-Grained Pipelines and Distribution-Aware Dynamic FPS. ICCAD 2023: 1-9
[c6]Tianchen Zhao, Xuefei Ning, Ke Hong, Zhongyuan Qiu, Pu Lu, Yali Zhao, Linfeng Zhang
, Lipu Zhou, Guohao Dai, Huazhong Yang, Yu Wang:
Ada3D : Exploiting the Spatial Redundancy with Adaptive Inference for Efficient 3D Object Detection. ICCV 2023: 17682-17692
[c5]Haotian Tang
, Shang Yang
, Zhijian Liu
, Ke Hong
, Zhongming Yu
, Xiuyu Li
, Guohao Dai
, Yu Wang
, Song Han
:
TorchSparse++: Efficient Training and Inference Framework for Sparse Convolution on GPUs. MICRO 2023: 225-239
[c4]Ke Hong, Zhongming Yu, Guohao Dai, Xinhao Yang, Yaoxiu Lian, Zehao Liu, Ningyi Xu, Yuhan Dong, Yu Wang:
Exploiting Hardware Utilization and Adaptive Dataflow for Efficient Sparse Convolution in 3D Point Clouds. MLSys 2023
[i3]Tianchen Zhao, Xuefei Ning, Ke Hong, Zhongyuan Qiu, Pu Lu, Yali Zhao, Linfeng Zhang
, Lipu Zhou, Guohao Dai, Huazhong Yang, Yu Wang:
Ada3D : Exploiting the Spatial Redundancy with Adaptive Inference for Efficient 3D Object Detection. CoRR abs/2307.08209 (2023)
[i2]Ke Hong, Guohao Dai, Jiaming Xu, Qiuli Mao, Xiuhong Li, Jun Liu, Kangdi Chen, Yuhan Dong, Yu Wang:
FlashDecoding++: Faster Large Language Model Inference on GPUs. CoRR abs/2311.01282 (2023)
[i1]Haotian Tang, Shang Yang, Zhijian Liu, Ke Hong, Zhongming Yu, Xiuyu Li, Guohao Dai, Yu Wang, Song Han:
TorchSparse++: Efficient Training and Inference Framework for Sparse Convolution on GPUs. CoRR abs/2311.12862 (2023)- 2022
[j1]Ke Hong
, Tianyu Wang
, Junchen Liu, Yu Wang
, Yuan Shen
:
A Learning-Based AoA Estimation Method for Device-Free Localization. IEEE Commun. Lett. 26(6): 1264-1267 (2022)
2010 – 2019
- 2019
[b1]Ke Hong:
Performance, Security, and Safety Requirements Testing for Smart Systems Through Systematic Software Analysis. University of Michigan, USA, 2019- 2013
[c3]Zhiqiang Ma, Ke Hong, Lin Gu:
VOLUME: Enable Large-Scale In-Memory Computation on Commodity Clusters. CloudCom (1) 2013: 56-63
[c2]Ke Hong, Shuo Yang, Zhiqiang Ma, Lin Gu:
A Synergy of the Wireless Sensor Network and the Data Center System. MASS 2013: 263-271- 2012
[c1]Shuo Yang, Ke Hong, Lin Gu:
Poster Abstract: Involving a Sensor Network System in Core Datacenter Management Functions. ICCPS 2012: 235
Coauthor Index

manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.
Unpaywalled article links
Add open access links from
to the list of external document links (if available).
Privacy notice: By enabling the option above, your browser will contact the API of unpaywall.org to load hyperlinks to open access articles. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Unpaywall privacy policy.
Archived links via Wayback Machine
For web page which are no longer available, try to retrieve content from the
of the Internet Archive (if available).
Privacy notice: By enabling the option above, your browser will contact the API of archive.org to check for archived content of web pages that are no longer available. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Internet Archive privacy policy.
Reference lists
Add a list of references from
,
, and
to record detail pages.
load references from crossref.org and opencitations.net
Privacy notice: By enabling the option above, your browser will contact the APIs of crossref.org, opencitations.net, and semanticscholar.org to load article reference information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Crossref privacy policy and the OpenCitations privacy policy, as well as the AI2 Privacy Policy covering Semantic Scholar.
Citation data
Add a list of citing articles from
and
to record detail pages.
load citations from opencitations.net
Privacy notice: By enabling the option above, your browser will contact the API of opencitations.net and semanticscholar.org to load citation information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the OpenCitations privacy policy as well as the AI2 Privacy Policy covering Semantic Scholar.
OpenAlex data
Load additional information about publications from
.
Privacy notice: By enabling the option above, your browser will contact the API of openalex.org to load additional information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the information given by OpenAlex.
last updated on 2025-11-22 05:08 CET by the dblp team
all metadata released as open data under CC0 1.0 license
see also: Terms of Use | Privacy Policy | Imprint


Google
Google Scholar
Semantic Scholar
Internet Archive Scholar
CiteSeerX
ORCID







