


default search action
Jidong Zhai
Person information
Refine list

refinements active!
zoomed in on ?? of ?? records
view refined list in
export refined list as
showing all ?? records
2020 – today
- 2025
- [j55]Yuyang Jin
, Haojie Wang
, Xiongchao Tang, Zhenhua Guo, Yaqian Zhao, Torsten Hoefler
, Tao Liu
, Xu Liu, Jidong Zhai
:
Leveraging Graph Analysis to Pinpoint Root Causes of Scalability Issues for Parallel Applications. IEEE Trans. Parallel Distributed Syst. 36(2): 308-325 (2025) - [j54]Da-Ming Zhao, Jiantao Zhou, Jidong Zhai, Keqin Li:
A Reinforcement Learning Based Framework for Holistic Energy Optimization of Sustainable Cloud Data Centers. IEEE Trans. Serv. Comput. 18(1): 15-28 (2025) - 2024
- [j53]Jianbin Fang, Jidong Zhai, Zheng Wang:
Editorial for the special issue on programming models and system software for High-Performance Computing (HPC) environments. CCF Trans. High Perform. Comput. 6(3): 241-242 (2024) - [j52]Kezhao Huang, Haitian Jiang, Minjie Wang, Guangxuan Xiao, David Wipf, Xiang Song, Quan Gan, Zengfeng Huang, Jidong Zhai, Zheng Zhang:
FreshGNN: Reducing Memory Access via Stable Historical Embeddings for Graph Neural Network Training. Proc. VLDB Endow. 17(6): 1473-1486 (2024) - [j51]Weitao Wan
, Feng Zhang
, Chenyang Zhang
, Mingde Zhang
, Jidong Zhai
, Yunpeng Chai
, Huanchen Zhang
, Wei Lu
, Yuxing Chen
, Haixiang Li, Anqun Pan
, Xiaoyong Du
:
Compressed Data Direct Computing for Databases. IEEE Trans. Knowl. Data Eng. 36(5): 1902-1918 (2024) - [j50]Jiesong Liu
, Feng Zhang
, Lv Lu
, Chang Qi
, Xiaoguang Guo
, Dong Deng
, Guoliang Li
, Huanchen Zhang
, Jidong Zhai
, Hechen Zhang
, Yuxing Chen
, Anqun Pan
, Xiaoyong Du
:
G-Learned Index: Enabling Efficient Learned Index on GPU. IEEE Trans. Parallel Distributed Syst. 35(6): 795-812 (2024) - [j49]Yuyang Jin
, Haojie Wang
, Runxin Zhong
, Chen Zhang
, Xia Liao
, Feng Zhang
, Jidong Zhai
:
Graph-Centric Performance Analysis for Large-Scale Parallel Applications. IEEE Trans. Parallel Distributed Syst. 35(7): 1221-1238 (2024) - [j48]Yuyang Jin
, Runxin Zhong
, Saiqin Long
, Jidong Zhai
:
Efficient Inference for Pruned CNN Models on Mobile Devices With Holistic Sparsity Alignment. IEEE Trans. Parallel Distributed Syst. 35(11): 2208-2223 (2024) - [j47]Liang Wang
, Jinzhe Yang, Jidong Zhai
, Guangwen Yang
:
Optimizing I/O Performance Through Effective vCPU Scheduling Interference Management. IEEE Trans. Parallel Distributed Syst. 35(12): 2315-2330 (2024) - [j46]Zhenhua Guo, Yinan Tang
, Jidong Zhai
, Tongtong Yuan
, Jian Jin
, Li Wang, Yaqian Zhao, Rengang Li
:
A Survey on Performance Modeling and Prediction for Distributed DNN Training. IEEE Trans. Parallel Distributed Syst. 35(12): 2463-2478 (2024) - [c75]Muyan Hu
, Ashwin Venkatram
, Shreyashri Biswas
, Balamurugan Marimuthu
, Bohan Hou
, Gabriele Oliaro
, Haojie Wang
, Liyan Zheng
, Xupeng Miao
, Jidong Zhai
, Zhihao Jia
:
Optimal Kernel Orchestration for Tensor Programs with Korch. ASPLOS (3) 2024: 755-769 - [c74]Kezhao Huang
, Jidong Zhai
, Liyan Zheng
, Haojie Wang
, Yuyang Jin
, Qihao Zhang
, Runqing Zhang
, Zhen Zheng
, Youngmin Yi
, Xipeng Shen
:
WiseGraph: Optimizing GNN with Joint Workload Partition of Graph and Operations. EuroSys 2024: 1-17 - [c73]Yanliang Zhou, Feng Zhang, Tuo Lin, Yuanjie Huang, Saiqin Long, Jidong Zhai, Xiaoyong Du:
F-TADOC: FPGA-Based Text Analytics Directly on Compression with HLS. ICDE 2024: 3739-3752 - [c72]Jiaao He
, Shengqi Chen
, Jidong Zhai
:
POSTER: Pattern-Aware Sparse Communication for Scalable Recommendation Model Training. PPoPP 2024: 466-468 - [c71]Yidong Chen, Chen Zhang, Rongchao Dong, Haoyuan Zhang, Yonghua Zhang, Zhonghua Lu, Jidong Zhai:
MixQ: Taming Dynamic Outliers in Mixed-Precision Quantization by Online Prediction. SC 2024: 74 - [c70]Kinman Lei, Yuyang Jin, Mingshu Zhai, Kezhao Huang, Haoxing Ye, Jidong Zhai:
PUZZLE: Efficiently Aligning Large Language Models through Light-Weight Context Switch. USENIX ATC 2024: 127-140 - [c69]Chen Zhang, Rongchao Dong, Haojie Wang, Runxin Zhong, Jike Chen, Jidong Zhai:
MAGPY: Compiling Eager Mode DNN Programs by Monitoring Execution States. USENIX ATC 2024: 683-698 - [i15]Jiaao He, Jidong Zhai:
FastDecode: High-Throughput GPU-Efficient LLM Serving using Heterogeneous Pipelines. CoRR abs/2403.11421 (2024) - [i14]Muyan Hu, Ashwin Venkatram, Shreyashri Biswas, Balamurugan Marimuthu, Bohan Hou, Gabriele Oliaro, Haojie Wang, Liyan Zheng, Xupeng Miao, Jidong Zhai:
Optimal Kernel Orchestration for Tensor Programs with Korch. CoRR abs/2406.09465 (2024) - 2023
- [j45]Zixuan Ma, Yuyang Jin, Shizhi Tang, Haojie Wang, Wei-Cheng Xue
, Jidong Zhai, Wei-Min Zheng:
Unified Programming Models for Heterogeneous High-Performance Computers. J. Comput. Sci. Technol. 38(1): 211-218 (2023) - [j44]Zheng Chen
, Feng Zhang
, Jiawei Guan
, Jidong Zhai
, Xipeng Shen
, Huanchen Zhang
, Wentong Shu
, Xiaoyong Du
:
CompressGraph: Efficient Parallel Graph Analytics with Rule-Based Compression. Proc. ACM Manag. Data 1(1): 4:1-4:31 (2023) - [j43]Zhen Zheng
, Zaifeng Pan
, Dalin Wang
, Kai Zhu
, Wenyi Zhao
, Tianyou Guo
, Xiafei Qiu
, Minmin Sun
, Junjie Bai
, Feng Zhang
, Xiaoyong Du
, Jidong Zhai
, Wei Lin
:
BladeDISC: Optimizing Dynamic Shape Machine Learning Workloads via Compiler Approach. Proc. ACM Manag. Data 1(3): 206:1-206:29 (2023) - [j42]Sunita Chandrasekaran
, Min Si
, Jidong Zhai, Lena Oden
:
Special issue on new trends in high-performance computing: Software systems and applications. Softw. Pract. Exp. 53(1): 3-5 (2023) - [j41]Haojie Wang
, Jidong Zhai
, Mingyu Gao
, Feng Zhang
, Tuowei Wang
, Zixuan Ma
, Shizhi Tang
, Liyan Zheng
, Wen Wang
, Kaiyuan Rong
, Yuanyong Chen
, Zhihao Jia:
Optimizing DNNs With Partially Equivalent Transformations and Automated Corrections. IEEE Trans. Computers 72(12): 3546-3560 (2023) - [j40]Juncheng Cao
, Kaiyuan Rong, Mingshu Zhai, Zeyu Song, Yanyu Ren
, Yuxi Zhu
, Wentao Han
, Jidong Zhai
:
Critique of "A Parallel Framework for Constraint-Based Bayesian Network Learning via Markov Blanket Discovery" by SCC Team From Tsinghua University. IEEE Trans. Parallel Distributed Syst. 34(6): 1723-1726 (2023) - [j39]Yihua Hu
, Feng Zhang
, Yifei Xia
, Zhiming Yao
, Letian Zeng
, Haipeng Ding
, Zhewei Wei
, Xiao Zhang
, Jidong Zhai
, Xiaoyong Du, Siqi Ma
:
Enabling Efficient Random Access to Hierarchically Compressed Text Data on Diverse GPU Platforms. IEEE Trans. Parallel Distributed Syst. 34(10): 2699-2717 (2023) - [c68]Lunyiu Nie, Jiuding Sun, Yanlin Wang, Lun Du, Shi Han, Dongmei Zhang, Lei Hou, Juanzi Li, Jidong Zhai:
Unveiling the Black Box of PLMs with Semantic Anchors: Towards Interpretable Neural Semantic Parsing. AAAI 2023: 13400-13408 - [c67]Qianjin Du, Shiji Zhou, Xiaohui Kuang, Gang Zhao, Jidong Zhai:
Joint Geometrical and Statistical Domain Adaptation for Cross-domain Code Vulnerability Detection. EMNLP 2023: 12791-12800 - [c66]Aohan Zeng, Xiao Liu, Zhengxiao Du, Zihan Wang, Hanyu Lai, Ming Ding, Zhuoyi Yang, Yifan Xu, Wendi Zheng, Xiao Xia, Weng Lam Tam, Zixuan Ma, Yufei Xue, Jidong Zhai, Wenguang Chen, Zhiyuan Liu, Peng Zhang, Yuxiao Dong, Jie Tang:
GLM-130B: An Open Bilingual Pre-trained Model. ICLR 2023 - [c65]Chen Zhang, Lingxiao Ma, Jilong Xue, Yining Shi, Ziming Miao, Fan Yang, Jidong Zhai, Zhi Yang, Mao Yang:
Cocktailer: Analyzing and Optimizing Dynamic Control Flow in Deep Learning. OSDI 2023: 681-699 - [c64]Liyan Zheng, Haojie Wang, Jidong Zhai, Muyan Hu, Zixuan Ma, Tuowei Wang, Shuhong Huang, Xupeng Miao, Shizhi Tang, Kezhao Huang, Zhihao Jia:
EINNET: Optimizing Tensor Programs with Derivation-Based Transformations. OSDI 2023: 739-755 - [c63]Tianhui Shi
, Jidong Zhai
, Haojie Wang
, Qiqian Chen
, Mingshu Zhai
, Zixu Hao
, Haoyu Yang
, Wenguang Chen
:
GraphSet: High Performance Graph Mining through Equivalent Set Transformations. SC 2023: 32:1-32:14 - [c62]Mingshu Zhai, Jiaao He, Zixuan Ma, Zan Zong, Runqing Zhang, Jidong Zhai:
SmartMoE: Efficiently Training Sparsely-Activated Models through Combining Offline and Online Parallelization. USENIX ATC 2023: 961-975 - [i13]Kezhao Huang, Haitian Jiang, Minjie Wang, Guangxuan Xiao, David Wipf, Xiang Song, Quan Gan, Zengfeng Huang
, Jidong Zhai, Zheng Zhang:
ReFresh: Reducing Memory Access from Exploiting Stable Historical Embeddings for Graph Neural Network Training. CoRR abs/2301.07482 (2023) - [i12]Zixuan Ma, Haojie Wang, Jingze Xing, Liyan Zheng, Chen Zhang, Huanqi Cao, Kezhao Huang, Shizhi Tang, Penghan Wang, Jidong Zhai:
PowerFusion: A Tensor Compiler with Explicit Data Movement Description and Instruction-level Graph IR. CoRR abs/2307.04995 (2023) - 2022
- [j38]Wei Liu
, Jiangming Jin, Hao Wu, Yifan Gong
, Ziyue Jiang, Jidong Zhai:
Zoro: A robotic middleware combining high performance and high reliability. J. Parallel Distributed Comput. 166: 126-138 (2022) - [j37]Feng Zhang
, Yani Liu, Ningxuan Feng, Cheng Yang
, Jidong Zhai
, Shuhao Zhang
, Bingsheng He, Jiazao Lin, Xiao Zhang, Xiaoyong Du:
Periodic Weather-Aware LSTM With Event Mechanism for Parking Behavior Prediction. IEEE Trans. Knowl. Data Eng. 34(12): 5896-5909 (2022) - [j36]Feng Zhang
, Jidong Zhai
, Xipeng Shen
, Onur Mutlu
, Xiaoyong Du:
POCLib: A High-Performance Framework for Enabling Near Orthogonal Processing on Compression. IEEE Trans. Parallel Distributed Syst. 33(2): 459-475 (2022) - [j35]Zaifeng Pan
, Feng Zhang
, Yanliang Zhou, Jidong Zhai
, Xipeng Shen
, Onur Mutlu
, Xiaoyong Du:
Exploring Data Analytics Without Decompression on Embedded GPU Systems. IEEE Trans. Parallel Distributed Syst. 33(7): 1553-1568 (2022) - [j34]Runxin Zhong
, Jiajie Chen
, Chen Zhang
, Mingshu Zhai, Zeyu Song, Yutian Wang
, Wentao Han, Lin Gan
, Jidong Zhai
:
Critique of "MemXCT: Memory-Centric X-Ray CT Reconstruction With Massive Parallelization" by SCC Team From Tsinghua University. IEEE Trans. Parallel Distributed Syst. 33(9): 2050-2053 (2022) - [j33]Jiesong Liu
, Feng Zhang
, Hourun Li, Dalin Wang, Weitao Wan, Xiaokun Fang, Jidong Zhai
, Xiaoyong Du:
Exploring Query Processing on CPU-GPU Integrated Edge Device. IEEE Trans. Parallel Distributed Syst. 33(10): 4057-4070 (2022) - [j32]Jidong Zhai
, Liyan Zheng
, Feng Zhang
, Xiongchao Tang, Haojie Wang, Teng Yu
, Yuyang Jin, Shuaiwen Leon Song, Wenguang Chen
:
Detecting Performance Variance for Parallel Applications Without Source Code. IEEE Trans. Parallel Distributed Syst. 33(10): 4239-4255 (2022) - [j31]Jidong Zhai, Min Si
, Antonio J. Peña
:
Guest Editorial. IEEE Trans. Parallel Distributed Syst. 33(11): 2644-2647 (2022) - [j30]Jidong Zhai
, Liyan Zheng
, Jinghan Sun
, Feng Zhang
, Xiongchao Tang, Xuehai Qian, Bingsheng He, Wei Xue, Wenguang Chen, Weimin Zheng:
Leveraging Code Snippets to Detect Variations in the Performance of HPC Systems. IEEE Trans. Parallel Distributed Syst. 33(12): 3558-3574 (2022) - [j29]Qingyu Xu, Feng Zhang
, Mingde Zhang, Jidong Zhai, Bingsheng He, Cheng Yang
, Shuhao Zhang, Jiazao Lin, Haidi Liu, Xiaoyong Du:
Payment behavior prediction on shared parking lots with TR-GCN. VLDB J. 31(5): 1035-1058 (2022) - [c61]Zhen Zheng, Xuanda Yang, Pengzhan Zhao, Guoping Long, Kai Zhu, Feiwen Zhu, Wenyi Zhao, Xiaoyong Liu, Jun Yang, Jidong Zhai, Shuaiwen Leon Song, Wei Lin:
AStitch: enabling a new multi-dimensional optimization space for memory-intensive ML training and inference on modern SIMT architectures. ASPLOS 2022: 359-373 - [c60]Lei Xie, Jidong Zhai, Zhenxing Zhang, Jonathan Allcock, Shengyu Zhang, Yicong Zheng
:
Suppressing ZZ crosstalk of Quantum computers through pulse and scheduling co-optimization. ASPLOS 2022: 499-513 - [c59]Lunyiu Nie, Shulin Cao, Jiaxin Shi, Jiuding Sun, Qi Tian, Lei Hou, Juanzi Li, Jidong Zhai:
GraphQ IR: Unifying the Semantic Parsing of Graph Query Languages with One Intermediate Representation. EMNLP 2022: 5848-5865 - [c58]Yunquan Zhang, Jidong Zhai, Rajiv Ranjan:
Message from the High Performance Computing and Communications 2022 Program Chairs. HPCC/DSS/SmartCity/DependSys 2022: lv - [c57]Zixuan Ma
, Haojie Wang, Guanyu Feng, Chen Zhang, Lei Xie, Jiaao He, Shengqi Chen, Jidong Zhai:
Efficiently emulating high-bitwidth computation with low-bitwidth hardware. ICS 2022: 5:1-5:12 - [c56]Shizhi Tang
, Jidong Zhai, Haojie Wang, Lin Jiang, Liyan Zheng, Zhenhao Yuan, Chen Zhang:
FreeTensor: a free-form DSL with holistic optimizations for irregular tensor programs. PLDI 2022: 872-887 - [c55]Jiaao He
, Jidong Zhai, Tiago Antunes, Haojie Wang, Fuwen Luo, Shangfeng Shi, Qin Li:
FasterMoE: modeling and optimizing training of large-scale dynamic pre-trained models. PPoPP 2022: 120-134 - [c54]Liyan Zheng, Jidong Zhai, Xiongchao Tang, Haojie Wang, Teng Yu, Yuyang Jin, Shuaiwen Leon Song, Wenguang Chen:
Vapro: performance variance detection and diagnosis for production-run parallel applications. PPoPP 2022: 150-162 - [c53]Yuyang Jin, Haojie Wang, Runxin Zhong, Chen Zhang, Jidong Zhai:
PerFlow: a domain specific framework for automatic performance analysis of parallel applications. PPoPP 2022: 177-191 - [c52]Zixuan Ma
, Jiaao He
, Jiezhong Qiu, Huanqi Cao, Yuanwei Wang, Zhenbo Sun, Liyan Zheng, Haojie Wang, Shizhi Tang
, Tianyu Zheng, Junyang Lin, Guanyu Feng, Zeqiang Huang, Jie Gao, Aohan Zeng, Jianwei Zhang, Runxin Zhong, Tianhui Shi, Sha Liu, Weimin Zheng, Jie Tang, Hongxia Yang, Xin Liu, Jidong Zhai, Wenguang Chen:
BaGuaLu: targeting brain scale pretrained models with over 37 million cores. PPoPP 2022: 192-204 - [c51]Chen Zhang, Haojie Wang, Zixuan Ma, Lei Xie, Zeyu Song, Jidong Zhai:
UniQ: A Unified Programming Model for Efficient Quantum Circuit Simulation. SC 2022: 49:1-49:16 - [c50]Feng Zhang, Weitao Wan, Chenyang Zhang, Jidong Zhai, Yunpeng Chai, Haixiang Li, Xiaoyong Du:
CompressDB: Enabling Efficient Compressed Data Direct Processing for Various Databases. SIGMOD Conference 2022: 1655-1669 - [i11]Lunyiu Nie, Shulin Cao, Jiaxin Shi, Qi Tian, Lei Hou, Juanzi Li, Jidong Zhai:
GraphQ IR: Unifying Semantic Parsing of Graph Query Language with Intermediate Representation. CoRR abs/2205.12078 (2022) - [i10]Liyan Zheng, Haojie Wang, Jidong Zhai, Muyan Hu, Zixuan Ma, Tuowei Wang, Shizhi Tang, Lei Xie, Kezhao Huang, Zhihao Jia:
OLLIE: Derivation-based Tensor Program Optimizer. CoRR abs/2208.02025 (2022) - [i9]Lunyiu Nie, Jiuding Sun, Yanlin Wang, Lun Du, Shi Han, Dongmei Zhang, Lei Hou, Juanzi Li, Jidong Zhai:
Guiding the PLMs with Semantic Anchors as Intermediate Supervision: Towards Interpretable Semantic Parsing. CoRR abs/2210.01425 (2022) - [i8]Aohan Zeng, Xiao Liu, Zhengxiao Du, Zihan Wang, Hanyu Lai, Ming Ding, Zhuoyi Yang, Yifan Xu, Wendi Zheng, Xiao Xia, Weng Lam Tam, Zixuan Ma, Yufei Xue, Jidong Zhai, Wenguang Chen, Peng Zhang, Yuxiao Dong, Jie Tang:
GLM-130B: An Open Bilingual Pre-trained Model. CoRR abs/2210.02414 (2022) - 2021
- [j28]Zhixiang Ren, Yongheng Liu, Tianhui Shi, Lei Xie, Yue Zhou, Jidong Zhai, Youhui Zhang, Yunquan Zhang, Wenguang Chen:
AIPerf: Automated machine learning as an AI-HPC benchmark. Big Data Min. Anal. 4(3): 208-220 (2021) - [j27]Xian-He Sun, Dong Li, Wen-Guang Chen, Tao Li, Jiwu Shu, Bo Wu, Jin Xiong, Jinging Xue, Feng Zhang, Jidong Zhai, Zhiia Zhao:
Preface. J. Comput. Sci. Technol. 36(1): 1-3 (2021) - [j26]Xiongchao Tang
, Chen Zhang, Jidong Zhai
, Xuehai Qian, Wenguang Chen, Yong Jiang:
A Fast Lock for Explicit Message Passing Architectures. IEEE Trans. Computers 70(10): 1555-1568 (2021) - [j25]Feng Zhang
, Jidong Zhai
, Bo Wu, Bingsheng He
, Wenguang Chen, Xiaoyong Du:
Automatic Irregularity-Aware Fine-Grained Workload Partitioning on Integrated Architectures. IEEE Trans. Knowl. Data Eng. 33(3): 867-881 (2021) - [j24]Teng Yu
, Runxin Zhong
, Vladimir Janjic, Pavlos Petoumenos
, Jidong Zhai
, Hugh Leather, John Thomson
:
Collaborative Heterogeneity-Aware OS Scheduler for Asymmetric Multicore Processors. IEEE Trans. Parallel Distributed Syst. 32(5): 1224-1237 (2021) - [j23]Pavan Balaji, Jidong Zhai, Min Si:
Guest Editorial. IEEE Trans. Parallel Distributed Syst. 32(7): 1511-1512 (2021) - [j22]Feng Zhang
, Zheng Chen, Chenyang Zhang
, Amelie Chi Zhou
, Jidong Zhai
, Xiaoyong Du:
An Efficient Parallel Secure Machine Learning Framework on GPUs. IEEE Trans. Parallel Distributed Syst. 32(9): 2262-2276 (2021) - [j21]Chen Zhang
, Chenggang Zhao, Jiaao He
, Shengqi Chen
, Liyan Zheng, Kezhao Huang, Wentao Han, Jidong Zhai
:
Critique of "Planetary Normal Mode Computation: Parallel Algorithms, Performance, and Reproducibility" by SCC Team From Tsinghua University. IEEE Trans. Parallel Distributed Syst. 32(11): 2631-2634 (2021) - [j20]Feng Zhang
, Jidong Zhai, Xipeng Shen
, Dalin Wang, Zheng Chen, Onur Mutlu, Wenguang Chen, Xiaoyong Du:
TADOC: Text analytics directly on compression. VLDB J. 30(2): 163-188 (2021) - [c49]Hao Wu, Jiangming Jin, Jidong Zhai, Yifan Gong, Wei Liu:
Accelerating GPU Message Communication for Autonomous Navigation Systems. CLUSTER 2021: 181-191 - [c48]Lei Xie, Jidong Zhai, Weimin Zheng:
Mitigating Crosstalk in Quantum Computers through Commutativity-Based Instruction Reordering. DAC 2021: 445-450 - [c47]Feng Zhang, Zaifeng Pan
, Yanliang Zhou, Jidong Zhai, Xipeng Shen
, Onur Mutlu, Xiaoyong Du:
G-TADOC: Enabling Efficient GPU-Based Text Analytics without Decompression. ICDE 2021: 1679-1690 - [c46]Chen Zhang
, Zeyu Song, Haojie Wang, Kaiyuan Rong, Jidong Zhai:
HyQuas: hybrid partitioner based quantum circuit simulation system on GPU. ICS 2021: 443-454 - [c45]Haojie Wang, Jidong Zhai, Mingyu Gao, Zixuan Ma, Shizhi Tang
, Liyan Zheng, Yuanzhi Li, Kaiyuan Rong, Yuanyong Chen, Zhihao Jia:
PET: Optimizing Tensor Programs with Partially Equivalent Transformations and Automated Corrections. OSDI 2021: 37-54 - [c44]Kezhao Huang, Jidong Zhai, Zhen Zheng, Youngmin Yi, Xipeng Shen:
Understanding and bridging the gaps in current GNN performance optimizations. PPoPP 2021: 119-132 - [i7]Jiaao He
, Jiezhong Qiu, Aohan Zeng, Zhilin Yang, Jidong Zhai, Jie Tang:
FastMoE: A Fast Mixture-of-Expert Training System. CoRR abs/2103.13262 (2021) - [i6]Feng Zhang, Zaifeng Pan, Yanliang Zhou, Jidong Zhai, Xipeng Shen, Onur Mutlu, Xiaoyong Du:
G-TADOC: Enabling Efficient GPU-Based Text Analytics without Decompression. CoRR abs/2106.06889 (2021) - 2020
- [j19]Ziyue Jiang, Yifan Gong, Jidong Zhai, Yu-Ping Wang
, Wei Liu, Hao Wu, Jiangming Jin:
Message Passing Optimization in Robot Operating System. Int. J. Parallel Program. 48(1): 119-136 (2020) - [c43]Chanyoung Oh, Zhen Zheng, Xipeng Shen
, Jidong Zhai, Youngmin Yi:
GOPipe: A Granularity-Oblivious Programming Framework for Pipelined Stencil Executions on GPU. PACT 2020: 43-54 - [c42]Lei Xie, Jidong Zhai, Baodong Wu, Yuanbo Wang, Xingcheng Zhang, Peng Sun, Shengen Yan:
Elan: Towards Generic and Efficient Elastic Training for Deep Learning. ICDCS 2020: 78-88 - [c41]Feng Zhang, Jidong Zhai, Xipeng Shen
, Onur Mutlu, Xiaoyong Du:
Enabling Efficient Random Access to Hierarchically-Compressed Data. ICDE 2020: 1069-1080 - [c40]Zheng Chen, Feng Zhang, Amelie Chi Zhou, Jidong Zhai, Chenyang Zhang
, Xiaoyong Du:
ParSecureML: An Efficient Parallel Secure Machine Learning Framework on GPUs. ICPP 2020: 22:1-22:11 - [c39]Wei Liu, Yifan Gong, Hao Wu, Jidong Zhai, Jiangming Jin:
Memory-Centric Communication Mechanism for Real-time Autonomous Navigation Applications. ICPP 2020: 33:1-33:11 - [c38]Xiaoyang Wang, Zhe Zhou
, Ping Han, Tong Meng, Guangyu Sun, Jidong Zhai:
Edge-Stream: a Stream Processing Approach for Distributed Applications on a Hierarchical Edge-computing System. SEC 2020: 14-27 - [c37]Feng Zhang, Ningxuan Feng, Yani Liu, Cheng Yang, Jidong Zhai, Shuhao Zhang
, Bingsheng He, Jiazao Lin, Xiaoyong Du:
PewLSTM: Periodic LSTM with Weather-Aware Gating Mechanism for Parking Behavior Prediction. IJCAI 2020: 4424-4430 - [c36]Qingyu Xu, Feng Zhang, Mingde Zhang, Jidong Zhai, Jiazao Lin, Haidi Liu, Xiaoyong Du:
Payment Behavior Prediction and Statistical Analysis for Shared Parking Lots. NPC 2020: 288-293 - [c35]Yuyang Jin, Haojie Wang, Xiongchao Tang, Torsten Hoefler, Xu Liu, Jidong Zhai:
Identifying scalability bottlenecks for large-scale parallel programs with graph analysis. PPoPP 2020: 409-410 - [c34]Yuyang Jin, Haojie Wang, Teng Yu, Xiongchao Tang, Torsten Hoefler, Xu Liu, Jidong Zhai:
ScalAna: automating scaling loss detection with graph analysis. SC 2020: 28 - [c33]Tianhui Shi, Mingshu Zhai, Yi Xu, Jidong Zhai:
GraphPi: high performance graph pattern matching through effective redundancy elimination. SC 2020: 100 - [i5]Zhixiang Ren, Yongheng Liu, Tianhui Shi, Lei Xie, Yue Zhou, Jidong Zhai, Youhui Zhang, Yunquan Zhang, Wenguang Chen:
AIPerf: Automated machine learning as an AI-HPC benchmark. CoRR abs/2008.07141 (2020) - [i4]Yuyang Jin, Haojie Wang, Teng Yu, Xiongchao Tang, Torsten Hoefler, Xu Liu, Jidong Zhai:
ScalAna: Automating Scaling Loss Detection with Graph Analysis. CoRR abs/2009.01692 (2020) - [i3]Feng Zhang, Jidong Zhai, Xipeng Shen, Dalin Wang, Zheng Chen, Onur Mutlu, Wenguang Chen, Xiaoyong Du:
TADOC: Text Analytics Directly on Compression. CoRR abs/2009.09442 (2020) - [i2]Tianhui Shi, Mingshu Zhai, Yi Xu, Jidong Zhai:
GraphPi: High Performance Graph Pattern Matching through Effective Redundancy Elimination. CoRR abs/2009.10955 (2020)
2010 – 2019
- 2019
- [j18]Feng Zhang
, Weifeng Liu
, Ningxuan Feng, Jidong Zhai, Xiaoyong Du:
Performance evaluation and analysis of sparse matrix and graph kernels on heterogeneous processors. CCF Trans. High Perform. Comput. 1(2): 131-143 (2019) - [j17]Feng Zhang, Jidong Zhai, Marc Snir, Hai Jin, Hironori Kasahara
, Mateo Valero:
Guest Editorial: Special Issue on Network and Parallel Computing for Emerging Architectures and Applications. Int. J. Parallel Program. 47(3): 343-344 (2019) - [j16]Jiaao He
, Chenggang Zhao, Jiping Yu
, Xinjian Yu, Liyan Zheng, Chenyao Lou, Shizhi Tang
, Wentao Han, Jidong Zhai:
Student Cluster Competition 2018, Team Tsinghua University: Reproducing performance of multi-physics simulations of the Tsunamigenic 2004 Sumatra megathrust earthquake on the Intel Skylake Architecture. Parallel Comput. 90 (2019) - [j15]Amelie Chi Zhou, Yao Xiao
, Yifan Gong
, Bingsheng He
, Jidong Zhai, Rui Mao
:
Privacy Regulation Aware Process Mapping in Geo-Distributed Cloud Data Centers. IEEE Trans. Parallel Distributed Syst. 30(8): 1872-1888 (2019) - [c32]Zhen Zheng, Chanyoung Oh, Jidong Zhai, Xipeng Shen
, Youngmin Yi, Wenguang Chen:
HiWayLib: A Software Framework for Enabling High Performance Communications for Heterogeneous Pipeline Computations. ASPLOS 2019: 153-166 - [c31]Xiongchao Tang, Jidong Zhai, Xuehai Qian, Wenguang Chen:
pLock: A Fast Lock for Architectures with Explicit Inter-core Message Passing. ASPLOS 2019: 765-778 - [c30]Xu Ji, Bin Yang, Tianyu Zhang, Xiaosong Ma, Xiupeng Zhu, Xiyang Wang, Nosayba El-Sayed, Jidong Zhai, Weiguo Liu, Wei Xue:
Automatic, Application-Aware I/O Forwarding Resource Allocation. FAST 2019: 265-279 - [c29]Ningxuan Feng, Feng Zhang, Jiazao Lin, Jidong Zhai, Xiaoyong Du:
Statistical Analysis and Prediction of Parking Behavior. NPC 2019: 93-104 - [c28]Bin Yang, Xu Ji, Xiaosong Ma, Xiyang Wang, Tianyu Zhang, Xiupeng Zhu, Nosayba El-Sayed, Haidong Lan, Yibo Yang, Jidong Zhai, Weiguo Liu, Wei Xue:
End-to-end I/O Monitoring on a Leading Supercomputer. NSDI 2019: 379-394 - [c27]Chanyoung Oh, Zhen Zheng, Xipeng Shen
, Jidong Zhai, Youngmin Yi:
GOPipe: a granularity-oblivious programming framework for pipelined stencil executions on GPU. PPoPP 2019: 431-432 - [c26]