default search action
CCF Transactions on High Performance Computing, Volume 5
Volume 5, Number 1, March 2023
- Jiachang Sun, Huiyuan Li, Wenjing Ma:
Editorial for the special issue on new algorithms and software for E-scale high performance computing. 1-2 - Chaofeng Hou, Aiqi Zhu, Shuai Zhang, Mingcan Zhao, Yanhao Ye, Ji Xu, Wei Ge:
Atomistic simulation of low-dimensional nanostructures toward extreme-scale supercomputing. 3-11 - Lian Duan, Chuanfu Xiao, Min Li, Mingshuo Ding, Chao Yang:
a-Tucker: fast input-adaptive and matricization-free Tucker decomposition of higher-order tensors on GPUs. 12-25 - Xinming Qin, Junshi Chen, Zhaolong Luo, Lingyun Wan, Jielan Li, Shizhe Jiao, Zhenlin Zhang, Qingcai Jiang, Wei Hu, Hong An, Jinlong Yang:
High performance computing for first-principles Kohn-Sham density functional theory towards exascale supercomputers. 26-42 - Kan Liu, Xinliang Wang, Wei Xue:
Model guided algorithm optimization for tridiagonal solver on many-core architectures. 43-55 - Fangfang Liu, Wenjing Ma, Yuwen Zhao, Daokun Chen, Yi Hu, Qinglin Lu, Wanwang Yin, Xinhui Yuan, Lijuan Jiang, Hao Yan, Min Li, Hongsen Wang, Xinyu Wang, Chao Yang:
xMath2.0: a high-performance extended math library for SW26010-Pro many-core processor. 56-71 - Xiaowen Xu, Xiaoqiang Yue, Runzhang Mao, Yuntong Deng, Silu Huang, Haifeng Zou, Xiao Liu, Shaoliang Hu, Chunsheng Feng, Shi Shu, Zeyao Mo:
JXPAMG: a parallel algebraic multigrid solver for extreme-scale numerical simulations. 72-83 - Qiao Sun, Wenjing Ma, Jiachang Sun, Huiyuan Li:
Evolving the HPL benchmark towards multi-GPGPU clusters. 84-96 - Fangfang Liu, Wenjing Ma, Yuwen Zhao, Daokun Chen, Yi Hu, Qinglin Lu, Wanwang Yin, Xinhui Yuan, Lijuan Jiang, Hao Yan, Min Li, Hongsen Wang, Xinyu Wang, Chao Yang:
Publisher Correction: xMath2.0: a high-performance extended math library for SW26010-Pro many-core processor. 97
Volume 5, Number 2, June 2023
- Weifeng Liu, Guangming Tan, Xiaowen Xu:
Editorial for the special issue on architecture, algorithms and applications of high performance sparse matrix computations. 99-101 - Y. R. Annie Bessant, J. Grace Jency, K. Martin Sagayam, A. Amir Anton Jone, Digvijay Pandey, Binay Kumar Pandey:
Improved parallel matrix multiplication using Strassen and Urdhvatiryagbhyam method. 102-115 - Shengguo Li, Xia Liao, Yutong Lu, José E. Román, Xiaoqiang Yue:
A parallel structured banded DC algorithm for symmetric eigenvalue problems. 116-128 - Zhengyang Lu, Weifeng Liu:
TileSpTRSV: a tiled algorithm for parallel sparse triangular solve on GPUs. 129-143 - Li Zhao, Shizhe Li, Chen-Song Zhang, Chunsheng Feng, Shi Shu:
An improved multistage preconditioner on GPUs for compositional reservoir simulation. 144-159 - Jiaquan Gao, Xinyue Chu, Yizhou Wang:
HeuriSPAI: a heuristic sparse approximate inverse preconditioning algorithm on GPU. 160-170 - Yu Li, Zijing Wang, Hehu Xie:
GCGE: a package for solving large scale eigenvalue problems by parallel block damping inverse power method. 171-190 - Chuanying Li, Stef Graillat, Zhe Quan, Tongxiang Gu, Hao Jiang, Kenli Li:
XHYPRE: a reliable parallel numerical algorithm library for solving large-scale sparse linear equations. 191-209 - Genghan Zhang, Yuetong Zhao, Yanting Tao, Zhongming Yu, Guohao Dai, Sitao Huang, Yuan Wen, Pavlos Petoumenos, Yu Wang:
Sgap: towards efficient sparse tensor algebra compilation for GPU. 210-227
Volume 5, Number 3, September 2023
- Liang Yuan, Junmin Xiao:
SI on parallel system and algorithm optimization. 229-230 - Yongtao Luo, Bo Yang, Jie Liu, Ruibo Wang, Jinmin Wen, Tiaojie Xiao, Xuguang Chen, Chunye Gong:
MT-office: parallel password recovery program for office on domestic heterogeneous multi-core processor. 231-244 - Xiaojun Lei, Tongxiang Gu, Xiaowen Xu:
ddRingAllreduce: a high-precision RingAllreduce algorithm. 245-257 - Kexing Zhou, Yong Dong, Juan Chen, Yuhan Cao, Zekai Li, Rongyu Deng, Yifei Guo, Zhixin Ou:
Processor power forecasting through model sample analysis and clustering. 258-276 - Yuan Zhang, Huawei Cao, Yan Liang, Jie Zhang, Junying Huang, Xiaochun Ye, Xuejun An:
FSGraph: fast and scalable implementation of graph traversal on GPUs. 277-291 - Xiaohui Wei, Xinyang Zheng, Chenyang Wang, Guangli Li, Hengshan Yue:
FASS-pruner: customizing a fine-grained CNN accelerator-aware pruning framework via intra-filter splitting and inter-filter shuffling. 292-303 - Jie Lou, Yiming Sun, Jie Zhang, Huawei Cao, Yuan Zhang, Ninghui Sun:
ArkGPU: enabling applications' high-goodput co-location execution on multitasking GPUs. 304-321 - Biao Sun, Mingzhen Li, Hailong Yang, Jun Xu, Zhongzhi Luan, Depei Qian:
Adapting combined tiling to stencil optimizations on sunway processor. 322-333 - Songwen Pei, Jie Luo, Sheng Liang, Haonan Ding, Xiaochun Ye, Mingsong Chen:
Carbon Emissions Reduction of Neural Network by Discrete Rank Pruning. 334-346
Volume 5, Number 4, December 2023
- Bin Zhao, Jiangkai Hu, Dapeng Wang, Bo Zhang, Fajing Chen, Ziwei Wan, Siyuan Sun:
The GRAPES evaluation tools based on Python (GetPy). 347-359 - Jiaxu Guo, Yidan Xu, Haohuan Fu, Wei Xue, Lin Gan, Mengxuan Tan, Tingye Wu, Yutong Shen, Xianwei Wu, Liang Hu, Xilong Che:
GEO-WMS: an improved approach to geoscientific workflow management system on HPC. 360-373 - D. Sirisha, S. Sambhu Prasad:
MPEFT: a makespan minimizing heuristic scheduling algorithm for workflows in heterogeneous computing systems. 374-389 - Faezeh Mollasalehi, Ehsan Mousavi Khaneghah, Amirhosein Reyhani ShowkatAbad, Seyed Alireza Seyednejad, Faeze Gholamrezaie:
ExaLB: a mathematical framework for load balancing to support distributed exascale computing environments. 390-415 - Pouria Fakhri, Ehsan Mousavi Khaneghah, Zohreh Esmaeili Bidhendi, Araz R. Aliev:
ExaSU: a mathematical model for selecting the structured or unstructured resource discovery mechanism in distributed exascale computing environments. 416-428 - Yan Zeng, Yong Ding, Dongyang Ou, Jilin Zhang, Yongjian Ren, Yunquan Zhang:
MP-DPS: adaptive distributed training for deep learning based on node merging and path prediction. 429-441 - Fang Lin, Yi Liu, Xin Wang, Xueyan Gai:
Leveraging simulation of high performance computing systems with node simulation using architecture simulator. 442-464 - Ou Wu, Binbin Huang, Shanshan Li, Yanze Wang, Haoming Li:
A performance evaluation method of queuing theory based on Cosmos cross-chain platform. 465-485
manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.