default search action
IEEE Transactions on Parallel and Distributed Systems, Volume 32
Volume 32, Number 1, January 2021
- Haitao Zhang, Xin Geng, Huadong Ma:
Learning-Driven Interference-Aware Workload Parallelization for Streaming Applications in Heterogeneous Cluster. 1-15 - Weichen Huang, Juntao Fang, Shenggang Wan, Changsheng Xie, Xubin He:
Design and Evaluation of a Risk-Aware Failure Identification Scheme for Improved RAS in Erasure-Coded Data Centers. 16-30 - Xiaoyu Xia, Feifei Chen, Qiang He, John C. Grundy, Mohamed Abdelrazek, Hai Jin:
Cost-Effective App Data Distribution in Edge Computing. 31-44 - Shinobu Miwa, Ignacio Laguna, Martin Schulz:
PredCom: A Predictive Approach to Collecting Approximated Communication Traces. 45-58 - Moming Duan, Duo Liu, Xianzhang Chen, Renping Liu, Yujuan Tan, Liang Liang:
Self-Balancing Federated Learning With Global Imbalanced Data in Mobile Systems. 59-71 - Philipp Grete, Forrest Wolfgang Glines, Brian W. O'Shea:
K-Athena: A Performance Portable Structured Grid Finite Volume Magnetohydrodynamics Code. 85-97 - Sheng Wang, Zhijun Ding, Changjun Jiang:
Elastic Scheduling for Microservice Applications in Clouds. 98-115 - Zhiyong Ye, Yang Wang, Shuibing He, Chengzhong Xu, Xian-He Sun:
Sova: A Software-Defined Autonomic Framework for Virtual Network Allocations. 116-130 - Guoqing Xiao, Kenli Li, Yuedan Chen, Wangquan He, Albert Y. Zomaya, Tao Li:
CASpMV: A Customized and Accelerative SpMV Framework for the Sunway TaihuLight. 131-146 - M. Ozan Karsavuran, Seher Acer, Cevdet Aykanat:
Partitioning Models for General Medium-Grain Parallel Sparse Tensor Decomposition. 147-159 - Zhigao Zheng, Xuanhua Shi, Ligang He, Hai Jin, Shuo Wei, Hulin Dai, Xuan Peng:
Feluca: A Two-Stage Graph Coloring Algorithm With Color-Centric Paradigm on GPU. 160-173 - Sudip Misra, Anandarup Mukherjee, Arijit Roy, Nishant Saurabh, Yogachandran Rahulamathavan, Muttukrishnan Rajarajan:
Blockchain at the Edge: Performance of Resource-Constrained IoT Networks. 174-183 - Dimosthenis Masouros, Sotirios Xydis, Dimitrios Soudris:
Rusty: Runtime Interference-Aware Predictive Monitoring for Modern Multi-Tenant Systems. 184-198 - Tong Geng, Ang Li, Tianqi Wang, Chunshu Wu, Yanfei Li, Runbin Shi, Wei Wu, Martin C. Herbordt:
O3BNN-R: An Out-of-Order Architecture for High-Performance and Regularized BNN Inference. 199-213 - Yujuan Tan, Congcong Xu, Jing Xie, Zhichao Yan, Hong Jiang, Witawas Srisa-an, Xianzhang Chen, Duo Liu:
Improving the Performance of Deduplication-Based Storage Cache via Content-Driven Cache Management Methods. 214-228 - Marc González, Enric Morancho:
Multi-GPU Parallelization of the NAS Multi-Zone Parallel Benchmarks. 229-241 - Jin Wang, Jia Hu, Geyong Min, Albert Y. Zomaya, Nektarios Georgalas:
Fast Adaptive Task Offloading in Edge Computing Based on Meta Reinforcement Learning. 242-253
Volume 32, Number 2, February 2021
- Alvaro Wong, Elisa Heymann, Dolores Rexachs, Emilio Luque:
Middleware to Manage Fault Tolerance Using Semi-Coordinated Checkpoints. 254-268 - Md. S. Q. Zulkar Nine, Tevfik Kosar:
A Two-Phase Dynamic Throughput Optimization Model for Big Data Transfers. 269-280 - Xiaoyu Xia, Feifei Chen, Qiang He, John Grundy, Mohamed Abdelrazek, Hai Jin:
Online Collaborative Data Caching in Edge Computing. 281-294 - Song Yang, Fan Li, Stojan Trajanovski, Ramin Yahyapour, Xiaoming Fu:
Recent Advances of Resource Allocation in Network Function Virtualization. 295-314 - Purushottam Sigdel, Xu Yuan, Nian-Feng Tzeng:
Realizing Best Checkpointing Control in Computing Systems. 315-329 - Hui Guan, Xipeng Shen, Hamid Krim:
An Automatic Synthesizer of Advising Tools for High Performance Computing. 330-341 - Thi-Thanh-Quynh Nguyen, Christophe Bobineau, Vincent Debusschere, Quang-Huy Giap, Nouredine Hadjsaid:
CPDE: A Methodology for the Transparent Distribution of Centralized Smart Grid Programs. 342-354 - Linhuai Tang, Gang Cai, Yong Zheng, Jiamin Chen:
A Resource and Performance Optimization Reduction Circuit on FPGAs. 355-366 - Xia Liao, Shengguo Li, Yutong Lu, José E. Román:
A Parallel Structured Divide-and-Conquer Algorithm for Symmetric Tridiagonal Eigenvalue Problems. 367-378 - Ahmad Al Badawi, Bharadwaj Veeravalli, Jie Lin, Xiao Nan, Kazuaki Matsumura, Khin Mi Mi Aung:
Multi-GPU Design and Performance Evaluation of Homomorphic Encryption on GPU Clusters. 379-391 - Zhengjun Cao, Olivier Markowitch:
Comment on "Circuit Ciphertext-Policy Attribute-Based Hybrid Encryption With Verifiable Delegation in Cloud Computing". 392-393 - Cong Wang, Yuanyuan Yang, Pengzhan Zhou:
Towards Efficient Scheduling of Federated Mobile Devices Under Computational and Statistical Heterogeneity. 394-410 - Xiaojie Wang, Zhaolong Ning, Song Guo:
Multi-Agent Imitation Learning for Pervasive Edge Computing: A Decentralized Computation Offloading Algorithm. 411-425 - Yusen Li, Changjian Zhao, Xueyan Tang, Wentong Cai, Xiaoguang Liu, Gang Wang, Xiaoli Gong:
Towards Minimizing Resource Usage With QoS Guarantee in Cloud Gaming. 426-440 - Pu Pang, Quan Chen, Deze Zeng, Minyi Guo:
Adaptive Preference-Aware Co-Location for Improving Resource Utilization of Power Constrained Datacenters. 441-456 - Tong Zhang, Fengyuan Ren, Jiakun Bao, Ran Shu, Wenxue Cheng:
Minimizing Coflow Completion Time in Optical Circuit Switched Networks. 457-469 - Andrea Giordano, Alessio De Rango, Rocco Rongo, Donato D'Ambrosio, William Spataro:
Dynamic Load Balancing in Parallel Execution of Cellular Automata. 470-484 - Yaxing Chen, Qinghua Zheng, Zheng Yan, Dan Liu:
QShield: Protecting Outsourced Cloud Data Queries With Multi-User Access Control Based on SGX. 485-499
Volume 32, Number 3, March 2021
- Xiaoli Gong, Dingyuan Cao, Yuxuan Li, Ximing Liu, Yusen Li, Jin Zhang, Tao Li:
A Thread Level SLO-Aware I/O Framework for Embedded Virtualization. 500-513 - Sara Kardani-Moghaddam, Rajkumar Buyya, Kotagiri Ramamohanarao:
ADRL: A Hybrid Anomaly-Aware Deep Reinforcement Learning-Based Resource Scaling in Clouds. 514-526 - Kristina Spirovska, Diego Didona, Willy Zwaenepoel:
Optimistic Causal Consistency for Geo-Replicated Key-Value Stores. 527-542 - Hamidreza Khaleghzadeh, Muhammad Fahad, Arsalan Shahid, Ravi Reddy Manumachu, Alexey L. Lastovetsky:
Bi-Objective Optimization of Data-Parallel Applications on Heterogeneous HPC Platforms for Performance and Energy Through Workload Distribution. 543-560 - Xueqiao Liu, Guomin Yang, Willy Susilo, Joseph Tonien, Ximeng Liu, Jian Shen:
Privacy-Preserving Multi-Keyword Searchable Encryption for Distributed Systems. 561-574 - Naina Gupta, Arpan Jati, Amit Kumar Chauhan, Anupam Chattopadhyay:
PQC Acceleration Using GPUs: FrodoKEM, NewHope, and Kyber. 575-586 - Xueqin Liang, Zheng Yan, Robert H. Deng, Qinghua Zheng:
Investigating the Adoption of Hybrid Encrypted Cloud Data Deduplication With Game Theory. 587-600 - Soojeong Cho, Wonbae Kim, Sehyeon Oh, Changdae Kim, Kwangwon Koh, Beomseok Nam:
Failure-Atomic Byte-Addressable R-tree for Persistent Memory. 601-614 - Changyuan Lin, Hamzeh Khazaei:
Modeling and Optimization of Performance and Cost of Serverless Applications. 615-632 - Florian Glaser, Giuseppe Tagliavini, Davide Rossi, Germain Haugou, Qiuting Huang, Luca Benini:
Energy-Efficient Hardware-Accelerated Synchronization for Shared-L1-Memory Multiprocessor Clusters. 633-648 - Kaixin Huang, Sumin Li, Linpeng Huang, Kian-Lee Tan, Hong Mei:
Lewat: A Lightweight, Efficient, and Wear-Aware Transactional Persistent Memory System. 649-664 - Guo Chen, Baolei Cheng, Dajin Wang:
Constructing Completely Independent Spanning Trees in Data Center Network Based on Augmented Cube. 665-673 - Rupesh Raj Karn, Prabhakar Kudva, Hai Huang, Sahil Suneja, Ibrahim M. Elfadel:
Cryptomining Detection in Container Clouds Using System Calls and Explainable Machine Learning. 674-691 - Xiangqiang Gao, Rongke Liu, Aryan Kaushik:
Hierarchical Multi-Agent Optimization for Resource Allocation in Cloud Computing. 692-707 - Mingzhen Li, Yi Liu, Xiaoyan Liu, Qingxiao Sun, Xin You, Hailong Yang, Zhongzhi Luan, Lin Gan, Guangwen Yang, Depei Qian:
The Deep Learning Compiler: A Comprehensive Survey. 708-727 - Gongming Zhao, Hongli Xu, Jingyuan Fan, Liusheng Huang, Chunming Qiao:
Achieving Fine-Grained Flow Management Through Hybrid Rule Placement in SDNs. 728-742
Volume 32, Number 4, April 2021
- Manish Parashar:
Editor's Note. 743-745 - Shufeng Gong, Yanfeng Zhang, Ge Yu:
Accelerating Large-Scale Prioritized Graph Computations by Hotness Balanced Partition. 746-759 - Gizem S. Çetin, Erkay Savas, Berk Sunar:
Homomorphic Sorting With Better Scalability. 760-771 - Tommaso Pozzetti, Ajay D. Kshemkalyani:
Resettable Encoded Vector Clock for Causality Analysis With an Application to Dynamic Race Detection. 772-785 - Dian Chen, Haobo Yuan, Shengshan Hu, Qian Wang, Cong Wang:
BOSSA: A Decentralized System for Proofs of Data Retrievability and Replication. 786-798 - Zichuan Xu, Liqian Zhao, Weifa Liang, Omer F. Rana, Pan Zhou, Qiufen Xia, Wenzheng Xu, Guowei Wu:
Energy-Aware Inference Offloading for DNN-Driven Applications in Mobile Edge Clouds. 799-814 - Lingzhi Ouyang, Yu Huang, Hengfeng Wei, Jian Lu:
Achieving Probabilistic Atomicity With Well-Bounded Staleness and Low Read Latency in Distributed Datastores. 815-829 - Ning Chen, Siyi Quan, Sheng Zhang, Zhuzhong Qian, Yibo Jin, Jie Wu, Wenzhong Li, Sanglu Lu:
Cuttlefish: Neural Configuration Adaptation for Video Analysis in Live Augmented Reality. 830-841 - Youfu Li, Matteo Interlandi, Fotis Psallidas, Wei Wang, Carlo Zaniolo:
SEIZE: Runtime Inspection for Parallel Dataflow Systems. 842-854 - Tianchen Ding, Shiyou Qian, Jian Cao, Guangtao Xue, Yanmin Zhu, Jiadi Yu, Minglu Li:
MO-Tree: An Efficient Forwarding Engine for Spatiotemporal-Aware Pub/Sub Systems. 855-866 - Rong Gu, Zhiqiang Zuo, Xi Jiang, Han Yin, Zhaokang Wang, Linzhang Wang, Xuandong Li, Yihua Huang:
Towards Efficient Large-Scale Interprocedural Program Static Analysis on Distributed Data-Parallel Computation. 867-883 - Minghua Shen, Guojie Luo, Nong Xiao:
Coarse-Grained Parallel Routing With Recursive Partitioning for FPGAs. 884-899 - Qihua Zhou, Kun Wang, Haodong Lu, Wenyao Xu, Yanfei Sun, Song Guo:
Canary: Decentralized Distributed Deep Learning Via Gradient Sketch and Partition in Multi-Interface Networks. 900-917 - Nannan Zhao, Vasily Tarasov, Hadeel Albahar, Ali Anwar, Lukas Rupprecht, Dimitrios Skourtis, Arnab Kumar Paul, Keren Chen, Ali Raza Butt:
Large-Scale Analysis of Docker Images and Performance Implications for Container Storage Systems. 918-930 - Tarik Reza Toha, A. S. M. Rizvi, Jannatun Noor, Muhammad Abdullah Adnan, A. B. M. Alim Al Islam:
Towards Greening MapReduce Clusters Considering Both Computation Energy and Cooling Energy. 931-942 - Maciej Besta, Jens Domke, Marcel Schneider, Marek Konieczny, Salvatore Di Girolamo, Timo Schneider, Ankit Singla, Torsten Hoefler:
High-Performance Routing With Multipathing and Path Diversity in Ethernet and HPC Networks. 943-959 - Sudheer Kumar Battula, Malgorzata M. O'Reilly, Saurabh Garg, James Montgomery:
A Generic Stochastic Model for Resource Availability in Fog Computing Environments. 960-974 - Xin Liu, Jun Sun, Lin Zheng, Su Wang, Yao Liu, Tongquan Wei:
Parallelization and Optimization of NSGA-II on Sunway TaihuLight System. 975-987
Volume 32, Number 5, May 2021
- Franco Cicirelli, Andrea Giordano, Carlo Mastroianni:
Analysis of Global and Local Synchronization in Parallel Computing. 988-1000 - Gökhan Göktürk, Kamer Kaya:
Boosting Parallel Influence-Maximization Kernels for Undirected Networks With Fusing and Vectorization. 1001-1013 - Johannes de Fine Licht, Maciej Besta, Simon Meierhans, Torsten Hoefler:
Transformations of High-Level Synthesis Codes for High-Performance Computing. 1014-1029 - Qihua Zhou, Song Guo, Zhihao Qu, Peng Li, Li Li, Minyi Guo, Kun Wang:
Petrel: Heterogeneity-Aware Distributed Deep Learning Via Hybrid Synchronization. 1030-1043 - Shashikant Ilager, Kotagiri Ramamohanarao, Rajkumar Buyya:
Thermal Prediction for Efficient Energy Management of Clouds Using Machine Learning. 1044-1056 - Hamza Djigal, Jun Feng, Jiamin Lu, Jidong Ge:
IPPTS: An Efficient Algorithm for Scientific Workflow Scheduling in Heterogeneous Computing Systems. 1057-1071 - Wanyu Lin, Helei Cui, Baochun Li, Cong Wang:
Privacy-Preserving Similarity Search With Efficient Updates in Distributed Key-Value Stores. 1072-1084 - Xiaoyu Qiu, Weikun Zhang, Wuhui Chen, Zibin Zheng:
Distributed and Collective Deep Reinforcement Learning for Computation Offloading: A Practical Perspective. 1085-1101 - Rodrigo Cataldo, Ramon Fernandes, Kevin J. M. Martin, Jarbas Silveira, Gustavo Sanchez, Johanna Sepúlveda, César A. M. Marcon, Jean-Philippe Diguet:
Subutai: Speeding Up Legacy Parallel Applications Through Data Synchronization. 1102-1116 - Myeonggyun Han, Jinsu Park, Woongki Baek:
Design and Implementation of a Criticality- and Heterogeneity-Aware Runtime System for Task-Parallel Applications. 1117-1132 - Yuvraj Sahni, Jiannong Cao, Lei Yang, Yusheng Ji:
Multi-Hop Multi-Task Partial Computation Offloading in Collaborative Edge Computing. 1133-1145 - Wenyu Li, Chenglin Feng, Lei Zhang, Hao Xu, Bin Cao, Muhammad Ali Imran:
A Scalable Multi-Layer PBFT Consensus for Blockchain. 1146-1160 - Bang Di, Jianhua Sun, Hao Chen, Dong Li:
Efficient Buffer Overflow Detection on GPU. 1161-1177 - Ana Gainaru, Brice Goglin, Valentin Honoré, Guillaume Pallez:
Profiles of Upcoming HPC Applications and Their Impact on Reservation Strategies. 1178-1190 - Priyanka Ghosh, Sriram Krishnamoorthy, Ananth Kalyanaraman:
PaKman: A Scalable Algorithm for Generating Genomic Contigs on Distributed Memory Machines. 1191-1209 - Bo Li, Qiang He, Feifei Chen, Hai Jin, Yang Xiang, Yun Yang:
Auditing Cache Data Integrity in the Edge Computing Environment. 1210-1223 - Teng Yu, Runxin Zhong, Vladimir Janjic, Pavlos Petoumenos, Jidong Zhai, Hugh Leather, John Thomson:
Collaborative Heterogeneity-Aware OS Scheduler for Asymmetric Multicore Processors. 1224-1237 - Jianzhen Luo, Jun Li, Lei Jiao, Jun Cai:
On the Effective Parallelization and Near-Optimal Deployment of Service Function Chains. 1238-1255 - Li Chen, Yuan Feng, Baochun Li, Bo Li:
A Case for Pricing Bandwidth: Sharing Datacenter Networks With Cost Dominant Fairness. 1256-1269
Volume 32, Number 6, June 2021
- Zhaolong Ning, Peiran Dong, Xiaojie Wang, Shupeng Wang, Xiping Hu, Song Guo, Tie Qiu, Bin Hu, Ricky Yu-Kwong Kwok:
Distributed and Dynamic Service Placement in Pervasive Edge Computing Networks. 1277-1292 - Pu Zhang, Huifeng Xue, Shan Gao, Jialong Zhang:
Distributed Adaptive Consensus Tracking Control for Multi-Agent System With Communication Constraints. 1293-1306 - Weihao Cui, Quan Chen, Han Zhao, Mengze Wei, Xiaoxin Tang, Minyi Guo:
E2bird: Enhanced Elastic Batch for Improving Responsiveness and Throughput of Deep Learning Services. 1307-1321 - Yang Wang, Xu Jiang, Nan Guan, Zhishan Guo, Xue Liu, Wang Yi:
Partitioning-Based Scheduling of OpenMP Task Systems With Tied Tasks. 1322-1339 - Maria Luisa Merani, Daniele Croce, Ilenia Tinnirello:
Rings for Privacy: An Architecture for Large Scale Privacy-Preserving Data Mining. 1340-1352 - Guoqi Xie, Kehua Yang, Haibo Luo, Renfa Li, Shiyan Hu:
Reliability and Confidentiality Co-Verification for Parallel Applications in Distributed Systems. 1353-1368 - Tianhui Meng, Yubin Zhao, Katinka Wolter, Cheng-Zhong Xu:
On Consortium Blockchain Consistency: A Queueing Network Model Approach. 1369-1382 - Niloofar Moradi, Alireza Shameli-Sendi, Alireza Khajouei:
A Scalable Stateful Approach for Virtual Security Functions Orchestration. 1383-1394 - Uthayanath Suthakar, Luca Magnoni, David Ryan Smith, Akram Khan:
Optimised Lambda Architecture for Monitoring Scientific Infrastructure. 1395-1408 - Engin Kayraklioglu, Erwan Favry, Tarek A. El-Ghazawi:
A Machine-Learning-Based Framework for Productive Locality Exploitation. 1409-1424 - Carlos Galindo, Naoki Nishida, Josep Silva, Salvador Tamarit:
Reversible CSP Computations. 1425-1436 - Hui Sun, Shangshang Dai, Jianzhong Huang, Xiao Qin:
Co-Active: A Workload-Aware Collaborative Cache Management Scheme for NVMe SSDs. 1437-1451 - Afshin Ahmadi, Felice Manganiello, Amin Khademi, Melissa C. Smith:
A Parallel Jacobi-Embedded Gauss-Seidel Method. 1452-1464 - Yen-Lung Chen, Bo-Yi Chang, Chia-Hsiang Yang, Tzi-Dar Chiueh:
A High-Throughput FPGA Accelerator for Short-Read Mapping of the Whole Human Genome. 1465-1478 - Aakash Khochare, Aravindhan Krishnan, Yogesh Simmhan:
A Scalable Platform for Distributed Object Tracking Across a Many-Camera Network. 1479-1493 - Long Cheng, Ying Wang, Qingzhi Liu, Dick H. J. Epema, Cheng Liu, Ying Mao, John Murphy:
Network-Aware Locality Scheduling for Distributed Data Operators in Data Centers. 1494-1510
Volume 32, Number 7, July 2021
- Pavan Balaji, Jidong Zhai, Min Si:
Guest Editorial. 1511-1512 - Muhammad Shayan, Clement Fung, Chris J. M. Yoon, Ivan Beschastnikh:
Biscotti: A Blockchain System for Private and Secure Federated Learning. 1513-1525 - Md. Palash Uddin, Yong Xiang, Xuequan Lu, John Yearwood, Longxiang Gao:
Mutual Information Driven Federated Learning. 1526-1538 - Wentai Wu, Ligang He, Weiwei Lin, Rui Mao:
Accelerating Federated Learning Over Reliability-Agnostic Clients in Mobile Edge Computing Systems. 1539-1551 - Tiansheng Huang, Weiwei Lin, Wentai Wu, Ligang He, Keqin Li, Albert Y. Zomaya:
An Efficiency-Boosting Client Selection Scheme for Federated Learning With Fairness Guarantee. 1552-1564 - Xueyu Wu, Xin Yao, Cho-Li Wang:
FedSCR: Structure-Based Communication Reduction for Federated Learning. 1565-1577 - Atakan Aral, Ivona Brandic:
Learning Spatiotemporal Failure Dependencies for Resilient Edge Computing Services. 1578-1590 - Rui Han, Shilin Li, Xiangwei Wang, Chi Harold Liu, Gaofeng Xin, Lydia Y. Chen:
Accelerating Gossip-Based Deep Learning in Heterogeneous Edge Computing Platforms. 1591-1602 - Chubo Liu, Fan Tang, Yikun Hu, Kenli Li, Zhuo Tang, Keqin Li:
Distributed Task Migration Optimization in MEC by Extending Multi-Agent Deep Reinforcement Learning Approach. 1603-1614 - Liangyi Gong, Hao Lin, Zhenhua Li, Feng Qian, Yang Li, Xiaobo Ma, Yunhao Liu:
Systematically Landing Machine Learning onto Market-Scale Mobile Malware Detection. 1615-1628 - Saiqin Long, Weifan Long, Zhetao Li, Kenli Li, Yuanqing Xia, Zhuo Tang:
A Game-Based Approach for Cost-Aware Task Assignment With QoS Constraint in Collaborative Edge and Cloud Environments. 1629-1640 - Yosuke Oyama, Naoya Maruyama, Nikoli Dryden, Erin McCarthy, Peter Harrington, Jan Balewski, Satoshi Matsuoka, Peter Nugent, Brian Van Essen:
The Case for Strong Scaling in Deep Learning: Training Large 3D CNNs With Hybrid Parallelism. 1641-1652 - Chunpeng Ge, Zhe Liu, Liming Fang, Huading Ling, Aiping Zhang, Changchun Yin:
A Hybrid Fuzzy Convolutional Neural Network Based Mechanism for Photovoltaic Cell Defect Detection With Electroluminescence Images. 1653-1664 - Jiangsu Du, Xin Zhu, Minghua Shen, Yunfei Du, Yutong Lu, Nong Xiao, Xiangke Liao:
Model Parallelism Optimization for Distributed Inference Via Decoupled CNN Structure. 1665-1676 - Kai Zhao, Sheng Di, Sihuan Li, Xin Liang, Yujia Zhai, Jieyang Chen, Kaiming Ouyang, Franck Cappello, Zizhong Chen:
FT-CNN: Algorithm-Based Fault Tolerance for Convolutional Neural Networks. 1677-1689 - Xiaqing Li, Guangyan Zhang, Weimin Zheng:
SmartTuning: Selecting Hyper-Parameters of a ConvNet System for Fast Training and Small Working Memory. 1690-1701 - Daning Cheng, Shigang Li, Hanping Zhang, Fen Xia, Yunquan Zhang:
Why Dataset Properties Bound the Scalability of Parallel Machine Learning Training Algorithms. 1702-1712 - Janaina Schwarzrock, Charles Cardoso De Oliveira, Marcus Ritt, Arthur Francisco Lorenzon, Antonio Carlos Schneider Beck:
A Runtime and Non-Intrusive Approach to Optimize EDP by Tuning Threads and CPU Frequency for OpenMP Applications. 1713-1724 - Shigang Li, Tal Ben-Nun, Giorgi Nadiradze, Salvatore Di Girolamo, Nikoli Dryden, Dan Alistarh, Torsten Hoefler:
Breaking (Global) Barriers in Parallel Stochastic Optimization With Wait-Avoiding Group Averaging. 1725-1739 - Chenyang Zhang, Feng Zhang, Xiaoguang Guo, Bingsheng He, Xiao Zhang, Xiaoyong Du:
iMLBench: A Machine Learning Benchmark Suite for CPU-GPU Integrated Architectures. 1740-1752 - Qing Ye, Yanan Sun, Jixin Zhang, Jiancheng Lv:
A Distributed Framework for EA-Based NAS. 1753-1764 - Cody Blakeney, Xiaomin Li, Yan Yan, Ziliang Zong:
Parallel Blockwise Knowledge Distillation for Deep Neural Network Compression. 1765-1776 - Yunlong Mao, Wenbo Hong, Heng Wang, Qun Li, Sheng Zhong:
Privacy-Preserving Computation Offloading for Parallel Deep Neural Networks Training. 1777-1788 - Meng Hao, Weizhe Zhang, Yiming Wang, Gangzhao Lu, Farui Wang, Athanasios V. Vasilakos:
Fine-Grained Powercap Allocation for Power-Constrained Systems Based on Multi-Objective Machine Learning. 1789-1801 - Yang Cheng, Dan Li, Zhiyuan Guo, Binyao Jiang, Jinkun Geng, Wei Bai, Jianping Wu, Yongqiang Xiong:
Accelerating End-to-End Deep Learning Workflow With Codesign of Data Preprocessing and Scheduling. 1802-1814 - Khu-rai Kim, Youngjae Kim, Sungyong Park:
A Probabilistic Machine Learning Approach to Scheduling Parallel Loops With Bayesian Optimization. 1815-1827 - Hao Li, Zixuan Li, Kenli Li, Jan S. Rellermeyer, Lydia Y. Chen, Keqin Li:
SGD$\_$_Tucker: A Novel Stochastic Optimization Strategy for Parallel Sparse Tucker Decomposition. 1828-1841 - Min Li, Yulong Ao, Chao Yang:
Adaptive SpMV/SpMSpV on GPUs for Input Vectors of Varied Sparsity. 1842-1853 - Lei Gong, Chao Wang, Xi Li, Xuehai Zhou:
Improving HW/SW Adaptability for Accelerating CNNs on FPGAs Through A Dynamic/Static Co-Reconfiguration Approach. 1854-1865 - Qin Li, Xiaofan Zhang, Jinjun Xiong, Wen-Mei Hwu, Deming Chen:
Efficient Methods for Mapping Neural Machine Translator on FPGAs. 1866-1877 - Ang Li, Simon Su:
Accelerating Binarized Neural Networks via Bit-Tensor-Cores in Turing GPUs. 1878-1891 - Dongxu Yang, Junhong Liu, Junjie Lai:
EDGES: An Efficient Distributed Graph Embedding System on GPU Clusters. 1892-1902
Volume 32, Number 8, August 2021
- Shaohuai Shi, Xiaowen Chu, Bo Li:
MG-WFBP: Merging Gradients Wisely for Efficient Communication in Distributed Deep Learning. 1903-1917 - Shuiguang Deng, Cheng Zhang, Chang Li, Jianwei Yin, Schahram Dustdar, Albert Y. Zomaya:
Burst Load Evacuation Based on Dispatching and Scheduling In Distributed Edge Networks. 1918-1932 - Hassan A. Youness, Aly Omar, Mohammed Moness:
An Optimized Weighted Average Makespan in Fault-Tolerant Heterogeneous MPSoCs. 1933-1946 - Yanghua Peng, Yixin Bao, Yangrui Chen, Chuan Wu, Chen Meng, Wei Lin:
DL2: A Deep Learning-Driven Scheduler for Deep Learning Clusters. 1947-1960 - Muhammad Saad, Zhan Qin, Kui Ren, DaeHun Nyang, David Mohaisen:
e-PoS: Making Proof-of-Stake Decentralized and Fair. 1961-1973 - Nabil Abubaker, Seher Acer, Cevdet Aykanat:
True Load Balancing for Matricized Tensor Times Khatri-Rao Product. 1974-1986 - Biru Zhu, Youyou Jiang, Ming Gu, Yangdong Deng:
A GPU Acceleration Framework for Motif and Discord Based Pattern Mining. 1987-2004 - Thilina Buddhika, Sangmi Lee Pallickara, Shrideep Pallickara:
Pebbles: Leveraging Sketches for Processing Voluminous, High Velocity Data Streams. 2005-2020 - Wenxin Li, Duowen Liu, Kai Chen, Keqiu Li, Heng Qi:
Hone: Mitigating Stragglers in Distributed Stream Processing With Tuple Scheduling. 2021-2034 - Mochamad Asri, Dhairya Malhotra, Jiajun Wang, George Biros, Lizy K. John, Andreas Gerstlauer:
Hardware Accelerator Integration Tradeoffs for High-Performance Computing: A Case Study of GEMM Acceleration in N-Body Methods. 2035-2048 - Yanjiao Chen, Long Lin, Baochun Li, Qian Wang, Qian Zhang:
Silhouette: Efficient Cloud Configuration Exploration for Large-Scale Analytics. 2049-2061 - Tian Zhou, Lixin Gao, Xiaohong Guan:
A Fault-Tolerant Distributed Framework for Asynchronous Iterative Computations. 2062-2073 - Xidi Qu, Shengling Wang, Qin Hu, Xiuzhen Cheng:
Proof of Federated Learning: A Novel Energy-Recycling Consensus Algorithm. 2074-2085 - Jiawei Zhang, Xiaochen Zhou, Tianyi Ge, Xudong Wang, Taewon Hwang:
Joint Task Scheduling and Containerizing for Efficient Edge Computing. 2086-2100 - Amit Gill, Lalith Maddegedara, Sebastian Poledna, Muneo Hori, Kohei Fujita, Tsuyoshi Ichimura:
High-Performance Computing Implementations of Agent-Based Economic Models for Realizing 1: 1 Scale Simulations of Large Economies. 2101-2114 - Mutaz Barika, Saurabh Garg, Albert Y. Zomaya, Rajiv Ranjan:
Online Scheduling Technique To Handle Data Velocity Changes in Stream Workflows. 2115-2130 - Yu Liu, Xiaojun Shang, Yuanyuan Yang:
Joint SFC Deployment and Resource Management in Heterogeneous Edge for Latency Minimization. 2131-2143
Volume 32, Number 9, September 2021
- Shaoqi Wang, Aidi Pi, Xiaobo Zhou, Jun Wang, Cheng-Zhong Xu:
Overlapping Communication With Computation in Parameter Server for Scalable DL Training. 2144-2159 - Ricardo Nobre, Aleksandar Ilic, Sergio Santander-Jiménez, Leonel Sousa:
Retargeting Tensor Accelerators for Epistasis Detection. 2160-2174 - Shuai Zhang, Sheng Zhang, Zhuzhong Qian, Jie Wu, Yibo Jin, Sanglu Lu:
DeepSlicing: Collaborative and Adaptive CNN Inference With Low Latency. 2175-2187 - Zhiyao Hu, Dongsheng Li, Dongxiang Zhang, Yiming Zhang, Baoyun Peng:
Optimizing Resource Allocation for Data-Parallel Jobs Via GCN-Based Prediction. 2188-2201 - Heng Yu, Zhilong Zheng, Junxian Shen, Congcong Miao, Chen Sun, Hongxin Hu, Jun Bi, Jianping Wu, Jilong Wang:
Octans: Optimal Placement of Service Function Chains in Many-Core Systems. 2202-2215 - Masudul Hassan Quraishi, Erfan Bank Tavakoli, Fengbo Ren:
A Survey of System Architectures and Techniques for FPGA Virtualization. 2216-2230 - Rui Han, Dong Li, Junyan Ouyang, Chi Harold Liu, Guoren Wang, Dapeng Wu, Lydia Y. Chen:
Accurate Differentially Private Deep Learning on the Edge. 2231-2247 - Yuichi Nakatani:
Structured Allocation-Based Consistent Hashing With Improved Balancing for Cloud Infrastructure. 2248-2261 - Feng Zhang, Zheng Chen, Chenyang Zhang, Amelie Chi Zhou, Jidong Zhai, Xiaoyong Du:
An Efficient Parallel Secure Machine Learning Framework on GPUs. 2262-2276 - David Kozhaya, Jérémie Decouchant, Vincent Rahli, Paulo Esteves Veríssimo:
PISTIS: An Event-Triggered Real-Time Byzantine-Resilient Protocol Suite. 2277-2290 - Jing Chen, Jianbin Fang, Weifeng Liu, Canqun Yang:
BALS: Blocked Alternating Least Squares for Parallel Sparse Matrix Factorization on GPUs. 2291-2302 - Feng Zhang, Chenyang Zhang, Lin Yang, Shuhao Zhang, Bingsheng He, Wei Lu, Xiaoyong Du:
Fine-Grained Multi-Query Stream Processing on Integrated Architectures. 2303-2320 - Feng Zhang, Jiya Su, Weifeng Liu, Bingsheng He, Ruofan Wu, Xiaoyong Du, Rujia Wang:
YuenyeungSpTRSV: A Thread-Level and Warp-Level Fusion Synchronization-Free Sparse Triangular Solve. 2321-2337 - Kristof Jannes, Bert Lagaisse, Wouter Joosen:
OWebSync: Seamless Synchronization of Distributed Web Clients. 2338-2351 - Yi-Wen Wei, Wei-Mei Chen, Hsin-Hung Tsai:
Accelerating the Bron-Kerbosch Algorithm for Maximal Clique Enumeration Using GPUs. 2352-2366 - Guangming Tan, Chaoyang Shui, Yinshan Wang, Xianzhi Yu, Yujin Yan:
Optimizing the LINPACK Algorithm for Large-Scale PCIe-Based CPU-GPU Heterogeneous Systems. 2367-2380
Volume 32, Number 10, October 2021
- Manish Parashar:
Editor's Note. 2381-2385 - Soheil Shahrouz, Saber Salehkaleybar, Matin Hashemi:
gIM: GPU Accelerated RIS-Based Influence Maximization Algorithm. 2386-2399 - Gregory J. Herschlag, Seyong Lee, Jeffrey S. Vetter, Amanda Randles:
Analysis of GPU Data Access Patterns on Complex Geometries for the D3Q19 Lattice Boltzmann Algorithm. 2400-2414 - Saba Ahmadian, Reza Salkhordeh, Onur Mutlu, Hossein Asadi:
ETICA: Efficient Two-Level I/O Caching Architecture for Virtualized Platforms. 2415-2433 - Daniel Hernández Juárez, Antonio Espinosa, David Vázquez, Antonio M. López, Juan C. Moure:
3D Perception With Slanted Stixels on GPU. 2434-2447 - Shi Dong, Yifan Sun, Nicolas Bohm Agostini, Elmira Karimi, Daniel Lowell, Jing Zhou, José Cano, José L. Abellán, David R. Kaeli:
Spartan: A Sparsity-Adaptive Framework to Accelerate Deep Neural Network Training on GPUs. 2448-2463 - Rong Ge, Xizhou Feng, Tyler N. Allen, Pengfei Zou:
The Case for Cross-Component Power Coordination on Power Bounded Systems. 2464-2476 - He Li, Hang Yuan, Jianbin Huang, Jiangtao Cui, Xiaoke Ma, Senzhang Wang, Jaesoo Yoo, Philip S. Yu:
Group Reassignment for Dynamic Edge Partitioning. 2477-2490 - Hoa Tran-Dang, Dong-Seong Kim:
FRATO: Fog Resource Based Adaptive Task Offloading for Delay-Minimizing IoT Service Provisioning. 2491-2508 - Dandan Song, Feng Zhang, Meiyan Lu, Sicheng Yang, Heyan Huang:
DTransE: Distributed Translating Embedding for Knowledge Graph. 2509-2523 - Lingchen Zhao, Qian Wang, Cong Wang, Qi Li, Chao Shen, Bo Feng:
VeriML: Enabling Integrity Assurances and Fair Payments for Machine Learning as a Service. 2524-2540 - Youhui Bai, Cheng Li, Zhiqi Lin, Yufei Wu, Youshan Miao, Yunxin Liu, Yinlong Xu:
Efficient Data Loader for Fast Sampling-Based GNN Training on Large Graphs. 2541-2556 - Stijn Schildermans, Jianchen Shan, Kris Aerts, Jason Jackrel, Xiaoning Ding:
Virtualization Overhead of Multithreading in X86 State-of-the-Art & Remaining Challenges. 2557-2570 - Wei Rang, Donglin Yang, Dazhao Cheng, Yu Wang:
Data Life Aware Model Updating Strategy for Stream-Based Online Deep Learning. 2571-2581 - Yahya Hassanzadeh-Nazarabadi, Alptekin Küpçü, Öznur Özkasap:
LightChain: Scalable DHT-Based Blockchain. 2582-2593 - Huaqing Li, Jinhui Hu, Liang Ran, Zheng Wang, Qingguo Lü, Zhenyuan Du, Tingwen Huang:
Decentralized Dual Proximal Gradient Algorithms for Non-Smooth Constrained Composite Optimization Problems. 2594-2605
Volume 32, Number 11, November 2021
- Manish Parashar:
Guest Editorial: Special Section on SC19 Student Cluster Competition. 2606 - Beth Plale, Stephen Lien Harrell:
Transparency and Reproducibility Practice in Large-Scale Computational Science: A Preface to the Special Section. 2607-2608 - Jia Shi, Ruipeng Li, Yuanzhe Xi, Yousef Saad, Maarten V. de Hoop:
Planetary Normal Mode Computation: Parallel Algorithms, Performance, and Reproducibility. 2609-2622 - Wei-Fang Sun, Hung-Hsin Chen, ShaoFu Lin, YuanChing Lin, Jing-Wei Wu, En-Te Lin, Jerry Chou:
Critique of "Planetary Normal Mode Computation: Parallel Algorithms, Performance, and Reproducibility" by SCC Team From National Tsing Hua University. 2623-2626 - Manuel Burger, Jan Kleine:
Critique of "Planetary Normal Mode Computation: Parallel Algorithms, Performance, and Reproducibility" by SCC Team From ETH Zurich. 2627-2630 - Chen Zhang, Chenggang Zhao, Jiaao He, Shengqi Chen, Liyan Zheng, Kezhao Huang, Wentao Han, Jidong Zhai:
Critique of "Planetary Normal Mode Computation: Parallel Algorithms, Performance, and Reproducibility" by SCC Team From Tsinghua University. 2631-2634 - Marek Masiak, Iwona Kotlarska, Lukasz Kondraciuk, Maciej Szpindler:
Critique of "Planetary Normal Mode Computation: Parallel Algorithms, Performance, and Reproducibility" by SCC Team From University of Warsaw. 2635-2638 - David Liu, Matthew Cinnamon, Thorne Garvin, Andrei Karavanov, Sungchan Park, Darius Strobeck, Andrew Lumsdaine:
Critique of "Planetary Normal Mode Computation: Parallel Algorithms, Performance, and Reproducibility" by SCC Team From University of Washington. 2639-2642 - Yihua Cheng, Zejia Fan, Jing Mai, Yifan Wu, Pengcheng Xu, Yuxuan Yan, Zhenxin Fu, Yun Liang:
Critique of "Planetary Normal Mode Computation: Parallel Algorithms, Performance, and Reproducibility" by SCC Team From Peking University. 2643-2645 - Santosh Pandey, Zhibin Wang, Sheng Zhong, Chen Tian, Bolong Zheng, Xiaoye S. Li, Lingda Li, Adolfy Hoisie, Caiwen Ding, Dong Li, Hang Liu:
Trust: Triangle Counting Reloaded on GPUs. 2646-2660 - Li Shi, Yang Liu, Junwei Zhang, Thomas G. Robertazzi:
Coflow Scheduling in Data Centers: Routing and Bandwidth Allocation. 2661-2675 - Qi Li, Yunpeng Liu, Zhuotao Liu, Peng Zhang, Chunhui Pang:
Efficient Forwarding Anomaly Detection in Software-Defined Networks. 2676-2690 - Yunjian Zhao, Zhi Liu, Yidi Wu, Guanxian Jiang, James Cheng, Kunlong Liu, Xiao Yan:
Timestamped State Sharing for Stream Analytics. 2691-2704 - Lailong Luo, Deke Guo, Yawei Zhao, Ori Rottenstreich, Richard T. B. Ma, Xueshan Luo:
MCFsyn: A Multi-Party Set Reconciliation Protocol With the Marked Cuckoo Filter. 2705-2718 - Muhammad M. Rafique, Zhichun Zhu:
Memory-Side Prefetching Scheme Incorporating Dynamic Page Mode in 3D-Stacked DRAM. 2734-2747 - Gabriele Mencagli, Massimo Torquati, Andrea Cardaci, Alessandra Fais, Luca Rinaldi, Marco Danelutto:
WindFlow: High-Speed Continuous Stream Processing With Parallel Building Blocks. 2748-2763 - Mahdi Boroujeni, Mohammad Ghodsi, Saeed Seddighin:
Improved MPC Algorithms for Edit Distance and Ulam Distance. 2764-2776 - Gongming Zhao, Hongli Xu, Yangming Zhao, Chunming Qiao, Liusheng Huang:
Offloading Tasks With Dependency and Service Caching in Mobile Edge Computing. 2777-2792 - Weibei Fan, Fu Xiao, Xiaobai Chen, Lei Cui, Shui Yu:
Efficient Virtual Network Embedding of Cloud-Based Data Center Networks into Optical Networks. 2793-2808 - Najeeb Ahmad, Buse Yilmaz, Didem Unat:
A Split Execution Model for SpTRSV. 2809-2822 - Xuedong Zhang, Zhuo Tang, Lifan Du, Li Yang:
An Incremental Iterative Acceleration Architecture in Distributed Heterogeneous Environments With GPUs for Deep Learning. 2823-2837 - Shuang Wang, Xiaoping Li, Quan Z. Sheng, Rubén Ruiz, Jinquan Zhang, Amin Beheshti:
Multi-Queue Request Scheduling for Profit Maximization in IaaS Clouds. 2838-2851 - Mary Lai O. Salvaña, Sameh Abdulah, Huang Huang, Hatem Ltaief, Ying Sun, Marc G. Genton, David E. Keyes:
High Performance Multivariate Geospatial Statistics on Manycore Systems. 2719-2733
Volume 32, Number 12, December 2021
- Zheng Dong, Kecheng Yang, Nathan Fisher, Cong Liu:
Tardiness Bounds for Sporadic Gang Tasks Under Preemptive Global EDF Scheduling. 2867-2879 - Cheng Tan, Chenhao Xie, Tong Geng, Andres Marquez, Antonino Tumeo, Kevin J. Barker, Ang Li:
ARENA: Asynchronous Reconfigurable Accelerator Ring to Enable Data-Centric Parallel Computing. 2880-2892 - Ashutosh Kumar Singh, Deepika Saxena, Jitendra Kumar, Vrinda Gupta:
A Quantum Approach Towards the Adaptive Prediction of Cloud Workloads. 2893-2905 - Hiren Kumar Thakkar, Prasan Kumar Sahoo, Bharadwaj Veeravalli:
RENDA: Resource and Network Aware Data Placement Algorithm for Periodic Workloads in Cloud. 2906-2920 - Yijin Guo, Huasong Shan, Shixin Huang, Kai Hwang, Jianping Fan, Zhibin Yu:
GML: Efficiently Auto-Tuning Flink's Configurations Via Guided Machine Learning. 2921-2935 - Dylan Chapp, Nigel Tan, Sanjukta Bhowmick, Michela Taufer:
Identifying Degree and Sources of Non-Determinism in MPI Applications Via Graph Kernels. 2936-2952 - Zhaokang Wang, Weiwei Hu, Guowang Chen, Chunfeng Yuan, Rong Gu, Yihua Huang:
Towards Efficient Distributed Subgraph Enumeration Via Backtracking-Based Framework. 2953-2969 - Lukasz Szustak, Roman Wyrzykowski, Lukasz Kuczynski, Tomasz Olas:
Architectural Adaptation and Performance-Energy Optimization for CFD Application on AMD EPYC Rome. 2852-2866 - Andrea Formisano, Raffaella Gentilini, Flavio Vella:
Scalable Energy Games Solvers on GPUs. 2970-2982 - Pablo Prieto, Pablo Abad Fidalgo, José-Ángel Gregorio, Valentin Puente:
Fast, Accurate Processor Evaluation Through Heterogeneous, Sample-Based Benchmarking. 2983-2995 - Tao Jiang, Wenjuan Meng, Xu Yuan, Liangmin Wang, Jianhua Ge, Jianfeng Ma:
ReliableBox: Secure and Verifiable Cloud Storage With Location-Aware Backup. 2996-3010 - Yuichi Sudo, Masahiro Shibata, Junya Nakamura, Yonghwan Kim, Toshimitsu Masuzawa:
Self-Stabilizing Population Protocols With Global Knowledge. 3011-3023 - Guanghui Zhang, Jack Y. B. Lee, Ke Liu, Haibo Hu, Vaneet Aggarwal:
A Unified Framework for Flexible Playback Latency Control in Live Video Streaming. 3024-3037 - Rohit Zambre, Damodar Sahasrabudhe, Hui Zhou, Martin Berzins, Aparna Chandramowlishwaran, Pavan Balaji:
Logically Parallel Communication for Fast MPI+Threads Applications. 3038-3052 - Marco Antonio C. de Figueiredo, João Paulo Navarro, Edans Flavius de Oliveira Sandes, George Teodoro, Alba C. M. A. Melo:
Parallel Fine-Grained Comparison of Long DNA Sequences in Homogeneous and Heterogeneous GPU Platforms With Pruning. 3053-3065 - Longlong Chen, Jianfeng Zhu, Yangdong Deng, Zhaoshi Li, Jian Chen, Xiaowei Jiang, Shouyi Yin, Shaojun Wei, Leibo Liu:
An Elastic Task Scheduling Scheme on Coarse-Grained Reconfigurable Architectures. 3066-3080 - Pradeep Ambati, Noman Bashir, David E. Irwin, Prashant J. Shenoy:
Modeling and Analyzing Waiting Policies for Cloud-Enabled Schedulers. 3081-3100
manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.