default search action
ACM Transactions on Multimedia Computing, Communications, and Applications, Volume 19
Volume 19, Number 1, January 2023
- Xuan Shao, Ying Shen, Lin Zhang, Shengjie Zhao, Dandan Zhu, Yicong Zhou:
SLAM for Indoor Parking: A Comprehensive Benchmark Dataset and a Tightly Coupled Semantic Framework. 1:1-1:23 - Prasen Kumar Sharma, Ira Bisht, Arijit Sur:
Wavelength-based Attributed Deep Neural Network for Underwater Image Restoration. 2:1-2:23 - Jie Li, Ling Han, Chong Zhang, Qiyue Li, Zhi Liu:
Spherical Convolution Empowered Viewport Prediction in 360 Video Multicast with Limited FoV Feedback. 3:1-3:23 - Thi Ngoc Hanh Le, Chih-Kuo Yeh, Ying-Chi Lin, Tong-Yee Lee:
Animating Still Natural Images Using Warping. 4:1-4:24 - Lizhi Xiong, Xiao Han, Ching-Nung Yang, Zhihua Xia:
RDH-DES: Reversible Data Hiding over Distributed Encrypted-Image Servers Based on Secret Sharing. 5:1-5:19 - Peining Zhen, Shuqi Wang, Suming Zhang, Xiaotao Yan, Wei Wang, Zhigang Ji, Hai-Bao Chen:
Towards Accurate Oriented Object Detection in Aerial Images with Adaptive Multi-level Feature Fusion. 6:1-6:22 - Yue Song, Hao Tang, Nicu Sebe, Wei Wang:
Disentangle Saliency Detection into Cascaded Detail Modeling and Body Filling. 7:1-7:15 - Yong Zhang, Yingwei Pan, Ting Yao, Rui Huang, Tao Mei, Chang Wen Chen:
Boosting Scene Graph Generation with Visual Relation Saliency. 8:1-8:17 - Jingwen Chen, Jianjie Luo, Yingwei Pan, Yehao Li, Ting Yao, Hongyang Chao, Tao Mei:
Boosting Vision-and-Language Navigation with Direction Guiding and Backtracing. 9:1-9:16 - Yunbo Rao, Ziqiang Yang, Shaoning Zeng, Qifeng Wang, Jiansu Pu:
Dual Projective Zero-Shot Learning Using Text Descriptions. 10:1-10:17 - Hang Yu, Chilam Cheang, Yanwei Fu, Xiangyang Xue:
Multi-view Shape Generation for a 3D Human-like Body. 11:1-11:22 - Weidong Chen, Guorong Li, Xinfeng Zhang, Shuhui Wang, Liang Li, Qingming Huang:
Weakly Supervised Text-based Actor-Action Video Segmentation by Clip-level Multi-instance Learning. 12:1-12:22 - Feihong Shen, Jun Liu:
Quantum Fourier Convolutional Network. 13:1-13:14 - Xiaotian Wu, Peng Yao:
Boolean-based Two-in-One Secret Image Sharing by Adaptive Pixel Grouping. 14:1-14:23 - Ashima Yadav, Dinesh Kumar Vishwakarma:
A Deep Multi-level Attentive Network for Multimodal Sentiment Analysis. 15:1-15:19 - Honghao Gao, Baobin Dai, Huaikou Miao, Xiaoxian Yang, Ramón J. Durán Barroso, Walayat Hussain:
A Novel GAPG Approach to Automatic Property Generation for Formal Verification: The GAN Perspective. 16:1-16:22 - Pengyi Zhang, Huanzhang Dou, Wenhu Zhang, Yuhan Zhao, Zequn Qin, Dongping Hu, Yi Fang, Xi Li:
A Large-Scale Synthetic Gait Dataset Towards in-the-Wild Simulation and Comparison Study. 17:1-17:23 - Wei Zhou, Zhiwu Xia, Peng Dou, Tao Su, Haifeng Hu:
Double Attention Based on Graph Attention Network for Image Multi-Label Classification. 18:1-18:23 - Xianlin Zhang, Mengling Shen, Xueming Li, Xiaojie Wang:
AABLSTM: A Novel Multi-task Based CNN-RNN Deep Model for Fashion Analysis. 19:1-19:18 - Deyin Liu, Lin Wu, Richang Hong, Zongyuan Ge, Jialie Shen, Farid Boussaïd, Mohammed Bennamoun:
Generative Metric Learning for Adversarially Robust Open-world Person Re-Identification. 20:1-20:19 - Shuo Wang, Huixia Ben, Yanbin Hao, Xiangnan He, Meng Wang:
Boosting Hyperspectral Image Classification with Dual Hierarchical Learning. 21:1-21:19 - Dayan Wu, Qi Dai, Bo Li, Weiping Wang:
Deep Uncoupled Discrete Hashing via Similarity Matrix Decomposition. 22:1-22:22 - Ming Cheung, Weiwei Sun, James She, Jiantao Zhou:
Social Network Analytic-Based Online Counterfeit Seller Detection using User Shared Images. 23:1-23:18 - Feihong Lu, Hang Chen, Kang Li, Qiliang Deng, Jian Zhao, Kaipeng Zhang, Hong Han:
Toward High-quality Face-Mask Occluded Restoration. 24:1-24:23 - Yajing Liu, Zhiwei Xiong, Ya Li, Yuning Lu, Xinmei Tian, Zheng-Jun Zha:
Category-Stitch Learning for Union Domain Generalization. 25:1-25:19
Volume 19, Number 1s, February 2023
- Claudio Ferrari, Federico Becattini, Leonardo Galteri, Alberto Del Bimbo:
(Compress and Restore)N: A Robust Defense Against Adversarial Attacks on Image Classification. 26:1-26:16 - Yaguang Song, Xiaoshan Yang, Changsheng Xu:
Self-supervised Calorie-aware Heterogeneous Graph Networks for Food Recommendation. 27:1-27:23 - Feng Xue, Tian Yang, Kang Liu, Zikun Hong, Mingwei Cao, Dan Guo, Richang Hong:
LCSNet: End-to-end Lipreading with Channel-aware Feature Selection. 28:1-28:21 - Zilong Fu, Hongtao Xie, Shancheng Fang, Yuxin Wang, Mengting Xing, Yongdong Zhang:
Learning Pixel Affinity Pyramid for Arbitrary-Shaped Text Detection. 29:1-29:24 - João Baptista Cardia Neto, Claudio Ferrari, Aparecido Nilceu Marana, Stefano Berretti, Alberto Del Bimbo:
Learning Streamed Attention Network from Descriptor Images for Cross-Resolution 3D Face Recognition. 30:1-30:20 - Xin Huang:
On Teaching Mode of MTI Translation Workshop Based on IPT Corpus for Tibetan Areas of China. 31:1-31:16 - Liming Xu, Xianhua Zeng, Weisheng Li, Bochuan Zheng:
MFGAN: Multi-modal Feature-fusion for CT Metal Artifact Reduction Using GANs. 32:1-32:17 - Yuzhang Hu, Wenhan Yang, Jiaying Liu, Zongming Guo:
Deep Inter Prediction with Error-Corrected Auto-Regressive Network for Video Coding. 33:1-33:22 - Yue Li, Li Zhang, Kai Zhang:
iDAM: Iteratively Trained Deep In-loop Filter with Adaptive Model Selection. 34:1-34:22 - Rahul Kumar Jaiswal, Rajesh Kumar Dubey:
CAQoE: A Novel No-Reference Context-aware Speech Quality Prediction Metric. 35:1-35:23 - Tao Xiang, Honghong Zeng, Biwen Chen, Shangwei Guo:
BMIF: Privacy-preserving Blockchain-based Medical Image Fusion. 36:1-36:23 - Xiaoke Zhu, Changlong Li, Xiaopan Chen, Xinyu Zhang, Xiao-Yuan Jing:
Distance and Direction Based Deep Discriminant Metric Learning for Kinship Verification. 37:1-37:19 - Weiming Zhuang, Xin Gan, Yonggang Wen, Shuai Zhang:
Optimizing Performance of Federated Person Re-identification: Benchmarking and Analysis. 38:1-38:18 - Lavinia De Divitiis, Federico Becattini, Claudio Baecchi, Alberto Del Bimbo:
Disentangling Features for Fashion Recommendation. 39:1-39:21 - Ka-Hou Chan, Sio Kei Im:
Using Four Hypothesis Probability Estimators for CABAC in Versatile Video Coding. 40:1-40:17 - Mengqi Yuan, Bing-Kun Bao, Zhiyi Tan, Changsheng Xu:
Adaptive Text Denoising Network for Image Caption Editing. 41:1-41:18 - Xiaoyu Zhang, Wei Gao, Ge Li, Qiuping Jiang, Runmin Cong:
Image Quality Assessment-driven Reinforcement Learning for Mixed Distorted Image Restoration. 42:1-42:23 - Chongyang Bai, Maksim Bolonkin, Viney Regunath, V. S. Subrahmanian:
DIPS: A Dyadic Impression Prediction System for Group Interaction Videos. 43:1-43:24 - Yuqing Liu, Xinfeng Zhang, Shanshe Wang, Siwei Ma, Wen Gao:
Sequential Hierarchical Learning with Distribution Transformation for Image Super-Resolution. 44:1-44:21 - Haidong Wang, Xuan He, Zhiyong Li, Jin Yuan, Shutao Li:
JDAN: Joint Detection and Association Network for Real-Time Online Multi-Object Tracking. 45:1-45:17 - Mengyao Xiao, Xiaolong Li, Yao Zhao, Bin Ma, Guodong Guo:
A Novel Reversible Data Hiding Scheme Based on Pixel-Residual Histogram. 46:1-46:19 - Jiazhi Liu, Feng Liu:
Modified 2D-Ghost-Free Stereoscopic Display with Depth-of-Field Effects. 47:1-47:16 - Jingwen Chen, Yingwei Pan, Yehao Li, Ting Yao, Hongyang Chao, Tao Mei:
Retrieval Augmented Convolutional Encoder-decoder Networks for Video Captioning. 48:1-48:24 - Guanyu Zhu, Yong Zhou, Rui Yao, Hancheng Zhu, Jiaqi Zhao:
Cyclic Self-attention for Point Cloud Recognition. 49:1-49:19 - Dinghao Yang, Wei Gao, Ge Li, Hui Yuan, Junhui Hou, Sam Kwong:
Exploiting Manifold Feature Representation for Efficient Classification of 3D Point Clouds. 50:1-50:21
Volume 19, Number 2, March 2023
- Xiaohan Lan, Yitian Yuan, Xin Wang, Zhi Wang, Wenwu Zhu:
A Survey on Temporal Sentence Grounding in Videos. 51:1-51:33 - Yu Qiao, Yuhao Liu, Ziqi Wei, Yuxin Wang, Qiang Cai, Guofeng Zhang, Xin Yang:
Hierarchical and Progressive Image Matting. 52:1-52:23 - Fei Peng, Wenyan Jiang, Min Long:
A Low Distortion and Steganalysis-resistant Reversible Data Hiding for 2D Engineering Graphics. 53:1-53:20 - Sijie Mai, Songlong Xing, Jiaxuan He, Ying Zeng, Haifeng Hu:
Multimodal Graph for Unaligned Multimodal Sequence Analysis via Graph Convolution and Graph Pooling. 54:1-54:24 - Qi Zheng, Jianfeng Dong, Xiaoye Qu, Xun Yang, Yabing Wang, Pan Zhou, Baolong Liu, Xun Wang:
Progressive Localization Networks for Language-Based Moment Localization. 55:1-55:21 - Yue Zhang, Fanghui Zhang, Yi Jin, Yigang Cen, Viacheslav V. Voronin, Shaohua Wan:
Local Correlation Ensemble with GCN Based on Attention Features for Cross-domain Person Re-ID. 56:1-56:22 - Jacob Chakareski, Mahmudur Khan, Tanguy Ropitault, Steve Blandino:
Millimeter Wave and Free-space-optics for Future Dual-connectivity 6DOF Mobile Multi-user VR Streaming. 57:1-57:25 - Yun-Shao Lin, Yi-Ching Liu, Chi-Chun Lee:
An Interaction-process-guided Framework for Small-group Performance Prediction. 58:1-58:25 - Na Zheng, Xuemeng Song, Tianyu Su, Weifeng Liu, Yan Yan, Liqiang Nie:
Egocentric Early Action Prediction via Adversarial Knowledge Distillation. 59:1-59:21 - Li Wang, Ke Li, Jingjing Tang, Yuying Liang:
Image Super-Resolution via Lightweight Attention-Directed Feature Aggregation Network. 60:1-60:23 - Jiaying Lin, Xin Tan, Ke Xu, Lizhuang Ma, Rynson W. H. Lau:
Frequency-aware Camouflaged Object Detection. 61:1-61:16 - Shuang Liang, Anjie Zhu, Jiasheng Zhang, Jie Shao:
Hyper-node Relational Graph Attention Network for Multi-modal Knowledge Graph Completion. 62:1-62:21 - Yaya Shi, Haiyang Xu, Chunfeng Yuan, Bing Li, Weiming Hu, Zheng-Jun Zha:
Learning Video-Text Aligned Representations for Video Captioning. 63:1-63:21 - Yang Yang, Yingqiu Ding, Ming Cheng, Weiming Zhang:
No-reference Quality Assessment for Contrast-distorted Images Based on Gray and Color-gray-difference Space. 64:1-64:20 - Jia Wang, Jingcheng Ke, Hong-Han Shuai, Yung-Hui Li, Wen-Huang Cheng:
Referring Expression Comprehension Via Enhanced Cross-modal Graph Attention Networks. 65:1-65:21 - Dengyong Zhang, Pu Huang, Xiangling Ding, Feng Li, Wenjie Zhu, Yun Song, Gaobo Yang:
L2BEC2: Local Lightweight Bidirectional Encoding and Channel Attention Cascade for Video Frame Interpolation. 66:1-66:19 - Yushu Zhang, Qing Tan, Shuren Qi, Mingfu Xue:
PRNU-based Image Forgery Localization with Deep Multi-scale Fusion. 67:1-67:20 - Shanshan Dong, Tian-Zi Niu, Xin Luo, Wu Liu, Xinshun Xu:
Semantic Embedding Guided Attention with Explicit Visual Feature Fusion for Video Captioning. 68:1-68:18 - Shunxin Xu, Ke Sun, Dong Liu, Zhiwei Xiong, Zheng-Jun Zha:
Synergy between Semantic Segmentation and Image Denoising via Alternate Boosting. 69:1-69:23 - Dan Song, Chu-Meng Zhang, Xiao-Qian Zhao, Teng Wang, Wei-Zhi Nie, Xuanya Li, An-An Liu:
Self-supervised Image-based 3D Model Retrieval. 70:1-70:18 - Stavros Nousias, Gerasimos Arvanitis, Aris S. Lalos, Konstantinos Moustakas:
Deep Saliency Mapping for 3D Meshes and Applications. 71:1-71:22 - Yun Liu, Xiaohua Yin, Zuliang Wan, Guanghui Yue, Zhi Zheng:
Toward A No-reference Omnidirectional Image Quality Evaluation by Using Multi-perceptual Features. 72:1-72:19 - Hua Wu, Xin Li, Gang Wang, Guang Cheng, Xiaoyan Hu:
Resolution Identification of Encrypted Video Streaming Based on HTTP/2 Features. 73:1-73:23 - Qipu Qin, Cheolkon Jung:
Quality Enhancement of Compressed 360-Degree Videos Using Viewport-based Deep Neural Networks. 74:1-74:19 - Wei Zhou, Zhiwu Xia, Peng Dou, Tao Su, Haifeng Hu:
Aligning Image Semantics and Label Concepts for Image Multi-Label Classification. 75:1-75:23
Volume 19, Number 3, May 2023
- Yi Zhang, Fang-Yi Chao, Wassim Hamidouche, Olivier Déforges:
PAV-SOD: A New Task towards Panoramic Audiovisual Saliency Detection. 101:1-101:26 - Chi Xie, Zikun Zhuang, Shengjie Zhao, Shuang Liang:
Temporal Dropout for Weakly Supervised Action Localization. 102:1-102:24 - Yangyang Guo, Liqiang Nie, Harry Cheng, Zhiyong Cheng, Mohan S. Kankanhalli, Alberto Del Bimbo:
On Modality Bias Recognition and Reduction. 103:1-103:22 - Kang Xu, Weixin Li, Xia Wang, Xiaoyan Hu, Ke Yan, Xiaojie Wang, Xuan Dong:
CUR Transformer: A Convolutional Unbiased Regional Transformer for Image Denoising. 104:1-104:22 - Wenxin Huang, Xuemei Jia, Xian Zhong, Xiao Wang, Kui Jiang, Zheng Wang:
Beyond the Parts: Learning Coarse-to-Fine Adaptive Alignment Representation for Person Search. 105:1-105:19 - Hongchuan Yu, Mengqing Huang, Jian-Jun Zhang:
Domain Adaptation Problem in Sketch Based Image Retrieval. 106:1-106:17 - Han Yan, Haijun Zhang, Jianyang Shi, Jianghong Ma, Xiaofei Xu:
Toward Intelligent Fashion Design: A Texture and Shape Disentangled Generative Adversarial Network. 107:1-107:23 - Peng Dou, Ying Zeng, Zhuoqun Wang, Haifeng Hu:
Multiple Temporal Pooling Mechanisms for Weakly Supervised Temporal Action Localization. 108:1-108:19 - Lei Li, Zhiyuan Zhou, Suping Wu, Yongrong Cao:
Multi-scale Edge-guided Learning for 3D Reconstruction. 109:1-109:24 - Zhengxue Wang, Guangwei Gao, Juncheng Li, Hui Yan, Hao Zheng, Huimin Lu:
Lightweight Feature De-redundancy and Self-calibration Network for Efficient Image Super-resolution. 110:1-110:15 - Zhijie Huang, Jun Sun, Xiaopeng Guo:
FastCNN: Towards Fast and Accurate Spatiotemporal Network for HEVC Compressed Video Enhancement. 111:1-111:22 - Xiaohan Wang, Linchao Zhu, Fei Wu, Yi Yang:
A Differentiable Parallel Sampler for Efficient Video Classification. 112:1-112:18 - Junjie Li, Jin Yuan, Zhiyong Li:
TP-FER: An Effective Three-phase Noise-tolerant Recognizer for Facial Expression Recognition. 113:1-113:17 - Baojin Huang, Zhongyuan Wang, Guangcheng Wang, Zhen Han, Kui Jiang:
Local Eyebrow Feature Attention Network for Masked Face Recognition. 114:1-114:19 - Bincheng Yang, Gangshan Wu:
Efficient Single-image Super-resolution Using Dual path Connections with Multiple scale Learning. 115:1-115:21 - Wei Zhou, Yanke Hou, Dihu Chen, Haifeng Hu, Tao Su:
Attention-Augmented Memory Network for Image Multi-Label Classification. 116:1-116:24 - Shuaixiong Hui, Qiang Guo, Xiaoyu Geng, Caiming Zhang:
Multi-Guidance CNNs for Salient Object Detection. 117:1-117:19 - Kai Xing, Tao Li, Xuanhan Wang:
ProposalVLAD with Proposal-Intra Exploring for Temporal Action Proposal Generation. 118:1-118:18 - Hao Tang, Lei Ding, Songsong Wu, Bin Ren, Nicu Sebe, Paolo Rota:
Deep Unsupervised Key Frame Extraction for Efficient Video Classification. 119:1-119:17 - Ling Zhang, Chengjiang Long, Xiaolong Zhang, Chunxia Xiao:
Exploiting Residual and Illumination with GANs for Shadow Detection and Shadow Removal. 120:1-120:22 - Yushu Zhang, Nuo Chen, Shuren Qi, Mingfu Xue, Zhongyun Hua:
Detection of Recolored Image by Texture Features in Chrominance Components. 121:1-121:23 - Han Xue, Jun Ling, Anni Tang, Li Song, Rong Xie, Wenjun Zhang:
High-Fidelity Face Reenactment Via Identity-Matched Correspondence Learning. 122:1-122:23 - Haozhe Chen, Hang Zhou, Jie Zhang, Dongdong Chen, Weiming Zhang, Kejiang Chen, Gang Hua, Nenghai Yu:
Perceptual Hashing of Deep Convolutional Neural Networks for Model Copy Detection. 123:1-123:20 - Wei Duan, Yi Yu, Xulong Zhang, Suhua Tang, Wei Li, Keizo Oyama:
Melody Generation from Lyrics with Local Interpretability. 124:1-124:21 - Shiguang Liu, Huixin Wang:
Talking Face Generation via Facial Anatomy. 125:1-125:19
Volume 19, Number 4, July 2023
- Xuehu Yan, Longlong Li, Lei Sun, Jia Chen, Shudong Wang:
Fake and Dishonest Participant Immune Secret Image Sharing. 139:1-139:26 - Song Yang, Qiang Li, Wenhui Li, Xuanya Li, Ran Jin, Bo Lv, Rui Wang, Anan Liu:
Semantic Completion and Filtration for Image-Text Retrieval. 140:1-140:20 - Xuan Ma, Xiaoshan Yang, Changsheng Xu:
Multi-Source Knowledge Reasoning Graph Network for Multi-Modal Commonsense Inference. 141:1-141:17 - Shangxi Wu, Jitao Sang, Kaiyuan Xu, Jiaming Zhang, Jian Yu:
Attention, Please! Adversarial Defense via Activation Rectification and Preservation. 142:1-142:18 - Kan Wang, Changxing Ding, Jianxin Pang, Xiangmin Xu:
Context Sensing Attention Network for Video-based Person Re-identification. 143:1-143:20 - Wenjing Wang, Lilang Lin, Zejia Fan, Jiaying Liu:
Semi-supervised Learning for Mars Imagery Classification and Segmentation. 144:1-144:23 - Hui Liu, Shanshan Li, Jicheng Zhu, Kai Deng, Meng Liu, Liqiang Nie:
DDIFN: A Dual-discriminator Multi-modal Medical Image Fusion Network. 145:1-145:17 - Xintian Wu, Huanyu Wang, Yiming Wu, Xi Li:
D3T-GAN: Data-Dependent Domain Transfer GANs for Image Generation with Limited Data. 146:1-146:20 - Dandan Zhu, Xuan Shao, Qiangqiang Zhou, Xiongkuo Min, Guangtao Zhai, Xiaokang Yang:
A Novel Lightweight Audio-visual Saliency Model for Videos. 147:1-147:22 - Amr Abdussalam, Zhongfu Ye, Ammar Hawbani, Majjed Al-Qatf, Rashid Khan:
NumCap: A Number-controlled Multi-caption Image Captioning Network. 148:1-148:24