


default search action
ACM Transactions on Multimedia Computing, Communications, and Applications, Volume 21
Volume 21, Number 1, January 2025
- Bogdan Ionescu
, Ioannis Patras
, Henning Müller
, Alberto Del Bimbo
:
Introduction to the Special Issue on Realistic Synthetic Data: Generation, Learning, Evaluation. 1:1-1:7 - Adam Westerski
, Wee Teck Fong
:
Synthetic Data for Object Detection with Neural Networks: State-of-the-Art Survey of Domain Randomisation Techniques. 2:1-2:20 - Bruno Vaz
, Álvaro Figueira
:
GANs in the Panorama of Synthetic Data Generation Methods. 3:1-3:28 - Azeez Idris
, Mohammed Khaleel
, Wallapak Tavanapong
, Piet C. de Groen
:
Synthesized Image Training Techniques: On Improving Model Performance Using Confusion. 4:1-4:24 - Wenmiao Hu
, Yifang Yin
, Ying Kiat Tan
, An Tran
, Hannes Kruppa
, Roger Zimmermann
:
GAN-Assisted Road Segmentation from Satellite Imagery. 5:1-5:29 - Fabio Hellmann
, Silvan Mertes
, Mohamed Benouis
, Alexander Hustinx
, Tzung-Chien Hsieh
, Cristina Conati
, Peter M. Krawitz
, Elisabeth André
:
GANonymization: A GAN-Based Face Anonymization Framework for Preserving Emotional Expressions. 6:1-6:27 - Kaifeng Zou
, Sylvain Faisan
, Boyang Yu
, Sébastien Valette
, Hyewon Seo
:
4D Facial Expression Diffusion Model. 7:1-7:23 - Anjali T
, Masilamani V.
:
Text-Guided Synthesis of Masked Face Images. 8:1-8:14 - Xin Huang
, Dong Liang
, Hongrui Cai
, Yunfeng Bai
, Juyong Zhang
, Feng Tian
, Jinyuan Jia
:
Double Reference Guided Interactive 2D and 3D Caricature Generation. 9:1-9:21 - Chaitra Desai
, Sujay Benur
, Ujwala Patil
, Uma Mudenagudi
:
RSUIGM: Realistic Synthetic Underwater Image Generation with Image Formation Model. 10:1-10:22 - Roberto Amoroso
, Davide Morelli
, Marcella Cornia
, Lorenzo Baraldi
, Alberto Del Bimbo
, Rita Cucchiara
:
Parents and Children: Distinguishing Multimodal Deepfakes from Natural Images. 11:1-11:23 - Pedro Celard
, Eva Lorenzo Iglesias
, José Manuel Sorribes-Fdez, Lourdes Borrajo
, Adrián Seara Vieira
:
New Metrics and Dataset for Biological Development Video Generation. 12:1-12:23 - Lysa Gramoli
, Julien Cumin
, Jérémy Lacoche, Anthony Foulonneau
, Bruno Arnaldi
, Valérie Gouranton
:
Generating and Evaluating Data of Daily Activities with an Autonomous Agent in a Virtual Smart Home. 13:1-13:25 - Louis Airale
, Xavier Alameda-Pineda
, Stéphane Lathuilière, Dominique Vaufreydaz
:
Autoregressive GAN for Semantic Unconditional Head Motion Generation. 14:1-14:14 - Kerim Hodzic
, Mirsad Cosovic
, Sasa Mrdovic
, Jason J. Quinlan
, Darijo Raca
:
DashReStreamer: Framework for Creation of Impaired Video Clips under Realistic Network Conditions. 15:1-15:26 - Mihai Gabriel Constantin
, Dan-Cristian Stanciu
, Liviu-Daniel Stefan
, Mihai Dogariu
, Dan Mihailescu
, George Ciobanu
, Matt Bergeron
, Winston Liu
, Konstantin Belov
, Octavian Radu
, Bogdan Ionescu
:
Exploring Generative Adversarial Networks for Augmenting Network Intrusion Detection Tasks. 16:1-16:19
- Jialin Yang
, Chunyu Lin
, Lang Nie
, Zisen Kong
, Jiapeng Wang
, Yao Zhao
:
Toward Oriented Fisheye Object Detection: Dataset and Baseline. 17:1-17:19 - Enji Liang
, Kuiyuan Zhang
, Zhongyun Hua
, Xiaohua Jia
:
Multi-Scale Feature Attention Fusion for Image Splicing Forgery Detection. 18:1-18:20 - Qingxin Sheng
, Chong Fu
, Zhaonan Lin
, Junxin Chen
, Xingwei Wang
, Chiu-Wing Sham
:
Content-Aware Selective Encryption for H.265/HEVC Using Deep Hashing Network and Steganography. 19:1-19:22 - Xu Cheng
, Zichun Wang
, Yan Jiang
, Xingyu Liu
, Hao Yu
, Jingang Shi
, Zitong Yu
:
Dual-Path Imbalanced Feature Compensation Network for Visible-Infrared Person Re-Identification. 20:1-20:24 - Pan Liao
, Feng Yang
, Di Wu
, Bo Liu
, Xingle Zhang
, Shangjun Zhou
:
Enhanced Multi-Object Tracking: Inferring Motion States of Tracked Objects. 21:1-21:25 - Hong Zhang
, Jiaxu Wan
, Jing Zhang
, Ding Yuan
, Xuliang Li
, Yifan Yang
:
P2FTrack: Multi-Object Tracking with Motion Prior and Feature Posterior. 22:1-22:22 - Loris Sauter
, Ralph Gasser
, Heiko Schuldt
, Abraham Bernstein
, Luca Rossetto
:
Performance Evaluation in Multimedia Retrieval. 23:1-23:23 - Linhua Kong
, Yiming Wang
, Dongxia Chang
, Yao Zhao
:
Temporal-Enhanced Radar and Camera Fusion for Object Detection. 24:1-24:16 - Yuxiao Huang
, Zhicong Huang
, Jingwen Zhao
, Haifeng Hu
, Dihu Chen
:
AMVFNet: Attentive Multi-View Fusion Network for 3D Object Detection. 25:1-25:18 - Chao Wang
, Zhongyuan Wang
, Ruimin Hu
, Xiaochen Wang
, Wen Zhou
:
Optimal Illumination Distance Metrics for Person Re-Identification in Complex Lighting Conditions. 26:1-26:18 - Tinghui Wu
, Shuhe Zhang
, Dihu Chen
, Haifeng Hu
:
Text-and-Image Learning Transformer for Cross-Modal Person Re-Identification. 27:1-27:18 - Xichu Ma
, Varun Sharma
, Min-Yen Kan
, Wee Sun Lee
, Ye Wang
:
KeYric: Unsupervised Keywords Extraction and Expansion from Music for Coherent Lyrics Generation. 28:1-28:28 - Huazhong Zhao
, Lei Qi
, Xin Geng
:
CLIP-DFGS: A Hard Sample Mining Method for CLIP in Generalizable Person Re-Identification. 29:1-29:20 - Xingchen Li
, Jun Xiao
, Guikun Chen
, Yinfu Feng
, Yi Yang, An-An Liu
, Long Chen
:
Decomposed Prototype Learning for Few-Shot Scene Graph Generation. 30:1-30:24 - Zan Chen
, Tao Wang
, Jun Li
, Wenlong Guo
, Yuanjing Feng
, Xueming Qian
, Xingsong Hou
:
Discard Significant Bits of Compressed Sensing: A Robust Image Coding for Resource-Limited Contexts. 31:1-31:25 - Chongyu Liu
, Dezhi Peng
, Yuliang Liu
, Lianwen Jin
:
CTRNet++: Dual-Path Learning with Local-Global Context Modeling for Scene Text Removal. 32:1-32:22 - Kai Han
, Jin Wang
, Yunhui Shi
, Hanqin Cai
, Nam Ling
, Baocai Yin
:
WTDUN: Wavelet Tree-Structured Sampling and Deep Unfolding Network for Image Compressed Sensing. 33:1-33:22 - Guilin Lan
, Ye-Qian Du
, Zhouwang Yang
:
Robust Multimodal Representation under Uncertain Missing Modalities. 34:1-34:23 - Nayoung Kim
, Jung-Kyung Lee
, Je-Won Kang
:
Reference-based In-loop Filter with Robust Neural Feature Transfer for Video Coding. 35:1-35:24 - Kehua Guo
, Xuyang Tan
, Xiangyuan Zhu
, Shaojun Guo
, Zhipeng Xi
:
ATMNet: Adaptive Texture Migration Network for Guided Depth Super-Resolution. 36:1-36:21 - Federico Becattini
, Xiaolin Chen
, Andrea Puccia
, Haokun Wen
, Xuemeng Song
, Liqiang Nie
, Alberto Del Bimbo
:
Interactive Garment Recommendation with User in the Loop. 37:1-37:21 - Nan Wang
, Qi Wang
:
Dynamic Weighted Gating for Enhanced Cross-Modal Interaction in Multimodal Sentiment Analysis. 38:1-38:19 - Yangchun Zhu
, Yufei Zheng
, Jiawei Liu
, Yao Li
, Zhengjun Zha
:
Noise-Resistance Learning via Multi-Granularity Consistency for Unsupervised Domain Adaptive Person Re-Identification. 39:1-39:23 - Wei Ji
, Li Li
, Hao Fei
, Xiangyan Liu
, Xun Yang
, Juncheng Li
, Roger Zimmermann
:
Toward Complex-query Referring Image Segmentation: A Novel Benchmark. 40:1-40:18
Volume 21, Number 2, February 2025
- Yushu Zhang
, William Puech
, Anderson Rocha
, Rongxing Lu
, Stefano Cresci
, Roberto Di Pietro
:
Introduction to the Special Issue on Security and Privacy of Avatar in Metaverse. 41:1-41:3 - Fan Wang
, Zhangjie Fu
, Xiang Zhang
:
A Self-Defense Copyright Protection Scheme for NFT Image Art Based on Information Embedding. 42:1-42:23 - Jinwei Wang
, Haihua Wang
, Jiawei Zhang
, Hao Wu
, Xiangyang Luo
, Bin Ma
:
Invisible Adversarial Watermarking: A Novel Security Mechanism for Enhancing Copyright Protection. 43:1-43:22 - Rui Zhai
, Rongrong Ni
, Yang Yu
, Yao Zhao
:
FaceDefend: Copyright Protection to Prevent Face Embezzle. 44:1-44:19 - Hanqing Zhao
, Wenbo Zhou
, Dongdong Chen
, Weiming Zhang
, Ying Guo
, Zhen Cheng
, Pengfei Yan
, Nenghai Yu
:
Audio-Visual Contrastive Pre-train for Face Forgery Detection. 45:1-45:16 - Long Tang
, Dengpan Ye
, Zhenhao Lu
, Yunming Zhang
, Chuanxi Chen
:
Feature Extraction Matters More: An Effective and Efficient Universal Deepfake Disruptor. 46:1-46:22 - Jian Zhang
, Jiangqun Ni
, Fan Nie
, Jiwu Huang
:
Domain-invariant and Patch-discriminative Feature Learning for General Deepfake Detection. 47:1-47:19 - Dengyong Zhang
, Wenjie Zhu
, Xin Liao
, Feifan Qi
, Gaobo Yang
, Xiangling Ding
:
Spatiotemporal Inconsistency Learning and Interactive Fusion for Deepfake Video Detection. 48:1-48:24 - Rui Yang
, Rushi Lan
, Zhenrong Deng
, Xiaonan Luo
, Xiyan Sun
:
Deepfake Video Detection Using Facial Feature Points and Ch-Transformer. 49:1-49:22 - Jianheng Tang
, Kejia Fan
, Wenjie Yin
, Shihao Yang
, Yajiang Huang
, Anfeng Liu
, Naixue Xiong
, Mianxiong Dong
, Tian Wang
, Shaobo Zhang
:
A Quality-Aware and Obfuscation-Based Data Collection Scheme for Cyber-Physical Metaverse Systems. 50:1-50:23 - Xiaoxuan Han
, Songlin Yang
, Wei Wang
, Ziwen He
, Jing Dong
:
Exploiting Backdoors of Face Synthesis Detection with Natural Triggers. 51:1-51:24 - Jiuzhen Zeng
, Laurence T. Yang
, Chao Wang
, Junjie Su
, Xianjun Deng
:
A New Tensor Summary Statistic for Real-Time Detection of Stealthy Anomaly in Avatar Interaction. 52:1-52:23 - Letian Sha
, Xiao Chen
, Fu Xiao
, Zhong Wang
, Zhangbo Long
, Qianyu Fan
, Jiankuo Dong
:
VRVul-Discovery: BiLSTM-based Vulnerability Discovery for Virtual Reality Devices in Metaverse. 53:1-53:19 - Gui Xiao
, Zhen Ling
, Qunqun Fan
, Xiangyu Xu
, Wenjia Wu
, Ding Ding
, Chen Chen
, Xinwen Fu
:
Pivot: Panoramic-Image-Based VR User Authentication against Side-Channel Attacks. 54:1-54:19 - Yalin Song
, Wenbin Jiang
, Xiuli Chai
, Zhihua Gan
, Mengyuan Zhou
, Lei Chen
:
Cross-Attention Based Two-Branch Networks for Document Image Forgery Localization in the Metaverse. 55:1-55:24 - Yuanman Li
, Lanhao Ye
, Haokun Cao
, Wei Wang
, Zhongyun Hua
:
Cascaded Adaptive Graph Representation Learning for Image Copy-Move Forgery Detection. 56:1-56:24
- Cong Hu
, Xiao-Zhong Wei
, Xiaojun Wu
:
DIRformer: A Novel Image Restoration Approach Based on U-shaped Transformer and Diffusion Models. 57:1-57:23 - Yuyu Xu
, Pingping Zhang
, Minghui Chen
, Qiudan Zhang
, Wenhui Wu
, Yun Zhang
, Xu Wang
:
RGB-D Data Compression via Bi-Directional Cross-Modal Prior Transfer and Enhanced Entropy Modeling. 58:1-58:17 - Jiayu Yang
, Yongqi Zhai
, Wei Jiang
, Chunhui Yang
, Feng Gao
, Ronggang Wang
:
Adaptive Prediction Structure for Learned Video Compression. 59:1-59:23 - Yifan Wang
, Liang Feng
, Fenglin Cai
, Lusi Li
, Rui Wu
, Jie Li
:
TEC-CNN: Toward Efficient Compressing of Convolutional Neural Nets with Low-rank Tensor Decomposition. 60:1-60:23 - Chong-Yang Xiang
, Xiao Wu
, Jun-Yan He
, Zhaoquan Yuan
, Tingquan He
:
Person in Uniforms Re-Identification. 61:1-61:23 - Xiyao Liu
, Cundian Yang
, Jianbiao He
, Hui Fang
, Gerald Schaefer
, Jian Zhang
, Yuesheng Zhu
, Shichao Zhang
:
Attack-Defending Contrastive Learning for Volumetric Medical Image Zero-Watermarking. 62:1-62:23 - Anqi Cao
, Zhijing Wan
, Xiao Wang
, Wei Liu
, Wei Wang
, Zheng Wang
, Xin Xu
:
Diversity-Representativeness Replay and Knowledge Alignment for Lifelong Vehicle Re-identification. 63:1-63:20 - Xiaonuo Dongye
, Haiyan Jiang
, Dongdong Weng
, Zhenliang Zhang
:
Demonstrative Learning for Human-Agent Knowledge Transfer. 64:1-64:24 - Chengxin Zhao
, Hefei Ling
, Jialie Shen
, Han Fang
, Sijing Xie
, Yaokun Fang
, Zongyi Li
, Ping Li
:
GSyncCode: Geometry Synchronous Hidden Code for One-step Photography Decoding. 65:1-65:21 - Xiaolin Chen
, Xuemeng Song
, Jianhui Zuo
, Yinwei Wei
, Liqiang Nie
, Tat-Seng Chua
:
Domain-aware Multimodal Dialog Systems with Distribution-based User Characteristic Modeling. 66:1-66:22 - Chenghao Li
, Lei Qi
, Xin Geng
:
A SAM-guided Two-stream Lightweight Model for Anomaly Detection. 67:1-67:23 - Ji-Yan Wu
, Kasun Gamlath
, Archan Misra
:
Pr-Ge-Ne: Efficient Encoding of Pervasive Video Sensing Streams by Pruned Generative Networks. 68:1-68:22 - Wei Ji
, Li Li
, Zheqi Lv
, Wenqiao Zhang
, Mengze Li
, Zhen Wan
, Wenqiang Lei
, Roger Zimmermann
:
Backpropagation-Free Multi-modal On-Device Model Adaptation via Cloud-Device Collaboration. 69:1-69:17 - Heqi Peng
, Yunhong Wang
, Ruijie Yang
, Beichen Li
, Rui Wang
, Yuanfang Guo
:
AED-PADA: Improving Generalizability of Adversarial Example Detection via Principal Adversarial Domain Adaptation. 70:1-70:24 - Ning Xu
, Xiaowen Wang
, Jing Liu
, Lanjun Wang
, Xuanya Li
, Mengxiao Zhu
, Yongdong Zhang
, An-An Liu
:
Model Can Be Subtle: Two Important Mechanisms for Social Media Popularity Prediction. 71:1-71:20
Volume 21, Number 3, March 2025
- Jiapeng Wang
, Zening Lin
, Dayi Huang
, Longfei Xiong
, Lianwen Jin
:
LiLTv2: Language-substitutable Layout-image Transformer for Visual Information Extraction. 72:1-72:27 - Yili Jin
, Jiahao Li
, Bin Li
, Yan Lu
:
Neural Image Compression with Regional Decoding. 73:1-73:18 - Xiaotian Wu
, Xinjie Feng
, Bing Chen
, Ching-Nung Yang
, Qing-Yu Peng
, Wei Qi Yan
:
EVCS-DAS: Evolving Visual Cryptography Schemes for Dynamic Access Structures. 74:1-74:27 - Mohamed Zakariya Talhaoui
, Zhelong Wang
, Mohamed Amine Midoun
, Abdelkarim Smaili
, Mekkaoui Djamel Eddine
, Mourad Lablack
, Ke Zhang
:
Vulnerability Detection and Improvements of an Image Cryptosystem for Real-Time Visual Protection. 75:1-75:23 - Kai Xu
, Lichun Wang
, Shuang Li
, Tong Gao
, Baocai Yin
:
Scene Adaptive Context Modeling and Balanced Relation Prediction for Scene Graph Generation. 76:1-76:19 - Khouloud Samrouth
, Pia El Housseini
, Olivier Déforges
:
Siamese Network-Based Detection of Deepfake Impersonation Attacks with a Person of Interest Approach. 77:1-77:23 - Yiping Yang
, Baiyun Cui
, Yingming Li
:
A Multimodal Hierarchical Attentional Ordering Network. 78:1-78:20 - Haoxian Ruan
, Zhihua Xu
, Zhijing Yang
, Yongyi Lu
, Jinghui Qin
, Tianshui Chen
:
Learning Semantic-aware Representation in Visual-Language Models for Multi-label Recognition with Partial Labels. 79:1-79:19 - Kun Yan
, Zied Bouraoui
, Fangyun Wei
, Chang Xu
, Ping Wang
, Shoaib Jameel
, Steven Schockaert
:
Modeling Multi-modal Cross-interaction for Multi-label Few-shot Image Classification Based on Local Feature Selection. 80:1-80:28 - Yajie Liu
, Pu Ge
, Guodong Wang
, Qingjie Liu
, Di Huang
:
Multi-Grained Contrastive Learning for Text-Supervised Open-Vocabulary Semantic Segmentation. 81:1-81:21 - Yipei Chen
, Hua Yuan
, Baojun Ma
, Limin Wang
, Yu Qian
:
Beyond Songs: Analyzing User Sentiment through Music Playlists and Multimodal Data. 82:1-82:24 - Yuzhen Niu
, Yeyuan Xu
, Yuezhou Li
, Jiabang Zhang
, Yuzhong Chen
:
Skeleton-Boundary-Guided Network for Camouflaged Object Detection. 83:1-83:21 - Xiaofeng Zhang
, Zishan Xu
, Hao Tang
, Chaochen Gu
, Wei Chen
, Abdulmotaleb El-Saddik
:
Wakeup-Darkness: When Multimodal Meets Unsupervised Low-Light Image Enhancement. 84:1-84:25 - Jiahang Tu
, Wei Ji
, Hanbin Zhao
, Chao Zhang
, Roger Zimmermann
, Hui Qian
:
DriveDiTFit: Fine-tuning Diffusion Transformers for Autonomous Driving Data Generation. 85:1-85:29 - Yifan Jiao
, Chenglong Cai
, Bing-Kun Bao
:
Unified Text-Image Space Alignment with Cross-Modal Prompting in CLIP for UDA. 86:1-86:20 - Feifei Kou
, Bingwei Wang
, Hai-Sheng Li
, Chuangying Zhu
, Lei Shi
, Jiwei Zhang
, Limei Qi
:
Potential Features Fusion Network for Multimodal Fake News Detection. 87:1-87:24 - Shihao Zou
, Yuanlu Xu
, Nikolaos Sarafianos
, Federica Bogo
, Tony Tung
, Weixin Si
, Li Cheng
:
Generating High-Fidelity Clothed Human Dynamics with Temporal Diffusion. 88:1-88:21 - Jiaxin Chen
, Xin Liao
, Zhenxing Qian
, Zheng Qin
:
PRest-Net: Multi-domain Probability Estimation Network for Robust Image Forgery Detection. 89:1-89:20 - Qiang Li
, Di Liu
, Guang Zu
, Sen Li
, Hui Sun
, Jianzhong Wang
:
Multigranularity Feature Aggregation and Cross-level Boundary Modeling for Temporal Action Detection. 90:1-90:24 - Lin Huang
, Chuan Qin
, Guorui Feng
, Xiangyang Luo
, Xinpeng Zhang
:
New Framework of Robust Image Encryption. 91:1-91:22 - Jiayue Chen
, Xiaomeng Wang
, Tong Xu
, Shiwei Wu
:
Towards Scene-Centric Multi-Level Interest Mining for Video Recommendation. 92:1-92:24 - Xiusheng Lu
, Yanbin Hao
, Lechao Cheng
, Sicheng Zhao
, Yutao Liu
, Mingli Song
:
Mixed Attention and Channel Shift Transformer for Efficient Action Recognition. 93:1-93:20 - Haifeng Zhao
, Chi Zhang
, Deyin Liu
, Lin Yuanbo Wu
:
Deformation Field Fusion for Medical Image Registration. 94:1-94:17 - Lisong Ou
, Zhixin Li
:
Multi-modal Sarcasm Detection on Social Media via Multi-Granularity Information Fusion. 95:1-95:23 - Ao Fu
, Jiaqi Zhao
, Yong Zhou
, Wen-Liang Du
, Rui Yao
, Abdulmotaleb El-Saddik
:
Similarity Regulation and Calibration Alignment for Weakly Supervised Text-Based Person Re-Identification. 96:1-96:19 - Shaojun Zhu
, Bincheng Zhu
, Kaikai Chi
, Jiefan Qiu
, Hailong Shi
, Xingyu Gao
:
Maximizing Long-Term Task Completion Ratio of UAV-Enabled Wirelessly Powered MEC Systems. 97:1-97:25 - Xuanqing Cao
, Wengang Zhou
, Qi Sun
, Weilun Wang
, Li Li
, Houqiang Li
:
DISA: Disentangled Dual-Branch Framework for Affordance-Aware Human Insertion. 98:1-98:18 - Marco Mameli
, Marina Paolanti
, Adriano Mancini
, Primo Zingaretti
, Roberto Pierdicca
:
RenderGAN: Enhancing Real-time Rendering Efficiency with Deep Learning. 99:1-99:22 - Lv Tang
, Xinfeng Zhang
, Li Zhang
:
UVC: A Unified Deep Video Compression Framework. 100:1-100:23 - Shen Wang
, Yu Wang
, Renjie Qiao
, Kejun Wu
, Chia-Wen Lin
, Chengtao Cai
:
Multi-Scale Dynamic Fusion for Visible-Infrared Person Re-Identification. 101:1-101:24 - Yucheng Li
, Siwang Zhou
, Deyan Tang
, Liubo Ouyang
, Jia Liu
:
GFPNet: Generalizable Face Privacy Network with Dynamic Defense Training. 102:1-102:22
Volume 21, Number 4, April 2025
- Dan Guo
, Troy McDaniel
, Shuhui Wang
, Meng Wang
:
Introduction to the Special Issue on Deep Learning for Robust Human Body Language Understanding. 103:1-103:7 - Jian Zhang
, Kaihao He
, Ting Yu
, Jun Yu
, Zhenming Yuan
:
Semi-Supervised RGB-D Hand Gesture Recognition via Mutual Learning of Self-Supervised Models. 104:1-104:20 - Shengeng Tang
, Feng Xue
, Jingjing Wu
, Shuo Wang
, Richang Hong
:
Gloss-driven Conditional Diffusion Models for Sign Language Production. 105:1-105:17 - Kaixin Chen
, Lin Zhang
, Zhong Wang
, Shengjie Zhao
, Yicong Zhou
:
Skeleton-Aware Graph-Based Adversarial Networks for Human Pose Estimation from Sparse IMUs. 106:1-106:22 - Zhewei Tu
, Xiangbo Shu
, Peng Huang
, Rui Yan
, Zhenxing Liu
, Jiachao Zhang
:
Leveraging Frame- and Feature-level Progressive Augmentation for Semi-supervised Action Recognition. 107:1-107:21 - Linhua Xiang
, Zengfu Wang
:
Joint Mixing Data Augmentation for Skeleton-Based Action Recognition. 108:1-108:24 - Zenan Shi
, Wenyu Liu
, Haipeng Chen
:
Face Reconstruction-Based Generalized Deepfake Detection Model with Residual Outlook Attention. 109:1-109:19 - Peng He
, Jun Yu
, Chengjie Ge, Ye Yu, Wei Xu, Lei Wang, Tianyu Liu, Zhen Kan:
Domain-Separated Bottleneck Attention Fusion Framework for Multimodal Emotion Recognition. 110:1-110:21 - Yan Gan
, Chenxue Yang
, Mao Ye
, Renjie Huang
, Deqiang Ouyang
:
Generative Adversarial Networks with Learnable Auxiliary Module for Image Synthesis. 111:1-111:21
- Wei Liu
, Xin Xu
, Hua Chang
, Xin Yuan
, Zheng Wang
:
Mix-Modality Person Re-Identification: A New and Practical Paradigm. 112:1-112:21 - Nianzi Li
, Guijuan Zhang
, Ping Du
, Dianjie Lu
:
GP-HSI: Human-Scene Interaction with Geometric and Physical Constraints. 113:1-113:22 - Enyuan Zhao
, Ning Song
, Ze Zhang
, Jie Nie
, Xinyue Liang
, Zhiqiang Wei
:
Language-guided Bias Generation Contrastive Strategy for Visual Question Answering. 114:1-114:21 - Kun Wang
, Jiuxin Cao
, Jiawei Ge
, Chang Liu
, Bo Liu
:
Dual-Domain Triple Contrast for Cross-Dataset Skeleton-Based Action Recognition. 115:1-115:23 - Runing Li
, Jiangyan Dai
, Qibing Qin
, Chengduan Wang
, Huihui Zhang
, Yugen Yi
:
Texture and Structure-Guided Dual-Attention Mechanism for Image Inpainting. 116:1-116:25 - Nana Zhang
, Min Xiong
, Dandan Zhu
, Kun Zhu
, Guangtao Zhai
, Xiaokang Yang
:
Audio-Visual Saliency Prediction Model with Implicit Neural Representation. 117:1-117:23 - Zhenqiang Zhang
, Kun Li
, Shengeng Tang
, Yanyan Wei
, Fei Wang
, Jinxing Zhou
, Dan Guo
:
Temporal Boundary Awareness Network for Repetitive Action Counting. 118:1-118:22 - Zicheng Zhang
, Yingjie Zhou
, Chunyi Li
, Wei Sun
, Xiongkuo Min
, Xiaohong Liu
, Guangtao Zhai
:
MM-PCQA+: Advancing Multi-Modal Learning for Point Cloud Quality Assessment. 119:1-119:22 - Xiao Cui
, Qi Sun
, Min Wang
, Li Li
, Wengang Zhou
, Houqiang Li
:
LayoutEnc: Leveraging Enhanced Layout Representations for Transformer-based Complex Scene Synthesis. 120:1-120:21 - Chintha Sri Pothu Raju
, Rabul Hussain Laskar
, Zulfiqar Ali
, Ghulam Muhammad
:
Attention-based Fusion for Stroke Lesion Segmentation on Computed Tomography Perfusion Data. 121:1-121:23 - Qianxing Li
, Dehui Kong
, Jinghua Li
, Dongpan Chen
, Baocai Yin
:
Multi-Anchor Offset Representation Based Coarse-to-Fine Diffusion Model for Human Pose Estimation. 122:1-122:21 - Wasim Ahmad
, Yan-Tsung Peng
, Yuan-Hao Chang
, Gaddisa Olani Ganfure
, Sarwar Khan
:
CapST: Leveraging Capsule Networks and Temporal Attention for Accurate Model Attribution in Deep-fake Videos. 123:1-123:23 - Zekun Sun
, Na Ruan
:
GANK: Dynamic Geometric and Appearance Features for Efficient and Robust Detection of Face Forgery. 124:1-124:24 - Hancheng Zhu
, Li Yan
, Yong Zhou
, Rui Yao
, Zhiwen Shao
, Jiaqi Zhao
, Leida Li
:
Image Cropping with Content and Composition Attribute-aware Global Relation Reasoning. 125:1-125:19 - Wenying Wen
, Yu Ye
, Ziye Yuan
, Baolin Qiu
, Dingli Hua
:
LFIZW-GRHFMR: Robust Zero-Watermarking with GRHFMR for Light Field Image. 126:1-126:17 - Fan Chen
, Lingfeng Qu
, Hadi Amirpour
, Christian Timmerer
, Hongjie He
:
Counterfeiting Attacks on an RDH-EI Scheme Based on Block-Permutation and Co-XOR. 127:1-127:25 - Shangrong Yang
, Chunyu Lin
, Kang Liao
, Yao Zhao
:
FishFormer: Annulus Slicing-based Transformer for Fisheye Rectification. 128:1-128:16 - Jiahui Wang
, Qin Xu
, Bo Jiang
, Bin Luo
:
Transductive Few-shot Learning via Joint Message Passing and Prototype-based Soft-label Propagation. 129:1-129:21 - Jie Wang
, Tingfa Xu
, Liqiang Song
, Lihe Ding
, Hui Li
, Peng Jiang
, Yuqi Han
, Jianan Li
:
PAPooling: Graph-based Position Adaptive Aggregation of Local Geometry in Point Clouds. 130:1-130:18 - Tao Song
, Kunlin Yang
, Fan Meng
, Xin Li
, Handan Sun
, Chenglizhao Chen
:
Tropical Cyclone Image Super-Resolution via Multimodality Fusion. 131:1-131:22 - Qianjiang Hu
, Wei Hu
:
Dynamic Point Cloud Denoising via Gradient Fields. 132:1-132:24 - Jiannan Huang
, Mengxue Qu
, Longfei Li
, Yunchao Wei
:
AdGPT: Explore Meaningful Advertising with ChatGPT. 133:1-133:23

manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.