


default search action
IEEE Transactions on Multimedia, Volume 27
Volume 27, 2025
- Yuan Yuan

, Hongjie He
, Yaolin Yang
, Hadi Amirpour
, Christian Timmerer
, Fan Chen
:
JPEG Image Encryption With DC Rotation and Undivided RSV-Based AC Group Permutation. 1-15 - Dizhan Xue

, Shengsheng Qian
, Quan Fang
, Changsheng Xu
:
LININ: Logic Integrated Neural Inference Network for Explanatory Visual Question Answering. 16-27 - Pingping Zhang, Shiqi Wang

, Meng Wang
, Peilin Chen
, Wenhui Wu
, Xu Wang
, Sam Kwong
:
HNR-ISC: Hybrid Neural Representation for Image Set Compression. 28-40 - Qingxin Sheng, Chong Fu

, Zhaonan Lin, Junxin Chen
, Xingwei Wang
, Chiu-Wing Sham
:
Content-Aware Tunable Selective Encryption for HEVC Using Sine-Modular Chaotification Model. 41-55 - Qiguang Miao, Wentian Xin

, Ruyi Liu, Yi Liu, Mengyao Wu, Cheng Shi
, Chi-Man Pun
:
Adaptive Pitfall: Exploring the Effectiveness of Adaptation in Skeleton-Based Action Recognition. 56-71 - Shizhou Zhang

, Dexuan Kong
, Yinghui Xing
, Yue Lu
, Lingyan Ran
, Guoqiang Liang
, Hexu Wang, Yanning Zhang
:
Frequency-Guided Spatial Adaptation for Camouflaged Object Detection. 72-83 - Yu Wang

, Shengjie Zhao
, Shiwei Chen
:
SQL-Net: Semantic Query Learning for Point-Supervised Temporal Action Localization. 84-94 - Kefan Tang

, Lihuo He
, Nannan Wang
, Xinbo Gao
:
Dual Semantic Reconstruction Network for Weakly Supervised Temporal Sentence Grounding. 95-107 - Yiting Liu

, Liang Li
, Yunbin Tu
, Beichen Zhang
, Zheng-Jun Zha
, Qingming Huang
:
Dynamic Strategy Prompt Reasoning for Emotional Support Conversation. 108-119 - Yunlong Tang, Yuxuan Wan, Lei Qi

, Xin Geng
:
DPStyler: Dynamic PromptStyler for Source-Free Domain Generalization. 120-132 - Zhenyu Shu

, Shiyang Li
, Shiqing Xin
, Ligang Liu
:
3D Shape Segmentation With Potential Consistency Mining and Enhancement. 133-144 - Min Dang

, Gang Liu
, Hao Li
, Di Wang
, Rong Pan
, Quan Wang
:
PRA-Det: Anchor-Free Oriented Object Detection With Polar Radius Representation. 145-157 - Yizhen Jia

, Rong Quan
, Haiyan Chen
, Jiamei Liu, Yichao Yan
, Song Bai
, Jie Qin
:
Disaggregation Distillation for Person Search. 158-170 - Shiqi Gao

, Huiyu Duan
, Xinyue Li
, Kang Fu
, Yicong Peng, Qihang Xu, Yuanyuan Chang, Jia Wang
, Xiongkuo Min
, Guangtao Zhai
:
Quality-Guided Skin Tone Enhancement for Portrait Photography. 171-185 - Yue Dai

, Shihui Ying
, Yue Gao
:
Exploring Local and Global Consistent Correlation on Hypergraph for Rotation Invariant Point Cloud Analysis. 186-197 - Hao Tan

, Zichang Tan
, Dunfang Weng
, Ajian Liu
, Jun Wan
, Zhen Lei
, Stan Z. Li
:
Vision Transformer With Relation Exploration for Pedestrian Attribute Recognition. 198-208 - Zhaofeng Shi

, Qingbo Wu
, Fanman Meng
, Linfeng Xu
, Hongliang Li
:
Cross-Modal Cognitive Consensus Guided Audio-Visual Segmentation. 209-223 - Ge Li

, Jiale Cao
, Hanqing Sun
, Rao Muhammad Anwer
, Jin Xie
, Fahad Khan
, Yanwei Pang
:
Video Instance Segmentation Without Using Mask and Identity Supervision. 224-235 - Guanghui Yue

, Shangjie Wu
, Tianwei Zhou
, Gang Li
, Jie Du
, Yu Luo
, Qiuping Jiang
:
Progressive Region-to-Boundary Exploration Network for Camouflaged Object Detection. 236-248 - Yumo Zhang

, Zhanchuan Cai
:
DNP-AUT: Image Compression Using Double-Layer Non-Uniform Partition and Adaptive U Transform. 249-262 - Sijia Wen

, Yinqiang Zheng
, Feng Lu
:
Polarization State Attention Dehazing Network With a Simulated Polar-Haze Dataset. 263-274 - Jiapeng Li

, Ruonan Zhang
, Ge Li
, Thomas H. Li:
SDE2D: Semantic-Guided Discriminability Enhancement Feature Detector and Descriptor. 275-286 - Xu Han

, Junyu Gao
, Chuang Yang
, Yuan Yuan, Qi Wang
:
Focus Entirety and Perceive Environment for Arbitrary-Shaped Text Detection. 287-299 - Kai Hu

, Xiaobo Chen
, Zhineng Chen
, Yuan Zhang
, Xieping Gao
:
Multi-Perspective Pseudo-Label Generation and Confidence-Weighted Training for Semi-Supervised Semantic Segmentation. 300-311 - Xinru Guo

, Huaxiang Zhang
, Li Liu
, Dongmei Liu
, Xu Lu
, Hui Meng
:
Primary Code Guided Targeted Attack against Cross-modal Hashing Retrieval. 312-326 - Shichao Zhang

, Yibo Ding
, Tianxiang Huo
, Shukai Duan
, Lidan Wang
:
PointAttention: Rethinking Feature Representation and Propagation in Point Cloud. 327-339 - Mengzan Qi

, Sixian Chan
, Chen Hang
, Guixu Zhang
, Tieyong Zeng
, Zhi Li
:
Auxiliary Representation Guided Network for Visible-Infrared Person Re-Identification. 340-355 - Li Huang

, Yaping Huang
, Qingji Guan
:
Improving Image Inpainting via Adversarial Collaborative Training. 356-370 - Lin Jiang

, Jigang Wu
, Shuping Zhao
, Jiaxing Li
:
Cross-Scatter Sparse Dictionary Pair Learning for Cross-Domain Classification. 371-384 - Yusra Alkendi

, Rana Azzam
, Sajid Javed
, Lakmal D. Seneviratne
, Yahya H. Zweiri
:
Neuromorphic Vision-Based Motion Segmentation With Graph Transformer Neural Network. 385-400 - Guangzhao Dai

, Xiangbo Shu
, Wenhao Wu, Rui Yan
, Jiachao Zhang
:
GPT4Ego: Unleashing the Potential of Pre-Trained Models for Zero-Shot Egocentric Action Recognition. 401-413 - Nan Wang

, Shaohui Mei
, Yi Wang
, Yifan Zhang
, Duo Zhan
:
WHANet:Wavelet-Based Hybrid Asymmetric Network for Spectral Super-Resolution From RGB Inputs. 414-428 - Haojin Deng

, Yimin Yang
:
Context-Enriched Contrastive Loss: Enhancing Presentation of Inherent Sample Connections in Contrastive Learning Framework. 429-441 - Jingyi Xu, Xin Deng

, Yibing Fu, Mai Xu
, Shengxi Li
:
MDSC-Net: Multi-Modal Discriminative Sparse Coding Driven RGB-D Classification Network. 442-454 - Chen Guo

, Weiling Chen
, Aiping Huang
, Tiesong Zhao
:
Prototype Alignment With Dedicated Experts for Test-Agnostic Long-Tailed Recognition. 455-465 - Hefeng Wang

, Jiale Cao
, Jin Xie
, Aiping Yang, Yanwei Pang
:
Implicit and Explicit Language Guidance for Diffusion-Based Visual Perception. 466-476 - Meijing Zhang, Mengxue Chen, Qi Li

, Yanchen Chen, Rui Lin, Xiaolian Li, Shengfeng He
, Wenxi Liu
:
Category-Contrastive Fine-Grained Crowd Counting and Beyond. 477-488 - Kaiwei Zhang

, Dandan Zhu
, Xiongkuo Min
, Huiyu Duan
, Guangtao Zhai
:
Explain Vision Focus: Blending Human Saliency Into Synthetic Face Images. 489-502 - Shaowei Weng

, Jianhao Zhang, Tanguo Zhu, Lifang Yu
, Chunyu Zhang
:
DCM-Net: A Diffusion Model-Based Detection Network Integrating the Characteristics of Copy-Move Forgery. 503-514 - Meng Yang

, Jun Chen
, Xin Tian
, Longsheng Wei
, Jiayi Ma
:
VRTNet: Vector Rectifier Transformer for Two-View Correspondence Learning. 515-530 - Kai Ye, Zepeng Huang

, Yilei Xiong, Yu Gao, Jinheng Xie, Linlin Shen
:
Progressive Pseudo Labeling for Multi-Dataset Detection Over Unified Label Space. 531-543 - Yuxiu Lin

, Hui Liu
, Ren Wang, Qiang Guo
, Caiming Zhang
:
Multiview Feature Decoupling for Deep Subspace Clustering. 544-556 - Lili Huang

, Yiming Cao, Pengcheng Jia, Chenglong Li
, Jin Tang
, Chuanfu Li:
Knowledge-Guided Cross-Modal Alignment and Progressive Fusion for Chest X-Ray Report Generation. 557-567 - Min Liu

, Zhu Zhang
, Yuan Bian
, Xueping Wang
, Yeqing Sun
, Baida Zhang, Yaonan Wang
:
Cross-Modality Semantic Consistency Learning for Visible-Infrared Person Re-Identification. 568-580 - Ben Fei

, Liwen Liu
, Tianyue Luo
, Weidong Yang
, Lipeng Ma
, Zhijun Li
, Wenming Chen
:
Point Patches Contrastive Learning for Enhanced Point Cloud Completion. 581-596 - Shunjie Yuan

, Xinghua Li
, Yinbin Miao
, Haiyan Zhang
, Ximeng Liu
, Robert H. Deng
:
Combating Noisy Labels by Alleviating the Memorization of DNNs to Noisy Labels. 597-609 - Jiaping Yu, Muli Yang

, Aming Wu
, Cheng Deng
:
Memory-Enhanced Confidence Calibration for Class-Incremental Unsupervised Domain Adaptation. 610-621 - Yi Jin

, Xiaoxiao Ma
, Rui Zhang
, Huaian Chen
, Yuxuan Gu
, Pengyang Ling
, Enhong Chen
:
Masked Video Pretraining Advances Real-World Video Denoising. 622-636 - Kun Dai

, Zhiqiang Jiang
, Tao Xie
, Ke Wang
, Dedong Liu
, Zhendong Fan
, Ruifeng Li
, Lijun Zhao
, Mohamed Omar
:
SOFW: A Synergistic Optimization Framework for Indoor 3D Object Detection. 637-651 - Abdullah Aman Khan

, Jie Shao
, Yunbo Rao
, Lei She, Heng Tao Shen
:
LRDNet: Lightweight LiDAR Aided Cascaded Feature Pools for Free Road Space Detection. 652-664 - Shuhua Wang

, Ke Lv
, Jian Xue
, Yang Zhao
:
DA-Net: Density-Aware 3D Object Detection Network for Point Clouds. 665-678 - Congcong Wen

, Xiang Li
, Hao Huang, Yu-Shen Liu
, Yi Fang
:
3D Shape Contrastive Representation Learning With Adversarial Examples. 679-692 - Dong Liang, Dong Zhang, Qiong Wang, Zongqi Wei, Liyan Zhang:

CrossNet: Cross-Scene Background Subtraction Network via 3D Optical Flow. 693-706 - Zhanwen Liu

, Juanru Cheng
, Jin Fan
, Shan Lin
, Yang Wang
, Xiangmo Zhao
:
Multi-Modal Fusion Based on Depth Adaptive Mechanism for 3D Object Detection. 707-717 - Hui Tian

, Zheng Qin, Renjiao Yi, Chenyang Zhu, Kai Xu
:
Tensorformer: Normalized Matrix Attention Transformer for High-Quality Point Cloud Reconstruction. 718-730 - Mingtao Feng

, Haoran Hou
, Liang Zhang
, Yulan Guo
, Hongshan Yu
, Yaonan Wang
, Ajmal Mian
:
Exploring Hierarchical Spatial Layout Cues for 3D Point Cloud Based Scene Graph Prediction. 731-743 - Qiaoyun Wu

, Jun Wang
, Yi Zhang, Hua Dong, Cheng Yi
:
Accelerating Point Cloud Registration With Low Overlap Using Graphs and Sparse Convolutions. 744-753 - Qijian Zhang

, Junhui Hou
, Yue Qian:
PointMCD: Boosting Deep Point Cloud Encoders via Multi-View Cross-Modal Distillation for 3D Shape Recognition. 754-767 - Shuaihang Yuan

, Congcong Wen
, Yu-Shen Liu
, Yi Fang
:
Retrieval-Specific View Learning for Sketch-to-Shape Retrieval. 768-779 - Jing-Yu Yang

, Wenqiang Xu
, Yusen Hou, Xinchen Ye
, Pascal Frossard
, Kun Li
:
High-Quality Reconstruction of Depth Maps From Graph-Based Non-Uniform Sampling. 780-791 - Shaojie Zhuang

, Guangshun Wei
, Zhiming Cui, Yuanfeng Zhou
:
Robust Hybrid Learning for Automatic Teeth Segmentation and Labeling on 3D Dental Models. 792-803 - Jiawen Zhao

, Qing Zhu
, Yaonan Wang
, Weixing Peng
, Hui Zhang, Jianxu Mao
:
Registration of Multiview Point Clouds With Unknown Overlap. 804-819 - Jincen Jiang

, Xuequan Lu
, Lizhi Zhao
, Richard Dazeley
, Meili Wang
:
Masked Autoencoders in 3D Point Cloud Representation Learning. 820-831 - Xu Wang

, Yi Jin
, Yigang Cen
, Tao Wang
, Bowen Tang
, Yidong Li
:
LighTN: Light-Weight Transformer Network for Performance-Overhead Tradeoff in Point Cloud Downsampling. 832-847 - Shuangzhi Li

, Zhijie Wang
, Felix Juefei-Xu
, Qing Guo
, Xingyu Li
, Lei Ma
:
Common Corruption Robustness of Point Cloud Detectors: Benchmark and Enhancement. 848-859 - Shanshan Li

, Pan Gao
, Xiaoyang Tan
, Wei Xiang
:
RLGrid: Reinforcement Learning Controlled Grid Deformation for Coarse-to-Fine Point Cloud Completion. 860-874 - Xianglin Guo

, Yifan Wang
, Heng Liu
, Haoran Xie
, Gary Cheng
, Fu Lee Wang
:
Steerable Graph Neural Network on Point Clouds via Second-Order Random Walks. 875-888 - Junteng Zhang

, Jianqiang Wang
, Dandan Ding
, Zhan Ma
:
Scalable Point Cloud Attribute Compression. 889-899 - Wenting Cui

, Shaoyi Du
, Runzhao Yao
, Canhui Tang
, Aixue Ye
, Feng Wen
, Zhiqiang Tian
:
RDD: Learning Reinforced 3D Detectors and Descriptors Based on Policy Gradient. 900-913 - André F. R. Guarda

, Manuel Ruivo
, Luís Coelho
, Abdelrahman Seleem
, Nuno M. M. Rodrigues
, Fernando Pereira
:
Deep Learning-Based Point Cloud Coding and Super-Resolution: A Joint Geometry and Color Approach. 914-926 - Zicheng Zhang

, Wei Sun
, Yucheng Zhu
, Xiongkuo Min
, Wei Wu
, Ying Chen
, Guangtao Zhai
:
Evaluating Point Cloud From Moving Camera Videos: A No-Reference Metric. 927-939 - Lintai Wu

, Qijian Zhang
, Junhui Hou
, Yong Xu
:
Leveraging Single-View Images for Unsupervised 3D Point Cloud Completion. 940-953 - Xin Kang

, Chaoqun Wang
, Xuejin Chen
:
Region-Enhanced Feature Learning for Scene Semantic Segmentation. 954-964 - Weiquan Liu

, Minghao Liu, Shijun Zheng
, Siqi Shen
, Xuesheng Bian
, Yu Zang, Ping Zhong
, Cheng Wang
:
Interpreting Hidden Semantics in the Intermediate Layers of 3D Point Cloud Classification Neural Network. 965-977 - Elena Camuffo

, Umberto Michieli
, Simone Milani
:
Learning From Mistakes: Self-Regularizing Hierarchical Representations in Point Cloud Semantic Segmentation. 978-989 - Xiantong Zhao

, Yinan Han
, Shengjing Tian
, Jian Liu
, Xiuping Liu
:
OST: Efficient One-Stream Network for 3D Single Object Tracking in Point Clouds. 990-1002 - Shuai Guo

, Lei Shi
, Xiaoheng Jiang
, Pei Lv
, Qidong Liu
, Yazhou Hu
, Rongrong Ji
, Mingliang Xu
:
An Efficient Ungrouped Mask Method With two Learnable Parameters for 3D Object Detection. 1003-1017 - Yuan Liang

, Zitian Zhang
, Chuhua Xian
, Shengfeng He
:
Delving Into Multi-Illumination Monocular Depth Estimation: A New Dataset and Method. 1018-1032 - Yanyang Xiao

, Tieyi Zhang
, Juan Cao
, Zhonggui Chen
:
Accelerated Lloyd's Method for Resampling 3D Point Clouds. 1033-1046 - Qing Guo

, Zhijie Wang
, Lubo Wang, Haotian Dong, Felix Juefei-Xu
, Di Lin
, Lei Ma
, Wei Feng
, Yang Liu
:
CarveNet: Carving Point-Block for Complex 3D Shape Completion. 1047-1058 - Jingtao Sun

, Yaonan Wang
, Mingtao Feng
, Xiaofeng Guo
, Huimin Lu
, Xieyuanli Chen
:
Category-Level Multi-Object 9D State Tracking Using Object-Centric Multi-Scale Transformer in Point Cloud Stream. 1072-1085 - Xingyu Gao

, Zhenyu Chen
, Jianze Wei
, Rubo Wang, Zhijun Zhao:
Deep Mutual Distillation for Unsupervised Domain Adaptation Person Re-Identification. 1059-1071 - Yuanpeng Zeng, Ru Zhang, Hao Zhang

, Shaojie Qiao
, Faliang Huang
, Qing Tian
, Yuzhong Peng
:
GCCNet: A Novel Network Leveraging Gated Cross-Correlation for Multi-View Classification. 1086-1099 - Liangchen Liu

, Nannan Wang
, Dawei Zhou
, Decheng Liu
, Xi Yang
, Xinbo Gao
, Tongliang Liu
:
Generalizable Prompt Learning via Gradient Constrained Sharpness-Aware Minimization. 1100-1113 - Liangwei Chen

, Xiren Zhou
, Qiuju Chen, Fang Xiong
, Huanhuan Chen
:
Investigating the Effective Dynamic Information of Spectral Shapes for Audio Classification. 1114-1126 - Abdullah Aman Khan

, Jie Shao
, Sidra Shafiq, Shuyuan Zhu
, Heng Tao Shen
:
Enhancing Few-Shot 3D Point Cloud Classification With Soft Interaction and Self-Attention. 1127-1141 - Guanglin Zhou

, Zhongyi Han
, Shiming Chen
, Biwei Huang, Liming Zhu
, Tongliang Liu
, Lina Yao
, Kun Zhang
:
HCVP: Leveraging Hierarchical Contrastive Visual Prompt for Domain Generalization. 1142-1152 - Cairong Zhao

, Rui Shu
, Shuyang Feng
, Liang Zhu
, Xuekuan Wang:
Scene Text Image Super-Resolution Via Semantic Distillation and Text Perceptual Loss. 1153-1164 - Yuhui Quan

, Xi Wan, Tianxiang Zheng, Yan Huang
, Hui Ji
:
Dual-Path Deep Unsupervised Learning for Multi-Focus Image Fusion. 1165-1176 - Zihan Gao

, Lingling Li
, Xu Liu
, Licheng Jiao
, Fang Liu
, Shuyuan Yang
:
Uncertainty Guided Progressive Few-Shot Learning Perception for Aerial View Synthesis. 1177-1192 - Lingtong Min

, Ziman Fan
, Shunzhou Wang
, Feiyang Dou, Xin Li
, Binglu Wang
:
Adaptive Fusion Learning for Compositional Zero-Shot Recognition. 1193-1204 - Jian Yang

, Jun Li
, Yunong Cai, Guoming Wu
, Zhi-Ping Shi
, Chaodong Tan, Xianglong Liu
:
Hard-Sample Style Guided Patch Attack With RL-Enhanced Motion Pattern for Video Recognition. 1205-1215 - Gaosheng Liu

, Huanjing Yue
, Bihan Wen
, Jing-Yu Yang
:
Learned Focused Plenoptic Image Compression With Local-Global Correlation Learning. 1216-1227 - Jingyun Tian

, Jinjing Gu
, Yuanyuan Pu
, Zhengpeng Zhao
:
Leveraging Enriched Skeleton Representation With Multi-Relational Metrics for Few-Shot Action Recognition. 1228-1241 - Shaocan Liu

, Xingtao Wang
, Ruiqin Xiong
, Xiaopeng Fan
:
GCN-Based Multi-Modality Fusion Network for Action Recognition. 1242-1253 - Deng Xu

, Chao Zhang
, Zechao Li
, Chunlin Chen
, Huaxiong Li
:
Fast Disentangled Slim Tensor Learning for Multi-View Clustering. 1254-1265 - Tae-Young Kim, Jufeng Yang

, Eunil Park
:
MSDLF-K: A Multimodal Feature Learning Approach for Sentiment Analysis in Korean Incorporating Text and Speech. 1266-1276 - Lei Zhao

, Bo Li
, Jixiang Jiang, Xingxing Wei
:
Classification Committee for Active Deep Object Detection. 1277-1288 - Lingzhi Zhao

, Ying Cui
, Yuhang Jia, Yunfei Zhang
, Klara Nahrstedt
:
Enhancing Neural Adaptive Wireless Video Streaming via Cross-Layer Information Exposure and Online Tuning. 1289-1304 - Wenyang Liu

, Kejun Wu
, Tianyi Liu
, Yi Wang
, Kim-Hui Yap
, Lap-Pui Chau
:
ByteNet: Rethinking Multimedia File Fragment Classification Through Visual Perspectives. 1305-1319 - Weikang Wang

, Yuting Su, Jing Liu
, Wei Sun
, Guangtao Zhai
:
Weakly Supervised Referring Video Object Segmentation With Object-Centric Pseudo-Guidance. 1320-1333 - Zeke Zexi Hu

, Xiaoming Chen
, Vera Yuk Ying Chung
, Yiran Shen
:
Beyond Subspace Isolation: Many-to-Many Transformer for Light Field Image Super-Resolution. 1334-1348 - Yu Jiang

, Yuehang Wang
, Siqi Li
, Yongji Zhang
, Qianren Guo
, Qi Chu
, Yue Gao
:
EvCSLR: Event-Guided Continuous Sign Language Recognition and Benchmark. 1349-1361 - Bingzheng Liu

, Jianjun Lei
, Bo Peng
, Zhe Zhang
, Jie Zhu
, Qingming Huang
:
Advancing Generalizable Occlusion Modeling for Neural Human Radiance Field. 1362-1373 - Rui Tian

, Zuxuan Wu
, Qi Dai
, Micah Goldblum, Han Hu
, Yu-Gang Jiang
:
The Role of ViT Design and Training in Robustness to Common Corruptions. 1374-1385 - Yalan Qin

, Nan Pu
, Hanzhou Wu
, Nicu Sebe
:
Discriminative Anchor Learning for Efficient Multi-View Clustering. 1386-1396 - Jingyao Wang

, Luntian Mou
, Changwen Zheng
, Wen Gao:
Image-Based Freeform Handwriting Authentication With Energy-Oriented Self-Supervised Learning. 1397-1409 - Dong Chen

, Kaihang Pan, Guangyu Dai
, Guoming Wang
, Yueting Zhuang
, Siliang Tang
, Mingliang Xu
:
Improving Vision Anomaly Detection With the Guidance of Language Modality. 1410-1419 - Li Wang

, Yunzhou Zhang
, Fawei Ge
, Wenjing Bai
, Yifan Wang
:
Learning Local Features by Reinforcing Spatial Structure Information. 1420-1431 - Feiwei Qin

, Gaoyang Zhan, Meie Fang
, C. L. Philip Chen
, Ping Li
:
VGNet: Multimodal Feature Extraction and Fusion Network for 3D CAD Model Retrieval. 1432-1447 - Chuanming Wang

, Huiyuan Fu
, Peiye Liu, Huadong Ma
:
Part-Level Relationship Learning for Fine-Grained Few-Shot Image Classification. 1448-1460 - Jinpu Zhang

, Ziwen Li
, Ruonan Wei
, Yuehuan Wang
:
Augment One With Others: Generalizing to Unforeseen Variations for Visual Tracking. 1461-1474 - Chunlei Peng

, Boyu Wang, Decheng Liu
, Nannan Wang
, Ruimin Hu, Xinbo Gao
:
Masked Attribute Description Embedding for Cloth-Changing Person Re-Identification. 1475-1485 - Xingfeng Li

, Yuangang Pan
, Yuan Sun
, Quansen Sun
, Yinghui Sun
, Ivor W. Tsang
, Zhenwen Ren
:
Incomplete Multi-View Clustering With Paired and Balanced Dynamic Anchor Learning. 1486-1497 - Guanghui Wu

, Lili Chen, Zengping Chen
:
Uni-DPM: Unifying Self-Supervised Monocular Depth, Pose, and Object Motion Estimation With a Shared Representation. 1498-1511 - Yi Liu

, Qiuping Jiang
, Xinyi Wang, Ting Luo
, Jingchun Zhou
:
Underwater Image Enhancement With Cascaded Contrastive Learning. 1512-1525 - Md. Moniruzzaman

, Zhaozheng Yin
:
Progressive Knowledge Distillation From Different Levels of Teachers for Online Action Detection. 1526-1537 - Mingze Yao

, Huibing Wang
, Yawei Chen
, Xianping Fu
:
Between/Within View Information Completing for Tensorial Incomplete Multi-View Clustering. 1538-1550 - Dongqing Wu

, Huihui Li
, Cang Gu
, Lei Guo, Hang Liu
:
Dual Stream Relation Learning Network for Image-Text Retrieval. 1551-1565 - Hailong Ma

, Sibo Feng
, Xi Xiao
, Chenyu Dong, Xingyue Cheng:
Image Shooting Parameter-Guided Cascade Image Retouching Network: Think Like an Artist. 1566-1573 - Song Chang

, Youfang Lin
, Shuo Zhang
:
Structure-Aware Pre-Selected Neural Rendering for Light Field Reconstruction. 1574-1587 - Jianxin Shi

, Miao Zhang
, Linfeng Shen
, Jiangchuan Liu
, Lingjun Pu
, Jingdong Xu
:
Towards Neural Codec-Empowered 360$^\circ$ Video Streaming: A Saliency-Aided Synergistic Approach. 1588-1600 - Xiating Jin

, Jiajun Bu
, Zhi Yu
, Hui Zhang
, Yaonan Wang
:
Federated Hallucination Translation and Source-Free Regularization Adaptation in Decentralized Domain Adaptation for Foggy Scene Understanding. 1601-1616 - Mehwish Ghafoor, Arif Mahmood

, Muhammad Bilal
:
Enhancing 3D Human Pose Estimation Amidst Severe Occlusion With Dual Transformer Fusion. 1617-1624 - Wei Gao

, Jintian Feng
, Mengqi Wei
, Rui Zou
, Jianwen Sun
:
Towards a Multi-Granulated Statistical Framework for Human-Machine Collaboration in Image Classification. 1625-1636 - Shishun Tian

, Tiantian Zeng
, Zhengyu Zhang
, Wenbin Zou
, Xia Li
:
Dual Residual-Guided Interactive Learning for the Quality Assessment of Enhanced Images. 1637-1651 - Weida Chen

, Jie Jiang
, Linfei Wang
, Huafeng Li
, Yibing Zhan
, Dapeng Tao
:
Cps-STS: Bridging the Gap Between Content and Position for Coarse-Point-Supervised Scene Text Spotter. 1652-1664 - Zhongwei Shen

, Xiaojun Wu
, Hui Li
, Tianyang Xu
, Cong Wu
:
I Know How You Move: Explicit Motion Estimation for Human Action Recognition. 1665-1676 - Hai Liu

, Cheng Zhang
, Yongjian Deng
, Bochen Xie
, Tingting Liu
, Youfu Li
:
TransIFC: Invariant Cues-Aware Feature Concentration Learning for Efficient Fine-Grained Bird Image Classification. 1677-1690 - Quanquan Xiao

, Haiyan Jin
, Haonan Su
, Yuanlin Zhang, Zhaolin Xiao
, Bin Wang
:
SPDFusion:A Semantic Prior Knowledge-Driven Method for Infrared and Visible Image Fusion. 1691-1705 - Renjie Zhang

, Di Lin
, Xin Wang
, George Baciu
, C. L. Philip Chen
, Ping Li
:
Accurate-PGNet: Learning to Assemble Perceptual Body Parts for Accurate Human Skeleton Establishment. 1706-1721 - Ke Liang

, Lingyuan Meng
, Hao Li, Meng Liu
, Siwei Wang
, Sihang Zhou
, Xinwang Liu
, Kunlun He
:
MGKsite: Multi-Modal Knowledge-Driven Site Selection via Intra and Inter-Modal Graph Fusion. 1722-1735 - Haoran Li

, Yulan Guo
, Jiali You
, Xiaojian You, Zhenwen Ren
:
Graph Proxy Fusion: Consensus Graph Intermediated Multi-View Local Information Fusion Clustering. 1736-1747 - Mina Han

, Kailong Yu
, Weiran Li
, Qiannan Guo
, Zhenbo Li
:
Colliding Depths and Fusion: Leveraging Adaptive Feature Maps and Restorable Depth Recharge for Infrared and Visible Scene Fusion. 1748-1759 - Yijun Chen

, Xianwei Zheng
, Zhulun Yang
, Xutao Li
, Jiantao Zhou
, Yuanman Li
:
DuPMAM: An Efficient Dual Perception Framework Equipped With a Sharp Testing Strategy for Point Cloud Analysis. 1760-1771 - Guozhang Li

, Xinpeng Ding, De Cheng
, Jie Li
, Nannan Wang
, Xinbo Gao
:
ETC: Temporal Boundary Expand Then Clarify for Weakly Supervised Video Grounding With Multimodal Large Language Model. 1772-1782 - Yi Xiao

, Qiangqiang Yuan
, Kui Jiang
, Yuzeng Chen
, Qiang Zhang
, Chia-Wen Lin
:
Frequency-Assisted Mamba for Remote Sensing Image Super-Resolution. 1783-1796 - Shuo Wang

, Xinyu Zhang, Meng Wang
, Xiangnan He
:
Symmetric Hallucination With Knowledge Transfer for Few-Shot Learning. 1797-1807 - Yu Luo

, Xuanrong Chen, Jie Ling
, Chao Huang
, Wei Zhou
, Guanghui Yue
:
Unsupervised Low-Light Image Enhancement With Self-Paced Learning. 1808-1820 - Xiaoyang Hao

, Han Li
, Jing Sun
, Lei Wang
, Jianping Fan
:
A Twist Representation and Shape Refinement Method for Human Mesh Recovery. 1821-1834 - Yidi Li

, Hong Liu
, Bing Yang
:
STNet: Deep Audio-Visual Fusion Network for Robust Speaker Tracking. 1835-1847 - Hua Yu

, Yaqing Hou
, Wenbin Pei
, Yew-Soon Ong
, Qiang Zhang
:
DivDiff: A Conditional Diffusion Model for Diverse Human Motion Prediction. 1848-1859 - Leiyu Xie

, Yuxing Yang, Zeyu Fu
, Syed Mohsen Naqvi
:
Position and Orientation Aware One-Shot Learning for Medical Action Recognition From Signal Data. 1860-1873 - Yanan Zhu

, Jiaqiu Ai
, Le Wu
, Dan Guo
, Wei Jia
, Richang Hong
:
An Active Multi-Target Domain Adaptation Strategy: Progressive Class Prototype Rectification. 1874-1886 - Jie Zhang

, Kangneng Zhou
, Yan Luximon
, Tong-Yee Lee
, Ping Li
:
3DCMM: 3D Comprehensive Morphable Models With UV-UNet for Accurate Head Creation. 1887-1900 - Sheng Zheng

, Chaoning Zhang
, Xinhong Hao
:
Black-Box Targeted Adversarial Attack on Segment Anything (SAM). 1901-1913 - Hao Feng

, Wendi Wang
, Shaokai Liu
, Jiajun Deng
, Wengang Zhou
, Houqiang Li
:
DeepEraser: Deep Iterative Context Mining for Generic Text Eraser. 1914-1925 - Bo Ding

, Libao Zhang
, Hongbo Sun, Yongjun He
, Jian Qin
:
Semantic-Enhanced ULIP for Zero-Shot 3D Shape Recognition. 1926-1936 - Xu Han

, Junyu Gao
, Chuang Yang
, Yuan Yuan, Qi Wang
:
Spotlight Text Detector: Spotlight on Candidate Regions Like a Camera. 1937-1949 - Jingcheng Ke

, Dele Wang
, Jun-Cheng Chen
, I-Hong Jhuo
, Chia-Wen Lin
, Yen-Yu Lin
:
Make Graph-Based Referring Expression Comprehension Great Again Through Expression-Guided Dynamic Gating and Regression. 1950-1961 - Zhiqiang Fu

, Yao Zhao
, Dongxia Chang
, Yiming Wang
, Jie Wen
:
Reordered $k$-Means: A New Baseline for View-Unaligned Multi-View Clustering. 1962-1972 - Huafeng Li

, Shedan Yang
, Yafei Zhang
, Dapeng Tao
, Zhengtao Yu
:
Progressive Feature Mining and External Knowledge-Assisted Text-Pedestrian Image Retrieval. 1973-1987 - Zhengyi Liu

, Sheng Deng
, Xinrui Wang, Linbo Wang
, Xianyong Fang
, Bin Tang
:
SSFam: Scribble Supervised Salient Object Detection Family. 1988-2000 - Hao Luo

, Baoliang Chen
, Lingyu Zhu
, Peilin Chen
, Shiqi Wang
:
RCNet: Deep Recurrent Collaborative Network for Multi-View Low-Light Image Enhancement. 2001-2014 - Xu Cheng

, Hao Yu
, Kevin Ho Man Cheng, Zitong Yu
, Guoying Zhao
:
MDANet: Modality-Aware Domain Alignment Network for Visible-Infrared Person Re-Identification. 2015-2027 - Ying Fu

, Xinyu Zhu
, Xiaojie Li
, Xin Wang
, Xi Wu, Shu Hu
, Yi Wu
, Siwei Lyu
, Wei Liu
:
VB-KGN: Variational Bayesian Kernel Generation Networks for Motion Image Deblurring. 2028-2042 - Yiting Lu

, Xin Li
, Jianzhao Liu, Zhibo Chen
:
StyleAM: Perception-Oriented Unsupervised Domain Adaption for No-Reference Image Quality Assessment. 2043-2058 - Wenhao Xu

, Changwei Wang
, Rongtao Xu
, Shibiao Xu
, Weiliang Meng
, Man Zhang
, Xiaopeng Zhang
:
Token Masking Transformer for Weakly Supervised Object Localization. 2059-2069 - Rongqun Lin, Wenhan Yang

, Baoliang Chen
, Pingping Zhang, Yue Liu
, Shiqi Wang
, Sam Kwong
:
HFGlobalFormer: When High-Frequency Recovery Meets Global Context Modeling for Compressed Image Deraindrop. 2070-2082 - Zhen Lan

, Zixing Li, Chao Yan
, Xiaojia Xiang
, Dengqing Tang
, Han Zhou, Jun Lai
:
Adaptive Knowledge Distillation With Attention-Based Multi-Modal Fusion for Robust Dim Object Detection. 2083-2096 - Kai Zhang

, Ludan Sun
, Jun Yan
, Wenbo Wan
, Jiande Sun
, Shuyuan Yang
, Huaxiang Zhang
:
Texture-Content Dual Guided Network for Visible and Infrared Image Fusion. 2097-2111 - Gang Hu

, Yafei Lv
, Jianting Zhang, Qian Wu
, Zaidao Wen
:
CLIP-Based Modality Compensation for Visible-Infrared Image Re-Identification. 2112-2126 - Bowen Shi

, Xiaopeng Zhang
, Yaoming Wang
, Wenrui Dai
, Junni Zou
, Hongkai Xiong
:
MENSA: Multi-Dataset Harmonized Pretraining for Semantic Segmentation. 2127-2140 - Shaowei Wang

, Lingling Zhang
, Wenjun Wu
, Tao Qin
, Xinyu Zhang
, Jun Liu
:
Alignment-Guided Self-Supervised Learning for Diagram Question Answering. 2141-2154 - Fan Nie

, Jiangqun Ni
, Jian Zhang, Bin Zhang, Weizhe Zhang
:
DIP: Diffusion Learning of Inconsistency Pattern for General DeepFake Detection. 2155-2167 - Xingjian He

, Sihan Chen, Fan Ma
, Zhicheng Huang, Xiaojie Jin
, Zikang Liu, Dongmei Fu
, Yi Yang, Jing Liu
, Jiashi Feng
:
VLAB: Enhancing Video Language Pretraining by Feature Adapting and Blending. 2168-2180 - Wentao Chao

, Fuqing Duan
, Yulan Guo
, Guanghui Wang
:
MaskBlur: Spatial and Angular Data Augmentation for Light Field Image Super-Resolution. 2181-2193 - Di Li

, Susanto Rahardja
:
Rethinking Affine Transform for Efficient Image Enhancement: A Color Space Perspective. 2194-2205 - Zijun Wang

, Shijie Li
, Jun Peng, Yonghang Tai
, Zhengtao Yu
:
Viscoelastic Cluster-Constrained PBD-Based Soft Tissue Behavior and Interactive Media Applications for Surgical Simulation. 2206-2220 - Mengqi Yuan

, Gengyun Jia
, Bing-Kun Bao
:
Relation Inference Enhancement Network for Visual Commonsense Reasoning. 2221-2231 - Yuanyuan Wang

, Meng Liu
, Xuemeng Song
, Liqiang Nie
:
TR-Adapter: Parameter-Efficient Transfer Learning for Video Question Answering. 2232-2242 - Nannan Lu

, Zhiyuan Han, Zhen Tan:
A Hypergraph Based Contextual Relationship Modeling Method for Multimodal Emotion Recognition in Conversation. 2243-2255 - Wenjie Li

, Juncheng Li
, Guangwei Gao
, Weihong Deng
, Jian Yang
, Guo-Jun Qi
, Chia-Wen Lin
:
Efficient Image Super-Resolution With Feature Interaction Weighted Hybrid Network. 2256-2267 - Ruoyu Zhao

, Yushu Zhang
, Junhao Ji
, Shuang Yi
, Wenying Wen
, Rushi Lan
:
AES-AUDIO: An Encryption Scheme for Audio Supporting Differentiated Decryption. 2268-2280 - Yang Li

, Licheng Jiao
, Xu Liu
, Fang Liu
, Lingling Li
, Puhua Chen
:
LGSNet: Local-Global Semantics Learning Object Detection. 2281-2292 - Yang Chen

, Tian He
, Junfeng Fu, Ling Wang
, Jingcai Guo
, Ting Hu, Hong Cheng
:
Vision-Language Meets the Skeleton: Progressively Distillation With Cross-Modal Knowledge for 3D Action Representation Learning. 2293-2303 - Ruohong Huan

, Guowei Zhong
, Peng Chen
, Ronghua Liang
:
MulDeF: A Model-Agnostic Debiasing Framework for Robust Multimodal Sentiment Analysis. 2304-2319 - Zhanxuan Mei

, Yun-Cheng Wang
, C.-C. Jay Kuo
:
Blind Video Quality Assessment at the Edge. 2320-2334 - Shijian Deng

, Erin E. Kosloski
, Siddhi Patel
, Zeke A. Barnett, Yiyang Nan
, Alexander Kaplan, Sisira Aarukapalli, William T. Doan
, Matthew Wang, Harsh Singh, Pamela R. Rollins
, Yapeng Tian
:
Hear Me, See Me, Understand Me: Audio-Visual Autism Behavior Recognition. 2335-2346 - Xin Ma, Qiang Li

, Yuan Yuan, Qi Wang
:
Confident Multi-View Stereo. 2347-2361 - Ruijie Tao

, Xinyuan Qian
, Rohan Kumar Das
, Xiaoxue Gao, Jiadong Wang
, Haizhou Li
:
Enhancing Real-World Active Speaker Detection With Multi-Modal Extraction Pre-Training. 2362-2373 - Miao Zhang

, Beiqi Hu
, Shunyu Yao
, Yongri Piao
, Huchuan Lu
:
PMNet: Predator-Mimicking Network for Video Camouflaged Object Detection. 2374-2383 - Ye Wang

, Shaohui Mei
, Mingyang Ma
, Yuheng Liu
, Yuru Su
:
HTACPE: A Hybrid Transformer With Adaptive Content and Position Embedding for Sample Learning Efficiency of Hyperspectral Tracker. 2384-2398 - Wenyao Zhang

, Letian Wu
, Zequn Zhang
, Tao Yu
, Chao Ma
, Xin Jin
, Xiaokang Yang
, Wenjun Zeng:
Unleash the Power of Vision-Language Models by Visual Attention Prompt and Multimodal Interaction. 2399-2411 - Yande Li

, Mingjie Wang, Minglun Gong
, Yonggang Lu
, Li Liu
:
FER-Former: Multimodal Transformer for Facial Expression Recognition. 2412-2422 - Binwei Xu

, Qiuping Jiang
, Haoran Liang
, Dingwen Zhang
, Ronghua Liang
, Peng Chen
:
Learning Video Salient Object Detection Progressively From Unlabeled Videos. 2423-2435 - Ze Song

, Xudong Kang
, Xiaohui Wei, Renwei Dian
, Jinyang Liu
, Shutao Li
:
Multi-Granularity Context Perception Network for Open Set Recognition of Camouflaged Objects. 2436-2449 - Hailiang Gao

, Guo-Sen Xie
, Rui Yan
, Qiongjie Cui
, Hongyu Qu
, Xiangbo Shu
:
Hierarchical Motion-Enhanced Matching Framework for Few-Shot Action Recognition. 2450-2462 - Shuzhao Xie

, Yuan Xue
, Yifei Zhu
, Zhi Wang
:
SkyML: A MLaaS Federation Design for Multicloud-Based Multimedia Analytics. 2463-2476 - Jing Zhang, Ruiheng Zhang

, Lixin Xu
, Xiankai Lu
, Yushu Yu
, Min Xu
, He Zhao
:
FasterSal: Robust and Real-Time Single-Stream Architecture for RGB-D Salient Object Detection. 2477-2488 - Wu Chen, Qiuping Jiang

, Wei Zhou
, Feng Shao
, Guangtao Zhai
, Weisi Lin
:
No-Reference Point Cloud Quality Assessment via Graph Convolutional Network. 2489-2502 - Lizhi Xiong

, Linsen Ding
, Mengqi Cao, Zhihua Xia
, Yun-Qing Shi:
SEDN: A Spatiotemporal Encoder-Decoder Network for End-to-End Object Removal Forgery Detection in High-Resolution Videos. 2503-2515 - Yuhao Qing

, Si Liu
, Hai Wang
, Yueying Wang
:
DiffUIE: Learning Latent Global Priors in Diffusion Models for Underwater Image Enhancement. 2516-2529 - Zhenbing Liu

, Jieyu Huang
, Wenhao Wang
, Haoxiang Lu
, Rushi Lan
:
Learning Distinguishable Degradation Maps for Unknown Image Super-Resolution. 2530-2542 - Rongjian Xu

, Zhilu Zhang
, Renlong Wu, Wangmeng Zuo
:
NIR-Assisted Image Denoising: A Selective Fusion Approach and a Real-World Benchmark Dataset. 2543-2555 - Hangwei Chen

, Feng Shao
, Weiyi Jing
, Huizhi Wang
, Qiuping Jiang
:
Cross-Modal Hierarchical Knowledge Distillation for Image Aesthetics Assessment. 2556-2569 - Tianshun Han

, Shengnan Gui
, Yiqing Huang
, Baihui Li, Lijian Liu, Benjia Zhou
, Ning Jiang
, Quan Lu
, Ruicong Zhi
, Yanyan Liang
, Du Zhang, Jun Wan
:
PMMTalk$:$ Speech-Driven 3D Facial Animation From Complementary Pseudo Multi-Modal Features. 2570-2581 - Hao Xu

, Bin Tan
, Yihao Chen
, Die Hu
, Jun Wu
:
Enhancing Distributed Source Coding With Encoder-Centric Frequency Adaptation and Spatial Transformation. 2582-2592 - Baoyang Mu

, Feng Shao
, Zhengxuan Xie
, Hangwei Chen
, Zhongjie Zhu
, Qiuping Jiang
:
MISF-Net: Modality-Invariant and -Specific Fusion Network for RGB-T Crowd Counting. 2593-2607 - Yixiong Yang

, Hassan Ahmed Sial
, Ramon Baldrich, María Vanrell
:
Relighting From a Single Image: Datasets and Deep Intrinsic-Based Architecture. 2608-2622 - Heng Wang

, Cong Wang
, Yuan Yuan:
Hierarchical Context Measurement Network for Single Hyperspectral Image Super-Resolution. 2623-2637 - Baoliang Chen

, Hanwei Zhu
, Lingyu Zhu
, Shanshe Wang
, Jingshan Pan, Shiqi Wang
:
Debiased Mapping for Full-Reference Image Quality Assessment. 2638-2649 - Chao Sun

, Min Chen
, Chuanbo Zhu
, Sheng Zhang, Ping Lu, Jincai Chen
:
Listen With Seeing: Cross-Modal Contrastive Learning for Audio-Visual Event Localization. 2650-2665 - Renyang Liu

, Kwok-Yan Lam
, Wei Zhou
, Sixing Wu
, Jun Zhao
, Dongting Hu, Mingming Gong
:
STBA: Towards Evaluating the Robustness of DNNs for Query-Limited Black-Box Scenario. 2666-2681 - Shih-Fang Chen

, Jun-Cheng Chen
, I-Hong Jhuo
, Yen-Yu Lin
:
Improving Visual Object Tracking Through Visual Prompting. 2682-2694 - Jiebin Yan

, Jiale Rao
, Xuelin Liu
, Yuming Fang
, Yifan Zuo
, Weide Liu
:
Subjective and Objective Quality Assessment of Non-Uniformly Distorted Omnidirectional Images. 2695-2707 - Weiqing Min

, Shuqiang Jiang, Petia Radeva, Vladimir Pavlovic, Chong-Wah Ngo, Kiyoharu Aizawa, Wanqing Li:
Guest Editorial: When Multimedia Meets Food: Multimedia Computing for Food Data Analysis and Applications. 2708-2712 - Gaojie Li

, Yaochen Li
, Jingle Liu
, Wei Guo
, Wenneng Tang
, Yuehu Liu:
ESE-GAN: Zero-Shot Food Image Classification Based on Low Dimensional Embedding of Visual Features. 2713-2723 - Guoshan Liu

, Yang Jiao
, Jingjing Chen
, Bin Zhu
, Yu-Gang Jiang
:
From Canteen Food to Daily Meals: Generalizing Food Recognition to More Practical Scenarios. 2724-2733 - Wenrui Li, Jiahui Li, Mengyao Ma, Xiaopeng Hong, Xiaopeng Fan:

Multi-Scale Spiking Pyramid Wireless Communication Framework for Food Recognition. 2734-2746 - Zhe Xue

, Yawen Li
, Zhongchao Guan
, Wenling Li
, Meiyu Liang
, Hai Zhou
:
Robust Multi-Graph Contrastive Network for Incomplete Multi-View Clustering. 2747-2759 - Chunlai Dong

, Haochao Ying
, Renjun Hu, Yuyang Xu
, Jintai Chen
, Fuzhen Zhuang
, Jian Wu
:
A Progressively-Passing-Then-Disentangling Approach to Recipe Recommendation. 2760-2771 - Mengling Xu

, Jie Wang
, Ming Tao
, Bing-Kun Bao
, Changsheng Xu
:
CookGALIP: Recipe Controllable Generative Adversarial CLIPs With Sequential Ingredient Prompts for Food Image Generation. 2772-2782 - Xu Huang

, Jin Liu
, Zhizhong Zhang
, Yuan Xie
, Yongqiang Tang
, Wensheng Zhang
, Xiaohui Cui
:
Cross-Modal Recipe Retrieval With Fine-Grained Prompting Alignment and Evidential Semantic Consistency. 2783-2794 - Xing Lan

, Jiayi Lyu
, Hanyu Jiang
, Kun Dong
, Zehai Niu
, Yi Zhang
, Jian Xue
:
FoodSAM: Any Food Segmentation. 2795-2808 - Samuel Ortega

, Tatiana N. Ageeva
, Silje Kristoffersen, Karsten Heia
, Heidi Nilsen
:
High Throughput Shelf Life Determination of Atlantic Cod (Gadus morhua L.) by Use of Hyperspectral Imaging. 2809-2824 - Zhaoyan Ming

, Zeyu Xie, Chao Zhang, Kui Su
, Changzheng Yuan
, Tat-Seng Chua
:
Robust Visual Food Recognition for Enriching Nutrition Knowledge Bases. 2825-2835 - Yi Chen

, Qiuxu Fan
, Xianpeng Yuan
, Qinghui Zhang
, Yu Dong
:
PGD-GP: A Chinese Named Entity Recognition Model for Constructing Food Safety Standard Knowledge Graph. 2836-2847 - Qi Wang

, Dong Wang
, Weidong Min
, Di Gai
, Qing Han
, Cheng Zha
, Yuling Zhong
:
Threefold Encoder Interaction: Hierarchical Multi-Grained Semantic Alignment for Cross-Modal Food Retrieval. 2848-2862 - Lanjun Wang

, Chenyu Zhang
, An-An Liu
, Bo Yang
, Mingwang Hu
, Xinran Qiao
, Lei Wang
, Jianlin He
, Qiang Liu
:
Toward Chinese Food Understanding: A Cross-Modal Ingredient-Level Benchmark. 2863-2874 - Hong Chen

, Xin Wang
, Guanning Zeng, Yipeng Zhang
, Yuwei Zhou
, Feilin Han
, Yaofei Wu
, Wenwu Zhu
:
VideoDreamer: Customized Multi-Subject Text-to-Video Generation With Disen-Mix Finetuning on Language-Video Foundation Models. 2875-2885 - Chaofan Luo

, Donglin Di
, Xun Yang
, Yongjia Ma, Zhou Xue, Wei Chen, Xiaofei Gou, Yebin Liu
:
TrAME: Trajectory-Anchored Multi-View Editing for Text-Guided 3D Gaussian Manipulation. 2886-2898 - Haomiao Xiong, Yunzhi Zhuge

, Jiawen Zhu
, Lu Zhang
, Huchuan Lu
:
3UR-LLM: An End-to-End Multimodal Large Language Model for 3D Scene Understanding. 2899-2911 - Qingwang Wang

, Chaohui Li
, Yi Liu, Qiubai Zhu
, Jian Song
, Tao Shen
:
An Adaptive Framework Embedded With LLM for Knowledge Graph Construction. 2912-2923 - Xinhao Li, Yun Liu

, Guolei Sun
, Min Wu
, Le Zhang
, Ce Zhu
:
Towards Open-Vocabulary Video Semantic Segmentation. 2924-2934 - Lu Shi

, Shichao Kan
, Yi Jin
, Linna Zhang, Yigang Cen
:
Multi-Modal Self-Perception Enhanced Large Language Model for 3D Region-of-Interest Captioning With Limited Data. 2935-2948 - Binglu Wang

, Yao Tian
, Shunzhou Wang
, Le Yang
:
Multimodal Large Models are Effective Action Anticipators. 2949-2960 - Zhenwei Shao, Zhou Yu

, Jun Yu
, Xuecheng Ouyang, Lihao Zheng
, Zhenbiao Gai, Mingyang Wang, Zhenzhong Kuang
, Jiajun Ding
:
Imp: Highly Capable Large Multimodal Models for Mobile Devices. 2961-2974 - Peng Liu

, Jinhong Deng
, Lixin Duan
, Wen Li
, Fengmao Lv
:
Segmenting Anything in the Dark via Depth Perception. 2975-2986 - Jiaxing Yang

, Lihe Zhang
, Huchuan Lu
:
Semantics Alternating Enhancement and Bidirectional Aggregation for Referring Video Object Segmentation. 2987-2998 - Yue Zhu

, Kun Li
, Zongxin Yang
:
Exploiting EfficientSAM and Temporal Coherence for Audio-Visual Segmentation. 2999-3008 - Run Li, Dawei Zhang, Yanchao Wang, Yunliang Jiang, Zhonglong Zheng, Sang-Woon Jeon, Hua Wang:

Open-Vocabulary Multi-Object Tracking With Domain Generalized and Temporally Adaptive Features. 3009-3022 - Zixuan Ding

, Zihan Zhou
, Hui Chen
, Tianxiang Hao
, Yizhe Xiong
, Sicheng Zhao
, Qiang Zhang
, Jungong Han
:
Cross-Modality Prompts: Few-Shot Multi-Label Recognition With Single-Label Training. 3023-3033 - Yizhe Li

, Sanping Zhou
, Zheng Qin
, Le Wang
:
Visual-Linguistic Feature Alignment With Semantic and Kinematic Guidance for Referring Multi-Object Tracking. 3034-3044 - Shidong Cao

, Zhonghan Zhao
, Shengyu Hao
, Wenhao Chai
, Jenq-Neng Hwang
, Hongwei Wang
, Gaoang Wang
:
Efficient Transfer From Image-Based Large Multimodal Models to Video Tasks. 3045-3056 - Yunpeng Mei

, Jian Sun
, Zhihong Peng
, Fang Deng
, Gang Wang
, Jie Chen
:
RoG-SAM: A Language-Driven Framework for Instance-Level Robotic Grasping Detection. 3057-3068 - Xing Lan

, Jian Xue
, Ji Qi, Dongmei Jiang
, Ke Lu
, Tat-Seng Chua
:
ExpLLM: Towards Chain of Thought for Facial Expression Recognition. 3069-3081 - Huaiwen Zhang

, Tianci Wu
, Yinwei Wei
:
Multi-View User Preference Modeling for Personalized Text-to-Image Generation. 3082-3091 - Weijun Zhuang

, Bowen Dong, Zhilin Zhu, Zhijun Li
, Jie Liu
, Yaowei Wang
, Xiaopeng Hong
, Xin Li
, Wangmeng Zuo
:
Spatial-Temporal Saliency Guided Unbiased Contrastive Learning for Video Scene Graph Generation. 3092-3104 - Zihan Huang

, Tao Wu
, Wang Lin
, Shengyu Zhang
, Jingyuan Chen
, Fei Wu
:
AutoGeo: Automating Geometric Image Dataset Creation for Enhanced Geometry Understanding. 3105-3116 - Yixuan Zhang

, Chuanbin Liu
, Yizhi Liu
, Yifan Gao
, Zhiying Lu
, Hongtao Xie
, Yongdong Zhang
:
Leveraging Concise Concepts With Probabilistic Modeling for Interpretable Visual Recognition. 3117-3131 - Chao Huang

, Weiliang Huang, Qiuping Jiang
, Wei Wang
, Jie Wen
, Bob Zhang
:
Multimodal Evidential Learning for Open-World Weakly-Supervised Video Anomaly Detection. 3132-3143 - Rui Yao

, Anqi Zhang, Yong Zhou
, Jiaqi Zhao
, Bing Liu
, Abdulmotaleb El-Saddik
:
Adversarial Geometric Attacks for 3D Point Cloud Object Tracking. 3144-3157 - Lili Wei

, Congyan Lang
, Zheming Xu
, Liqian Liang
, Jun Liu
:
Few-Shot 3D Point Cloud Segmentation via Relation Consistency-Guided Heterogeneous Prototypes. 3158-3170 - Yuqi Ma

, Mengyin Liu
, Chao Zhu
, Xu-Cheng Yin
:
HA-FGOVD: Highlighting Fine-Grained Attributes via Explicit Linear Composition for Open-Vocabulary Object Detection. 3171-3183 - Yu Weng

, Wenbin He
, Jun Dong
, Chaomurilige
, Xuan Liu
, Zheng Liu
:
Cross-Lingual Adaptation for Vision-Language Model via Multimodal Semantic Distillation. 3184-3196 - Xiaoshuo Jia

, Qingzhen Xu
, Aiqing Zhu
, Xiaomei Kuang
:
Multi-Target Pose Estimation and Behavior Analysis Based on Symmetric Cascaded AdderNet. 3197-3209 - Haoqian Wu, Minda Zhao

, Zhipeng Hu
, Changjie Fan
, Lincheng Li
, Weijie Chen, Rui Zhao, Xin Yu
:
ICE: Interactive 3D Game Character Facial Editing via Dialogue. 3210-3223 - Wai Keung Wong

, Lunke Fei
, Jianyang Qin, Shuping Zhao
, Jie Wen
, Zhihao He:
Heterogeneous Pairwise-Semantic Enhancement Hashing for Large-Scale Cross-Modal Retrieval. 3238-3250 - Lei Wang

, Yibing Zhan
, Leilei Ma
, Dapeng Tao
, Liang Ding
, Chen Gong
:
SpliceMix: A Cross-Scale and Semantic Blending Augmentation Strategy for Multi-Label Image Classification. 3251-3265 - Fupeng Chu

, Yang Cong
, Yanmei Wang
, Ronghan Chen
:
DetailRecon: Focusing on Detailed Regions for Online Monocular 3D Reconstruction. 3266-3278 - Yang Chen

, Lin Zhang
, Shengjie Zhao
, Yicong Zhou
:
ATM-NeRF: Accelerating Training for NeRF Rendering on Mobile Devices via Geometric Regularization. 3279-3293 - Sheng-Yu Huang

, Chi-Pin Huang
, Kai-Po Chang
, Zi-Ting Chou, I-Jieh Liu
, Yu-Chiang Frank Wang
:
Learning Shape-Color Diffusion Priors for Text-Guided 3D Object Generation. 3294-3306 - Jiahe Zhao

, Ruibing Hou
, Hong Chang
, Xinqian Gu
, Bingpeng Ma
, Shiguang Shan
, Xilin Chen
:
Clothes-Changing Person Re-Identification With Feasibility-Aware Intermediary Matching. 3307-3319 - Yuehai Chen

, Qingzhong Wang
, Jing Yang
, Badong Chen
, Haoyi Xiong
, Shaoyi Du
:
CSCC: Cross-Scene Crowd Counting via Learning to Diversify for Domain Generalization. 3320-3330 - Shuyi Mao

, Xinpeng Li
, Fan Zhang
, Xiaojiang Peng
, Yang Yang
:
Facial Action Units as a Joint Dataset Training Bridge for Facial Expression Recognition. 3331-3342 - Yuanbo Wen

, Tao Gao
, Ziqi Li
, Jing Zhang
, Kaihao Zhang
, Ting Chen
:
All-in-One Weather-Degraded Image Restoration Via Adaptive Degradation-Aware Self-Prompting Model. 3343-3355 - Hanlin Bai

, Xin Gao
, Wei Deng, Jianwang Gan
, Yijin Xiong
, Kangkang Kou, Guoying Zhang
:
QRNet: Quaternion-Based Refinement Network for Surface Normal Estimation. 3356-3369 - Songlin Dong

, Yingjie Chen, Yuhang He
, Yuhan Jin, Alex C. Kot
, Yihong Gong
:
Analogical Augmentation and Significance Analysis for Online Task-Free Continual Learning. 3370-3382 - Kangle Wu

, Jun Huang
, Yong Ma
, Fan Fan
, Jiayi Ma
:
Universal Infrared Image Nonuniformity Correction via Stripe-Aware Attention Network. 3383-3398 - Zheling Meng

, Bo Peng
, Jing Dong
:
Latent Watermark: Inject and Detect Watermarks in Latent Diffusion Space. 3399-3410 - Bo Wang

, Zhao Zhang
, Mingbo Zhao
, Xiaojie Jin
, Mingliang Xu
, Meng Wang
:
SeaCap: Multi-Sight Embedding and Alignment for One-Stage Image Captioner. 3411-3425 - Jingjing Wu

, Yifan Sun, Richang Hong
:
Local Fine-Grained Visual Tracking. 3426-3436 - Musrea Abdo Ghaseb

, Ahmed Elhayek
, Fawaz Alsolami
, Abdullah Marish Ali
:
S3GAAR: Segmented Spatiotemporal Skeleton Graph-Attention for Action Recognition. 3437-3446 - Yuwu Lu

, Dewei Lin, Linlin Shen
, Yicong Zhou
, Jiahui Pan
:
Heterogeneous Domain Adaptation via Correlative and Discriminative Feature Learning. 3447-3461 - Mengru Ma

, Wenping Ma
, Licheng Jiao
, Lingling Li
, Xu Liu
, Fang Liu
, Shuyuan Yang
, Yuwei Guo:
A 3D Self-Awareness Diffusion Network for Multimodal Classification. 3462-3475 - Xiaoqing Liu

, Huanqiang Zeng
, Yifan Shi
, Jianqing Zhu
, Kaixiang Yang
, Zhiwen Yu
:
Ensemble Prototype Networks for Unsupervised Cross-Modal Hashing With Cross-Task Consistency. 3476-3488 - Zhao-Min Chen

, Quan Cui
, Xiaoqin Zhang
, Ruoxi Deng, Chaoqun Xia
, Shijian Lu
:
Towards Gradient Equalization and Feature Diversification for Long-Tailed Multi-Label Image Recognition. 3489-3500 - Jing Hao

, Moyun Liu
, Jinrong Yang, Kuo Feng Hung
:
GEM: Boost Simple Network for Glass Surface Segmentation via Vision Foundation Models. 3501-3512 - Xiaotong Li

, Licheng Jiao
, Fang Liu
, Shuyuan Yang
, Hao Zhu
, Xu Liu
, Lingling Li
, Wenping Ma
:
Adaptive Complex Wavelet Informed Transformer Operator. 3513-3526 - Chuanyang Zhang

, Guijuan Zhang
, Zhuoran Zheng
, Dianjie Lu
:
Group-PTP: A Pedestrian Trajectory Prediction Method Based on Group Features. 3527-3541 - Jingchun Zhou

, Chunjiang Liu
, Dehuan Zhang, Zongxin He, Ferdous Sohel
, Qiuping Jiang
:
RSUIA: Dynamic No-Reference Underwater Image Assessment via Reinforcement Sequences. 3542-3555 - Yaolin Yang

, Hongjie He
, Zhuo Feng, Fan Chen
, Yuan Yuan
:
Cloud-Based Privacy-Preserving Medical Images Storage Scheme With Low Consumption. 3556-3570 - Yule Duan, Chuang Chen

, Maixia Fu
, Xiuwen Gong
, Yingying Niu, Fulin Luo
:
GITANet: Group Interactive Threshold-Based Attention Network for Hyperspectral Image Classification. 3571-3584 - Liyang Chen

, Weihong Bao
, Shun Lei
, Boshi Tang, Zhiyong Wu
, Shiyin Kang
, Haozhi Huang
, Helen Meng
:
AdaMesh: Personalized Facial Expressions and Head Poses for Adaptive Speech-Driven 3D Facial Animation. 3598-3609 - Wei Zhou

, Kang Lin, Weipeng Hu
, Chao Xie
, Tao Su
, Haifeng Hu
, Yap-Peng Tan:
Snippet-Inter Difference Attention Network for Weakly-Supervised Temporal Action Localization. 3610-3624 - Weihao Jiang

, Zhaozhi Xie
, Yuxiang Lu
, Longjie Qi, Jingyong Cai, Hiroyuki Uchiyama, Bin Chen, Yue Ding
, Hongtao Lu
:
Learning Auxiliary Representations With Inconsistency-Guided Detail Regularization for Mask-Guided Matting. 3625-3636 - Rongshan Chen

, Hao Sheng
, Da Yang
, Ruixuan Cong
, Zhenglong Cui
, Sizhe Wang
, Tun Wang
, Mingyuan Zhao
:
Towards Depth-Continuous Scene Representation With a Displacement Field for Robust Light Field Depth Estimation. 3637-3649 - Yesong Xu

, Shuo Chen
, Jun Li
, Jian Yang
:
Asymptotics-Aware Multi-View Subspace Clustering. 3650-3663 - Lei Zhang

, Xin Chen, Zichen Wang:
IMU-Assisted Gray Pixel Shift for Video White Balance Stabilization. 3664-3676 - Ping Kong

, An Li, Daidou Guo
, Liang Zhou
, Chuan Qin
, Xinpeng Zhang
:
Privacy-Preserving Image Inpainting Using Markov Random Field Modeling. 3688-3701 - Yingli Hou

, Wei Zhang
, Zhiliang Zhu
, Hai Yu
:
CLIP-GAN: Stacking CLIPs and GAN for Efficient and Controllable Text-to-Image Synthesis. 3702-3715 - Ke Gu

, Yuchen Liu
, Hongyan Liu
, Bo Liu
, Junfei Qiao
, Weisi Lin
, Wenjun Zhang
:
Air Pollution Monitoring by Integrating Local and Global Information in Self-Adaptive Multiscale Transform Domain. 3716-3728 - Fengyong Li

, Qiankuan Wang
, Hang Cheng
, Xinpeng Zhang
, Chuan Qin
:
JPEG Reversible Data Hiding via Block Sorting Optimization and Dynamic Iterative Histogram Modification. 3729-3743 - Wei Tang

, Fazhi He
:
EAT: Multi-Exposure Image Fusion With Adversarial Learning and Focal Transformer. 3744-3754 - Yingwei Pan

, Yehao Li
, Ting Yao
, Chong-Wah Ngo
, Tao Mei
:
Stream-ViT: Learning Streamlined Convolutions in Vision Transformer. 3755-3765 - Yu Wang

, Yuanyuan Liu
, Shunping Zhou
, Yuxuan Huang, Chang Tang
, Wujie Zhou
, Zhe Chen
:
Emotion-Oriented Cross-Modal Prompting and Alignment for Human-Centric Emotional Video Captioning. 3766-3780 - Wenzhang Wei

, Zhipeng Gui
, Changguang Wu
, Anqi Zhao
, Dehua Peng
, Huayi Wu
:
Dynamic Visual Semantic Sub-Embeddings and Fast Re-Ranking for Image-Text Retrieval. 3781-3796 - Yong-Hoon Kwon

, Ju Hong Yoon
, Min-Gyu Park
:
Text2Avatar: Articulated 3D Avatar Creation With Text Instructions. 3797-3806 - Xu Wang

, Ziyan He, Qiudan Zhang
, You Yang
, Tiesong Zhao
, Jianmin Jiang
:
Geometry-Aware Self-Supervised Indoor 360$^{\circ }$ Depth Estimation via Asymmetric Dual-Domain Collaborative Learning. 3224-3237 - Guowei Wang

, Changxing Ding
, Wentao Tan, Mingkui Tan
:
Decoupled Prototype Learning for Reliable Test-Time Adaptation. 3585-3597 - Wujian Peng

, Zejia Weng
, Hengduo Li
, Zuxuan Wu
, Yu-Gang Jiang
:
BMB: Balanced Memory Bank for Long-Tailed Semi-Supervised Learning. 3677-3687 - Chaoyang Zhou

, Zengmao Wang
, Bo Du
:
Learning Intrinsic Invariance Within Intra-Class for Domain Generalization. 3807-3820 - Jing Liu

, Qingying Li
, Huiyu Duan
, Zhiwei Fan
, Yuting Su
, Guangtao Zhai
:
Learning to Generate Realistic Images for Bit-Depth Enhancement via Camera Imaging Processing. 3821-3832 - Tianyi Qin

, Bo Peng
, Jianjun Lei
, Yuxuan Yao
, Qingming Huang
:
Modeling Intra- and Inter-Modal Correlations for Incomplete Multi-Modal 3D Shape Clustering. 3833-3843 - Kun Li

, Xinge Peng
, Dan Guo
, Xun Yang
, Meng Wang
:
Repetitive Action Counting With Hybrid Temporal Relation Modeling. 3844-3855 - Guoming Wu

, Jun Li
, Yangfan Xu, Zhi-Ping Shi
, Xianglong Liu
:
Video Motion Blur Attack via Grad-Weighted and Discrete-Fusion Based Perturbation Generation. 3856-3868 - Yanmin Wu

, Qiankun Gao, Renrui Zhang, Haijie Li, Jian Zhang
:
Language-Assisted 3D Scene Understanding. 3869-3879 - Jia Lei

, Jiawei Li
, Jinyuan Liu
, Bin Wang
, Shihua Zhou
, Qiang Zhang
, Xiaopeng Wei
, Nikola K. Kasabov
:
MLFuse: Multi-Scenario Feature Joint Learning for Multi-Modality Image Fusion. 3880-3894 - Yu Zhou

, Jialun Pei
, Weixin Si
, Jing Qin
, Pheng-Ann Heng
:
Delving Into Quaternion Wavelet Transformer for Facial Expression Recognition in the Wild. 3895-3909 - Xiaoyu Zhang

, Yulin Jin, Haoyu Tong
, Jian Lou, Kai Wu
, Xiaofeng Chen
:
Purifier$^{+}$: Plug-and-Play Backdoor Mitigation for Pre-Trained Models via Activation Alignment. 3910-3924 - Pengkun Jiao

, Na Zhao
, Jingjing Chen
, Yu-Gang Jiang
:
Domain Expansion and Boundary Growth for Open-Set Single-Source Domain Generalization. 3925-3938 - Yuzhu Ji

, Chuanxia Zheng
, Tat-Jen Cham
:
One-Shot Human Motion Transfer via Occlusion-Robust Flow Prediction and Neural Texturing. 3939-3952 - Mingkui Tan

, Peihao Chen
, Hongyan Zhi, Jiajie Mai
, Benjamin Rosman
, Dongyu Ji, Runhao Zeng
:
Source-Free Elastic Model Adaptation for Vision-and-Language Navigation. 3953-3965 - Chengyu Zheng

, Xiu Li
, Xinyue Liang
, Lei Huang
, Shan Du
, Jie Nie
, Junyu Dong
:
Cross-Modal Progressive Perspective Matching Network for Remote Sensing Image-Text Retrieval. 3966-3978 - Minghong Xie

, Mengzhao Wang
, Huafeng Li
, Yafei Zhang
, Dapeng Tao
, Zhengtao Yu
:
Phrase Decoupling Cross-Modal Hierarchical Matching and Progressive Position Correction for Visual Grounding. 3979-3991 - Chao Li

, Tianyi Li, Fanyang Meng
, Qingyu Mao
, Youneng Bao
, Yonghong Tian
, Yongsheng Liang
:
One is All: A Unified Rate-Distortion-Complexity Framework for Learned Image Compression Under Energy Concentration Criteria. 3992-4007 - Yiming Wang

, Qun Li
, Dongxia Chang
, Jie Wen
, Fu Xiao
, Yao Zhao
:
A Category-Driven Contrastive Recovery Network for Double Incomplete Multi-View Multi-Label Classification. 4008-4017 - Jiaqi Ma

, Guo-Sen Xie
, Fang Zhao
, Zechao Li
:
AFANet: Adaptive Frequency-Aware Network for Weakly-Supervised Few-Shot Semantic Segmentation. 4018-4028 - Qiankun Ma

, Ziyao Zhang
, Pengchong Qiao
, Yu Wang, Rongrong Ji
, Chang Liu
, Jie Chen
:
Dual-Level Masked Semantic Inference for Semi-Supervised Semantic Segmentation. 4029-4042 - Yongbiao Gao

, Sijie Niu
, Guohua Lv
, Miaogen Ling
, Xin Geng
:
Long and Recent Preference Learning With Recent-K Items Distribution for Recommender System. 4043-4057 - Qiuping Jiang

, Xiwen Li, Xinyi Wang, Zhihua Wang
, Guangtao Zhai
:
Dataset and Metric for Quality Assessment of HDR Tone Mapping: Detail Visibility, Color Naturalness, and Overall Quality. 4058-4071 - Wenfang Sun

, Yuedong Tan
, Jingyuan Li
, Shuwei Hou, Xiaobo Li, Yingzhao Shao, Zhe Wang
, Beibei Song
:
HotMoE: Exploring Sparse Mixture-of-Experts for Hyperspectral Object Tracking. 4072-4083 - Yuxuan Shi

, Shaowei Weng
, Lifang Yu
, Li Li
:
A Copy-Move Forgery Detection Network Based on Selective Sampling Attention and Low-Cost Two-Step Self-Correlation Calculation. 4084-4094 - Ruitao Pu, Yang Qin

, Dezhong Peng
, Xiaomin Song, Huiming Zheng:
Deep Reversible Consistency Learning for Cross-Modal Retrieval. 4095-4106 - Fukun Yin

, Xin Chen
, Chi Zhang, Biao Jiang, Zibo Zhao
, Wen Liu
, Gang Yu
, Tao Chen
:
ShapeGPT: 3D Shape Generation With a Unified Multi-Modal Language Model. 4107-4120 - Yilong Chen

, Zongyi Xu
, Xiaoshui Huang
, Shanshan Zhao
, Xinqi Jiang, Xinyu Gao, Xinbo Gao
:
Weakly Supervised LiDAR Semantic Segmentation via Scatter Image Annotation. 4121-4136 - Xiaoyan Yu, Neng Dong

, Liehuang Zhu
, Hao Peng
, Dapeng Tao
:
CLIP-Driven Semantic Discovery Network for Visible-Infrared Person Re-Identification. 4137-4150 - Shaoxu Cheng

, Kanglei Geng, Chiyuan He
, Zihuan Qiu
, Linfeng Xu
, Heqian Qiu
, Lanxiao Wang
, Qingbo Wu
, Fanman Meng
, Hongliang Li
:
Distribution-Level Memory Recall for Continual Learning: Preserving Knowledge and Avoiding Confusion. 4151-4166 - Zhijie Zheng

, Zhicong Huang
, Jingwen Zhao, Kang Lin, Haifeng Hu
, Dihu Chen
:
RAFDet: Range View Augmented Fusion Network for Point-Based 3D Object Detection. 4167-4180 - Jun Zhou

, Chi Xu
, Li Cheng
:
Hand Gesture Recognition From an Open-Set Perspective. 4181-4192 - Ji-Feng Luo

, Yuzhen Chen
, Kaixun Zhang, Xudong An, Menghan Hu
, Guangtao Zhai
, Xiao-Ping Zhang
:
Human-Centered Financial Signal Analysis Based on Visual Patterns in Stock Charts. 4193-4205 - Ziyou Ren

, Guozhang Li
, Nan Cheng
, Anqi Wu, Nannan Wang
, Xinbo Gao
:
Cluster Assumption-Guided Timestamp-Supervised Temporal Action Segmentation. 4206-4216 - Fei Wang, Luhui Zhao

, Shijie Hong, Zhe Wang, Chen Liu, Changxin Gao
, Jinsheng Li, Xin Li, Dapeng Luo
:
Dual-Domain Teacher for Unsupervised Domain Adaptation Detection. 4217-4226 - Xinyang Huang

, Chuang Zhu
, Ruiying Ren
, Shengjie Liu, Tie-Jun Huang
:
Source-Free Semantic Regularization Learning for Semi-Supervised Domain Adaptation. 4227-4239 - Zhu Yin

, Zhongcheng Wu, Wuzhen Shi
, Guyue Hu
, Weisi Lin
:
Video Compressed Sensing Via Wavelet Residual Sampling and Dual-Domain Fusion. 4240-4255 - Junbin Yuan

, Yiqi Wang
, Zhoutao Wang
, Qingzhen Xu
, Bharadwaj Veeravalli
, Xulei Yang
:
DPPNet: A Depth Pixel-Wise Potential-Aware Network for RGB-D Salient Object Detection. 4256-4268 - Jiaqi Wu, Shihao Zhang, Mingshuo Hou, Zehua Wang

, Wei Chen
, Zijian Tian
, F. Richard Yu
, Victor C. M. Leung
:
CLIP-AE: A Multi-Modal Unsupervised Images Enhancement Method Based on High-Order Adaptive Curve for Visual Disbalance Defects. 4269-4283 - Zhenling Mo

, Zijun Zhang
, Kwok-Leung Tsui
:
Domain Generalization Study of Empirical Risk Minimization From Causal Perspectives. 4284-4296 - Han Jiang

, Chaofan Chen
, Xiaoshan Yang
, Changsheng Xu
:
Compact Latent Primitive Space Learning for Compositional Zero-Shot Learning. 4297-4308 - Jian Zhu

, Lei Liu
, Yu Zhang, Chang Tang
, Li-Rong Dai
:
Adaptive Confidence Multi-View Learning. 4309-4320 - Mohammad Adiban

, Kalin Stefanov
, Sabato Marco Siniscalchi, Giampiero Salvi
:
S-HR-VQVAE: Sequential Hierarchical Residual Learning Vector Quantized Variational Autoencoder for Video Prediction. 4321-4332 - Chen Hui

, Debin Zhao
, Weisi Lin
, Shaohui Liu
, Feng Jiang
:
Image Compressive Sensing With Scale-Variable Adaptive Sampling and Hybrid-Attention Transformer Reconstruction. 4333-4347 - Jian Zhong

, Yifan Jiao
, Bing-Kun Bao
:
Replay-Based Incremental Object Detection With Local Response Exploration. 4348-4360 - Wuzhen Shi

, Xuping Chen
, Biyun Yao, Yang Wen
, Bin Sheng
:
Identity and Modality Attributes Driven Multimodal Fusion Networks for Emotion Recognition in Conversations. 4361-4371 - Qinyang Zeng

, Ronghao Dang
, Xun Zhou, Chengju Liu
, Qijun Chen
:
Contrastive Feedback Vision-Language for 3D Skeleton-Based Action Recognition. 4372-4385 - Kejun Wu

, Zhenxing Li, You Yang
, Qiong Liu
, Xiao-Ping Zhang
:
End-to-End Deep Video Compression Based on Hierarchical Temporal Context Learning. 4386-4399 - Yu Hu

, Xiaobo Chen
, Sheng Wang
, Luyang Liu
, Hengyang Shi
, Lihong Fan
, Jing Tian, Jun Liang
:
Deformable Cross-Attention Transformer for Weakly Aligned RGB-T Pedestrian Detection. 4400-4411 - Ju-Yeon Shin, Jung-Kyung Lee

, Gun Bang, Junsik Kim
, Je-Won Kang
:
Neural Volumetric Video Coding With Hierarchical Coded Representation of Dynamic Volume. 4412-4426 - Zhaojie Chu

, Kailing Guo
, Xiaofen Xing
, Pengsheng Liu, Bolun Cai
, Xiangmin Xu
:
DCPTalk: Speech-Driven 3D Face Animation With Personalized Facial Dynamic Coupling Properties. 4427-4440 - Canlin Li, Haowen Su

, Xin Tan
, Xiangfei Zhang
, Lizhuang Ma
:
WV-LUT: Wide Vision Lookup Tables for Real-Time Low-Light Image Enhancement. 4441-4453 - Hao Tang

, Bin Ren
, Pingping Wu
, Nicu Sebe
:
Hierarchical Cross-Attention Network for Virtual Try-On. 4454-4466 - Wei Yu

, Rui Wang
, Weizhi Yang
, Wenjian Hu, Wei Xiang
:
AplusN: Progressively Integrating Attention and Normalization in Wavelet Domain for Pose Transfer. 4467-4479 - Yi-Zeng Hsieh

, Ji-Jie Lin
, Mu-Chun Su
, Wei-Jen Lin:
Strumming in the Metaverse: A Deep-Learning-Enabled Virtual Air Guitar System in VR With Enhanced Chord Recognition and Simulated Pedal Effects. 4480-4493 - Jianqi Chen

, Yilan Zhang
, Zhengxia Zou
, Keyan Chen
, Zhenwei Shi
:
Zero-Shot Image Harmonization With Generative Model Prior. 4494-4507 - Lifang Yu

, Zichao Yu, Shaowei Weng
, Dewang Chen
:
Adaptive PUPM-Based HEVC Video Steganography Balancing Embedding Performance and Security. 4508-4519 - Yikun Ma

, Haoran Qi
, Zhi Jin
:
Eliminating Moiré Patterns Across Diverse Image Resolutions via DMMNet. 4520-4530 - Di Wang

, Xianghao Jiao, Jinyuan Liu
, Xin Fan
:
Robust One-Stop Multi-Modality Image Registration-Fusion-Segmentation Framework Against Misalignments and Adversarial Attacks. 4531-4543 - Xiaomei Feng

, Qi Jia
, Yu Liu
, Weimin Wang
, Yuqing Liu
, Xinwei Xue
:
Rectangling for Stitched Image via Pixel-Wise Deformation Learning. 4544-4557 - Jiaming Wang

, Xitong Chen
, Xiao Huang
, Ruiqian Zhang
, Yu Wang
, Tao Lu
:
Rethinking the Role of Panchromatic Images in Pan-Sharpening. 4558-4570 - Yu Gan, Yunning You, Junjie Huang

, Sen Xiang
, Chang Tang
, Wei Hu
, Shan An:
Multi-View Clustering via Multi-Stage Fusion. 4571-4583 - Tengpeng Li

, Hanli Wang
, Qinyu Li
, Zhangkai Ni
:
Vision-Language Relational Transformer for Video-to-Text Generation. 4584-4596 - Chaobo Li, Hongjun Li

, Guoan Zhang
:
Detecting Adversarial Attacks Based on Tracking Differences in Frequency Bands. 4597-4612 - Qibing Qin

, Lei Wu
, Wenfeng Zhang
, Lei Huang
, Jie Nie
:
Deep Semantic-Consistent Penalizing Hashing for Cross-Modal Retrieval. 4613-4626 - Yang Xu

, Yifan Feng
, Xiaopin Zhong
, Yue Gao
, Zongze Wu
:
Hypergraph-Based Remaining Prototype Alignment for Open-Set Cross-Domain Image Retrieval. 4627-4642 - Cheng Cheng

, Wenzhe Liu
, Xinying Wang, Lin Feng
, Ziyu Jia
:
DISD-Net: A Dynamic Interactive Network With Self-Distillation for Cross-Subject Multi-Modal Emotion Recognition. 4643-4655 - Jinsheng Yang

, Bineng Zhong
, Qihua Liang
, Zhiyi Mo
, Shengping Zhang
, Shuxiang Song
:
Uncertainty-Guided Diffusion Model for Camouflaged Object Detection. 4656-4669 - Junzhu Mao

, Yang Shen
, Jinyang Guo
, Yazhou Yao
, Xian-Sheng Hua
, Hengtao Shen
:
Prune and Merge: Efficient Token Compression for Vision Transformer With Spatial Information Preserved. 4670-4683 - Mingjie Wei

, Xuemei Xie
, Yutong Zhong
, Guangming Shi
:
Learning Pyramid-Structured Long-Range Dependencies for 3D Human Pose Estimation. 4684-4697 - Zhiqiang Jiang

, Kun Dai
, Ke Wang
, Tao Xie
, Zhendong Fan
, Ruifeng Li
, Peng Kang, Lijun Zhao
:
Centra-Net: A Centralized Network for Visual Localization Spanning Multiple Scenes. 4698-4712 - Zhaoqing Pan

, Jixing Chen, Bo Peng
, Jianjun Lei
, Fu Lee Wang
, Nam Ling
, Sam Kwong
:
Efficient Chroma Intra Prediction via Exemplar Colorization Network for Versatile Video Coding. 4713-4724 - Kaijiang Li

, Haining Li
, Miduo Cui
, Junxin Li
, Pei Lv
, Bing Zhou
, Mingliang Xu
:
TITFormer: Combining Textual Modality and Simulating Infrared Modality Based on Transformer for Image Enhancement. 4725-4735 - Yue Zhang

, Akin Caliskan
, Mai Xu
, Adrian Hilton
, Jean-Yves Guillemaut
:
MVL-Net: Pairwise Learning for Multi-View Multiple People Labelling. 4736-4751 - Yingjie Liu

, Dan Wang
, Bin Song
:
Viewport Prediction With Unsupervised Multiscale Causal Representation Learning for Virtual Reality Video Streaming. 4752-4764 - Yan-Tsung Peng

, Wei-Hua Li
, Zihao Chen:
Rain2Avoid: Learning Deraining by Self-Supervision. 4765-4779 - Zhongjie Mi

, Xinghao Jiang
, Tanfeng Sun
, Ke Xu
, Qiang Xu
:
Preemptive Defense Algorithm Based on Generalizable Black-Box Feedback Regulation Strategy Against Face-Swapping Deepfake Models. 4780-4794 - Tong Liu

, Jing Li
, Jia Wu
, Bo Du
, Yibing Zhan
, Dapeng Tao
, Jun Wan
:
Facial Expression Recognition With Heatmap Neighbor Contrastive Learning. 4795-4807 - Xinchen Ye

, Yue Chang, Rui Xu
, Haojie Li
:
UW-Adapter: Adapting Monocular Depth Estimation Model in Underwater Scenes. 4808-4818 - Jing Liu

, Zongbing Zhang
, Yuting Su
, Bing Yang
, Xiongkuo Min
, Guangtao Zhai
:
Aggregate and Discriminate: Pseudo Clips-Guided Boundary Perception for Video Moment Retrieval. 4819-4830 - Pinghai Gao, Longguang Wang

, Sheng Ao
, Ye Zhang
, Yulan Guo
:
Enhancing Event-Based Video Reconstruction With Bidirectional Temporal Information. 4831-4843 - Yuqi Bu

, Xin Wu
, Yi Cai
, Qiong Liu
, Tao Wang
, Qingbao Huang
:
Error-Aware Generative Reasoning for Zero-Shot Visual Grounding. 4844-4855 - Zhiyan Wang

, Deyin Liu
, Lin Yuanbo Wu
, Song Wang
, Xin Guo
, Lin Qi
:
A Deep Semantic Segmentation Network With Semantic and Contextual Refinements. 4856-4868 - Yuyang Chang, Yifan Jiao

, Bing-Kun Bao
:
SVSRD: Spatial Visual and Statistical Relation Distillation for Class-Incremental Semantic Segmentation. 4869-4881 - Haojun Xu, Yan Gao

, Jie Li
, Xinbo Gao
:
An Information Compensation Framework for Zero-Shot Skeleton-Based Action Recognition. 4882-4894 - Yuan Luo

, Xiaorun Li
, Shuhan Chen
:
Spatial-Temporal Aware-Based Unsupervised Network for Infrared Small Target Detection. 4895-4909 - Yiping Xie

, Haihong Xiao
, Wenxiong Kang
:
$\mathrm{Tri^{2}plane}$: Advancing Neural Implicit Surface Reconstruction for Indoor Scenes. 4910-4923 - Xincheng Ju

, Dong Zhang
, Junhui Li
, Shoushan Li, Guodong Zhou
:
Enhanced Generative Framework With LLMs for Multimodal Emotion-Cause Pair Extraction in Conversations. 4924-4935 - Xiaoqin Zhang

, Kenan Bi
, Sixian Chan
, Shijian Lu
, Xiaolong Zhou
:
SyNet: A Synergistic Network for 3D Object Detection Through Geometric-Semantic-Based Multi-Interaction Fusion. 4950-4960 - Cong Xu

, Feiyu Chen
, Qi Jia
, Yihua Wang
, Liang Jin, Yunji Li
, Yaqian Zhao, Changming Zhao
:
A Multi-Granularity Relation Graph Aggregation Framework With Multimodal Clues for Social Relation Reasoning. 4961-4970 - Weiqing Yan

, Tingyu Yang, Chang Tang
:
Self-Supervised Semantic Soft Label Learning Network for Deep Multi-View Clustering. 4971-4983 - Ziyi Liu

, Zengmao Wang
, Bo Du
:
Medical Transformer With Mix Mask Generation for Thorax Disease Classification. 4984-4995 - Hang Lu, Xinmeng Tan, Mingkai Chen

, Zhe Zhang
, Xuguang Zhang, Jianxin Chen, Xin Wei
, Tiesong Zhao
:
Cross-Modal Haptic Compression Inspired by Embodied AI for Haptic Communications. 4996-5008 - Zehong Ma

, Hao Chen
, Wei Zeng
, Limin Su, Shiliang Zhang
:
Multi-Modal Reference Learning for Fine-Grained Text-to-Image Retrieval. 5009-5022 - Yanbing Xue

, Xinyu Tian
, Feifei Zhang
, Xianbin Wen
, Zan Gao
, Shengyong Chen:
CACP: Covariance-Aware Cross-Domain Prototypes for Domain Adaptive Semantic Segmentation. 5023-5034 - Abrham Shiferaw Alemaw

, Giulia Slavic
, Pamela Zontone
, Lucio Marcenaro
, David Martín Gómez
, Carlo S. Regazzoni
:
Modeling Interactions Between Autonomous Agents in a Multi-Agent Self-Awareness Architecture. 5035-5049 - Jian Zhu

, Jianrong Yan
, Jiebin Huang
, Yongwei Nie
, Bin Sheng
, Tong-Yee Lee
:
SGG-Nets: Generic Rotation-Invariant Plugin Networks for Point Cloud Analysis. 5062-5076 - Tianming Liang

, Linhui Li
, Jian-Fang Hu
, Xiangyang Yu, Wei-Shi Zheng
, Jianhuang Lai
:
Rethinking Temporal Context in Video-QA: A Comprehensive Study of Single-Frame Static Bias. 5077-5091 - Rui Song

, Guohong Liu
, Yan Zhang
, Xiaoying Sun
:
A Cross-Modal Generation Algorithm for Temporal Force Tactile Data for Multidimensional Haptic Rendering. 5092-5102 - Heng Wang

, Hongxia Wang
, Fei Zhang
, Zhenhao Shi, Xinyi Huang:
Moiré-Watermark: Robust Watermarking Against Screen-Shooting Using Moiré Patterns. 5103-5118 - Zhong Zhang

, Jianglin Zhou
, Shuang Liu
, Baihua Xiao
:
Completed Interaction Networks for Pedestrian Trajectory Prediction. 5119-5129 - Wenyi Zhao, Wei Li

, Yuhan Li, Lu Yang
, Zhenhao Liang, Enwen Hu, Weidong Zhang
, Huihua Yang
:
Constructing Balanced Training Samples: A New Perspective on Long-Tailed Classification. 5130-5143 - Jiangtao Zhang

, Qingshan Wang
, Qi Wang
:
GFTLS-SLT: Gloss-Free Transformer Based Lexical and Semantic Awareness Framework for Multimodal Sign Language Translation. 5144-5155 - Shaowu Wu

, Wei Lu
, Xiangyang Luo
:
Robust Watermarking Based on Multi-Layer Watermark Feature Fusion. 5156-5169 - Cheng Tan

, Zhangyang Gao
, Siyuan Li
, Stan Z. Li
:
SimVPv2: Towards Simple Yet Powerful Spatiotemporal Predictive Learning. 5170-5184 - Huiting Liu

, Xinlong Lv
, Peng Zhao
, Peipei Li
, Xindong Wu
:
Unbiased Meta Reinforcement Learning for Interactive Recommender Systems. 5185-5197 - Zhenying Fang

, Jun Yu
, Richang Hong
:
Boundary Discretization and Reliable Classification Network for Temporal Action Detection. 5198-5211 - Congrui Fu

, Hui Yuan
, Shiqi Jiang, Guanghui Zhang
, Liquan Shen
, Raouf Hamzaoui
:
Global Spatial-Temporal Information-Based Residual ConvLSTM for Video Space-Time Super-Resolution. 5212-5224 - Lei Zhang

, Haoran Ning, Jiaxin Tang
, Zhenxiang Chen, Yaping Zhong
, Yahong Han
:
WiViPose: A Video-Aided Wi-Fi Framework for Environment-Independent 3D Human Pose Estimation. 5225-5240 - Xinyi Wu

, Haohong Wang
, Aggelos K. Katsaggelos
:
Automatic Camera Movement Generation With Enhanced Immersion for Virtual Cinematography. 5241-5254 - Dengyong Zhang

, Nuo Fu, Xin Liao
, Jiaxin Chen
, Hengfu Yang, Gaobo Yang
:
Efficient Hierarchical Feature Collaboration Transformer for Image Inpainting. 5255-5266 - Jun Yu

, Guochen Xie
, Quansheng Liu
, Zhen Kan
, Lei Wang, Tianyu Liu, Qiang Ling
, Wei Xu
, Fang Gao
:
Contrastive Learning With Multiple Prototypes for Unsupervised Domain Adaptive Semantic Segmentation. 5267-5282 - Xunsong Li

, Pengzhan Sun, Yangcen Liu
, Lixin Duan
, Wen Li
:
Simultaneous Detection and Interaction Reasoning for Object-Centric Action Recognition. 5283-5295 - Jieyu Chen

, Ping An
, Xinpeng Huang
, Yilei Chen
, Chao Yang
, Liquan Shen
:
Mask-Aware Light Field De-Occlusion With Gated Feature Aggregation and Texture-Semantic Attention. 5296-5311 - Enping Li

, Tianrui Li
, Huaishao Luo
, Jielei Chu
, Lixin Duan
, Fengmao Lv
:
Adaptive Multi-Scale Language Reinforcement for Multimodal Named Entity Recognition. 5312-5323 - Li Wang

, Shoujin Wang
, Quangui Zhang
, Qiang Wu
, Min Xu
:
Federated User Preference Modeling for Privacy-Preserving Cross-Domain Recommendation. 5324-5336 - Igor Morawski, Kai He, Shusil Dangi, Winston H. Hsu

:
Leveraging Content and Context Cues for Low-Light Image Enhancement. 5337-5351 - Kehua Guo

, Zheng Wu
, Xianhong Wen
, Shaojun Guo
, Zhipeng Xi
, Tianyu Chen
:
GAN Prior-Enhanced Novel View Synthesis From Monocular Degraded Images. 5352-5362 - Ruoyi Xue, Cheng Cheng, Hang Wang

, Hongbin Sun
:
TBag: Three Recipes for Building up a Lightweight Hybrid Network for Real-Time SISR. 5363-5375 - Shaozu Yuan

, Yiwei Wei
, Hengyang Zhou
, Qinfu Xu, Meng Chen
, Xiaodong He:
Enhancing Semantic Awareness by Sentimental Constraint With Automatic Outlier Masking for Multimodal Sarcasm Detection. 5376-5386 - Yuqing Chen

, Jiayu Wang, Qianchen Zhou, Huosheng Hu
:
ArbiTrack: A Novel Multi-Object Tracking Framework for a Moving AAV to Detect and Track Arbitrarily Oriented Targets. 5387-5397 - Yichen Liu

, Lixuan Wei, Yufei Guo
, Lei Yu
:
All-in-Focus Imaging From Events With Occlusions. 5398-5412 - Sitong Gong

, Yunzhi Zhuge
, Lu Zhang
, Yifan Wang
, Pingping Zhang, Lijun Wang
, Huchuan Lu
:
AVS-Mamba: Exploring Temporal and Multi-Modal Mamba for Audio-Visual Segmentation. 5413-5425 - Wentao Zhang

, Tong Yu, Ruixuan Wang
, Jianhui Xie, Emanuele Trucco
, Wei-Shi Zheng
, Xiaobo Yang
:
Visual Class Incremental Learning With Textual Priors Guidance Based on an Adapted Vision-Language Model. 5426-5438 - Lizhi Hou

, Tingyu Fan, Yiling Xu
, Zhu Li
:
Lossless LiDAR Point Cloud Reflectance Compression With a Deep Hierarchical KNN Context Model. 5439-5451 - Jingyu Wang

, Jie Nie
, Niantai Jing
, Xinyue Liang
, Xiaodong Wang
, Chi-Hung Chi
, Zhiqiang Wei
:
Copy-Move Forgery Image Detection Based on Cross-Scale Modeling and Alternating Refinement. 5452-5465 - Daizong Liu

, Wei Hu
:
Imperceptible Backdoor Attacks on Text-Guided 3D Scene Grounding. 5466-5479 - Xiaojie Wei, Jielian Lin

, Jiawei Xu, Wei Gao
, Tiesong Zhao
:
RDVC: Efficient Deep Video Compression With Regulable Rate and Complexity Optimization. 5480-5491 - Yanzhao Su

, Nian Wang
, Zhigao Cui
, Yanping Cai
, Chuan He
, Aihua Li:
Real Scene Single Image Dehazing Network With Multi-Prior Guidance and Domain Transfer. 5492-5506 - Xusheng Cao, Haori Lu, Xialei Liu

, Ming-Ming Cheng
:
Class Incremental Learning for Image Classification With Out-of-Distribution Task Identification. 5507-5520 - Zhuomin Liang

, Liang Bai
, Xian Yang
, Jiye Liang
:
Graph Contrastive Learning for Fusion of Graph Structure and Attribute Information. 5521-5532 - Yawen Cui

, Jian Zhao
, Zitong Yu
, Rizhao Cai, Xun Wang, Lei Jin
, Alex C. Kot
, Li Liu
, Xuelong Li
:
CMoA: Contrastive Mixture of Adapters for Generalized Few-Shot Continual Learning. 5533-5547 - Xiaoqiang Lu

, Lingling Li
, Licheng Jiao
, Xu Liu
, Fang Liu
, Wenping Ma
, Shuyuan Yang
:
Uncertainty-Aware Semi-Supervised Learning Segmentation for Remote Sensing Images. 5548-5562 - Huixin Luo

, Li Li
, Xinpeng Zhang
:
Secure Neural Network Watermarking Protocol Against Evidence Exposure Attack. 5563-5574 - Qiang Li

, Shihao Wang, Wei Zhang, Shaojin Bai
, Weizhi Nie
, Anan Liu
:
DCDL: Dual Causal Disentangled Learning for Zero-Shot Sketch-Based Image Retrieval. 5575-5590 - Yan Jiang

, Xu Cheng
, Hao Yu
, Xingyu Liu
, Haoyu Chen
, Guoying Zhao
:
DSAF: Dual Space Alignment Framework for Visible-Infrared Person Re-Identification. 5591-5603 - Alessandro Ragano

, Helard Becerra Martinez, Andrew Hines
:
Beyond Correlation: Evaluating Multimedia Quality Models With the Constrained Concordance Index. 5604-5616 - Lei Wang

, Qingbo Wu
, Desen Yuan
, King Ngi Ngan, Hongliang Li
, Fanman Meng
, Linfeng Xu
:
Learning With Noisy Low-Cost MOS for Image Quality Assessment via Dual-Bias Calibration. 5617-5631 - Xihua Sheng

, Li Li
, Dong Liu
, Shiqi Wang
:
Bi-Directional Deep Contextual Video Compression. 5632-5646 - Lingzhi He

, Yakun Chang
, Runmin Cong
, Hongyu Liu
, Shujuan Huang, Renshuai Tao, Yao Zhao
:
Rethinking Depth Guided Reflection Removal. 5647-5658 - Jiao Liu

, Bin Pan, Zhenwei Shi
:
CR-Famba: A Frequency-Domain Assisted Mamba for Thin Cloud Removal in Optical Remote Sensing Imagery. 5659-5668 - Yan Feng

, Longting Xu
, Xiaochen Lu
, Guanglin Zhang
, Wei Rao
:
A Robust Coverless Audio Steganography Based on Differential Privacy Clustering. 5669-5684 - Lei Zhao

, Mengwei Li
, Bo Li
, Xingxing Wei
:
Diverse Visible-to-Thermal Image Translation via Controllable Temperature Encoding. 5685-5695 - Zechu Zhang, Weilong Peng

, Jinyu Wen
, Keke Tang
, Meie Fang
, David Dagan Feng
, Ping Li
:
Continuous Bijection Supervised Pyramid Diffeomorphic Deformation for Learning Tooth Meshes From CBCT Images. 5696-5708 - Junjie Shi

, Puhong Duan
, Xiaoguang Ma
, Jianning Chi
, Yong Dai
:
Frefusion: Frequency Domain Transformer for Infrared and Visible Image Fusion. 5722-5730 - Yuwu Lu

, Haoyu Huang
, Xue Hu
, Zhihui Lai
:
Multiple Adaptation Network for Multi-Source and Multi-Target Domain Adaptation. 5731-5745 - Hao Wang

, Shuo Zhang
, Biao Leng
:
HGFormer: Topology-Aware Vision Transformer With HyperGraph Learning. 5746-5757 - Yu Qiao

, Wei Lu
, Peiguang Jing
, Weiming Wang
, Yuting Su
:
Multimodal Dual-Graph Collaborative Network With Serial Attentive Aggregation Mechanism for Micro-Video Multi-Label Classification. 5758-5769 - Jianqiu Chen, Mingshan Sun

, Ye Zheng, Tianpeng Bao
, Zhenyu He
, Donghai Li, Guoqiang Jin, Rui Zhao
, Liwei Wu, Xiaoke Jiang:
Geo6D: Geometric-Constraints-Guided Direct Object 6D Pose Estimation Network. 5770-5783 - Haojun Xu, Yan Gao

, Zheng Hui, Jie Li
, Xinbo Gao
:
Language Knowledge-Assisted Representation Learning for Skeleton-Based Action Recognition. 5784-5799 - Jiyou Chen

, Wenqi Ren
, Huihuang Zhao
, Qunbing Xia, Gaobo Yang
:
You Only Need Clear Images: Self-Supervised Single Image Dehazing. 5800-5814 - Shaojin Bai

, Liang Zheng, Jing Bai
, Xiangyu Ma:
DLS-HCAN: Duplex Label Smoothing Based Hierarchical Context-Aware Network for Fine-Grained 3D Shape Classification. 5815-5830 - Riyu Lu

, Yingwen Zhang
, Hengyu Man
, Meng Wang
, Shiqi Wang
, Xiaopeng Fan
:
Learning the Scale in Reference Picture Resampling for Versatile Video Coding. 5831-5842 - Ruifan Zuo

, Chaoqun Zheng
, Lei Zhu
, Wenpeng Lu
, Jiasheng Si, Weiyu Zhang
:
Compact-Yet-Separate: Proto-Centric Multi-Modal Hashing With Pronounced Category Differences for Multi-Modal Retrieval. 5843-5856 - Lei Lei

, Xianxian Li
:
Multi-Modal Hybrid Interaction Vision-Language Tracking. 5857-5865 - Bo Han

, Lihuo He
, Junjie Ke
, Jinjian Wu
, Xinbo Gao
:
Progressive Semi-Decoupled Detector for Accurate Object Detection. 5866-5878 - Ruoyue Shen

, Nakamasa Inoue
, Dayan Guan
, Rizhao Cai
, Alex C. Kot
, Koichi Shinoda
:
ContextualCoder: Adaptive In-Context Prompting for Programmatic Visual Question Answering. 4936-4949 - Xin Mei

, Libin Yang
, Dehong Gao
, Xiaoyan Cai
, Junwei Han
, Tianming Liu
:
Adaptive Medical Topic Learning for Enhanced Fine-Grained Cross-Modal Alignment in Medical Report Generation. 5050-5061 - Huake Wang

, Xingsong Hou
, Jutao Li, Yadi Yan
, Wenke Sun, Xin Zeng
, Kaibing Zhang
, Xiangyong Cao
:
Multi-Scale Retinex Unfolding Network for Low-Light Image Enhancement. 5709-5721 - Mengzhao Wang

, Huafeng Li
, Yafei Zhang
, Jinxing Li
, Minghong Xie
, Dapeng Tao
:
Dual-Task Mutual Reinforcing Embedded Joint Video Paragraph Retrieval and Grounding. 5879-5894 - Wanlin Liang

, Hongbin Xu
, Wanshui Gan
, Wenxiong Kang
:
Zero-Shot Text-Driven Dynamic Neural Radiance Fields Stylization. 5895-5908 - Jimiao Yu

, Honglong Chen
, Junjian Li
, Linghan Chen, Yudong Gao
, Weifeng Liu
, Lei Zhang:
Black-Box Adversarial Defense Based on Image Decomposition and Reconstruction. 5909-5921 - Jun Zhou

, Chunsheng Liu
, Faliang Chang
, Wenqian Wang
, Penghui Hao
, Yiming Huang
, Zhiqiang Yang:
EraW-Net: Enhance-Refine-Align W-Net for Scene-Associated Driver Attention Estimation. 5922-5935 - Sheng Zheng

, Chaoning Zhang
, Dongshen Han
, Fachrina Dewi Puspitasari, Xinhong Hao
, Yang Yang
, Heng Tao Shen
:
Exploring Kernel Transformations for Implicit Neural Representations. 5936-5945 - Xue Wang

, Zheng Guan
, Wenhua Qian
, Jinde Cao
, Runzhuo Ma:
PID Controller-Driven Network for Image Fusion. 5977-5988 - Junkang Zhang

, Faming Fang
, Tingting Wang, Guixu Zhang
, Haichuan Song
:
FrDiff: Framelet-Based Conditional Diffusion Model for Multispectral and Panchromatic Image Fusion. 5989-6002 - Xing Yan

, Tanfeng Sun
, Qiang Xu
, Ke Xu
, Xinghao Jiang
:
Detection of HEVC Double Compression Based on Deep Representations of In-Loop Filtering and CU Depth Maps. 6003-6018 - Xu Yin

, Fei Pan, Guoyuan An
, Yuchi Huo
, Zixuan Xie
, Sung-Eui Yoon
:
OpenSlot: Mixed Open-Set Recognition With Object-Centric Learning. 6019-6030 - Pengpeng Yu

, Ye Zhang
, Fan Liang
, Haoran Li, Yulan Guo
:
Hierarchical Distortion Learning for Fast Lossy Compression of Point Clouds. 6031-6046 - Zhengyi Kwan, Wei Zhang, Zhengkui Wang, Aik Beng Ng, Simon See:

Nutrition Estimation for Dietary Management: A Transformer Approach With Depth Sensing. 6047-6058 - Weize Li

, Zhicheng Zhao
, Haochen Bai, Fei Su
:
Bring Adaptive Binding Prototypes to Generalized Referring Expression Segmentation. 6059-6069 - Feng Luan

, Jiarui Hu, Zhipeng Wang
, Jiguang Yue
, Yanmin Zhou
, Bin He
:
Transition-Aware Point Cloud Completion by a Progressive Refinement Generative Adversarial Network. 6070-6079 - Zewei Xin

, Qinya Li
, Bowen Sheng, Fan Wu
, Guihai Chen
:
Scale-Shift Attention in Polarization Domain for Fine-Grained Classification of Satellite ISAR Images. 6092-6101 - Xiaoxing Guo

, Ming Yang
, Gui-Fu Lu
:
Tensor-Based Late Fusion Incomplete Multiview Clustering. 6102-6112 - Bingzhi Chen

, Zhanhao Ye
, Yishu Liu
, Xiaozhao Fang
, Guangming Lu
, Shengli Xie
, Xuelong Li
:
Toward Robust Semi-Supervised Distribution Alignment Against Label Distribution Shift With Noisy Annotations. 6127-6139 - Anh H. Vo

, Tae-Seok Kim
, Hulin Jin
, Soo-Mi Choi
, Yong-Guk Kim
:
Instruction-Driven 3D Facial Expression Generation and Transition. 6140-6153 - Tongyu Zong

, Yixiang Mao
, Chen Li
, Yong Liu
, Yao Wang
:
Progressive Frame Patching for FoV-Based Point Cloud Video Streaming. 6154-6167 - Fengyuan Liu

, Lingyun Yu
, Quanwei Yang
, Meng Shao
, Hongtao Xie
:
High Fidelity Face Swapping via Facial Texture and Structure Consistency Mining. 6168-6181 - Xiaoyan Yang

, Licheng Jiao
, Yangyang Li
, Xu Liu
, Lingling Li
, Puhua Chen
, Fang Liu
, Wenping Ma
, Shuyuan Yang
:
Tracking Like Human: Dynamic Scene Learning Reasoning Tracker in Satellite Videos. 6182-6197 - Yang Wen

, Bin Luo, Wuzhen Shi
, Jianhua Ji
, Wenming Cao
, Xiaokang Yang
, Bin Sheng
:
SAT-Net: Structure-Aware Transformer-Based Attention Fusion Network for Low-Quality Retinal FunduImages Enhancement. 6198-6210 - Guokai Zhang

, Lanjun Wang
, Yuting Su
, An-An Liu
:
MarkPlugger: Generalizable Watermark Framework for Latent Diffusion Models Without Retraining. 6211-6220 - Guojun Fan, Lei Lu, Zijing Li, Ping Li, Quan Zhou, Zhibin Pan

:
Generalized Skewed Histogram Shifting Based Reversible Data Hiding by Differential Evolution. 6221-6234 - Yi Zhang, Yi Wang, Yawen Cui, Lap-Pui Chau:

3DGeoDet: General-Purpose Geometry-Aware Image-Based 3D Object Detection. 6235-6247 - Qiudan Zhang, Kaiyu Ji, Jie Zhang, Xu Wang

, Zhaoqing Pan
, Jianmin Jiang
:
Hierarchical Uncertainty-Aware Salient Object Detection for $360 ^{\circ }$ Images via Bi-Projection Collaborative Learning. 6248-6261 - Yan Gan

, Xinyao Xiao, Tao Xiang
, Chengqian Wu
, Deqiang Ouyang
:
SFCM-AEG: Source-Free Cross-Modal Adversarial Example Generation. 6262-6272 - Mingsheng Li

, Sijin Chen, Shengji Tang, Hongyuan Zhu
, Yanyan Fang, Xin Chen
, Zhuoyuan Li, Fukun Yin
, Tao Chen
:
WI3D: Weakly Incremental 3D Detection via Vision Foundation Models. 6273-6283 - Yunzhi Teng, Xiaoke Huang

, Kejie Li, Xiao-Ping Zhang
, Yansong Tang
:
DEHand: Deformable Encoding for Photo-Realistic Free-View and Free-Pose Hand Rendering. 6284-6295 - Weijia Liu

, Bo Miao
, Jiuxin Cao
, Xuelin Zhu
, Jiawei Ge
, Bo Liu
, Mehwish Nasim, Ajmal Mian
:
Context-Enhanced Video Moment Retrieval With Large Language Models. 6296-6306 - Bowen Wang, Lei Zhu

, Fengling Li
, Hui Cui
, Jingjing Li
:
Plug-In Open-Set Cross-Modal Hashing. 6319-6334 - Yifeng Ma

, Suzhen Wang
, Yu Ding
, Bowen Ma
, Tangjie Lv
, Changjie Fan
, Zhipeng Hu
, Zhidong Deng
, Xin Yu
:
TalkCLIP: Talking Head Generation with Text-Guided Expressive Speaking Styles. 6335-6346 - Jihyun Kim, Junho Park, Kyeongbo Kong

, Suk-Ju Kang
:
Programmable-Room: Interactive Textured 3D Room Meshes Generation Empowered by Large Language Models. 6358-6368 - Ning Li

, Bineng Zhong
, Qihua Liang
, Zhiyi Mo
, Shuxiang Song
:
Robust Multi-Stage Tracking via Multi-Scale and Multi-Level Representation Learning. 6369-6381 - Qiang Zheng

, Chao Zhang, Jian Sun
:
PointMT: Efficient Point Cloud Analysis With Hybrid MLP-Transformer Architecture. 6382-6396 - Linhao Zhang

, Li Jin
, Xiaoyu Li
, Xian Sun, Xin Wang, Zequn Zhang, Jian Liu
, Zhicong Lu
, Guangluan Xu:
Flexible Optimal Transport With Contrastive Graphical Modeling for Multimodal Hate Detection. 6397-6409 - Youze Wang

, Wenbo Hu
, Yinpeng Dong
, Hanwang Zhang
, Hang Su
, Richang Hong
:
Exploring Transferability of Multimodal Adversarial Samples for Vision-Language Pre-Training Models With Contrastive Learning. 6410-6421 - Shuowen Yang

, Fernando Pérez-Bueno, Hanlin Qin
, Rafael Molina
, Aggelos K. Katsaggelos
:
LCNet: Lightweight Cycle Network Driven by Physical and Deep Prior for Compressed Sensing. 6422-6433 - Jianjian Yin

, Tao Chen
, Gensheng Pei
, Huafeng Liu, Yazhou Yao
, Liqiang Nie
, XianSheng Hua
:
Semi-Supervised Semantic Segmentation With Multi-Constraint Consistency Learning. 6449-6461 - Ran Ran

, Jiwei Wei
, Yuyang Zhou, Xiang Guan
, Yang Yang
, Heng Tao Shen
:
HCFMN: Hierarchical Cross-Modal Fine-Grained Mining Network for Temporal Sentence Grounding. 6462-6474 - Nan Wu

, Chunfang Yang
, Baojun Qi
, Ma Zhu
, Jiangshan Li
, Xiangyang Luo
:
CCIGeo: Cross-View and Cross-Day-Night Image Geo-Localization Using Daytime Image Supervision. 6475-6488 - Qing Wang

, Yajian Wang, Hang Chen
, Shuxian Wang, Jun Du
, Chin-Hui Lee
:
Video Segmentation and Tokenization for Model-Based Video Scene Classification. 6489-6502 - Zhuo Feng

, Hongjie He
, Fan Chen
, Jie Bai:
Lightweight and Controllable Privacy-Preserving Image Retrieval in Multi-User Settings. 6503-6515 - Yabin Zhu

, Xiao Wang
, Chenglong Li
, Bo Jiang
, Lin Zhu
, Zhixiang Huang
, Yonghong Tian
, Jin Tang
:
CRSOT: Cross-Resolution Object Tracking Using Unaligned Frame and Event Cameras. 6529-6542 - Lijun Dong, Wei Ma

, Hongbin Zha
:
EdgeMaskFormer: Adapting Mask Transformer for Semantic Edge Detection. 6543-6554 - Jiaxin Han, Feng Li

, Anqi Li, Mengmeng Zhang
, Huihui Bai
, Jimin Xiao
, Yao Zhao
:
Enhancing Light Field Salient Object Detection With Variance-Maximized Key Focal Slice Selection. 6555-6567 - Liuqing Zhao

, Zichen Tian, Peng Zou
, Richang Hong
, Qianru Sun
:
Synthesizing Multi-Person and Rare Pose Images for Human Pose Estimation. 6568-6580 - Ke Wang

, Qi Ma
, Xingcan Li, Chongqiang Shen, Rui Leng, Jianbo Lu
:
UBTransformer: Uncertainty-Based Transformer Model for Complex Scenarios Detection in Autonomous Driving. 6581-6592 - Hongyu Qu

, Rui Yan
, Xiangbo Shu
, Hailiang Gao
, Peng Huang
, Guosen Xie
:
MVP-Shot: Multi-Velocity Progressive-Alignment Framework for Few-Shot Action Recognition. 6593-6605 - Xiaoyan Sun, De Cheng

, Yan Li
, Nannan Wang
, Dingwen Zhang
, Xinbo Gao
, Jiande Sun
:
Progressive Prompt-Driven Low-Light Image Enhancement With Frequency Aware Learning. 6620-6634 - Guangyong Gao

, Tongchao Feng
, Chongtao Guo
, Zhihua Xia
, Yun Q. Shi:
A Blockchain and Improved Perception Hash Based Copyright Protection Scheme for Purely Chromatic Background Images. 6635-6647 - Feng Hou

, Jin Yuan
, Ying Yang
, Yao Zhang
, Yang Liu
, Yang Zhang
, Cheng Zhong, Zhongchao Shi
, Jianping Fan
, Zhiqiang He
, Yong Rui
:
DomainVerse: A Benchmark Towards Real-World Distribution Shifts for Training-Free Adaptive Domain Generalization. 6648-6660 - Qingshan Hou

, Yaqi Wang
, Peng Cao
, Jianguo Ju
, Huijuan Tu
, Xiaoli Liu
, Jinzhu Yang
, Huazhu Fu
, Yih Chung Tham
, Osmar R. Zaïane:
Pathology-Preserving Transformer Based on Multicolor Space for Low-Quality Medical Image Enhancement. 6661-6676 - Zhuangzi Li

, Shan Liu
, Wei Gao
, Guanbin Li
, Ge Li
:
S4R: Rethinking Point Cloud Sampling via Guiding Upsampling-Aware Perception. 6677-6689 - Ting Yu

, Yifei Wu
, Qiongjie Cui, Qingming Huang
, Jun Yu
:
MossVLN: Memory-Observation Synergistic System for Continuous Vision-Language Navigation. 6690-6704 - Wenbo Xu, Huaxi Huang

, Yongshun Gong
, Litao Yu
, Qiang Wu
, Jian Zhang
:
Hierarchical Multi-Prototype Discrimination: Boosting Support-Query Matching for Few-Shot Segmentation. 6705-6718 - Xiao He

, Chang Tang
, Xinwang Liu
, Wei Zhang
, Zhimin Gao, Chuankun Li
, Shaohua Qiu
, Jiangfeng Xu:
Spectral Discrepancy and Cross-Modal Semantic Consistency Learning for Object Detection in Hyperspectral Images. 6719-6731 - Peipei Song, Long Zhang, Long Lan, Weidong Chen, Dan Guo, Xun Yang, Meng Wang:

Towards Efficient Partially Relevant Video Retrieval With Active Moment Discovering. 6740-6751 - Huixin Hu, Feng Shao

, Hangwei Chen
, Xiongli Chai
, Qiuping Jiang
:
Cross-Projection Distilling Knowledge for Omnidirectional Image Quality Assessment. 6752-6765 - Ziming Li

, Yaxin Liu
, Chuanpeng Yang, Yan Zhou
, Songlin Hu
:
ROSA: A Robust Self-Adaptive Model for Multimodal Emotion Recognition With Uncertain Missing Modalities. 6766-6779 - Huimin Yan

, Xian Yang
, Liang Bai
, Jiamin Li
, Jiye Liang
:
Multi-Grained Vision-and-Language Model for Medical Image and Text Alignment. 6780-6792 - Xiaolong Guo, Chengxu Liu

, Xueming Qian
, Zhixiao Wang, Xubin Feng
, Yao Xue
:
Single-Domain Generalized Object Detection With Frequency Whitening and Contrastive Learning. 6805-6818 - Jiancong Feng, Yuan-Gen Wang

, Mingjie Li
, Fengchuang Xing
:
Image Super-Resolution With Taylor Expansion Approximation and Large Field Reception. 6819-6830 - Zhuolin Tan

, Chenqiang Gao
, Anyong Qin
, Ruixin Chen, Tiecheng Song
, Feng Yang
, Deyu Meng
:
Towards Student Actions in Classroom Scenes: New Dataset and Baseline. 6831-6844 - Jiahao Wang

, Gang Pan
, Di Sun
, Jinyuan Li
, Jiawan Zhang
:
AFAN: An Attention-Driven Forgery Adversarial Network for Blind Image Inpainting. 6845-6856 - Sida Tian, Can Zhang

, Wei Yuan, Wei Tan, Wenjie Zhu:
XMusic: Towards a Generalized and Controllable Symbolic Music Generation Framework. 6857-6871 - Lianqiang Gan

, Junyu Lai
, Junhong Zhu
, Huashuo Liu
, Lianli Gao
:
Motion Direction Awareness: A Biomimetic Dynamic Capture Mechanism for Video Prediction. 5946-5960 - Kun Ouyang, Liqiang Jing, Xuemeng Song, Meng Liu, Yupeng Hu, Liqiang Nie:

Sentiment-Enhanced Graph-Based Sarcasm Explanation in Dialogue. 6080-6091 - Xiaoheng Jiang

, Yingjie Li
, Feng Yan
, Yang Lu
, Changsheng Xu
, Mingliang Xu
:
MGDefect: A Mask-Guided High-Quality Defect Image Generation Method for Improving Defect Inspection. 6113-6126 - Siqi Zhang

, Yanyuan Qiao
, Qunbo Wang
, Longteng Guo
, Zhihua Wei
, Jing Liu
:
FlexVLN: Flexible Adaptation for Diverse Vision-and-Language Navigation Tasks. 6307-6318 - Ting-Wei Zhou

, Xi-Le Zhao
, Jian-Li Wang
, Yi-Si Luo
, Min Wang
, Xiao-Xuan Bai, Hong Yan
:
DTR: A Unified Deep Tensor Representation Framework for Multimedia Data Recovery. 6347-6357 - Laijin Meng

, Xinghao Jiang
, Qiang Xu
, Tanfeng Sun
:
A Robust Coverless Video Steganography Based on Two-Level DCT Features Against Video Attacks. 6434-6448 - Hengsheng Lun, Ke Lu

, Liping Hou, Shuhua Wang
, Jian Xue
:
Beyond 3D: Generic IoU for 3D Object Detection. 6516-6528 - Sentao Chen

, Ping Xuan, Zhifeng Hao
:
Joint Distribution Weighted Alignment for Multi-Source Domain Adaptation via Kernel Relative Entropy Estimation. 6606-6619 - Mulin Chen

, Yajie Wang
, Xuelong Li
:
PIMG: Progressive Image-to-Music Generation With Contrastive Diffusion Models. 6732-6739 - Xin Jiang, Lihuo He

, Fei Gao
, Kaifan Zhang, Jie Li
, Xinbo Gao
:
Boosting Modal-Specific Representations for Sentiment Analysis With Incomplete Modalities. 6793-6804 - Zhiyuan Zhou

, Yanrong Guo
, Shijie Hao
, Richang Hong
:
Multi-Modal Depression Detection in Interview via Exploring Emotional Distribution Information. 6872-6883 - Yixuan Zhou

, Xing Xu
, Zhe Sun
, Jingkuan Song
, Andrzej Cichocki
, Heng Tao Shen
:
VQ-Flow: Taming Normalizing Flows for Multi-Class Anomaly Detection via Hierarchical Vector Quantization. 6884-6895 - Mingfu Xiong

, Kaikang Hu, Zhongyuan Wang
, Ruimin Hu
, Khan Muhammad
, Javier Del Ser
, Xiaokang Yang
, Bin Sheng
:
Adaptive Clustering and Weighted Regularization Contrastive Learning Framework for Unsupervised Person Re-Identification. 6896-6907 - Yan Liu

, Hongyuan Zhu
, Yinjie Lei
, Hao Liu
, Yun Pei, Yulan Guo
:
SF-City: A Source-Free Domain Adaptation Method for City-Scale Point Cloud Semantic Segmentation. 6908-6921 - Shuhong Chen

, Kairen Chen, Guojun Wang
, Sheng Wen
, Zhili Zhou:
DSLL-Face: Distributed Supervision-Integrated Framework for Low-Light Face Detection. 6922-6932 - Tengyu Yin

, Hongmei Chen
, Jihong Wan, Keyu Liu
, Zhong Yuan
, Chuan Luo
, Shi-Jinn Horng
, Tianrui Li
:
Leveraging Fuzzy Manifold Intra-Class Correlation and Inter-Class Separability for Online Multilabel Streaming Features Analysis. 6933-6948 - Yuehao Yin

, Huiyan Qi
, Bin Zhu
, Jingjing Chen
, Yu-Gang Jiang
, Chong-Wah Ngo
:
FoodLMM: A Versatile Food Assistant Using Large Multi-Modal Model. 6949-6961 - Bohao Fan

, Wenzhao Zheng
, Jianjiang Feng
, Jie Zhou
:
LiDAR-HMR: 3D Human Mesh Recovery From LiDAR. 6962-6975 - Yanglin Feng

, Yang Qin
, Dezhong Peng
, Hongyuan Zhu
, Xi Peng
, Peng Hu
:
PointCloud-Text Matching: Benchmark Dataset and Baseline. 6986-6995 - Hexu Xing

, Torsten Braun
:
A Hybrid Network for Extended Reality Environments. 6996-7011 - Si Chen

, Liuxiang Qiu, Da-Han Wang
, Wentao Zhu
, Yang Hua
, Yan Yan
:
Hierarchical Token-Aware Cross-Modality Reconstruction for Visible-Infrared Person Re-Identification. 7012-7027 - Siyu Yi

, Zhengyang Mao
, Yifan Wang
, Yiyang Gu
, Zhiping Xiao
, Chong Chen
, Xian-Sheng Hua
, Ming Zhang
, Wei Ju
:
Hypergraph Consistency Learning With Relational Distillation. 7028-7039 - Zhenzhen Hu

, Ao Sun, Zhenshan Wang, Jia Li
, Zijie Song
, Richang Hong
, Meng Wang
:
Adaptive Dual Video Summarization: From Dynamic Keyframes to Captions. 7040-7052 - Xingyu Zhu

, Xiangbo Shu
, Peng Huang
, Jinhui Tang
:
Prompt-Guided Prototype-Aware Commonality and Discrimination Learning for Zero-Shot Skeleton-Based Action Recognition. 7053-7066 - Zhong Ji

, Zhihao Li
, Yan Zhang
, Yanwei Pang
, Xuelong Li
:
Visual Semantic Contextualization Network for Multi-Query Image Retrieval. 7067-7080 - Pan Liu, Jing Li

, Meng Zhao
, Wanli Xue
, Qinghua Hu
, Shengyong Chen
:
Domain-Division Based Progressive Learning for Source-Free Domain Adaptation. 7081-7092 - Bosheng Qin

, Wentao Ye, Chi Zhang
, Qifan Yu, Wenqiao Zhang
, Siliang Tang
, Yueting Zhuang
:
Truncate Diffusion: Efficient Video Editing With Low-Rank Truncate. 7093-7108 - Duo Chen

, Zixin Tang
, Ke Song, Xingyu Peng, Wuque Cai
, Hongze Sun
, Dezhong Yao
, Daqing Guo
:
Manifold Embedding for Fast and Accurate 3D Reconstruction. 7109-7124 - Chi Yung

, Scott C.-H. Huang
, Hsiao-Chun Wu
, Che-Hua Li:
Novel Secure and Robust Recoverable Cryptographic Mosaic Technique. 7137-7151 - Shuofeng Sun, Yongming Rao, Jiwen Lu

, Haibin Yan
:
PointMax: Self-Boosted Local Sampling for 3D Point Cloud Analysis. 7152-7165 - Zhiquan Wen

, Mingkui Tan
, Yaowei Wang
, Qingyao Wu
, Qi Wu
:
Enhanced Reasoning via Multimodal LLMs and Collaborative Inference. 7166-7178 - Sarah Fachada

, Daniele Bonatto
, Gauthier Lafruit
, Mehrdad Teratani
:
Micro-Image Domain View Synthesizer for Free Navigation With Focused Plenoptic Cameras. 7179-7191 - Wanting Zhou

, Yiwei Ru
, Yushan Han
, Longteng Kong
, Zijian Wang
, Yong He, Zhenan Sun
:
CASIA-PR-V1: A Multi-Ethnic, Multi-Device and Cross-Spectral Dataset and a Multiscale Disentangled Model for Periocular Recognition. 7192-7204 - Zihui Zhang

, Chenghao Xu
, Jiexi Yan
, Cheng Deng
:
Bilevel Direction Preserving for Few-Shot Open-Set Recognition. 7205-7214 - Kehua Qu

, Rui Ding
, Jin Tang
:
Relation Learning and Aggregate-Attention for Multi-Person Motion Prediction. 7215-7229 - Yubo Cui

, Zhikang Zou
, Xiaoqing Ye
, Xiao Tan
, Zhiheng Li
, Zheng Fang
:
Coupling and Decoupling: Towards Temporal Feedback for 3D Object Detection. 7230-7242 - Jing Huo, Shiyin Jin, Jiashen Li, Pinzhuo Tian, Wenbin Li, Jing Wu, Yu-Kun Lai

, Yang Gao
:
Dictionary Based Generative Adversarial Network for Multi-Collection Style Transfer. 7243-7254 - Jianwei Zheng

, Ni Xu
, Wei Li
, Jiawei Jiang
, Xiaoqin Zhang
:
Semantic-Spatial Attention for Refined Object Placement in Text-to-Image Synthesis. 7255-7270 - Ruoyu Guo

, Maurice Pagnucco
, Yang Song
:
Exploring Multi-Feature Relationship in Retinex Decomposition for Low-Light Image Enhancement. 7271-7284 - Yang Xu

, Yifan Feng
, Xu Zhuang, Jason Wang, Zongze Wu
, Yue Gao
:
Residual Fuzzy Alignment on Hypergraph for Open-Set 3D Cross-Modal Retrieval. 7285-7298 - Zhaoran Zhao

, Peng Lu
, Xujun Peng, Wenhao Guo
:
Self-Supervised Photographic Image Layout Representation Learning. 7299-7313 - Shipei Wang, Ping An, Chao Yang, Gongyang Li

, Xinpeng Huang
, Shiqi Wang:
Feature Quality Assessment: A Database and A Lightweight Objective Method. 7314-7325 - Zhijie Lin

, Zhaoshui He
, Chang Liu
, Hao Liang
, Wenqing Su
, Ji Tan, Jing Guo
:
A Collaborative Learning Framework With Coupling Graph Transformers for 3D Tooth Segmentation. 7340-7352 - Wenhe Chen

, Yuan Chai, Xiaojun Wu
, Hongjin Zhu, Qian Yu
, Zhuo-Ming Du
, Feilong Han
, Wei Gao,


Google
Google Scholar
Semantic Scholar
Internet Archive Scholar
CiteSeerX
ORCID