


default search action
IEEE Transactions on Multimedia, Volume 25
Volume 25, 2023
- Zan-Xia Jin
, Heran Wu, Chun Yang, Fang Zhou
, Jingyan Qin, Lei Xiao, Xu-Cheng Yin
:
RUArt: A Novel Text-Centered Solution for Text-Based Visual Question Answering. 1-12 - Yu Wang
, Shiwei Chen:
Multi-Agent Trajectory Prediction With Spatio-Temporal Sequence Fusion. 13-23 - Jiayi Xie, Yaochen Zhu
, Zhenzhong Chen
:
Micro-Video Popularity Prediction Via Multimodal Variational Information Bottleneck. 24-37 - Zhicheng Guo
, Jiaxuan Zhao
, Licheng Jiao
, Xu Liu
, Fang Liu
:
A Universal Quaternion Hypergraph Network for Multimodal Video Question Answering. 38-49 - Xiao Lin
, Shuzhou Sun, Wei Huang, Bin Sheng
, Ping Li
, David Dagan Feng
:
EAPT: Efficient Attention Pyramid Transformer for Image Processing. 50-61 - Zhi Li
, Haoliang Li
, Xin Luo, Yongjian Hu
, Kwok-Yan Lam
, Alex C. Kot
:
Asymmetric Modality Translation for Face Presentation Attack Detection. 62-76 - Wei Lu
, Desheng Li, Liqiang Nie
, Peiguang Jing
, Yuting Su
:
Learning Dual Low-Rank Representation for Multi-Label Micro-Video Classification. 77-89 - Yun Wang
, Tong Zhang
, Chuanwei Zhou
, Zhen Cui
, Jian Yang
:
Instance-Aware Deep Graph Learning for Multi-Label Classification. 90-99 - Jae Young Choi
, Bumshik Lee
:
Combining Deep Convolutional Neural Networks With Stochastic Ensemble Weight Optimization for Facial Expression Recognition in the Wild. 100-111 - Zerui Shao
, Yifei Pu, Jiliu Zhou, Bihan Wen
, Yi Zhang
:
Hyper RPCA: Joint Maximum Correntropy Criterion and Laplacian Scale Mixture Modeling on-the-Fly for Moving Object Detection. 112-125 - Yajing Liu, Zhiwei Xiong
, Ya Li, Xinmei Tian
, Zheng-Jun Zha
:
Domain Generalization Via Encoding and Resampling in a Unified Latent Space. 126-139 - Hangwei Chen
, Xiongli Chai
, Feng Shao
, Xuejin Wang, Qiuping Jiang
, Xiangchao Meng
, Yo-Sung Ho
:
Perceptual Quality Assessment of Cartoon Images. 140-153 - Yang Li
, Shengbin Meng, Xinfeng Zhang
, Meng Wang
, Shiqi Wang
, Yue Wang, Siwei Ma
:
User-Generated Video Quality Assessment: A Subjective and Objective Study. 154-166 - Yan Yang
, Jun Yu
, Jian Zhang
, Weidong Han
, Hanliang Jiang, Qingming Huang
:
Joint Embedding of Deep Visual and Semantic Features for Medical Image Report Generation. 167-178 - Hancheng Zhu
, Yong Zhou
, Leida Li
, Yaqian Li
, Yandong Guo:
Learning Personalized Image Aesthetics From Subjective and Objective Attributes. 179-190 - Jun Cheng
, Fusheng Hao
, Fengxiang He
, Liu Liu
, Qieshi Zhang
:
Mixer-Based Semantic Spread for Few-Shot Learning. 191-202 - Haojie Yuan
, Qi Chu
, Feng Zhu
, Rui Zhao, Bin Liu
, Nenghai Yu
:
AutoMA: Towards Automatic Model Augmentation for Transferable Adversarial Attacks. 203-213 - Zefan Li
, Bingbing Ni
, Xiaokang Yang
, Wenjun Zhang
, Wen Gao:
Residual Quantization for Low Bit-Width Neural Networks. 214-227 - Zhaoliang Chen
, Jie Yao, Guobao Xiao
, Shiping Wang
:
Efficient and Differentiable Low-Rank Matrix Completion With Back Propagation. 228-242 - Tong Xue
, Abdallah El Ali
, Tianyi Zhang
, Gangyi Ding, Pablo César
:
CEAP-360VR: A Continuous Physiological and Behavioral Emotion Annotation Dataset for 360$^\circ$ VR Videos. 243-255 - Gaosheng Liu
, Huanjing Yue
, Jiamin Wu
, Jing-Yu Yang
:
Intra-Inter View Interaction Network for Light Field Image Super-Resolution. 256-266 - Zhihao Wu
, Jie Wen
, Yong Xu
, Jian Yang
, David Zhang
:
Multiple Instance Detection Networks With Adaptive Instance Refinement. 267-279 - Yanhua Yang, Xiaozhe Zhang, Muli Yang
, Cheng Deng
:
Adaptive Bias-Aware Feature Generation for Generalized Zero-Shot Learning. 280-290 - Tung-I Chen, Yueh-Cheng Liu
, Hung-Ting Su, Yu-Cheng Chang, Yu-Hsiang Lin, Jia-Fong Yeh
, Wen-Chin Chen, Winston H. Hsu
:
Dual-Awareness Attention for Few-Shot Object Detection. 291-301 - Laizhong Cui
, Erchao Ni, Yipeng Zhou
, Zhi Wang
, Lei Zhang
, Jiangchuan Liu
, Yuedong Xu
:
Towards Real-Time Video Caching at Edge Servers: A Cost-Aware Deep Q-Learning Solution. 302-314 - Sutong Wang
, Jiacheng Zhu, Yunqiang Yin, Dujuan Wang
, T. C. Edwin Cheng
, Yanzhang Wang:
Interpretable Multi-Modal Stacking-Based Ensemble Learning Method for Real Estate Appraisal. 315-328 - Zhihao Zhang
, Xianqiang Yang
, Chao Xu
:
Natural Image Stitching With Layered Warping Constraint. 329-338 - Hao Tang
, Guoshuai Zhao
, Yuxia Wu
, Xueming Qian
:
Multisample-Based Contrastive Loss for Top-K Recommendation. 339-351 - Ke Zhang
, Chun Yuan
, Yiming Zhu, Yong Jiang
, Lishu Luo:
Weakly Supervised Instance Segmentation by Exploring Entire Object Regions. 352-363 - Astha Verma
, A. Venkata Subramanyam
, Zheng Wang
, Shin'ichi Satoh
, Rajiv Ratn Shah
:
Unsupervised Domain Adaptation for Person Re-Identification Via Individual-Preserving and Environmental-Switching Cyclic Generation. 364-377 - Carlos M. Lentisco
, Luis Bellido
, Andrés Cárdenas
, Ricardo Flores Moyano
, David Fernández
:
Design of a 5G Multimedia Broadcast Application Function Supporting Adaptive Error Recovery. 378-388 - Huicong Wu
, Liang Xiao, Le Sun
, Byeungwoo Jeon
:
A Novel Video Stabilization Model With Motion Morphological Component Priors. 389-404 - Xuehao Gao
, Yang Yang
, Yimeng Zhang
, Maosen Li
, Jin-Gang Yu
, Shaoyi Du
:
Efficient Spatio-Temporal Contrastive Learning for Skeleton-Based 3-D Action Recognition. 405-417 - Cheng Xue, Xionghu Zhong
, Minjie Cai
, Hao Chen
, Wenwu Wang
:
Audio-Visual Event Localization by Learning Spatial and Semantic Co-Attention. 418-429 - Guang Han
, Jinpeng Su, Yaoming Liu, Yuqiu Zhao, Sam Kwong
:
Multi-Stage Visual Tracking With Siamese Anchor-Free Proposal Network. 430-442 - Lei Yu
, Bishan Wang
, Jingwei He, Gui-Song Xia
, Wen Yang
:
Single Image Deraining With Continuous Rain Density Estimation. 443-456 - Jianjun Xiang
, Gangyi Jiang
, Mei Yu
, Zhidi Jiang
, Yo-Sung Ho
:
No-Reference Light Field Image Quality Assessment Using Four-Dimensional Sparse Transform. 457-472 - Mehdi Rahmati
, Zhuoran Qi
, Dario Pompili:
Underwater Adaptive Video Transmissions Using MIMO-Based Software-Defined Acoustic Modems. 473-485 - Nan Jiang
, Kuiran Wang, Xiaoke Peng
, Xuehui Yu, Qiang Wang, Junliang Xing, Guorong Li
, Guodong Guo, Qixiang Ye
, Jianbin Jiao
, Jian Zhao
, Zhenjun Han
:
Anti-UAV: A Large-Scale Benchmark for Vision-Based UAV Tracking. 486-500 - Yujie Huang
, Ming-e Jing, Jinjia Zhou
, Yuhao Liu
, Yibo Fan
:
LCCStyle: Arbitrary Style Transfer With Low Computational Complexity. 501-514 - Jing Yi, Yaochen Zhu
, Jiayi Xie, Zhenzhong Chen
:
Cross-Modal Variational Auto-Encoder for Content-Based Micro-Video Background Music Recommendation. 515-528 - Luntian Mou
, Chao Zhou, Pengtao Xie, Pengfei Zhao
, Ramesh C. Jain
, Wen Gao, Baocai Yin
:
Isotropic Self-Supervised Learning for Driver Drowsiness Detection With Attention-Based Multimodal Fusion. 529-542 - Wenhui Li
, Yan Wang, Yuting Su
, Xuanya Li
, An-An Liu
, Yongdong Zhang
:
Multi-Scale Fine-Grained Alignments for Image and Sentence Matching. 543-556 - Yongqiang Kong
, Yunhong Wang
, Annan Li
, Qiuyu Huang:
Self-Sufficient Feature Enhancing Networks for Video Salient Object Detection. 557-571 - Qinchuan Zhang
, Yi Jiang, Qin Zhou, Yiru Zhao, Yao Liu, Hongtao Lu
, Xian-Sheng Hua
:
Single Person Dense Pose Estimation via Geometric Equivariance Consistency. 572-583 - Kailun Zhou
, Liping Zhao
, Zigao Ye
, Huihui Wang, Tao Lin
, Sheng Feng
, Yufen Yang
:
Equal Value String and Copy Above String Based String Prediction for SCC in AVS3. 584-592 - Maja Krivokuca
, Ehsan Miandji
, Christine Guillemot
, Philip A. Chou
:
Compression of Plenoptic Point Cloud Attributes Using 6-D Point Clouds and 6-D Transforms. 593-607 - Xiaoqing Luo
, Yuanhao Gao, Anqi Wang
, Zhancheng Zhang
, Xiaojun Wu
:
IFSepR: A General Framework for Image Fusion Based on Separate Representation Learning. 608-623 - Shihao Xu
, Haocong Rao
, Xiping Hu
, Jun Cheng
, Bin Hu
:
Prototypical Contrast and Reverse Prediction: Unsupervised Skeleton Based Action Recognition. 624-634 - Huabing Zhou
, Wei Wu
, Yanduo Zhang
, Jiayi Ma
, Haibin Ling
:
Semantic-Supervised Infrared and Visible Image Fusion Via a Dual-Discriminator Generative Adversarial Network. 635-648 - Ming Li, Bin Fu
, Zhengfu Zhang, Yu Qiao
:
Character-Aware Sampling and Rectification for Scene Text Recognition. 649-661 - Mingyue Su
, Guanghua Gu
, Xianlong Ren, Hao Fu, Yao Zhao
:
Semi-Supervised Knowledge Distillation for Cross-Modal Hashing. 662-675 - Lei Zhu
, Xiaoqiang Wang
, Ping Li
, Xin Yang, Qing Zhang, Weiming Wang
, Carola-Bibiane Schönlieb
, C. L. Philip Chen
:
S $^3$ Net: Self-Supervised Self-Ensembling Network for Semi-Supervised RGB-D Salient Object Detection. 676-689 - Xinjue Hu
, Yuxuan Pan
, Yumei Wang
, Lin Zhang, Shervin Shirmohammadi
:
Multiple Description Coding for Best-Effort Delivery of Light Field Video Using GNN-Based Compression. 690-705 - Le Wang
, Qing Li, Sanping Zhou, Nanning Zheng
:
Multi-Panda Tracking. 706-720 - Changsheng Gao, Dong Liu
, Li Li
, Feng Wu:
Towards Task-Generic Image Compression: A Study of Semantics-Oriented Metrics. 721-735 - Pei Lv
, Jianqi Fan
, Xixi Nie
, Weiming Dong
, Xiaoheng Jiang
, Bing Zhou
, Mingliang Xu
, Changsheng Xu
:
User-Guided Personalized Image Aesthetic Assessment Based on Deep Reinforcement Learning. 736-749 - Xiao Tan
, Huaian Chen
, Kai Xu
, Yi Jin
, Changan Zhu
:
Deep SR-HDR: Joint Learning of Super-Resolution and High Dynamic Range Imaging for Dynamic Scenes. 750-763 - Zhen Bai
, Zhi Liu
, Gongyang Li
, Yang Wang
:
Adaptive Group-Wise Consistency Network for Co-Saliency Detection. 764-776 - Chenghu Du
, Feng Yu
, Minghua Jiang
, Ailing Hua, Xiong Wei, Tao Peng
, Xinrong Hu:
VTON-SCFA: A Virtual Try-On Network Based on the Semantic Constraints and Flow Alignment. 777-791 - Shiji Zhou
, Zhi Wang
, Chenghao Hu, Yinan Mao, Haopeng Yan, Shanghang Zhang
, Chuan Wu
, Wenwu Zhu
:
Caching in Dynamic Environments: A Near-Optimal Online Learning Approach. 792-804 - Shuyi Li
, Bob Zhang
, Lunke Fei
, Shuping Zhao
, Yicong Zhou
:
Learning Sparse and Discriminative Multimodal Feature Codes for Finger Recognition. 805-815 - Wenxue Cui
, Shaohui Liu
, Feng Jiang
, Debin Zhao
:
Image Compressed Sensing Using Non-Local Neural Network. 816-830 - Nastaran Nourbakhsh Kaashki
, Pengpeng Hu
, Adrian Munteanu
:
Anet: A Deep Neural Network for Automatic 3D Anthropometric Measurement Extraction. 831-844 - Xiaoyan Cai
, Sen Liu, Junwei Han
, Libin Yang
, Zhenguo Liu, Tianming Liu
:
ChestXRayBERT: A Pretrained Language Model for Chest Radiology Report Summarization. 845-855 - Xuemeng Song
, Shi-Ting Fang
, Xiaolin Chen
, Yinwei Wei
, Zhongzhou Zhao, Liqiang Nie
:
Modality-Oriented Graph Learning Toward Outfit Compatibility Modeling. 856-867 - Jie Nie
, Zian Zhao
, Lei Huang
, Weizhi Nie
, Zhiqiang Wei:
Cross-Domain Recommendation Via User-Clustering and Multidimensional Information Fusion. 868-880 - Haimin Zhang
, Min Xu
:
Recognition of Emotions in User-Generated Videos through Frame-Level Adaptation and Emotion Intensity Learning. 881-891 - Fei Peng
, Bo Long, Min Long
:
A Semi-Fragile Reversible Watermarking for Authenticating 3D Models Based on Virtual Polygon Projection and Double Modulation Strategy. 892-906 - Karam Park
, Jae Woong Soh
, Nam Ik Cho
:
A Dynamic Residual Self-Attention Network for Lightweight Single Image Super-Resolution. 907-918 - Ming Li
, Jun Liu
, Ce Zheng
, Xinming Huang
, Ziming Zhang:
Exploiting Multi-View Part-Wise Correlation via an Efficient Transformer for Vehicle Re-Identification. 919-929 - Liyuan Ma
, Kejie Huang
, Dongxu Wei
, Zhaoyan Ming
, Haibin Shen
:
FDA-GAN: Flow-Based Dual Attention GAN for Human Pose Transfer. 930-941 - Chongyang Bai
, Haipeng Chen
, Srijan Kumar, Jure Leskovec
, V. S. Subrahmanian
:
M2P2: Multimodal Persuasion Prediction Using Adaptive Fusion. 942-952 - Prasen Kumar Sharma
, Arun Abraham
, Vikram Nelvoy Rajendiran
:
A Generalized Zero-Shot Quantization of Deep Convolutional Neural Networks Via Learned Weights Statistics. 953-965 - Fan Zhao
, Wenda Zhao
, Huimin Lu
, Yong Liu
, Libo Yao, Yu Liu
:
Depth-Distilled Multi-Focus Image Fusion. 966-978 - Xuanhan Wang
, Yuyu Guo
, Jingkuan Song
, Lianli Gao
, Heng Tao Shen
:
AMANet: Adaptive Multi-Path Aggregation for Learning Human 2D-3D Correspondences. 979-992 - Tiejian Zhang
, Xinwang Liu
, Lei Gong
, Siwei Wang
, Xin Niu
, Li Shen:
Late Fusion Multiple Kernel Clustering With Local Kernel Alignment Maximization. 993-1007 - Yiming Wang
, Dongxia Chang
, Zhiqiang Fu, Yao Zhao
:
Consistent Multiple Graph Embedding for Multi-View Clustering. 1008-1018 - Jingjing Xiong
, Lai-Man Po
, Wing Yin Yu
, Yuzhi Zhao
, Kwok-Wai Cheung:
Distortion Map-Guided Feature Rectification for Efficient Video Semantic Segmentation. 1019-1032 - Wei Qin
, Hanwang Zhang
, Richang Hong
, Ee-Peng Lim
, Qianru Sun
:
Causal Interventional Training for Image Recognition. 1033-1044 - Shikun Li
, Tongliang Liu
, Jiyong Tan
, Dan Zeng
, Shiming Ge
:
Trustable Co-Label Learning From Multiple Noisy Annotators. 1045-1057 - Jiebo Luo
:
Editorial. 1058-1059 - Yonggang Wen
:
Editorial. 1060 - Wenqian Wang
, Faliang Chang
, Chunsheng Liu
, Guangxin Li
, Bin Wang:
GA-Net: A Guidance Aware Network for Skeleton-Based Early Activity Recognition. 1061-1073 - Qifan Wang
, Yinwei Wei
, Jianhua Yin
, Jianlong Wu
, Xuemeng Song
, Liqiang Nie
:
DualGNN: Dual Graph Neural Network for Multimedia Recommendation. 1074-1084 - Xiaoping Liang
, Zhenjun Tang
, Jingli Wu, Zhixin Li
, Xinpeng Zhang
:
Robust Image Hashing With Isomap and Saliency Map for Copy Detection. 1085-1097 - Shuping Zhao
, Lunke Fei
, Jie Wen
, Jigang Wu
, Bob Zhang
:
Intrinsic and Complete Structure Learning Based Incomplete Multiview Clustering. 1098-1110 - Shixiang Wu, Chao Dong
, Yu Qiao
:
Blind Image Restoration Based on Cycle-Consistent Network. 1111-1124 - Jose Jaena Mari Ople, Tai-Ming Huang
, Ming-Chih Chiu
, Yi-Ling Chen
, Kai-Lung Hua
:
Adjustable Model Compression Using Multiple Genetic Algorithm. 1125-1132 - Le Wang
, Mo Zhou
, Zhenxing Niu, Qilin Zhang
, Nanning Zheng
:
Adaptive Ladder Loss for Learning Coherent Visual-Semantic Embedding. 1133-1147 - Weide Liu
, Xiangfei Kong, Tzu-Yi Hung, Guosheng Lin
:
Cross-Image Region Mining With Region Prototypical Network for Weakly Supervised Segmentation. 1148-1160 - Ziqiang Wang
, Zhi Liu
, Gongyang Li
, Yang Wang
, Tianhong Zhang, Lihua Xu, Jijun Wang:
Spatio-Temporal Self-Attention Network for Video Saliency Prediction. 1161-1174 - Rui Wang
, Jun Liu
, Qiuhong Ke
, Duo Peng
, Yinjie Lei
:
Dear-Net: Learning Diversities for Skeleton-Based Early Action Recognition. 1175-1189 - Cheng Wang
, Bingpeng Ma
, Hong Chang
, Shiguang Shan
, Xilin Chen
:
Person Search by a Bi-Directional Task-Consistent Learning Model. 1190-1203 - Jipeng Wu
, Rongrong Ji
, Qiang Wang, Shengchuan Zhang
, Xiaoshuai Sun
, Yan Wang
, Mingliang Xu
, Feiyue Huang:
Fast Monocular Depth Estimation via Side Prediction Aggregation with Continuous Spatial Refinement. 1204-1216 - Di Wang
, Caiping Zhang, Quan Wang
, Yumin Tian, Lihuo He
, Lin Zhao
:
Hierarchical Semantic Structure Preserving Hashing for Cross-Modal Retrieval. 1217-1229 - Min Cao
, Cong Ding
, Chen Chen
, Hao Dou, Xiyuan Hu
, Junchi Yan
:
Progressive Context-Aware Graph Feature Learning for Target Re-Identification. 1230-1242 - Yuting Su
, Wei Zhao, Peiguang Jing
, Liqiang Nie
:
Exploiting Low-Rank Latent Gaussian Graphical Model Estimation for Visual Sentiment Distributions. 1243-1255 - Gaoang Wang
, Yizhou Wang
, Renshu Gu
, Weijie Hu
, Jenq-Neng Hwang
:
Split and Connect: A Universal Tracklet Booster for Multi-Object Tracking. 1256-1268 - Qiao Liu
, Di Yuan
, Nana Fan, Peng Gao
, Xin Li
, Zhenyu He
:
Learning Dual-Level Deep Representation for Thermal Infrared Tracking. 1269-1281 - Wenhao Li
, Hong Liu
, Runwei Ding
, Mengyuan Liu
, Pichao Wang
, Wenming Yang
:
Exploiting Temporal Contexts With Strided Transformer for 3D Human Pose Estimation. 1282-1293 - Mengxi Jia
, Xinhua Cheng, Shijian Lu
, Jian Zhang
:
Learning Disentangled Representation Implicitly Via Transformer for Occluded Person Re-Identification. 1294-1305 - Zhe Tang
, Yi Yang
, Wen Li
, Defu Lian
, Lixin Duan:
Deep Cross-Attention Network for Crowdfunding Success Prediction. 1306-1319 - Kun Zhang
, Zhendong Mao
, An-An Liu
, Yongdong Zhang
:
Unified Adaptive Relevance Distinguishable Attention Network for Image-Text Matching. 1320-1332 - Dongnan Liu
, Chaoyi Zhang
, Yang Song
, Heng Huang
, Chenyu Wang
, Michael Barnett
, Tom Weidong Cai
:
Decompose to Adapt: Cross-Domain Object Detection Via Feature Disentanglement. 1333-1344 - Bin Chen
, Kunhong Liu
, Yong Xu, Qingqiang Wu, Junfeng Yao
:
Block Division Convolutional Network With Implicit Deep Features Augmentation for Micro-Expression Recognition. 1345-1358 - Yingjian Li
, Zheng Zhang
, Bingzhi Chen
, Guangming Lu
, David Zhang
:
Deep Margin-Sensitive Representation Learning for Cross-Domain Facial Expression Recognition. 1359-1373 - Jianjun Sun
, Yan Zhao
, Shigang Wang
, Jian Wei:
3D Holoscopic Image Compression Based on Gaussian Mixture Model. 1374-1389 - Huan Liu
, Wentao Liu, Zhixiang Chi
, Yang Wang
, Yuanhao Yu
, Jun Chen
, Jin Tang:
Fast Human Pose Estimation in Compressed Videos. 1390-1400 - Yujian Feng
, Yimu Ji
, Fei Wu
, Guangwei Gao
, Yang Gao, Tianliang Liu
, Shangdong Liu
, Xiao-Yuan Jing
, Jiebo Luo
:
Occluded Visible-Infrared Person Re-Identification. 1401-1413 - Haoyu Zhao
, Qi Wang, Guowei Zhan, Weidong Min
, Yi Zou, Shimiao Cui:
Need Only One More Point (NOOMP): Perspective Adaptation Crowd Counting in Complex Scenes. 1414-1426 - Jianjun Qian
, Shumin Zhu
, Chaoyu Zhao, Jian Yang
, Wai Keung Wong
:
OTFace: Hard Samples Guided Optimal Transport Loss for Deep Face Representation. 1427-1438 - Tianyu Shen
, Deqi Li, Fei-Yue Wang
, Hua Huang
:
Depth-Aware Multi-Person 3D Pose Estimation With Multi-Scale Waterfall Representations. 1439-1451 - Qianqian Yu
, Keqi Fan
, Yuhui Zheng
:
Domain Adaptive Transformer Tracking Under Occlusions. 1452-1461 - Zhihao Liu
, Yuanyuan Shang, Timing Li
, Guanlin Chen, Yu Wang
, Qinghua Hu
, Pengfei Zhu
:
Robust Multi-Drone Multi-Target Tracking to Resolve Target Occlusion: A Benchmark. 1462-1476 - Zhijing Yang
, Junyang Chen
, Yukai Shi
, Hao Li
, Tianshui Chen
, Liang Lin
:
OccluMix: Towards De-Occlusion Virtual Try-on by Semantically-Guided Mixup. 1477-1488 - Kunyu Peng
, Alina Roitberg
, Kailun Yang
, Jiaming Zhang
, Rainer Stiefelhagen:
Delving Deep Into One-Shot Skeleton-Based Action Recognition With Diverse Occlusions. 1489-1504 - Guangwei Gao
, Lei Tang, Fei Wu
, Huimin Lu
, Jian Yang
:
JDSR-GAN: Constructing an Efficient Joint Learning Network for Masked Face Super-Resolution. 1505-1512 - Puning Zhang
, Fengyi Huang
, Dapeng Wu
, Boran Yang
, Zhigang Yang, Lei Tan:
Device-Edge-Cloud Collaborative Acceleration Method Towards Occluded Face Recognition in High-Traffic Areas. 1513-1520 - Qun Li
, Ziyi Zhang
, Feng Zhang, Fu Xiao
:
HRNeXt: High-Resolution Context Network for Crowd Pose Estimation. 1521-1528 - Chunjie Ma
, Li Zhuo
, Jiafeng Li
, Yutong Zhang, Jing Zhang
:
Cascade Transformer Decoder Based Occluded Pedestrian Detection With Dynamic Deformable Convolution and Gaussian Projection Channel Attention Mechanism. 1529-1537 - Rui Wang
, Yixue Hao
, Long Hu
, Jincai Chen
, Min Chen
, Di Wu
:
Self-Supervised Learning With Data-Efficient Supervised Fine-Tuning for Crowd Counting. 1538-1546 - Yun Lan
, Ruimin Hu
, Xin Xu
, Dengshi Li
, Chao Wang
, Xiaochen Wang:
From Collective Attribute Association of Groups to Precise Attribute Association of Individuals. 1547-1554 - Xingyu Yang, Mengya Han, Yong Luo
, Han Hu
, Yonggang Wen
:
Two-Stream Prototype Learning Network for Few-Shot Face Recognition Under Occlusions. 1555-1563 - Qinyang Zeng
, Chengju Liu
, Ming Liu
, Qijun Chen
:
Contrastive 3D Human Skeleton Action Representation Learning via CrossMoCo With Spatiotemporal Occlusion Mask Data Augmentation. 1564-1574 - Jianping Gou
, Xia Yuan
, Baosheng Yu, Jiali Yu
, Zhang Yi
:
Intra- and Inter-Class Induced Discriminative Deep Dictionary Learning for Visual Recognition. 1575-1583 - Zheng Cao
, Liming Xu, Danny Z. Chen
, Honghao Gao
, Jian Wu
:
A Robust Shape-Aware Rib Fracture Detection and Segmentation Framework With Contrastive Learning. 1584-1591 - Junzhu Mao
, Yazhou Yao
, Zeren Sun
, Xingguo Huang
, Fumin Shen
, Heng Tao Shen
:
Attention Map Guided Transformer Pruning for Occluded Person Re-Identification on Edge Device. 1592-1599 - Yun Li
, Zhe Liu
, Lina Yao
, Xiaojun Chang
:
Attribute-Modulated Generative Meta Learning for Zero-Shot Learning. 1600-1610 - Mingjie Sun
, Jimin Xiao
, Eng Gee Lim
, Yao Zhao
:
Cycle-Free Weakly Referring Expression Grounding With Self-Paced Learning. 1611-1621 - Yang Chen
, Lin Zhang
, Ying Shen
, Brian Nlong Zhao, Yicong Zhou
:
Extrinsic Self-Calibration of the Surround-View System: A Weakly Supervised Approach. 1622-1635 - Rui Gao
, Xingsong Hou
, Jie Qin
, Yuming Shen
, Yang Long, Li Liu, Zhao Zhang
, Ling Shao
:
Visual-Semantic Aligned Bidirectional Network for Zero-Shot Learning. 1649-1664 - Rui Wang
, Zuxuan Wu, Zejia Weng
, Jingjing Chen
, Guo-Jun Qi
, Yu-Gang Jiang
:
Cross-Domain Contrastive Learning for Unsupervised Domain Adaptation. 1665-1673 - Peng Wu
, Xiaotao Liu
, Jing Liu
:
Weakly Supervised Audio-Visual Violence Detection. 1674-1685 - Jinlong Li
, Zequn Jie, Xu Wang
, Yu Zhou
, Xiaolin Wei
, Lin Ma
:
Weakly Supervised Semantic Segmentation Via Progressive Patch Learning. 1686-1699 - Yucheng Shu
, Hengbo Li, Bin Xiao
, Xiuli Bi
, Weisheng Li
:
Cross-Mix Monitoring for Medical Image Segmentation With Limited Supervision. 1700-1712 - Bin Fan
, Yuzhu Yang, Wensen Feng, Fuchao Wu, Jiwen Lu
, Hongmin Liu
:
Seeing Through Darkness: Visual Localization at Night via Weakly Supervised Learning of Domain Invariant Features. 1713-1726 - Tao Chen
, Yazhou Yao
, Lei Zhang
, Qiong Wang
, Guo-Sen Xie
, Fumin Shen
:
Saliency Guided Inter- and Intra-Class Relation Constraints for Weakly Supervised Semantic Segmentation. 1727-1737 - Yan Luo
, Yongkang Wong
, Mohan S. Kankanhalli
, Qi Zhao
:
Learning to Minimize the Remainder in Supervised Learning. 1738-1748 - Yuhang Zhang
, Xiaopeng Zhang
, Jie Li
, Robert C. Qiu
, Haohang Xu
, Qi Tian
:
Semi-Supervised Contrastive Learning With Similarity Co-Calibration. 1749-1759 - Jingwei Yan
, Jingjing Wang, Qiang Li, Chunmao Wang, Shiliang Pu
:
Weakly Supervised Regional and Temporal Learning for Facial Action Unit Recognition. 1760-1772 - Anran Zhang
, Yandan Yang
, Jun Xu
, Xianbin Cao
, Xiantong Zhen
, Ling Shao
:
Latent Domain Generation for Unsupervised Domain Adaptation Object Counting. 1773-1783 - Pedro H. T. Gama
, Hugo N. Oliveira
, José Marcato Junior
, Jefersson A. dos Santos
:
Weakly Supervised Few-Shot Segmentation via Meta-Learning. 1784-1797 - Xing Lan
, Qinghao Hu, Jian Cheng
:
ATF: An Alternating Training Framework for Weakly Supervised Face Alignment. 1798-1809 - Xiaoliang Qian
, Yinfeng Zeng
, Wei Wang
, Qiuwen Zhang
:
Co-Saliency Detection Guided by Group Weakly Supervised Learning. 1810-1818 - Zhigang Tu
, Jiaxu Zhang
, Hongyan Li
, Yujin Chen
, Junsong Yuan
:
Joint-Bone Fusion Graph Convolutional Network for Semi-Supervised Skeleton Action Recognition. 1819-1831 - Guoliang Hua
, Hong Liu
, Wenhao Li
, Qian Zhang, Runwei Ding
, Xin Xu
:
Weakly-Supervised 3D Human Pose Estimation With Cross-View U-Shaped Graph Convolutional Network. 1832-1843 - Zhuo Huang
, Jian Yang
, Chen Gong
:
They are Not Completely Useless: Towards Recycling Transferable Unlabeled Data for Class-Mismatched Semi-Supervised Learning. 1844-1857 - Peipei Song
, Dan Guo
, Jun Cheng
, Meng Wang
:
Contextual Attention Network for Emotional Video Captioning. 1858-1867 - Huifang Li
, Yidong Li
, Yuanzhouhan Cao
, Yushan Han
, Yi Jin
, Yunchao Wei
:
Weakly Supervised Object Detection With Class Prototypical Network. 1868-1878 - Guangwei Gao
, Yi Yu
, Huimin Lu
, Jian Yang
, Dong Yue
:
Context-Patch Representation Learning With Adaptive Neighbor Embedding for Robust Face Image Super-Resolution. 1879-1889 - Yufei Yin
, Jiajun Deng
, Wengang Zhou
, Li Li
, Houqiang Li
:
FI-WSOD: Foreground Information Guided Weakly Supervised Object Detection. 1890-1902 - Jun Kong
, Xuefeng Tao
, Min Jiang
, Tianshan Liu
:
Weakly Supervised Distribution Discrepancy Minimization Learning With State Information for Person Re-Identification. 1903-1915 - Xiao Dong
, Gengwei Zhang
, Xunlin Zhan, Yi Ding
, Yunchao Wei
, Minlong Lu, Xiaodan Liang
:
Caption-Aided Product Detection via Collaborative Pseudo-Label Harmonization. 1916-1927 - Guodong Ding
, Angela Yao
:
Temporal Action Segmentation With High-Level Complex Activity Labels. 1928-1939 - Cheng Qi
, Zhiyong Feng
, Meng Xing
, Yong Su
, Jinqing Zheng
, Yiming Zhang
:
Energy-Based Temporal Summarized Attentive Network for Zero-Shot Action Recognition. 1940-1953 - Yuke Li
, Pin Wang
, Ching-Yao Chan
:
RESTEP Into the Future: Relational Spatio-Temporal Learning for Multi-Person Action Forecasting. 1954-1963 - Jialun Pei
, Tianyang Cheng, He Tang
, Chuanbo Chen
:
Transformer-Based Efficient Salient Instance Segmentation Networks With Orientative Query. 1964-1978 - Xian Zhong
, Cheng Gu
, Mang Ye
, Wenxin Huang
, Chia-Wen Lin
:
Graph Complemented Latent Representation for Few-Shot Image Classification. 1979-1990 - Yu Qiu
, Yun Liu
, Yanan Chen, Jianwen Zhang
, Jinchao Zhu
, Jing Xu
:
A2SPPNet: Attentive Atrous Spatial Pyramid Pooling Network for Salient Object Detection. 1991-2006 - Li Li
, Zhu Li
, Shan Liu
, Houqiang Li
:
Plenoptic Point Cloud Compression Using Multiview Extension of High Efficiency Video Coding. 2007-2021 - Siwang Zhou
, Xiaoning Deng, Chengqing Li, Yonghe Liu
, Hongbo Jiang
:
Recognition-Oriented Image Compressive Sensing With Deep Learning. 2022-2032 - Zipeng Ye
, Mengfei Xia, Ran Yi
, Juyong Zhang
, Yu-Kun Lai
, Xuwei Huang, Guo-Xin Zhang, Yong-Jin Liu
:
Audio-Driven Talking Face Video Generation With Dynamic Convolution Kernels. 2033-2046 - Chen Li
, Li Song
, Shuai Chen
, Rong Xie, Wenjun Zhang
:
Deep Online Video Stabilization Using IMU Sensors. 2047-2060 - Yufan Hu
, Junyu Gao
, Changsheng Xu
:
Learning Scene-Aware Spatio-Temporal GNNs for Few-Shot Early Action Prediction. 2061-2073 - Mingjie Wang
, Hao Cai
, Xian-Feng Han
, Jun Zhou, Minglun Gong
:
STNet: Scale Tree Network With Multi-Level Auxiliator for Crowd Counting. 2074-2084 - Ming Lu
, Tong Chen
, Zhenyu Dai
, Dong Wang, Dandan Ding
, Zhan Ma
:
Decoder-Side Cross Resolution Synthesis for Video Compression Enhancement. 2097-2110 - Hongguang Zhang
, Hongdong Li
, Piotr Koniusz
:
Multi-Level Second-Order Few-Shot Learning. 2111-2126 - Wenli Song
, Lei Zhang
, Xinbo Gao
:
Compound Projection Learning for Bridging Seen and Unseen Objects. 2127-2139 - Yunxin Li
, Qian Yang, Qingcai Chen
, Baotian Hu
, Xiaolong Wang, Yuxin Ding, Lin Ma
:
Fast and Robust Online Handwritten Chinese Character Recognition With Deep Spatial and Contextual Information Fusion Network. 2140-2152 - Jin Xie
, Yanwei Pang
, Jing Nie
, Jiale Cao
, Jungong Han
:
Latent Feature Pyramid Network for Object Detection. 2153-2163 - Min Wang
, Wengang Zhou
, Qi Tian
, Houqiang Li
:
Deep Graph Convolutional Quantization Networks for Image Retrieval. 2164-2175 - Zelong Zeng
, Zheng Wang
, Fan Yang, Shin'ichi Satoh
:
Geo-Localization via Ground-to-Satellite Cross-View Image Retrieval. 2176-2188 - Yu Pang
, Chengdong Wu
, Hao Wu, Xiaosheng Yu:
Unsupervised Multi-Subclass Saliency Classification for Salient Object Detection. 2189-2202 - Haimin Zhang
, Min Xu
:
Multiscale Emotion Representation Learning for Affective Image Recognition. 2203-2212 - Jiahao Zheng
, Sen Zhang
, Zilu Wang
, Xiaoping Wang
, Zhigang Zeng
:
Multi-Channel Weight-Sharing Autoencoder Based on Cascade Multi-Head Attention for Multimodal Emotion Recognition. 2213-2225 - Nan Jiang, Bin Sheng
, Ping Li
, Tong-Yee Lee
:
PhotoHelper: Portrait Photographing Guidance Via Deep Feature Retrieval and Fusion. 2226-2238 - Xiao Li
, Dong Zhang
, Ming Li
, Dah-Jye Lee
:
Accurate Head Pose Estimation Using Image Rectification and a Lightweight Convolutional Neural Network. 2239-2251 - Yalan Ye
, Tongjie Pan
, Tonghoujun Luo
, Jingjing Li
, Heng Tao Shen
:
Learning MLatent Representations for Generalized Zero-Shot Learning. 2252-2265 - Min Meng
, Mengcheng Lan
, Jun Yu
, Jigang Wu
, Ligang Liu
:
Dual-Level Adaptive and Discriminative Knowledge Transfer for Cross-Domain Recognition. 2266-2279 - Debashri Roy
, Yuanyuan Li
, Tong Jian, Peng Tian, Kaushik Roy Chowdhury
, Stratis Ioannidis
:
Multi-Modality Sensing and Data Fusion for Multi-Vehicle Detection. 2280-2295 - Zhejing Hu
, Yan Liu, Gong Chen
, Yongxu Liu
:
Can Machines Generate Personalized Music? A Hybrid Favorite-Aware Method for User Preference Music Transfer. 2296-2308 - Nayyer Aafaq
, Ajmal Mian
, Naveed Akhtar
, Wei Liu
, Mubarak Shah
:
Dense Video Captioning With Early Linguistic Information Fusion. 2309-2322 - Han Yan, Haijun Zhang
, Linlin Liu, Dongliang Zhou
, Xiaofei Xu, Zhao Zhang
, Shuicheng Yan
:
Toward Intelligent Design: An AI-Based Fashion Designer Using Generative Adversarial Networks Aided by Sketch and Rendering Generators. 2323-2338 - Jiayao Shan
, Sifan Zhou
, Yubo Cui
, Zheng Fang
:
Real-Time 3D Single Object Tracking With Transformer. 2339-2353 - Zheng Chang
, Xinfeng Zhang
, Shanshe Wang
, Siwei Ma
, Wen Gao:
STAM: A SpatioTemporal Attention Based Memory for Video Prediction. 2354-2367 - Dezhi Peng
, Lianwen Jin
, Weihong Ma
, Canyu Xie, Hesuo Zhang, Shenggao Zhu, Jing Li:
Recognition of Handwritten Chinese Text by Segmentation: A Segment-Annotation-Free Approach. 2368-2381 - Junke Wang
, Shaoxiang Chen, Zuxuan Wu, Yu-Gang Jiang
:
FT-TDR: Frequency-Guided Transformer and Top-Down Refinement Network for Blind Face Inpainting. 2382-2392 - Yi-Xing Peng, Jile Jiao
, Xuetao Feng
, Wei-Shi Zheng
:
Consistent Discrepancy Learning for Intra-Camera Supervised Person Re-Identification. 2393-2403 - Lintai Wu
, Yong Xu
, Junhui Hou
, C. L. Philip Chen
, Cheng-Lin Liu
:
A Two-Level Rectification Attention Network for Scene Text Recognition. 2404-2414 - Hang Liu, Menghan Hu
, Yuzhen Chen, Qingli Li
, Guangtao Zhai
, Simon X. Yang
, Xiao-Ping Zhang
, Xiaokang Yang
:
Angel's Girl for Blind Painters: An Efficient Painting Navigation System Validated by Multimodal Evaluation Approach. 2415-2429 - Huakui Zhang
, Yi Cai
, Haopeng Ren, Qing Li
:
Multimodal Topic Modeling by Exploring Characteristics of Short Text Social Media. 2430-2445 - Mengyang Sun
, Wei Suo
, Peng Wang
, Yanning Zhang
, Qi Wu
:
A Proposal-Free One-Stage Framework for Referring Expression Comprehension and Generation via Dense Cross-Attention. 2446-2458 - Jinkun You
, Yuan-Gen Wang
, Guopu Zhu
, Ligang Wu
, Hongli Zhang
, Sam Kwong
:
Estimating the Secret Key of Spread Spectrum Watermarking Based on Equivalent Keys. 2459-2473 - Ziqiang Zheng, Yi Bin
, Xiaoou Lv, Yang Wu, Yang Yang
, Heng Tao Shen
:
Asynchronous Generative Adversarial Network for Asymmetric Unpaired Image-to-Image Translation. 2474-2487 - Weihe Li
, Jiawei Huang
, Shiqi Wang, Chuliang Wu, Sen Liu
, Jian-xin Wang
:
An Apprenticeship Learning Approach for Adaptive Video Streaming Based on Chunk Quality and User Preference. 2488-2502 - Xiaoya Zhang
, Shumin Zhang, Zhen Cui
, Zechao Li
, Jin Xie, Jian Yang
:
Tube-Embedded Transformer for Pixel Prediction. 2503-2514 - Zhi Jin
, Junjia Huang, Wenjin Wang
, Aolin Xiong, Xiaojun Tan
:
Estimating Human Weight From a Single Image. 2515-2527 - Chuan Qin
, Jinchuan Hu
, Fengyong Li
, Zhenxing Qian
, Xinpeng Zhang
:
JPEG Image Encryption With Adaptive DC Coefficient Prediction and RS Pair Permutation. 2528-2542 - Lisha Wang
, Chenglin Li
, Wenrui Dai
, Shaohui Li
, Junni Zou
, Hongkai Xiong
:
QoE-Driven Adaptive Streaming for Point Clouds. 2543-2558 - Mengmeng Jing
, Lichao Meng
, Jingjing Li
, Lei Zhu
, Heng Tao Shen
:
Adversarial Mixup Ratio Confusion for Unsupervised Domain Adaptation. 2559-2572 - Shaocan Liu
, Xin Ma
:
Attention-Driven Appearance-Motion Fusion Network for Action Recognition. 2573-2584 - Farzad Tashtarian
, Abdelhak Bentaleb
, Alireza R. Erfanian
, Hermann Hellwagner
, Christian Timmerer
, Roger Zimmermann
:
$\mathsf{HxL3}$: Optimized Delivery Architecture for HTTP Low-Latency Live Streaming. 2585-2600 - Xiaomei Zhang
, Yingying Chen
, Ming Tang
, Jinqiao Wang
, Xiangyu Zhu
, Zhen Lei
:
Human Parsing With Part-Aware Relation Modeling. 2601-2612 - Chengrun Qiu
, Dongheng Zhang
, Yang Hu
, Houqiang Li
, Qibin Sun, Yan Chen
:
Radio-Assisted Human Detection. 2613-2623 - Pengfei Wang
, Changxing Ding
, Wentao Tan, Mingming Gong
, Kui Jia
, Dacheng Tao:
Uncertainty-Aware Clustering for Unsupervised Domain Adaptive Object Re-Identification. 2624-2635 - Liyang Sun
, Yixiang Mao
, Tongyu Zong
, Yong Liu
, Yao Wang
:
Live 360 Degree Video Delivery Based on User Collaboration in a Streaming Flock. 2636-2647 - Han Fang
, Zhaoyang Jia, Hang Zhou
, Zehua Ma
, Weiming Zhang
:
Encoded Feature Enhancement in Watermarking Network for Distortion in Real Scenes. 2648-2660 - Wei Wang, Junyu Gao
, Xiaoshan Yang, Changsheng Xu
:
Many Hands Make Light Work: Transferring Knowledge From Auxiliary Tasks for Video-Text Retrieval. 2661-2674 - A. Sophia Koepke
, Andreea-Maria Oncescu
, João F. Henriques, Zeynep Akata
, Samuel Albanie
:
Audio Retrieval With Natural Language Queries: A Benchmark Study. 2675-2685 - En Yu
, Zhuoling Li
, Shoudong Han
, Hongwei Wang
:
RelationTrack: Relation-Aware Multiple Object Tracking With Decoupled Representation. 2686-2697 - Yi Dong
, Xinghao Jiang
, Zhaohong Li
, Tanfeng Sun
, Zhenzhen Zhang:
Multi-Channel HEVC Steganography by Minimizing IPM Steganographic Distortions. 2698-2709 - Liang Chen
, Jun Liu
, Weidong Chen
, Bo Du
:
A GLRT-Based Multi-Pixel Target Detector in Hyperspectral Imagery. 2710-2722 - Anyi Rao
, Linning Xu, Zhizhong Li
, Qingqiu Huang, Zhanghui Kuang, Wayne Zhang
, Dahua Lin:
A Coarse-to-Fine Framework for Automatic Video Unscreen. 2723-2733 - Shuo Liu
, Weize Quan
, Chaoqun Wang, Yuan Liu, Bin Liu
, Dong-Ming Yan
:
Dense Modality Interaction Network for Audio-Visual Event Localization. 2734-2748 - Shaokun Wang
, Tian Gan
, Yuan Liu
, Jianlong Wu
, Yuan Cheng, Liqiang Nie
:
Micro-Influencer Recommendation by Multi-Perspective Account Representation Learning. 2749-2760 - Desheng Cai, Shengsheng Qian
, Quan Fang, Jun Hu, Wenkui Ding
, Changsheng Xu
:
Heterogeneous Graph Contrastive Learning Network for Personalized Micro-Video Recommendation. 2761-2773 - Lingfeng Ma
, Hongtao Xie
, Chuanbin Liu
, Yongdong Zhang
:
Learning Cross-Channel Representations for Semantic Segmentation. 2774-2787 - Zhenxiao Luo
, Zelong Wang
, Miao Hu
, Yipeng Zhou
, Di Wu
:
LiveSR: Enabling Universal HD Live Video Streaming With Crowdsourced Online Learning. 2788-2798 - Qiyao Deng
, Qi Li
, Jie Cao
, Yunfan Liu
, Zhenan Sun
:
Semantic-Aware Noise Driven Portrait Synthesis and Manipulation. 2799-2811 - Yuanjie Dang
, Chong Huang, Peng Chen
, Ronghua Liang
, Xin Yang
, Kwang-Ting Cheng
:
Path-Analysis-Based Reinforcement Learning Algorithm for Imitation Filming. 2812-2824 - Feng Li, Yixuan Wu, Huihui Bai
, Weisi Lin
, Runmin Cong
, Yao Zhao
:
Learning Detail-Structure Alternative Optimization for Blind Super-Resolution. 2825-2838 - Zhangyu Chang
, S.-H. Gary Chan
:
Bi-Criteria Approximation for a Multi-Origin Multi-Channel Auto-Scaling Live Streaming Cloud. 2839-2850 - Yaxin Liu
, Jianlong Wu
, Leigang Qu, Tian Gan
, Jianhua Yin
, Liqiang Nie
:
Self-Supervised Correlation Learning for Cross-Modal Retrieval. 2851-2863 - Fan Chen
, Yaolin Yang, Hongjie He
, Yuan Yuan:
Adaptive Coding and Ordered-Index Extended Scrambling Based RDH in Encrypted Images. 2864-2875 - Ce Wang
, Dejia Xu
, Renjie Wan
, Bin He
, Boxin Shi
, Ling-Yu Duan
:
Background Scene Recovery From an Image Looking Through Colored Glass. 2876-2887 - Yongri Piao
, Wei Wu
, Miao Zhang
, Yongyao Jiang
, Huchuan Lu
:
Noise-Sensitive Adversarial Learning for Weakly Supervised Salient Object Detection. 2888-2897 - Laure Prétet
, Gaël Richard
, Clément Souchier, Geoffroy Peeters:
Video-to-Music Recommendation Using Temporal Alignment of Segments. 2898-2911 - Simeng Sun
, Tao Yu
, Jiahua Xu
, Wei Zhou
, Zhibo Chen
:
GraphIQA: Learning Distortion Graph Representations for Blind Image Quality Assessment. 2912-2925 - Cong Yu, Zhi Wu, Dongheng Zhang, Zhi Lu, Yang Hu, Yan Chen:
RFGAN: RF-Based Human Synthesis. 2926-2938 - Jie Li
, Cong Zhang
, Zhi Liu
, Richang Hong, Han Hu
:
Optimal Volumetric Video Streaming With Hybrid Saliency Based Tiling. 2939-2953 - Dechao Meng
, Liang Li
, Xuejing Liu
, Lin Gao
, Qingming Huang
:
Viewpoint Alignment and Discriminative Parts Enhancement in 3D Space for Vehicle ReID. 2954-2965 - Depeng Wang
, Zhenzhen Hu
, Yuanen Zhou
, Richang Hong
, Meng Wang
:
A Text-Guided Generation and Refinement Model for Image Captioning. 2966-2977 - Jie Huang
, Xueyang Fu
, Zeyu Xiao
, Feng Zhao
, Zhiwei Xiong
:
Low-Light Stereo Image Enhancement. 2978-2992 - Youguang Yu
, Wei Zhang
, Fuzheng Yang
, Ge Li
:
Rate-Distortion Optimized Geometry Compression for Spinning LiDAR Point Cloud. 2993-3005 - Kenan E. Ak
, Ying Sun
, Joo Hwee Lim
:
Learning by Imagination: A Joint Framework for Text-Based Image Manipulation and Change Captioning. 3006-3016 - Yuzhi Zhao
, Lai-Man Po
, Wing Yin Yu
, Yasar Abbas Ur Rehman
, Mengyang Liu
, Yujia Zhang
, Weifeng Ou
:
VCGAN: Video Colorization With Hybrid Generative Adversarial Network. 3017-3032 - Pengfei Zhu
, Xinjie Yao
, Yu Wang
, Meng Cao, Binyuan Hui
, Shuai Zhao, Qinghua Hu
:
Latent Heterogeneous Graph Network for Incomplete Multi-View Learning. 3033-3045 - Na Li
, Xinbo Zhao
:
A Strong and Robust Skeleton-Based Gait Recognition Method with Gait Periodicity Priors. 3046-3058 - Yuchen Zhang
, Wenrui Dai
, Yong Li, Chenglin Li
, Junhui Hou
, Junni Zou
, Hongkai Xiong
:
Light Field Compression With Graph Learning and Dictionary-Guided Sparse Coding. 3059-3072 - Cheng-Hao Wu
, Chih-Fan Hsu
, Tzu-Kuan Hung, Carsten Griwodz
, Wei Tsang Ooi, Cheng-Hsin Hsu
:
Quantitative Comparison of Point Cloud Compression Algorithms With PCC Arena. 3073-3088 - Cunyi Lin
, Xianwei Rong
, Xiaoyan Yu
:
MSAFF-Net: Multiscale Attention Feature Fusion Networks for Single Image Dehazing and Beyond. 3089-3100 - Xiaoming Zhao
, Xingming Wu
, Jinyu Miao
, Weihai Chen
, Peter C. Y. Chen
, Zhengguo Li
:
ALIKE: Accurate and Lightweight Keypoint Detection and Descriptor Extraction. 3101-3112 - Yuxia Wu
, Lizi Liao
, Gangyi Zhang, Wenqiang Lei, Guoshuai Zhao
, Xueming Qian
, Tat-Seng Chua
:
State Graph Reasoning for Multimodal Conversational Recommendation. 3113-3124 - Xianxu Hou
, Xiaokang Zhang
, Hanbang Liang
, Linlin Shen
, Zhong Ming
:
Lifelong Age Transformation With a Deep Generative Prior. 3125-3139 - Yiming Li, Xiaoshan Yang, Xuhui Huang, Zhe Ma, Changsheng Xu
:
Zero-Shot Predicate Prediction for Scene Graph Parsing. 3140-3153 - Pengfei Wang
, Changxing Ding
, Zhiyin Shao, Zhibin Hong, Shengli Zhang
, Dacheng Tao:
Quality-Aware Part Models for Occluded Person Re-Identification. 3154-3165 - Shu-Yu Chen
, Yu-Kun Lai
, Shihong Xia
, Paul L. Rosin
, Lin Gao
:
3D Face Reconstruction and Gaze Tracking in the HMD for Virtual Interaction. 3166-3179 - Shaojie Li
, Mingbao Lin
, Yan Wang
, Fei Chao
, Ling Shao
, Rongrong Ji
:
Learning Efficient GANs for Image Translation via Differentiable Masks and Co-Attention Distillation. 3180-3189 - Yutong Gao
, Liqian Liang
, Congyan Lang
, Songhe Feng
, Yidong Li
, Yunchao Wei
:
Clicking Matters: Towards Interactive Human Parsing. 3190-3203 - Yangbo Feng, Junyu Gao
, Changsheng Xu
:
Learning Dual-Routing Capsule Graph Neural Network for Few-Shot Video Classification. 3204-3216 - Ni Zhang
, Nian Liu
, Junwei Han
, Kaiyuan Wan, Ling Shao
:
Face De-Occlusion With Deep Cascade Guidance Learning. 3217-3229 - Xiaoke Li
, Zufan Zhang
, Chenquan Gan
, Yong Xiang
:
Multi-Label Speech Emotion Recognition via Inter-Class Difference Loss Under Response Residual Network. 3230-3244 - Peter Szabó, Anderson Augusto Simiscuka
, Stefano Masneri
, Mikel Zorrilla
, Gabriel-Miro Muntean
:
A CNN-Based Framework for Enhancing 360° VR Experiences With Multisensorial Effects. 3245-3258 - Guangwei Gao
, Guoan Xu, Juncheng Li
, Yi Yu
, Huimin Lu
, Jian Yang
:
FBSNet: A Fast Bilateral Symmetrical Network for Real-Time Semantic Segmentation. 3273-3283 - Zeren Sun
, Yazhou Yao
, Xiu-Shen Wei, Fumin Shen
, Jian Zhang
, Xian-Sheng Hua
:
Boosting Robust Learning Via Leveraging Reusable Samples in Noisy Web Data. 3284-3295 - Nayu Liu
, Xian Sun
, Hongfeng Yu, Fanglong Yao
, Guangluan Xu
, Kun Fu
:
Abstractive Summarization for Video: A Revisit in Multistage Fusion Network With Forget Gate. 3296-3310 - Mehwish Ghafoor
, Arif Mahmood
:
Quantification of Occlusion Handling Capability of a 3D Human Pose Estimation Framework. 3311-3318 - Lei Zhang
, Yingjun Du
, Jiayi Shen, Xiantong Zhen
:
Learning to Learn With Variational Inference for Cross-Domain Image Classification. 3319-3328 - Jian Xiong
, Hao Gao
, Miaohui Wang
, Hongliang Li
, King Ngi Ngan
, Weisi Lin
:
Efficient Geometry Surface Coding in V-PCC. 3329-3342 - Yahui Liu
, Yajing Chen, Linchao Bao
, Nicu Sebe
, Bruno Lepri
, Marco De Nadai
:
ISF-GAN: An Implicit Style Function for High-Resolution Image-to-Image Translation. 3343-3353 - Mingjie Sun
, Jimin Xiao
, Eng Gee Lim
, Yao Zhao
:
Starting Point Selection and Multiple-Standard Matching for Video Object Segmentation With Language Annotation. 3354-3363 - Lei Jin
, Xiaojuan Wang, Xuecheng Nie
, Luoqi Liu, Yandong Guo, Jian Zhao
:
Grouping by Center: Predicting Centripetal Offsets for the Bottom-up Human Pose Estimation. 3364-3374 - Tong Zhu
, Leida Li
, Jufeng Yang
, Sicheng Zhao
, Hantao Liu
, Jiansheng Qian:
Multimodal Sentiment Analysis With Image-Text Interaction Network. 3375-3385 - Kaiwen Yang
, Xinmei Tian
:
Domain-Class Correlation Decomposition for Generalizable Person Re-Identification. 3386-3396 - Weilun Wang
, Wengang Zhou
, Jianmin Bao, Houqiang Li
:
Coherent Image Animation Using Spatial-Temporal Correspondence. 3397-3408 - Xianxu Hou
, Xiaokang Zhang
, Yudong Li
, Linlin Shen
:
TextFace: Text-to-Style Mapping Based Face Generation and Manipulation. 3409-3419 - Qing Li
, Changqing Zhang
, Qinghua Hu
, Huazhu Fu
, Pengfei Zhu
:
Confidence-Aware Fusion Using Dempster-Shafer Theory for Multispectral Pedestrian Detection. 3420-3431 - Zhuangzi Li
, Ge Li
, Thomas H. Li, Shan Liu
, Wei Gao
:
Semantic Point Cloud Upsampling. 3432-3442 - Qi Liang
, Qiang Li, Weizhi Nie
, An-An Liu
:
Unsupervised Cross-Media Graph Convolutional Network for 2D Image-Based 3D Model Retrieval. 3443-3455 - Yunhao Zhou
, Yi Wang
, Lap-Pui Chau
:
Moving Towards Centers: Re-Ranking With Attention and Memory for Re-Identification. 3456-3468 - Lei Zhang
, Hua Huang
:
Image Stitching With Manifold Optimization. 3469-3482 - Wujie Zhou
, Enquan Yang, Jingsheng Lei, Jian Wan, Lu Yu:
PGDENet: Progressive Guided Fusion and Depth Enhancement Network for RGB-D Indoor Scene Parsing. 3483-3494 - Yi-Jen Shih
, Shih-Lun Wu
, Frank Zalkow
, Meinard Müller
, Yi-Hsuan Yang
:
Theme Transformer: Symbolic Music Generation With Theme-Conditioned Transformer. 3495-3508 - Jiayi Ma
, Yang Wang, Aoxiang Fan
, Guobao Xiao
, Riqing Chen
:
Correspondence Attention Transformer: A Context-Sensitive Network for Two-View Correspondence Learning. 3509-3524 - Fatemeh Nikoonezhad, Mohammed Ghanbari
:
PRAM: Penalized Resource Allocation Method for Video Services. 3525-3533 - Di Hu
, Zheng Wang
, Feiping Nie
, Rong Wang
, Xuelong Li
:
Self-Supervised Learning for Heterogeneous Audiovisual Scene Analysis. 3534-3545 - Songsong Wu
, Hao Tang
, Xiao-Yuan Jing, Haifeng Zhao
, Jianjun Qian
, Nicu Sebe
, Yan Yan:
Cross-View Panorama Image Synthesis. 3546-3559 - Shihao Zou
, Xinxin Zuo
, Sen Wang
, Yiming Qian, Chuan Guo
, Li Cheng
:
Human Pose and Shape Estimation From Single Polarization Images. 3560-3572 - Long Ma
, Risheng Liu
, Yiyang Wang
, Xin Fan
, Zhongxuan Luo:
Low-Light Image Enhancement via Self-Reinforced Retinex Projection Model. 3573-3586 - Chunhui Bao
, Qianru Sun
:
Generating Music With Emotions. 3602-3614 - Yunqing Li
, Jun Du
, Jianshu Zhang, Changjie Wu:
A Tree-Structure Analysis Network on Handwritten Chinese Character Error Correction. 3615-3627 - Zichen Zhao
, Hai-Miao Hu
, Hongda Zhang
, Fei Chen, Qiang Guo
:
Improving Color Constancy Using Chromaticity-Line Prior. 3642-3656 - Chang Liu, Xudong Jiang
, Henghui Ding
:
Instance-Specific Feature Propagation for Referring Segmentation. 3657-3667 - Liang Han
, Zhaozheng Yin
:
Global Memory and Local Continuity for Video Object Detection. 3681-3693 - Md Mofijul Islam
, Mohammad Samin Yasar
, Tariq Iqbal
:
MAVEN: A Memory Augmented Recurrent Approach for Multimodal Fusion. 3694-3708 - Ercheng Pei
, Yong Zhao
, Meshia Cédric Oveneke
, Dongmei Jiang
, Hichem Sahli
:
A Bayesian Filtering Framework for Continuous Affect Recognition From Facial Images. 3709-3722 - Yiwei Ma
, Jiayi Ji
, Xiaoshuai Sun
, Yiyi Zhou
, Yongjian Wu, Feiyue Huang, Rongrong Ji
:
Knowing What it is: Semantic-Enhanced Dual Attention Transformer. 3723-3736 - Yuzhi Zhao
, Lai-Man Po
, Xuehui Wang
, Qiong Yan, Wei Shen
, Yujia Zhang
, Wei Liu, Chun Kit Wong, Chiu-Sing Pang, Weifeng Ou
, Wing Yin Yu
, Buhua Liu:
ChildPredictor: A Child Face Prediction Framework With Disentangled Learning. 3737-3752 - Tongtong Feng
, Qi Qi
, Jingyu Wang
, Jianxin Liao
, Jiangchuan Liu
:
Timely and Accurate Bitrate Switching in HTTP Adaptive Streaming With Date-Driven I-Frame Prediction. 3753-3762 - Tianyi Zhang
, Abdallah El Ali
, Alan Hanjalic
, Pablo César
:
Few-Shot Learning for Fine-Grained Emotion Recognition Using Physiological Signals. 3773-3787 - Hengyue Bi
, Canhui Xu
, Cao Shi
, Guozhu Liu
, Yuteng Li
, Honghong Zhang
, Jing Qu
:
SRRV: A Novel Document Object Detector Based on Spatial-Related Relation and Vision. 3788-3798 - Minggang Gan, Yan Zhang
:
Temporal Attention-Pyramid Pooling for Temporal Action Detection. 3799-3810 - Xin Liu
, Jinhan Yi, Yiu-ming Cheung
, Xing Xu
, Zhen Cui
:
OMGH: Online Manifold-Guided Hashing for Flexible Cross-Modal Retrieval. 3811-3824 - Wujiang Xu
, Yifei Xu
, Genan Sang, Li Li, Aichen Wang
, Pingping Wei, Li Zhu
:
Recursive Multi-Relational Graph Convolutional Network for Automatic Photo Selection. 3825-3840 - Guanglei Yang
, Enrico Fini
, Dan Xu
, Paolo Rota
, Mingli Ding
, Hao Tang
, Xavier Alameda-Pineda
, Elisa Ricci
:
Continual Attentive Fusion for Incremental Learning in Semantic Segmentation. 3841-3854 - Li Li
, Zhu Li
, Shan Liu
, Houqiang Li
:
Frame-Level Rate Control for Geometry-Based LiDAR Point Cloud Compression. 3855-3867 - Hongrun Zhang
, Yanda Meng
, Yitian Zhao
, Xuesheng Qian
, Yihong Qiao
, Xiaoyun Yang, Yalin Zheng
:
3D Human Pose and Shape Reconstruction From Videos via Confidence-Aware Temporal Feature Aggregation. 3868-3880 - Weiming Yang
, Xianke Wang
, Bowen Tian
, Wei Xu
, Wenqing Cheng:
A Multi-Stage Automatic Evaluation System for Sight-Singing. 3881-3893 - Kehua Guo
, Changchun Shen
, Bin Hu
, Min Hu
, Xiaoyan Kui
:
RSNet: Relation Separation Network for Few-Shot Similar Class Recognition. 3894-3904 - Zhong Wang
, Lin Zhang
, Ying Shen
, Yicong Zhou
:
D-LIOM: Tightly-Coupled Direct LiDAR-Inertial Odometry and Mapping. 3905-3920 - Yunxiao Wang
, Meng Liu
, Yinwei Wei
, Zhiyong Cheng
, Yinglong Wang, Liqiang Nie
:
Siamese Alignment Network for Weakly Supervised Video Moment Retrieval. 3921-3933 - Tuxin Guan
, Chaofeng Li
, Ke Gu
, Hantao Liu
, Yuhui Zheng
, Xiaojun Wu
:
Visibility and Distortion Measurement for No-Reference Dehazed Image Quality Assessment via Complex Contourlet Transform. 3934-3949 - Tianwen Qian
, Jingjing Chen
, Shaoxiang Chen, Bo Wu
, Yu-Gang Jiang
:
Scene Graph Refinement Network for Visual Question Answering. 3950-3961 - Kejun Wu
, You Yang
, Qiong Liu
, Xiao-Ping Zhang
:
Focal Stack Image Compression Based on Basis-Quadtree Representation. 3975-3988 - Changwei Wang
, Rongtao Xu
, Shibiao Xu
, Weiliang Meng
, Xiaopeng Zhang
:
CNDesc: Cross Normalization for Local Descriptors Learning. 3989-4001 - Yao Xue
, Yu Cao
, Xubin Feng
, Meilin Xie, Ke Li
, Xingjun Zhang
, Xueming Qian
:
Towards Handling Sudden Changes in Feature Maps During Depth Estimation. 4002-4012 - Xiang Deng, Songhe Feng
, Gengyu Lyu, Tao Wang
, Congyan Lang
:
Beyond Word Embeddings: Heterogeneous Prior Knowledge Driven Multi-Label Image Classification. 4013-4025 - Chuntao Wang
, Tianjian Zhang, Hao Chen, Qiong Huang
, Jiangqun Ni
, Xinpeng Zhang
:
A Novel Encryption-Then-Lossy-Compression Scheme of Color Images Using Customized Residual Dense Spatial Network. 4026-4040 - Hengmin Zhang
, Feng Qian
, Bob Zhang
, Wenli Du
, Jianjun Qian
, Jian Yang
:
Incorporating Linear Regression Problems Into an Adaptive Framework With Feasible Optimizations. 4041-4051 - Jiafeng Li
, Yaopeng Li
, Li Zhuo
, Lingyan Kuang, Tianjian Yu:
USID-Net: Unsupervised Single Image Dehazing Network via Disentangled Representations. 3587-3601 - Bairong Li
, Biao Guo
, Yuesheng Zhu
, Jianfeng Yin, Xiangli Ji:
Superframe-Based Temporal Proposals for Weakly Supervised Temporal Action Detection. 3628-3641 - Jiaqi Zhao
, Hanzheng Wang
, Yong Zhou
, Rui Yao
, Silin Chen
, Abdulmotaleb El-Saddik
:
Spatial-Channel Enhanced Transformer for Visible-Infrared Person Re-Identification. 3668-3680 - Jiayi Ji
, Xiaoyang Huang
, Xiaoshuai Sun
, Yiyi Zhou
, Gen Luo
, Liujuan Cao
, Jianzhuang Liu
, Ling Shao
, Rongrong Ji
:
Multi-Branch Distance-Sensitive Self-Attention Network for Image Captioning. 3962-3974 - Chengpei Xu
, Wenjing Jia
, Tingcheng Cui, Ruomei Wang
, Yuan-fang Zhang
, Xiangjian He
:
Arbitrary-Shape Scene Text Detection via Visual-Relational Rectification and Contour Approximation. 4052-4066 - Wenlong Cheng
, Wei Tang, Yan Huang
, Yiwen Luo, Liang Wang
:
A Reconstruction-Based Visual-Acoustic-Semantic Embedding Method for Speech-Image Retrieval. 4067-4080 - Ming Li, Bin Fu
, Han Chen, Junjun He
, Yu Qiao
:
Dual Relation Network for Scene Text Recognition. 4094-4107 - Xin Deng
, Hao Wang, Mai Xu
, Li Li
, Zulin Wang:
Omnidirectional Image Super-Resolution via Latitude Adaptive Network. 4108-4120 - Sijie Mai
, Ying Zeng, Haifeng Hu
:
Multimodal Information Bottleneck: Learning Minimal Sufficient Unimodal and Multimodal Representations. 4121-4134 - Xiang Wen
, Shiwei Zhao
, Haobo Wang
, Runze Wu
, Manhu Qu, Tianlei Hu, Gang Chen, Jianrong Tao
, Changjie Fan:
Multi-Source Multi-Label Learning for User Profiling in Online Games. 4135-4147 - Yang Yang
, Hao Zheng
, Lanling Zeng, Xiangjun Shen
, Yongzhao Zhan
:
$L_{1}$-Regularized Reconstruction Model for Edge-Preserving Filtering. 4148-4162 - Zhengzheng Tu
, Yan Ma, Zhun Li
, Chenglong Li
, Jieming Xu, Yongtao Liu:
RGBT Salient Object Detection: A Large-Scale Dataset and Benchmark. 4163-4176 - Yu Zhou
, Weikang Gong, Yanjing Sun
, Leida Li
, Jinjian Wu
, Xinbo Gao
:
Pyramid Feature Aggregation for Hierarchical Quality Prediction of Stitched Panoramic Images. 4177-4186 - Lingxiang Yao
, Worapan Kusakunniran
, Peng Zhang, Qiang Wu
, Jian Zhang
:
Improving Disentangled Representation Learning for Gait Recognition Using Group Supervision. 4187-4198 - Chengpei Xu
, Wenjing Jia
, Ruomei Wang
, Xiaonan Luo
, Xiangjian He
:
MorphText: Deep Morphology Regularized Accurate Arbitrary-Shape Scene Text Detection. 4199-4212 - Si Liu
, Renda Bao, Defa Zhu, Shaofei Huang
, Qiong Yan, Liang Lin
, Chao Dong
:
Fine-Grained Face Editing via Personalized Spatial-Aware Affine Modulation. 4213-4224 - Haodan Zhang
, Yixuan Ban
, Zongming Guo
, Ken Chen, Xinggong Zhang
:
RAM360: Robust Adaptive Multi-Layer 360$^\circ$ Video Streaming With Lyapunov Optimization. 4225-4239 - Hongyi Sun
, Wanhua Li
, Yueqi Duan
, Jie Zhou
, Jiwen Lu
:
Learning Adaptive Patch Generators for Mask-Robust Image Inpainting. 4240-4252 - Huiyu Duan
, Wei Shen
, Xiongkuo Min
, Yuan Tian
, Jae-Hyun Jung
, Xiaokang Yang
, Guangtao Zhai
:
Develop Then Rival: A Human Vision-Inspired Framework for Superimposed Image Decomposition. 4267-4281 - Bosheng Qin
, Haoji Hu
, Yueting Zhuang:
Deep Residual Weight-Sharing Attention Network With Low-Rank Attention for Visual Question Answering. 4282-4295 - Shihui Zhang
, Dongxu Zuo
, Yongliang Yang
, Xiaowei Zhang
:
A Transferable Adversarial Belief Attack With Salient Region Perturbation Restriction. 4296-4306 - Dong Wei
, Xiaobo Shen
, Quansen Sun
, Xizhan Gao
, Zhenwen Ren
:
Sparse Representation Classifier Guided Grassmann Reconstruction Metric Learning With Applications to Image Set Analysis. 4307-4322 - Tongzhen Si
, Fazhi He
, Zhong Zhang
, Yansong Duan
:
Hybrid Contrastive Learning for Unsupervised Person Re-Identification. 4323-4334 - Xiao Wang
, Xiujun Shu
, Shiliang Zhang
, Bo Jiang
, Yaowei Wang
, Yonghong Tian
, Feng Wu:
MFGNet: Dynamic Modality-Aware Filter Generation for RGB-T Tracking. 4335-4348 - Zhengyan Chen
, Hong Liu
, Linlin Zhang, Xin Liao
:
Multi-Dimensional Attention With Similarity Constraint for Weakly-Supervised Temporal Action Localization. 4349-4360 - Jiacheng Chen
, Bin-Bin Gao
, Zongqing Lu, Jing-Hao Xue
, Chengjie Wang
, Qingmin Liao
:
APANet: Adaptive Prototypes Alignment Network for Few-Shot Semantic Segmentation. 4361-4373 - Bowen Ma
, Tong Jia, Min Su, Xiaodong Jia, Dongyue Chen, Yichun Zhang:
Automated Segmentation of Prohibited Items in X-Ray Baggage Images Using Dense De-Overlap Attention Snake. 4374-4386 - Deyang Liu
, Yan Huang
, Yuming Fang
, Yifan Zuo
, Ping An
:
Multi-Stream Dense View Reconstruction Network for Light Field Image Compression. 4400-4414 - Liang Xu
, Cuiling Lan
, Wenjun Zeng
, Cewu Lu
:
Skeleton-Based Mutually Assisted Interacted Object Localization and Human Action Recognition. 4415-4425 - Chaoqin Huang
, Qinwei Xu
, Yanfeng Wang, Yu Wang
, Ya Zhang
:
Self-Supervised Masking for Unsupervised Anomaly Detection and Localization. 4426-4438 - Xiaozhou Lei
, Zixiang Fei
, Wenju Zhou
, Huiyu Zhou
, Minrui Fei
:
Low-Light Image Enhancement Using the Cell Vibration Model. 4439-4454 - Kaijun Liu
, Shujing Lyu
, Yue Lu
:
Few-Shot Segmentation for Prohibited Items Inspection With Patch-Based Self-Supervised Learning and Prototype Reverse Validation. 4455-4463 - Aite Zhao
, Yue Wang, Jianbo Li:
Transferable Self-Supervised Instance Learning for Sleep Recognition. 4464-4477 - Souradeep Chakraborty
, Zijun Wei
, Conor Kelton, Seoyoung Ahn
, Aruna Balasubramanian, Gregory J. Zelinsky, Dimitris Samaras
:
Predicting Visual Attention in Graphic Design Documents. 4478-4493 - Sheng Liu
, Annan Li
, Jiahao Wang
, Yunhong Wang
:
Bidirectional Maximum Entropy Training With Word Co-Occurrence for Video Captioning. 4494-4507 - Sanchita Ghose
, John J. Prevost
:
FoleyGAN: Visually Guided Generative Adversarial Network-Based Synchronous Sound Generation in Silent Videos. 4508-4519 - Wentao Tan
, Lei Zhu
, Jingjing Li
, Huaxiang Zhang
, Junwei Han
:
Teacher-Student Learning: Efficient Hierarchical Message Aggregation Hashing for Cross-Modal Retrieval. 4520-4532 - Qi Liu
, Honglei Su
, Tianxin Chen
, Hui Yuan
, Raouf Hamzaoui
:
No-Reference Bitstream-Layer Model for Perceptual Quality Assessment of V-PCC Encoded Point Clouds. 4533-4546 - Jin Li
, Wanyun Li
, Zichen Xu
, Yuhao Wang
, Qiegen Liu
:
Wavelet Transform-Assisted Adaptive Generative Modeling for Colorization. 4547-4562 - Guangzhi Wang
, Yangyang Guo
, Ziwei Xu
, Yongkang Wong
, Mohan S. Kankanhalli
:
Semantic-Aware Triplet Loss for Image Classification. 4563-4572 - Sangwook Park, David K. Han
, Mounya Elhilali
:
Cross-Referencing Self-Training Network for Sound Event Detection in Audio Mixtures. 4573-4585 - Yusheng Tao
, Jian Zhang
, Jiajing Hong, Yuesheng Zhu
:
DREAMT: Diversity Enlarged Mutual Teaching for Unsupervised Domain Adaptive Person Re-Identification. 4586-4597 - Jingtao Xu
, Yali Li
, Shengjin Wang
:
AdaZoom: Towards Scale-Aware Large Scene Object Detection. 4598-4609 - Xuesong Wang
, Ke Jin
, Yi Kong
, C. L. Philip Chen
, Yuhu Cheng
:
Discriminator-Quality Evaluation GAN. 4081-4093 - Xiaolong Cheng
, Xuan Zheng
, Jialun Pei
, He Tang
, Zehua Lyu, Chuanbo Chen
:
Depth-Induced Gap-Reducing Network for RGB-D Salient Object Detection: An Interaction, Guidance and Refinement Approach. 4253-4266 - Xuena Ren
, Dongming Zhang
, Xiuguo Bao, Yongdong Zhang
:
S$^{2}$-Net:Semantic and Saliency Attention Network for Person Re-Identification. 4387-4399 - Pingyu Wang
, Zhicheng Zhao
, Fei Su
, Hongying Meng
:
LTReID: Factorizable Feature Generation With Independent Components for Long-Tailed Person Re-Identification. 4610-4622 - Wenbin Zou
, Liang Chen
, Yi Wu
, Yunchen Zhang, Yuxiang Xu, Jun Shao:
Joint Wavelet Sub-Bands Guided Network for Single Image Super-Resolution. 4623-4637 - Yihao Liu
, Jingwen He
, Xiangyu Chen
, Zhengwen Zhang, Hengyuan Zhao, Chao Dong
, Yu Qiao
:
Very Lightweight Photo Retouching Network With Conditional Sequential Modulation. 4638-4652 - Mingrui Zhang
, Mading Li, Jiahao Yu, Li Chen
:
Aesthetic Photo Collage With Deep Reinforcement Learning. 4653-4664 - Guanchen Ding
, Daiqin Yang
, Tao Wang, Sihan Wang, Yunfei Zhang
:
Crowd Counting via Unsupervised Cross-Domain Feature Adaptation. 4665-4678 - Qingping Sun
, Yi Xiao
, Jie Zhang, Shizhe Zhou
, Chi-Sing Leung
, Xin Su:
A Local Correspondence-Aware Hybrid CNN-GCN Model for Single-Image Human Body Reconstruction. 4679-4690 - Chuanyi Zhang
, Guosheng Lin
, Qiong Wang
, Fumin Shen
, Yazhou Yao
, Zhenmin Tang:
Guided by Meta-Set: A Data-Driven Method for Fine-Grained Visual Recognition. 4691-4703 - Yunan Li
, Huizhou Chen, Qiguang Miao
, Daohui Ge, Siyu Liang
, Zhuoqi Ma
, Bocheng Zhao:
Image Hazing and Dehazing: From the Viewpoint of Two-Way Image Translation With a Weakly Supervised Framework. 4704-4717 - Qitong Wang
, Bin Fu
, Ming Li, Junjun He
, Xi Peng
, Yu Qiao
:
Region-Aware Arbitrary-Shaped Text Detection With Progressive Fusion. 4718-4729 - Zhi Wu
, Dongheng Zhang
, Chunyang Xie
, Cong Yu
, Jinbo Chen
, Yang Hu
, Yan Chen
:
RFMask: A Simple Baseline for Human Silhouette Segmentation With Radio Signals. 4730-4741 - Hai Wang
, Wenming Yang
, Qingmin Liao
, Jie Zhou
:
Bi-RSTU: Bidirectional Recurrent Upsampling Network for Space-Time Video Super-Resolution. 4742-4751 - Hao Li
, Jinghui Qin
, Zhijing Yang
, Pengxu Wei
, Jinshan Pan
, Liang Lin
, Yukai Shi
:
Real-World Image Super-Resolution by Exclusionary Dual-Learning. 4752-4763 - Li Zhang, Tong Qiao
, Ming Xu
, Ning Zheng
, Shichuang Xie
:
Unsupervised Learning-Based Framework for Deepfake Video Detection. 4785-4799 - Xulun Ye
, Jieyu Zhao
:
Graph Convolutional Network With Unknown Class Number. 4800-4813 - Xiangyu Hu, Liquan Shen
, Mingxing Jiang
, Ran Ma
, Ping An
:
LA-HDR: Light Adaptive HDR Reconstruction Framework for Single LDR Image Considering Varied Light Conditions. 4814-4829 - Zhuo Chen, Fei Yin
, Qing Yang
, Cheng-Lin Liu
:
Cross-Lingual Text Image Recognition via Multi-Hierarchy Cross-Modal Mimic. 4830-4841 - Liping Nong
, Jie Peng
, Wenhui Zhang, Jiming Lin, Hongbing Qiu
, Junyi Wang
:
Adaptive Multi-Hypergraph Convolutional Networks for 3D Object Classification. 4842-4855 - Lei Qi
, Lei Wang
, Yinghuan Shi
, Xin Geng
:
A Novel Mix-Normalization Method for Generalizable Multi-Source Person Re-Identification. 4856-4867 - Linhui Dai
, Xiang Song, Xiaohong Liu
, Chengqi Li, Zhihao Shi, Jun Chen
, Martin Brooks:
Enabling Trimap-Free Image Matting With a Frequency-Guided Saliency-Aware Network via Joint Learning. 4868-4879 - Junda Cheng, Xin Yang
, Yuechuan Pu
, Peng Guo
:
Region Separable Stereo Matching. 4880-4893 - Jiehang Xie
, Xuanbai Chen, Tianyi Zhang
, Yixuan Zhang, Shao-Ping Lu
, Pablo César
, Yulu Yang:
Multimodal-Based and Aesthetic-Guided Narrative Video Summarization. 4894-4908 - Di Wang
, Shuai Liu
, Quan Wang
, Yumin Tian
, Lihuo He
, Xinbo Gao
:
Cross-Modal Enhancement Network for Multimodal Sentiment Analysis. 4909-4921 - Wenfeng Pang
, Wei Xie
, Qianhua He
, Yanxiong Li
, Jichen Yang
:
Audiovisual Dependency Attention for Violence Detection in Videos. 4922-4932 - Chuangchuang Tan
, Guanghua Gu
, Tao Ruan
, Shikui Wei
, Yao Zhao
:
Dual-Gradients Localization Framework With Skip-Layer Connections for Weakly Supervised Object Localization. 4933-4942 - Kangjian He
, Xuejie Zhang
, Dan Xu
, Jian Gong
, Lisiqi Xie
:
Fidelity-driven Optimization Reconstruction and Details Preserving Guided Fusion for Multi-Modality Medical Image. 4943-4957 - Hamed RahmaniKhezri
, Suhong Kim, Mohamed Hefeeda
:
Unsupervised Single-Image Reflection Removal. 4958-4971 - Lele Fu
, Zhaoliang Chen
, Yongyong Chen
, Shiping Wang
:
Unified Low-Rank Tensor Learning and Spectral Embedding for Multi-View Subspace Clustering. 4972-4985 - Dongliang Zhou
, Haijun Zhang
, Qun Li
, Jianghong Ma
, Xiaofei Xu:
COutfitGAN: Learning to Synthesize Compatible Outfits Supervised by Silhouette Masks and Fashion Styles. 4986-5001 - Jingjing Jiang
, Ziyi Liu
, Nanning Zheng
:
LiVLR: A Lightweight Visual-Linguistic Reasoning Framework for Video Question Answering. 5002-5013 - Yuchen Su
, Zhiwen Shao
, Yong Zhou
, Fanrong Meng, Hancheng Zhu
, Bing Liu
, Rui Yao
:
TextDCT: Arbitrary-Shaped Text Detection via Discrete Cosine Transform Mask. 5030-5042 - Tiesong Zhao
, Ying Fang, Kai Wang, Qian Liu
, Yuzhen Niu
:
High Efficiency Vibrotactile Codec Based on Gate Recurrent Network. 5043-5052 - Zhi Lu
, Yang Hu
, Cong Yu
, Yunchao Jiang, Yan Chen
, Bing Zeng
:
Personalized Fashion Recommendation With Discrete Content-Based Tensor Factorization. 5053-5064 - An-An Liu
, Heyu Zhou
, Xuanya Li
, Lanjun Wang
:
Vulnerability of Feature Extractors in 2D Image-Based 3D Object Retrieval. 5065-5076 - Sanaz Nami
, Farhad Pakdaman
, Mahmoud Reza Hashemi
, Shervin Shirmohammadi
:
BL-JUNIPER: A CNN-Assisted Framework for Perceptual Video Coding Leveraging Block-Level JND. 5077-5092 - Pengfei Guo
, Hantao Liu
, Delu Zeng
, Tao Xiang
, Leida Li
, Ke Gu
:
An Underwater Image Quality Assessment Metric. 5093-5106 - Zhulin Tao
, Xiaohao Liu
, Yewei Xia, Xiang Wang
, Lifang Yang
, Xianglin Huang
, Tat-Seng Chua:
Self-Supervised Learning for Multimedia Recommendation. 5107-5116 - Devanshu Anand
, Mohammed Amine Togou
, Gabriel-Miro Muntean
:
A Machine Learning Solution for Video Delivery to Mitigate Co-Tier Interference in 5G HetNets. 5117-5129 - Weide Liu
, Chi Zhang, Henghui Ding
, Tzu-Yi Hung, Guosheng Lin
:
Few-Shot Segmentation With Optimal Transport Matching and Message Flow. 5130-5141 - Miao Zhang
, Shunyu Yao
, Beiqi Hu, Yongri Piao
, Wei Ji
:
C$^{2}$DFNet: Criss-Cross Dynamic Filter Network for RGB-D Salient Object Detection. 5142-5154 - Wei Zhai
, Yang Cao
, Haiyong Xie, Zheng-Jun Zha
:
Deep Texton-Coherence Network for Camouflaged Object Detection. 5155-5165 - Jiande Sun
, Fanfu Xue
, Jing Li, Lei Zhu
, Huaxiang Zhang
, Jia Zhang
:
TSINIT: A Two-Stage Inpainting Network for Incomplete Text. 5166-5177 - Haidong Qin
, Jing Li
, Yuqi Jiang
, Yanran Dai, Shikuan Hong
, Feng Zhou
, Zhijun Wang
, Tao Yang
:
Bullet-Time Video Synthesis Based on Virtual Dynamic Target Axis. 5178-5191 - Jiaqi Zhou
, Zehua Fu, Qiuyu Huang, Qingjie Liu
, Yunhong Wang
:
LgNet: A Local-Global Network for Action Recognition and Beyond. 5192-5205 - Xiaodi Guan
, Fan Li
, Yangfan Zhang, Pamela C. Cosman
:
End-to-End Blind Video Quality Assessment Based on Visual and Memory Attention Modeling. 5206-5221 - Yiqing Cai
, Zhenwei Ma, Changhong Lu, Changbo Wang
, Gaoqi He
:
Global Representation Guided Adaptive Fusion Network for Stable Video Crowd Counting. 5222-5233 - Junna Gao
, Dehui Kong
, Shaofan Wang
, Jinghua Li, Baocai Yin
:
DASI: Learning Domain Adaptive Shape Impression for 3D Object Reconstruction. 5248-5262 - Jingwen Hou
, Weisi Lin
, Guanghui Yue
, Weide Liu
, Baoquan Zhao
:
Interaction-Matrix Based Personalized Image Aesthetics Assessment. 5263-5278 - Nam Joon Kim, Hyun Kim
:
FP-AGL: Filter Pruning With Adaptive Gradient Learning for Accelerating Deep Convolutional Neural Networks. 5279-5290 - Hanqi Zhu
, Jiajun Deng
, Yu Zhang
, Jianmin Ji
, Qiuyu Mao, Houqiang Li
, Yanyong Zhang
:
VPFNet: Improving 3D Object Detection With Virtual Point Based LiDAR and Stereo Data Fusion. 5291-5304 - Lingyun Song
, Xuequn Shang
, Chen Yang
, Mingxuan Sun:
Attribute-Guided Multiple Instance Hashing Network for Cross-Modal Zero-Shot Hashing. 5305-5318 - Xianjing Han
, Xuemeng Song
, Xingning Dong, Yinwei Wei
, Meng Liu
, Liqiang Nie
:
DBiased-P: Dual-Biased Predicate Predictor for Unbiased Scene Graph Generation. 5319-5329 - Tianpeng Liu
, Jing Li
, Jia Wu
, Jun Chang
, Beihang Song, Bowen Yao:
Tracking With Mutual Attention Network. 5330-5343 - Yuhang Liu
, Wei Wei
, Daowan Peng, Xian-Ling Mao
, Zhiyong He, Pan Zhou
:
Depth-Aware and Semantic Guided Relational Attention Network for Visual Question Answering. 5344-5357 - Jianzhao Liu, Wei Zhou
, Xin Li
, Jiahua Xu
, Zhibo Chen
:
LIQA: Lifelong Blind Image Quality Assessment. 5358-5373 - Zhi Chen
, Yadan Luo
, Sen Wang
, Jingjing Li
, Zi Huang
:
GSMFlow: Generation Shifts Mitigating Flow for Generalized Zero-Shot Learning. 5374-5385 - Qi Zhang
, Jianchao Wei
, Shanshe Wang
, Siwei Ma
, Wen Gao:
RealVR: Efficient, Economical, and Quality-of- Experience-Driven VR Video System Based on MPEG OMAF. 5386-5399 - Lanxiao Wang
, Hongliang Li
, Wenzhe Hu
, Xiaoliang Zhang
, Heqian Qiu
, Fanman Meng
, Qingbo Wu
:
What Happens in Crowd Scenes: A New Dataset About Crowd Scenes for Image Captioning. 5400-5412 - Wei Tang
, Fazhi He
, Yu Liu
:
YDTR: Infrared and Visible Image Fusion via Y-Shape Dynamic Transformer. 5413-5428 - Xianye Ben
, Chen Gong
, Tianhuan Huang
, Chuanye Li, Rui Yan, Yujun Li
:
Tackling Micro-Expression Data Shortage via Dataset Alignment and Active Learning. 5429-5443 - Ze Zhou
, Quansen Sun
, Hongjun Li
, Chaobo Li
, Zhenwen Ren
:
Regression-Selective Feature-Adaptive Tracker for Visual Object Tracking. 5444-5457 - Naishan Zheng
, Jie Huang
, Feng Zhao
, Xueyang Fu
, Feng Wu:
Unsupervised Underexposed Image Enhancement via Self-Illuminated and Perceptual Guidance. 5469-5484 - Xiaochuang Shu, Xiangdong Zhang
, Quanxue Gao
, Ming Yang
, Rong Wang
, Xinbo Gao:
Self-Weighted Anchor Graph Learning for Multi-View Clustering. 5485-5499 - Maregu Assefa
, Wei Jiang
, Kumie Gedamu
, Getinet Yilma
, Bulbula Kumeda
, Melese Ayalew
:
Self-Supervised Scene-Debiasing for Video Representation Learning via Background Patching. 5500-5515 - Zhi Lu
, Yang Hu
, Cong Yu
, Yan Chen
, Bing Zeng
:
Learning Fashion Compatibility With Context Conditioning Embedding. 5516-5526 - Xin Wei
, Yuyuan Yao, Haoyu Wang
, Liang Zhou
:
Perception-Aware Cross-Modal Signal Reconstruction: From Audio-Haptic to Visual. 5527-5538 - Chengliang Liu
, Zhihao Wu
, Jie Wen
, Yong Xu
, Chao Huang
:
Localized Sparse Incomplete Multi-View Clustering. 5539-5551 - Liming Zou
, Jing Li
, Wenbo Wan
, Q. M. Jonathan Wu
, Jiande Sun
:
Robust Coverless Image Steganography Based on Neglected Coverless Image Dataset Construction. 5552-5564 - Xixi Nie, Bo Hu
, Xinbo Gao
:
MLNet: A Multi-Domain Lightweight Network for Multi-Focus Image Fusion. 5565-5579 - Qianting Ma
, Yang Wang
, Tieyong Zeng
:
Retinex-Based Variational Framework for Low-Light Image Enhancement and Denoising. 5580-5588 - Huanjing Yue
, Yijia Cheng, Yan Mao, Cong Cao, Jing-Yu Yang
:
Recaptured Screen Image Demoiréing in Raw Domain. 5589-5600 - Jie Nie
, Chenglong Wang
, Shusong Yu
, Jinjin Shi, Xiaowei Lv, Zhiqiang Wei
:
MIGN: Multiscale Image Generation Network for Remote Sensing Image Semantic Segmentation. 5601-5613 - Lei Li
, Kai Fan
, Chun Yuan
:
StrokeNet: Stroke Assisted and Hierarchical Graph Reasoning Networks. 5614-5625 - Rui Li
, Danna Xue, Yu Zhu
, Hao Wu, Jinqiu Sun
, Yanning Zhang
:
Self-Supervised Monocular Depth Estimation With Frequency-Based Recurrent Refinement. 5626-5637 - Xian-Feng Han
, Yi-Fei Jin, Hui-Xian Cheng, Guoqiang Xiao
:
Dual Transformer for Point Cloud Analysis. 5638-5648 - Jiaheng Liu
, Jinyang Guo
, Dong Xu
:
GeometryMotion-Transformer: An End-to-End Framework for 3D Action Recognition. 5649-5661 - Weihe Li
, Jiawei Huang
, Wenjun Lyu
, Baoshen Guo
, Wanchun Jiang
, Jianxin Wang
:
RAV: Learning-Based Adaptive Streaming to Coordinate the Audio and Video Bitrate Selections. 5662-5675 - Kuiyuan Zhang
, Zhongyun Hua
, Yuanman Li
, Yongyong Chen
, Yicong Zhou
:
AMS-Net: Adaptive Multi-Scale Network for Image Compressive Sensing. 5676-5689 - Kangle Wu
, Jun Chen
, Jiayi Ma
:
DMEF: Multi-Exposure Image Fusion Based on a Novel Deep Decomposition Method. 5690-5703 - Huaian Chen
, Jianfeng Wang
, Minghui Duan
, Yi Jin
, Yan Kan
, Changan Zhu
:
Video Denoising for Scenes With Challenging Motion: A Comprehensive Analysis and a New Framework. 5704-5719 - Xiaofeng Ding
, Tieyong Zeng
, Jian Tang
, Zhengping Che
, Yaxin Peng
:
SRRNet: A Semantic Representation Refinement Network for Image Segmentation. 5720-5732 - Kaihua Zhang
, Yang Wu
, Mingliang Dong
, Bo Liu, Dong Liu, Qingshan Liu
:
Deep Object Co-Segmentation and Co-Saliency Detection via High-Order Spatial-Semantic Network Modulation. 5733-5746 - Shaowei Weng
, Ye Zhou, Tiancong Zhang
, Mengyao Xiao
, Yao Zhao
:
General Framework to Reversible Data Hiding for JPEG Images With Multiple Two-Dimensional Histograms. 5747-5762 - Shule Deng
, Jin-Gang Yu
, Zihao Wu
, Hongxia Gao, Yansheng Li
, Yang Yang
:
Learning Relative Feature Displacement for Few-Shot Open-Set Recognition. 5763-5774 - Jian Jin
, Xingxing Zhang
, Lili Meng
, Weisi Lin
, Jie Liang
, Huaxiang Zhang
, Yao Zhao
:
Auto-Weighted Layer Representation Based View Synthesis Distortion Estimation for 3-D Video Coding. 5775-5788 - Ginam Kim, Hyunsung Kim, Kyeongbo Kong, Jou Won Song
, Suk-Ju Kang
:
Human Body-Aware Feature Extractor Using Attachable Feature Corrector for Human Pose Estimation. 5789-5799 - Junwen Xiong
, Yu Zhou
, Peng Zhang
, Lei Xie
, Wei Huang
, Yufei Zha
:
Look&listen: Multi-Modal Correlation Learning for Active Speaker Detection and Speech Enhancement. 5800-5812 - Haoyu Chen, Minggui Teng
, Boxin Shi
, Yizhou Wang
, Tie-Jun Huang
:
A Residual Learning Approach to Deblur and Generate High Frame Rate Video With an Event Camera. 5826-5839 - Frank Po Wen Lo
, Yao Guo
, Yingnan Sun
, Jianing Qiu
, Benny Lo
:
An Intelligent Vision-Based Nutritional Assessment Method for Handheld Food Items. 5840-5851 - Longrong Yang
, Hongliang Li
, Qingbo Wu
, Fanman Meng
, Heqian Qiu
, Linfeng Xu
:
Bias-Correction Feature Learner for Semi-Supervised Instance Segmentation. 5852-5863 - Yanli Ji
, Shuo Ma, Xing Xu
, Xuelong Li
, Heng Tao Shen
:
Self-Supervised Fine-Grained Cycle-Separation Network (FSCN) for Visual-Audio Separation. 5864-5876 - Yushu Zhang
, Wentao Zhou, Ruoyu Zhao
, Xinpeng Zhang
, Xiaochun Cao:
F-TPE: Flexible Thumbnail-Preserving Encryption Based on Multi-Pixel Sum-Preserving Encryption. 5877-5891 - Muli Yang
, Chenghao Xu, Aming Wu, Cheng Deng
:
A Decomposable Causal View of Compositional Zero-Shot Learning. 5892-5902 - Tianxin Huang
, Hao Zou, Jinhao Cui, Jiangning Zhang
, Xuemeng Yang, Lin Li
, Yong Liu
:
Adaptive Recurrent Forward Network for Dense Point Cloud Completion. 5903-5915 - Lin Zhang, Mingxin Zhang, Ran Song
, Ziying Zhao, Xiaolei Li:
Unsupervised Embedding Learning With Mutual-Information Graph Convolutional Networks. 5916-5926 - Jie Li
, Yong Xiang
, Hao Wu, Shaowen Yao
, Dan Xu
:
Optimal Transport-Based Patch Matching for Image Style Transfer. 5927-5940 - Jacob Chakareski
, Xavier Corbillon, Gwendal Simon, Viswanathan (Vishy) Swaminathan:
User Navigation Modeling, Rate-Distortion Analysis, and End-to-End Optimization for Viewport-Driven 360$^\circ $ Video Streaming. 5941-5956 - Jiaqi Zhang
, Yunrui Jian
, Suhong Wang
, Chuanmin Jia
, Shanshe Wang
, Siwei Ma
, Wen Gao:
Textural and Directional Information Based Offset In-Loop Filtering in AVS3. 5957-5971 - Jun-Sang Yoo
, Dong-Wook Kim, Yucheng Lu
, Seung-Won Jung
:
RZSR: Reference-Based Zero-Shot Super-Resolution With Depth Guided Self-Exemplars. 5972-5983 - Majjed Al-Qatf
, Xingfu Wang
, Ammar Hawbani
, Amr Abdussalam
, Saeed Hamood Alsamhi
:
Image Captioning With Novel Topics Guidance and Retrieval-Based Topics Re-Weighting. 5984-5999 - Zhentan Zheng
, Jianyi Liu
, Nanning Zheng
:
P$^{2}$-GAN: Efficient Stroke Style Transfer Using Single Style Image. 6000-6012 - Ali Ak
, Abhishek Goswami
, Wolf Hauser, Patrick Le Callet
, Frédéric Dufaux
:
RV-TMO: Large-Scale Dataset for Subjective Quality Assessment of Tone Mapped Images. 6013-6025 - Pei Wang
, Yun Yang
, Yuelong Xia, Kun Wang, Xingyi Zhang
, Song Wang
:
Information Maximizing Adaptation Network With Label Distribution Priors for Unsupervised Domain Adaptation. 6026-6039 - Dingkang Liang
, Wei Xu
, Yingying Zhu
, Yu Zhou
:
Focal Inverse Distance Transform Maps for Crowd Localization. 6040-6052 - Sheikh Tania
, Gour C. Karmakar
, Shyh Wei Teng
, M. Manzur Murshed
:
A Robust Local Texture Descriptor in the Parametric Space of the Weibull Distribution. 6053-6066 - Huihui Yue, Jichang Guo
, Xiangjun Yin
, Yi Zhang, Sida Zheng:
Deep Label Prior: Pre-Training-Free Salient Object Detection Network Based on Label Learning. 6067-6078 - Jakub Nawala
, Lucjan Janowski
, Bogdan Cmiel, Krzysztof Rusek
, Pablo Pérez
:
Generalized Score Distribution: A Two-Parameter Discrete Distribution Accurately Describing Responses From Quality of Experience Subjective Experiments. 6090-6104 - Congcong Li
, Jing Li
, Yuguang Xie, Jiayang Nie, Tao Yang
, Zhaoyang Lu:
Calibration-Free Cross-Camera Target Association Using Interaction Spatiotemporal Consistency. 6105-6120 - Zhe Xu
, Kun Wei
, Xu Yang, Cheng Deng
:
Point-Supervised Video Temporal Grounding. 6121-6131 - Wenfeng Song, Xia Hou, Shuai Li, Chenglizhao Chen, Danyang Gao, Xian'e Wang, Yuzhe Sun, Jianxia Hou, Aimin Hao:
An Intelligent Virtual Standard Patient for Medical Students Training Based on Oral Knowledge Graph. 6132-6145 - Xu Yin
, Dongbo Min
, Yuchi Huo
, Sung-Eui Yoon
:
Contour-Aware Equipotential Learning for Semantic Segmentation. 6146-6156 - Junbao Zhuo
, Shuhui Wang
, Qingming Huang
:
Uncertainty Modeling for Robust Domain Adaptation Under Noisy Environments. 6157-6170 - Zhiqi Pang
, Lingling Zhao, Qiuyang Liu, Chunyu Wang
:
Camera Invariant Feature Learning for Unsupervised Person Re-Identification. 6171-6182 - Jiahao Nie
, Zhiwei He
, Yuxiang Yang
, Mingyu Gao
, Zhekang Dong
:
Learning Localization-Aware Target Confidence for Siamese Visual Tracking. 6194-6206 - Chao Sun, Zhedong Zheng
, Xiaohan Wang
, Mingliang Xu
, Yi Yang:
Self-Supervised Point Cloud Representation Learning via Separating Mixed Shapes. 6207-6218 - Fuxiang Wu
, Liu Liu
, Fusheng Hao
, Fengxiang He, Jun Cheng
:
Language-Based Image Manipulation Built on Language-Guided Ranking. 6219-6231 - Yamin Sepehri
, Pedram Pad, Clément Kündig, Pascal Frossard
, L. Andrea Dunbar:
Privacy-Preserving Image Acquisition for Neural Vision Systems. 6232-6244 - Sweta Anmulwar
, Ning Wang
, Vu San Ha Huynh
, Stewart Bryant, Jinze Yang, Regius Rahim Tafazolli
:
HoloSync: Frame Synchronisation for Multi-Source Holographic Teleportation Applications. 6245-6257 - Jingzhao Xu
, Mengke Yuan
, Dong-Ming Yan
, Tieru Wu
:
Illumination Guided Attentive Wavelet Network for Low-Light Image Enhancement. 6258-6271 - Xixia Xu
, Qi Zou
, Xue Lin
:
Structure-Enriched Topology Learning For Cross-Domain Multi-Person Pose Estimation. 6272-6284 - Jun Chen
, Hui Duan
, Yuanxin Song
, Zemin Cai
, Guangguang Yang
:
Optical Flow Computation for Video Under the Dynamic Illumination. 6285-6300 - Jiandian Zeng
, Jiantao Zhou
, Tianyi Liu
:
Robust Multimodal Sentiment Analysis via Tag Encoding of Uncertain Missing Modalities. 6301-6314 - Wei Wang, Junyu Gao
, Changsheng Xu
:
Weakly-Supervised Video Object Grounding via Learning Uni-Modal Associations. 6329-6340 - Qing Li
, Ying Chen
, Aoyang Zhang, Yong Jiang
, Longhao Zou
, Zhimin Xu
, Gabriel-Miro Muntean
:
A Super-Resolution Flexible Video Coding Solution for Improving Live Streaming Quality. 6341-6355 - Hanyang Jin, Shenqi Lai, Qi Tang, Tianyu Zhu, Xueming Qian
:
MPPM: A Mobile-Efficient Part Model for Object re-ID. 6356-6370 - Qiangqiang Shen
, Shuangyan Yi
, Yongsheng Liang
, Yongyong Chen
, Wei Liu:
Bilateral Fast Low-Rank Representation With Equivalent Transformation for Subspace Clustering. 6371-6383 - Qiaokang Xie
, Zhenbo Lu, Wengang Zhou
, Houqiang Li
:
Improving Person Re-Identification With Multi-Cue Similarity Embedding and Propagation. 6384-6396 - Wenhong Duan
, Zhenhua Liu, Chuanmin Jia
, Shanshe Wang
, Siwei Ma
, Wen Gao:
Differential Weight Quantization for Multi-Model Compression. 6397-6410 - Tiesong Zhao
, Yuhang Huang, Weize Feng, Yiwen Xu
, Sam Kwong
:
Efficient VVC Intra Prediction Based on Deep Feature Fusion and Probability Estimation. 6411-6421 - Yawen Cui
, Wanxia Deng
, Xin Xu
, Zhen Liu
, Zhong Liu, Matti Pietikäinen
, Li Liu
:
Uncertainty-Guided Semi-Supervised Few-Shot Class-Incremental Learning With Knowledge Distillation. 6422-6435 - Dongyan Nie
, Jialin Liu
, Hong Fei, Xiaoying Sun
:
Neuromorphic Similarity Measurement of Tactile Stimuli in Human-Machine Interface. 6436-6445 - Huaxin Pang
, Shikui Wei
, Gangjian Zhang
, Shiyin Zhang
, Shuang Qiu
, Yao Zhao
:
Heterogeneous Feature Alignment and Fusion in Cross-Modal Augmented Space for Composed Image Retrieval. 6446-6457 - Chuang Yang
, Mulin Chen
, Yuan Yuan, Qi Wang
:
Reinforcement Shrink-Mask for Text Detection. 6458-6470 - Junjie Wu
, Changqun Xia
, Tianshu Yu
, Jia Li
:
View-Aware Salient Object Detection for $360^{\circ }$ Omnidirectional Image. 6471-6484 - Wendong Mao
, Shuai Yang
, Huihong Shi
, Jiaying Liu
, Zhongfeng Wang
:
Intelligent Typography: Artistic Text Style Transfer for Complex Texture and Structure. 6485-6498 - Guanghui Yue
, Di Cheng
, Leida Li
, Tianwei Zhou
, Hantao Liu
, Tianfu Wang
:
Semi-Supervised Authentically Distorted Image Quality Assessment With Consistency-Preserving Dual-Branch Convolutional Neural Network. 6499-6511 - Naiyu Fang
, Lemiao Qiu
, Shuyou Zhang
, Zili Wang
, Kerui Hu, Liangyu Dong:
A Novel Human Image Sequence Synthesis Method by Pose-Shape-Content Inference. 6512-6524 - Jinchao Zhu
, Xiaoyu Zhang, Xian Fang, Yuxuan Wang
, Panlong Tan, Junnan Liu
:
Perception-and-Regulation Network for Salient Object Detection. 6525-6537 - Dehui Zhu
, Bo Du
, Yanni Dong
, Liangpei Zhang
:
Target Detection With Spatial-Spectral Adaptive Sample Generation and Deep Metric Learning for Hyperspectral Imagery. 6538-6550 - Yiming Wang
, Dongxia Chang
, Zhiqiang Fu
, Jie Wen
, Yao Zhao
:
Graph Contrastive Partial Multi-View Clustering. 6551-6562 - Changchong Sheng
, Li Liu
, Wanxia Deng
, Liang Bai, Zhong Liu, Songyang Lao
, Gangyao Kuang, Matti Pietikäinen
:
Importance-Aware Information Bottleneck Learning Paradigm for Lip Reading. 6563-6574 - Huan Deng
, Zhenguo Yang
, Tianyong Hao
, Qing Li
, Wenyin Liu
:
Multimodal Affective Computing With Dense Fusion Transformer for Inter- and Intra-Modality Interactions. 6575-6587 - Gaosheng Liu
, Huanjing Yue
, Jiamin Wu
, Jing-Yu Yang
:
Efficient Light Field Angular Super-Resolution With Sub-Aperture Feature Learning and Macro-Pixel Upsampling. 6588-6600 - Yukun Qiu
, Fa-Ting Hong
, Wei-Hong Li
, Wei-Shi Zheng
:
Learning Relation Models to Detect Important People in Still Images. 6601-6615 - Congcong Zhu
, Xiaoqiang Li
, Jide Li
, Songmin Dai, Weiqin Tong
:
Multi-Sourced Knowledge Integration for Robust Self-Supervised Facial Landmark Tracking. 6616-6628 - Huibing Wang
, Guangqi Jiang, Jinjia Peng, Ruoxi Deng, Xianping Fu:
Towards Adaptive Consensus Graph: Multi-View Clustering via Graph Collaboration. 6629-6641 - Yong Li
, Qiang Hao
, Jianguo Hu, Xinmiao Pan
, Zechao Li
, Zhen Cui
:
3D3M: 3D Modulated Morphable Model for Monocular Face Reconstruction. 6642-6652 - Tingyu Weng
, Jun Xiao
, Feilong Yan, Haiyong Jiang
:
Context-Aware 3D Point Cloud Semantic Segmentation With Plane Guidance. 6653-6664 - Wei Xia
, Qianqian Wang
, Quanxue Gao
, Ming Yang
, Xinbo Gao:
Self-Consistent Contrastive Attributed Graph Clustering With Pseudo-Label Prompt. 6665-6677 - Xin Yao
, Min Wang
, Wengang Zhou
, Houqiang Li
:
Hash Bit Selection With Reinforcement Learning for Image Retrieval. 6678-6687 - Chen Ju
, Peisen Zhao
, Siheng Chen
, Ya Zhang
, Xiaoyun Zhang
, Yanfeng Wang
, Qi Tian
:
Adaptive Mutual Supervision for Weakly-Supervised Temporal Action Localization. 6688-6701 - Peipei Zhu
, Xiao Wang
, Yong Luo, Zhenglong Sun
, Wei-Shi Zheng
, Yaowei Wang
, Changwen Chen
:
Unpaired Image Captioning by Image-Level Weakly-Supervised Visual Concept Recognition. 6702-6716 - Xinsheng Wang
, Qicong Xie, Jihua Zhu
, Lei Xie
, Odette Scharenborg
:
AnyoneNet: Synchronized Speech and Talking Head Generation for Arbitrary Persons. 6717-6728 - Taro Narahara
, Toshihiko Yamasaki
:
Subjective Functionality and Comfort Prediction for Apartment Floor Plans and Its Application to Intuitive Online Property Search. 6729-6742 - Zhenrong Zhang
, Jiefeng Ma, Jun Du
, Licheng Wang, Jianshu Zhang:
Multimodal Pre-Training Based on Graph Attention Network for Document Understanding. 6743-6755 - Chunjie Zhang
, Huihui Bai
, Yao Zhao
:
Fine-Grained Image Classification by Class and Image-Specific Decomposition With Multiple Views. 6756-6766 - Zehua Sheng
, Xiongwei Liu
, Si-Yuan Cao, Hui-Liang Shen
, Huaqi Zhang:
Frequency-Domain Deep Guided Image Denoising. 6767-6781 - Kunpeng Niu
, Yanli Liu
, Enhua Wu, Guanyu Xing
:
A Boundary-Aware Network for Shadow Removal. 6782-6793 - Lirong Zheng
, Yanshan Li
, Kaihao Zhang
, Wenhan Luo
:
T-Net: Deep Stacked Scale-Iteration Network for Image Dehazing. 6794-6807 - Dengyan Luo
, Mao Ye
, Shuai Li
, Ce Zhu
, Xue Li
:
Spatio-Temporal Detail Information Retrieval for Compressed Video Quality Enhancement. 6808-6820 - Jian Zhu
, Qingwu Zhang, Lunke Fei
, Ruichu Cai
, Yuan Xie
, Bin Sheng
, Xiaokang Yang
:
FFFN: Frame-By-Frame Feedback Fusion Network for Video Super-Resolution. 6821-6835 - Shijia Ni
, Feng Shao
, Xiongli Chai
, Hangwei Chen
, Yo-Sung Ho
:
Composition-Guided Neural Network for Image Cropping Aesthetic Assessment. 6836-6851 - Yuanman Li
, Jiaxiang You, Jiantao Zhou
, Wei Wang
, Xin Liao
, Xia Li
:
Image Operation Chain Detection with Machine Translation Framework. 6852-6867 - Tong Zhu
, Leida Li
, Jufeng Yang
, Sicheng Zhao
, Xiao Xiao
:
Multimodal Emotion Classification With Multi-Level Semantic Reasoning Network. 6868-6880 - Pinzhuo Tian
, Shaorong Xie
:
An Adversarial Meta-Training Framework for Cross-Domain Few-Shot Learning. 6881-6891 - Minsoo Song
, Gi-Mun Um, Heekyung Lee
, Jeongil Seo
, Wonjun Kim
:
Dynamic Residual Filtering With Laplacian Pyramid for Instance Segmentation. 6892-6903 - Nannan Hu, Yue Ming
, Chunxiao Fan, Fan Feng
, Boyang Lyu:
TSFNet: Triple-Steam Image Captioning. 6904-6916 - Weimin Tan
, Ganghui Ru, Yueming Jiang, Jichun Li
, Bo Yan
:
Rethinking and Improving Few-Shot Segmentation From a Contour-Aware Perspective. 6917-6929 - Yuchun Fang
, Sirui Cai, Yiting Cao
, Zhengchen Li, Zhaoxiang Zhang
:
Adversarial Learning Guided Task Relatedness Refinement for Multi-Task Deep Learning. 6946-6957 - Haonan Zhang
, Longjun Liu
, Bingyao Kang, Nanning Zheng
:
Hierarchical Model Compression via Shape-Edge Representation of Feature Maps - an Enlightenment From the Primate Visual System. 6958-6970 - Runmin Cong
, Kepu Zhang
, Chen Zhang, Feng Zheng
, Yao Zhao
, Qingming Huang
, Sam Kwong
:
Does Thermal Really Always Matter for RGB-T Salient Object Detection? 6971-6982 - Yadong Qu
, Hongtao Xie
, Shancheng Fang
, Yuxin Wang
, Yongdong Zhang
:
ADNet: Rethinking the Shrunk Polygon-Based Approach in Scene Text Detection. 6983-6996 - Aihua Mao
, Zhi Yang
, Ken Lin, Jun Xuan, Yong-Jin Liu
:
Positional Attention Guided Transformer-Like Architecture for Visual Question Answering. 6997-7009 - Haijin Zeng
, Jize Xue
, Hiep Luong, Wilfried Philips:
Multimodal Core Tensor Factorization and its Applications to Low-Rank Tensor Completion. 7010-7024 - Fengda Hao
, Jiaojiao Li
, Rui Song
, Yunsong Li
, Kailang Cao:
Structure-Aware Graph Convolution Network for Point Cloud Parsing. 7025-7036 - Wei Huang
, Yintao Zhou
, Yiu-ming Cheung
, Peng Zhang
, Yufei Zha
, Meng Pang
:
Facial Expression Guided Diagnosis of Parkinson's Disease via High-Quality Data Augmentation. 7037-7050 - Wuyang Li
, Xinyu Liu
, Yixuan Yuan
:
SCAN++: Enhanced Semantic Conditioned Adaptation for Domain Adaptive Object Detection. 7051-7061 - Qingrong Cheng
, Keyu Wen
, Xiaodong Gu
:
Vision-Language Matching for Text-to-Image Synthesis via Generative Adversarial Networks. 7062-7075 - Cankun Zhong
, Wing W. Y. Ng
:
A Robust Frequency-Domain-Based Graph Adaptive Network for Parkinson's Disease Detection From Gait Data. 7076-7088 - Zhonghong Ou
, Zhongjie Chen
, Shengyi Shen
, Lina Fan
, Siyuan Yao, Meina Song
, Pan Hui
:
Free$\rm ^{3}$Net: Gliding Free, Orientation Free, and Anchor Free Network for Oriented Object Detection. 7089-7100 - Yuchen Hong
, Youwei Lyu
, Si Li
, Gang Cao, Boxin Shi
:
Reflection Removal With NIR and RGB Image Feature Fusion. 7101-7112 - Lu Yang
, Qing Song
, Zhihui Wang, Zhiwei Liu, Songcen Xu, Zhihao Li
:
Quality-Aware Network for Human Parsing. 7128-7138 - SangEun Lee
, Chaeeun Ryu, Eunil Park
:
OSANet: Object Semantic Attention Network for Visual Sentiment Analysis. 7139-7148 - Fan Liu
, Huilin Chen, Zhiyong Cheng
, Anan Liu
, Liqiang Nie
, Mohan S. Kankanhalli
:
Disentangled Multimodal Representation Learning for Recommendation. 7149-7159 - Yuanwei Zhu
, Yakun Huang
, Xiuquan Qiao
, Zhijie Tan, Boyuan Bai, Huadong Ma
, Schahram Dustdar
:
A Semantic-Aware Transmission With Adaptive Control Scheme for Volumetric Video Service. 7160-7172 - Yue Zhang
, Chao Liang
, Longxiang Jiang
:
Confidence-Aware Active Feedback for Interactive Instance Search. 7173-7184 - Soushi Ueno
, Takuya Fujihashi
, Toshiaki Koike-Akino
, Takashi Watanabe
:
Point Cloud Soft Multicast for Untethered XR Users. 7185-7195 - Bianca Jansen Van Rensburg
, William Puech
, Jean-Pierre Pedeboy:
A Format Compliant Encryption Method for 3D Objects Allowing Hierarchical Decryption. 7196-7207 - Yuqi Bu
, Liuwu Li, Jiayuan Xie
, Qiong Liu
, Yi Cai
, Qingbao Huang
, Qing Li
:
Scene-Text Oriented Referring Expression Comprehension. 7208-7221 - Yan Wang
, Tongtong Su
, Yusen Li
, Jiuwen Cao
, Gang Wang
, Xiaoguang Liu
:
DDistill-SR: Reparameterized Dynamic Distillation Network for Lightweight Image Super-Resolution. 7222-7234 - Dawei Zhao
, Qingwei Gao
, Yixiang Lu
, Dong Sun
:
Non-Aligned Multi-View Multi-Label Classification via Learning View-Specific Labels. 7235-7247 - Lianli Gao
, Qike Zhao
, Junchen Zhu
, Sitong Su
, Lechao Cheng
, Lei Zhao
:
From External to Internal: Structuring Image for Text-to-Image Attributes Manipulation. 7248-7261 - Yangfan Sun
, Li Li
, Zhu Li
, Shizheng Wang, Shan Liu
, Ge Li
:
Learning a Compact Spatial-Angular Representation for Light Field. 7262-7273 - Yiyun Chen
, Yunmeng Liu
, Mingliang Chen, Zirui Wang
, Wenming Yang
, Qingmin Liao
:
Blind JPEG Compression Artifacts Removal by Integrating Channel Regulation With Exit Strategy. 7274-7286 - Yan Bai
, Jile Jiao
, Yihang Lou
, Shengsen Wu, Jun Liu
, Xuetao Feng, Ling-Yu Duan
:
Dual-Tuning: Joint Prototype Transfer and Structure Regularization for Compatible Feature Learning. 7287-7298 - Ruisong Zhang
, Weize Quan, Yong Zhang, Jue Wang
, Dong-Ming Yan
:
W-Net: Structure and Texture Interaction for Image Inpainting. 7299-7310 - Xihua Sheng
, Jiahao Li
, Bin Li
, Li Li
, Dong Liu
, Yan Lu:
Temporal Context Mining for Learned Video Compression. 7311-7322 - Zhening Xing
, Yuchen Wu
, Si Liu
, Shangzhe Di, Huimin Ma
:
Virtual Try-On With Garment Self-Occlusion Conditions. 7323-7336 - Jiaxiang Chen
, Jiayuan Fan
, Hancheng Ye
, Jie Li
, Yongbin Liao
, Tao Chen
:
Exploring Kernel-Based Texture Transfer for Pose-Guided Person Image Generation. 7337-7349 - Tong Qiao
, Jiasheng Wu, Ning Zheng
, Ming Xu
, Xiangyang Luo
:
FGDNet: Fine-Grained Detection Network Towards Face Anti-Spoofing. 7350-7363 - Jun Jia
, Zhongpai Gao
, Dandan Zhu
, Xiongkuo Min
, Menghan Hu
, Guangtao Zhai
:
RIVIE: Robust Inherent Video Information Embedding. 7364-7377 - Biwei Cao
, Jiuxin Cao
, Jie Gui
, Jiayun Shen, Bo Liu
, Lei He, Yuan Yan Tang, James Tin-Yau Kwok
:
AlignVE: Visual Entailment Recognition Based on Alignment Relations. 7378-7387 - Guoqiang Gong
, Linchao Zhu
, Yadong Mu
:
Language-Guided Multi-Granularity Context Aggregation for Temporal Sentence Grounding. 7402-7414 - Fuxiang Huang, Lei Zhang
, Yuhang Zhou, Xinbo Gao
:
Adversarial and Isotropic Gradient Augmentation for Image Retrieval With Text Feedback. 7415-7427 - Chengyin Xu
, Zenghao Chai
, Zhengzhuo Xu
, Hongjia Li, Qiruyi Zuo
, Lingyu Yang, Chun Yuan
:
HHF: Hashing-Guided Hinge Function for Deep Hashing Retrieval. 7428-7440 - Ziming Liu
, Song Guo
, Jingcai Guo
, Yuanyuan Xu, Fushuo Huo
:
Towards Unbiased Multi-Label Zero-Shot Learning With Pyramid and Semantic Attention. 7441-7455 - Pan Yang
, Xiong Luo
, Jiankun Sun
:
A Simple but Effective Method for Balancing Detection and Re-Identification in Multi-Object Tracking. 7456-7468 - Xiang Li
, Jinglu Wang
, Xiao Li
, Yan Lu:
Video Instance Segmentation by Instance Flow Assembly. 7469-7479 - Masum Shah Junayed
, Md Baharul Islam
:
Consistent Video Inpainting Using Axial Attention-Based Style Transformer. 7494-7504 - Jiaxiang Wang
, Chenglong Li
, Aihua Zheng
, Jin Tang
, Bin Luo
:
Looking and Hearing Into Details: Dual-Enhanced Siamese Adversarial Network for Audio-Visual Matching. 7505-7516 - Xiang Fang
, Daizong Liu
, Pan Zhou
, Yuchong Hu
:
Multi-Modal Cross-Domain Alignment Network for Video Moment Retrieval. 7517-7532 - Kai Yang
, Haijun Zhang
, Feng Gao, Jianyang Shi, Yanfeng Zhang, Q. M. Jonathan Wu
:
DETA: A Point-Based Tracker With Deformable Transformer and Task-Aligned Learning. 7545-7558 - Hezhen Hu, Junfu Pu
, Wengang Zhou
, Houqiang Li
:
Collaborative Multilingual Continuous Sign Language Recognition: A Unified Framework. 7559-7570 - Han Fang
, Zhaoyang Jia, Yupeng Qiu
, Jiyi Zhang, Weiming Zhang
, Ee-Chien Chang
:
De-END: Decoder-Driven Watermarking Network. 7571-7581 - Yiran Yang
, Xian Sun
, Wenhui Diao
, Xuee Rong
, Shiyao Yan, Dongshuo Yin
, Xinming Li:
Optimal Partition Assignment for Universal Object Detection. 7582-7593 - Zhao Xie
, Jiansong Chen
, Kewei Wu
, Dan Guo
, Richang Hong
:
Global Temporal Difference Network for Action Recognition. 7594-7606 - Yucheng Zhu
, Yunhao Li
, Wei Sun
, Xiongkuo Min