default search action
ICME 2023: Brisbane, Australia
- IEEE International Conference on Multimedia and Expo, ICME 2023, Brisbane, Australia, July 10-14, 2023. IEEE 2023, ISBN 978-1-6654-6891-6
- Prashant Pandey, Mustafa Chasmai, Monish Natarajan, Brejesh Lall:
Weakly Supervised Few-Shot and Zero-Shot Semantic Segmentation with Mean Instance Aware Prompt Learning. 1-6 - Qianwen Cao, Heyan Huang, Minpeng Liao, Xianling Mao:
Ada-SwinBERT: Adaptive Token Selection for Efficient Video Captioning with Online Self-Distillation. 7-12 - Jiuxiang You, Zhenguo Yang, Qing Li, Wenyin Liu:
A Retriever-Reader Framework with Visual Entity Linking for Knowledge-Based Visual Question Answering. 13-18 - Pufen Zhang, Peng Shi, Song Zhang:
2S-DFN: Dual-semantic Decoding Fusion Networks for Fine-grained Image Recognition. 19-24 - Yongzhu Miao, Shasha Li, Jintao Tang, Ting Wang:
MuDPT: Multi-modal Deep-symphysis Prompt Tuning for Large Pre-trained Vision-Language Models. 25-30 - Sai Shashank Kalakonda, Shubh Maheshwari, Ravi Kiran Sarvadevabhatla:
Action-GPT: Leveraging Large-scale Language Models for Improved and Generalized Action Generation. 31-36 - Tianhua Xu, Sheng-hua Zhong, Zhijiao Xiao:
Protecting Intellectual Property of EEG-based Model with Watermarking. 37-42 - Hanxiu Zhang, Guitao Cao, Xinyue Zhang, Jing Xiang, Chunwei Wu:
Making Adversarial Attack Imperceptible in Frequency Domain: A Watermark-based Framework. 43-48 - Jie Luo, Peisong He, Jiayong Liu, Hongxia Wang, Chunwang Wu, Yijing Chen, Wanjie Li, Jiangchuan Li:
Content-adaptive Adversarial Embedding for Image Steganography Using Deep Reinforcement Learning. 49-54 - Youqiang Sun, Jianyi Liu, Ru Zhang:
A Robust Generative Image Steganography Method based on Guidance Features in Image Synthesis. 55-60 - Shiqiang Wu, Jie Liu, Ying Huang, Hu Guan, Shuwu Zhang:
Adversarial Audio Watermarking: Embedding Watermark into Deep Feature. 61-66 - Tengjun Liu, Ying Chen, Wanxuan Gu:
Deniable Diffusion Generative Steganography. 67-71 - Songbin Li, Xiangzhi Yang, Jingang Wang:
Sea Surface Object Detection Based on Background Dynamic Perception and Cross-Layer Semantic Interaction. 72-77 - Guikun Chen, Lin Li, Yawei Luo, Jun Xiao:
Addressing Predicate Overlap in Scene Graph Generation with Semantic Granularity Controller. 78-83 - Shiqi Ren, Chao Zhu, Mengyin Liu, Xu-Cheng Yin:
Towards Discriminative Semantic Relationship for Fine-grained Crowd Counting. 84-89 - Jun Xie, Yixuan Zhou, Xing Xu, Guoqing Wang, Fumin Shen, Yang Yang:
Region-Aware Semantic Consistency for Unsupervised Domain-Adaptive Semantic Segmentation. 90-95 - Chuang Zhao, Hefei Ling, Yuxuan Shi, Chengxin Zhao, Jiazhong Chen, Qiang Cao:
Deep Unsupervised Hashing with Selective Semantic Mining. 96-101 - Qiaoqiao Wei, Hui Zhang, Jun-Hai Yong:
Boosting Interactive Image Segmentation by Exploiting Semantic Clues. 102-107 - Dafeng Li, Yingying Zhu:
Visual-Linguistic Alignment and Composition for Image Retrieval with Text Feedback. 108-113 - Xinyu Zhou, Anna Zhu, Huen Chen, Wei Pan:
Scene Text Involved "Text"-to-Image Retrieval through Logically Hierarchical Matching. 114-119 - Yi Li, Meihua Yu, Xin Xie, Haiyan Fu, Hao He, Yanqing Guo:
Federating Hashing Networks Adaptively for Privacy-Preserving Retrieval. 120-125 - Kangkang Lu, Yanhua Yu, Meiyu Liang, Min Zhang, Xiaowen Cao, Zehua Zhao, Mengran Yin, Zhe Xue:
Deep Unsupervised Momentum Contrastive Hashing for Cross-modal Retrieval. 126-131 - Yiyang Cai, Jiaming Lu, Jiewen Wang, Shuang Liang:
Uncertainty-Aware Cross-Modal Transfer Network for Sketch-Based 3D Shape Retrieval. 132-137 - Guoliang Wang, Yanlei Shang, Yong Chen, Chaoqi Zhen, Dequan Cheng:
Scene Graph based Fusion Network for Image-Text Retrieval. 138-143 - Yuchao Feng, Honghui Xu, Jiawei Jiang, Jianwei Zheng:
Compact Intertemporal Coupling Network for Remote Sensing Change Detection. 144-149 - Jueyu Chen, Guanyu Xing, Jingwei Liao, Housheng Wei, Yanli Liu:
Boundary-aware Shadow Detection via Mask Decoupling and Feature Correction. 150-155 - Yuzhong Zhao, Yuanqiang Cai, Weijia Wu, Weiqiang Wang:
Explore Faster Localization Learning For Scene Text Detection. 156-161 - Xiaofeng Ji, Jin Chen, Xinxiao Wu:
Counterfactual Inference for Visual Relationship Detection in Videos. 162-167 - Huayi Zhou, Fei Jiang, Hongtao Lu:
Body-Part Joint Detection and Association via Extended Object Representation. 168-173 - Jian Cui, Lin Li, Xiaohui Tao:
Be-or-Not Prompt Enhanced Hard Negatives Generating For Memes Category Detection. 174-179 - Yanni Wang, Gang Yang, Dayong Ding, Jianchun Zhao:
Automatic Retinal Nerve Fiber Trajectory Simulation and Quasi-polar Transformation for Detecting Retinal Nerve Fiber Layer Defect in Fundus Images. 180-185 - Jiawei Jiang, Jiacheng Chen, Honghui Xu, Yuchao Feng, Jianwei Zheng:
GA-HQS: MRI reconstruction via a generically accelerated unfolding approach. 186-191 - Yi Li, Baoyao Yang, Dan Pan, An Zeng, Long Wu, Yang Yang:
Early Diagnosis of Alzheimer's Disease Based on Multimodal Hypergraph Attention Network. 192-197 - Shanshan Huang, Qingsong Li, Lei Wang, Yuanhao Wang, Li Liu:
Score-based causal feature selection for cancer risk prediction. 198-203 - Wentian Cai, Yulin Cheng, Ying Gao, Weixiao Liu, Xinyan Xie, Xiongwen Luo, Weixian Yang, Zaiyi Liu, Changhong Liang:
A Dual-Path Supplemental Information Learning Architecture for Breast Cancer Ki-67 Status Prediction in T2w MRI. 210-215 - Hui Zhang, Shiqi Shen, Jinhua Xu:
Expression-Guided Attention GAN for Fine-Grained Facial Expression Editing. 216-221 - Yini Fang, Didan Deng, Liang Wu, Frederic Jumelle, Bertram E. Shi:
RMES: Real-Time Micro-Expression Spotting Using Phase From Riesz Pyramid. 222-227 - Shukang Yin, Shiwei Wu, Tong Xu, Shifeng Liu, Sirui Zhao, Enhong Chen:
AU-aware graph convolutional network for Macroand Micro-expression spotting. 228-233 - Hao Sun, Chenchen Pi, Wei Xie:
Semi-Supervised Facial Expression Recognition by Exploring False Pseudo-Labels. 234-239 - Jingning Xu, Benlai Tang, Mingjie Wang, Minghao Li, Meirong Ma:
CPNet: Exploiting CLIP-based Attention Condenser and Probability Map Guidance for High-fidelity Talking Face Generation. 240-245 - David Anghelone, Sarah Lannes, Antitza Dantcheva:
ANYRES: Generating High-Resolution visible-face images from Low-Resolution thermal-face images. 246-251 - Yutong Li, Zhenyu Liu, Gang Li, Qiongqiong Chen, Zhijie Ding, Xiping Hu, Bin Hu:
A Visually Interpretable Convolutional-Transformer Model for Assessing Depression from Facial Images. 252-257 - Zhaowen Li, Xu Zhao, Peigeng Ding, Zongxing Gao, Yuting Yang, Ming Tang, Jinqiao Wang:
FreConv: Frequency Branch-and-Integration Convolutional Networks. 258-263 - Ruofan Wang, Jiayu Guo, Rui-Wei Zhao, Ling Su, Yingzi Ye, Xiaobo Zhang, Yuejie Zhang, Rui Feng:
Class-aware Variational Auto-encoder for Open Set Recognition. 264-269 - Mingyang Zhang, Xinyi Yu, Jingtao Rong, Linlin Ou:
Repnas: Searching for Efficient Re-Parameterizing Blocks. 270-275 - Bowen Zhao, Weidong Chen, Bo Hu, Hongtao Xie, Zhendong Mao:
Difference-Aware Iterative Reasoning Network for Key Relation Detection. 276-281 - Luying Li, Lizhuang Ma:
Injecting-Diffusion: Inject Domain-Independent Contents into Diffusion Models for Unpaired Image-to-Image Translation. 282-287 - Lei Xu, Rong Wang, Feiping Nie, Jun Wu, Xuelong Li:
Semi-Supervised Top-k Feature Selection with a General Optimization Framework. 288-293 - Yukun Zhang, Shengming Yuan, Jingkuan Song, Yixuan Zhou, Lin Zhang, Yulan He:
Towards Boosting Black-Box Attack Via Sharpness-Aware. 294-299 - Xiaolin Zhai, Zhengxi Hu, Dingye Yang, Shichao Wu, Jingtai Liu:
Learning Group Residual Representation for Group Activity Prediction*. 300-305 - Xuesong Guo, Shuo Wang, Jiahao Chang, Zehui Chen, Feng Zhao:
SAFE: Simultaneous Alignment of Features and Predictions for Dense Object Detectors. 306-311 - Xiaohong Xiang, Fuyuan Zhang, Xin Deng, Ke Hu:
MSG-CAM:Multi-scale inputs make a better visual interpretation of CNN networks. 312-317 - Peng Yan, Guodong Long:
Personalization Disentanglement for Federated Learning. 318-323 - Yuxin Shi, Zelei Liu, Zhuan Shi, Han Yu:
Fairness-Aware Client Selection for Federated Learning. 324-329 - Xiaoli Tang, Han Yu:
Utility-Maximizing Bidding Strategy for Data Consumers in Auction-Based Federated Learning. 330-335 - Zhiwei Xiong, Han Yu, Zhiqi Shen:
Federated Learning for Personalized Image Aesthetics Assessment. 336-341 - Yue Huang, Lanju Kong, Qingzhong Li, Baochen Zhang:
Decentralized Federated Learning Via Mutual Knowledge Distillation. 342-347 - Zekai Chen, Fuyi Wang, Zhiwei Zheng, Ximeng Liu, Yujie Lin:
Fedward: Flexible Federated Backdoor Defense Framework with Non-IID Data. 348-353 - Jialing He, Zhen Qin, Hangcheng Liu, Shangwei Guo, Biwen Chen, Ning Wang, Tao Xiang:
Contrastive Fusion Representation: Mitigating Adversarial Attacks on VQA Models. 354-359 - Zhengyu Wang, Yujie Zhang, Qi Yang, Yiling Xu, Yifei Zhou, Jun Sun, Shan Liu:
Improving Point Cloud Quality Metrics with Noticeable Possibility Maps. 360-365 - Haoning Wu, Liang Liao, Jingwen Hou, Chaofeng Chen, Erli Zhang, Annan Wang, Wenxiu Sun, Qiong Yan, Weisi Lin:
Exploring Opinion-Unaware Video Quality Assessment with Semantic Affinity Criterion. 366-371 - Lirong Huang, Rong Zhang, Miaohui Wang:
Just Noticeable Difference Estimation for Screen Content Images: A Content Uncertainty-guided Approach. 372-377 - Hui Wang, Xiguang Zheng, Yong Qin:
Intermediate-Task Learning with Pretrained Model for Synthesized Speech MOS Prediction. 378-383 - Zenan Xu, Wanjun Zhong, Qinliang Su, Fuwei Zhang:
Cross-Modal-Aware Representation Learning with Syntactic Hypergraph Convolutional Network for VideoQA. 384-389 - Hui Su, Yue Ye, Wei Hua, Lechao Cheng, Mingli Song:
SASFormer: Transformers for Sparsely Annotated Semantic Segmentation. 390-395 - Wujie Sun, Defang Chen, Can Wang, Deshi Ye, Yan Feng, Chun Chen:
Holistic Weighted Distillation for Semantic Segmentation. 396-401 - Feng Jiang, Heng Gao, Shoumeng Qiu, Haiqiang Zhang, Ru Wan, Jian Pu:
Knowledge Distillation from 3D to Bird's-Eye-View for LiDAR Semantic Segmentation. 402-407 - Huazheng Hao, Hui Xiao, Li Dong, Diqun Yan, Dongtai Liang, Jiayan Zhuang, Chengbin Peng:
A Pseudo-Dual Self-Rectification Framework for Semantic Segmentation. 408-413 - Feifei Ding, Jianjun Li, Wanyong Tian:
Dual-level Consistency Learning for Unsupervised Domain Adaptive Night-time Semantic Segmentation. 420-425 - Wenrui Li, Zhengyu Ma, Liang-Jian Deng, Hengyu Man, Xiaopeng Fan:
Modality-Fusion Spiking Transformer Network for Audio-Visual Zero-Shot Learning. 426-431 - Rui Gao, Fan Wan, Daniel Organisciak, Jiyao Pu, Haoran Duan, Peng Zhang, Xingsong Hou, Yang Long:
Privacy-Enhanced Zero-Shot Learning via Data-Free Knowledge Transfer. 432-437 - Ting Guo, Jiye Liang, Guo-Sen Xie:
Swap-Reconstruction Autoencoder for Compositional Zero-Shot Learning. 438-443 - Xinmiao Dai, Chong Wang, Haohe Li, Sunqi Lin, Li Dong, Jiafei Wu, Jun Wang:
Synthetic Feature Assessment for Zero-Shot Object Detection. 444-449 - Yapeng Li, Yong Luo, Bo Du:
Audio-Visual Generalized Zero-Shot Learning Based on Variational Information Bottleneck. 450-455 - Han Jiang, Xiaoshan Yang, Chaofan Chen, Changsheng Xu:
Fine-grained Primitive Representation Learning for Compositional Zero-shot Classification. 456-461 - Jingwei Wang, Peng Zhou, Xianjun Han, Yanming Chen:
Medical Image Super-Resolution via Diagnosis-Guided Attention. 462-467 - Hong Zhang, Shenglun Chen, Zhihui Wang, Haojie Li, Wanli Ouyang:
Denser is Better:cost distribution super-resolution network for more accurate sub-pixel disparity. 468-473 - Lin Sun, Chao Yang, Bin Jiang:
DSP-Net: Diverse Structure Prior Network for Image Inpainting. 474-479 - Zekun Ai, Xiaotong Luo, Yanyun Qu:
Joint Feature Aggregation for Stereo Image Super-resolution. 480-485 - Zijian Yuan, Kan Chang, Zhiquan Liu, Xinjie Wei, Boning Chen:
Joint Super-Resolution and Classification Based on Bidirectional Mapping and Multiple Constraints. 486-491 - Qichen Wei, Zijie Zuo, Jie Nie, Jiahao Du, Yaning Diao, Min Ye, Xinyue Liang:
Inpainting of Remote Sensing Sea Surface Temperature image with Multi-scale Physical Constraints. 492-497 - Lei Chen, Huhe Dai, Yuan Zheng:
ICANet: A Lightweight Increasing Context Aided Network for Real-Time Image Semantic Segmentation. 492-497 - Zhijie Huang, Tianyi Sun, Xiaopeng Guo, Yanze Wang, Jun Sun:
Generalized Compressed Video Restoration by Multi-Scale Temporal Fusion and Hierarchical Quality Score Estimation. 498-503 - Yuan Zou, Yinyao Ma:
Edgeformer: Edge-Enhanced Transformer for High-Quality Image Deblurring. 504-509 - Yubo Huang, Jia Wang, Peipei Li, Liuyu Xiang, Peigang Li, Zhaofeng He:
Generative Iris Prior Embedded Transformer for Iris Restoration. 510-515 - Zhongbao Yang, Jinshan Pan:
MBDFNet: Multi-scale Bidirectional Dynamic Feature Fusion Network for Efficient Image Deblurring. 522-527 - Minhua Liu, Yuanman Li, Rongqin Liang, Jiaxiang You, Xia Li:
Multiple degraded image restoration via degradation history estimation. 528-533 - Jintao Zhang, Guangyi Xiao:
Gradual Migration and Style Consistency for Unsupervised Domain Adaptation. 534-539 - Han Xie, Zhifeng Shen, Shicai Yang, Weijie Chen, Luojun Lin:
Adapt then Generalize: A Simple Two-Stage Framework for Semi-Supervised Domain Generalization. 540-545 - Hongjian Song, Jie Tang, Hongzhao Xiao, Juncheng Hu:
Rethinking Overfitting of Multiple Instance Learning for Whole Slide Image Classification. 546-551 - Qiang Chen, Dong Zhang, Shoushan Li, Guodong Zhou:
A Unified MRC Framework with Multi-Query for Multi-modal Relation Triplets Extraction. 552-557 - Jiaxin Yang, Xiaofei Li, Jun Zhang, Shuohao Li:
Feature Bias Correction: A Feature Augmentation Method for Long-tailed Recognition. 558-563 - Yuling Jiang, Yingyuan Zhao, Bing-Kun Bao:
Recombination Samples Training for Robust Natural Language Visual Reasoning. 564-569 - Yansong Qu, Yuze Wang, Yue Qi:
SG-NeRF: Semantic-guided Point-based Neural Radiance Fields. 570-575 - Hai Zhou, Zhe Xue, Ying Liu, Boang Li, Junping Du, Meiyu Liang:
RTMC: A Rubost Trusted Multi-View Classification Framework. 576-581 - Xinjiao Zhou, Bin Jiang, Chao Yang, Haotian Hu, Xiaofei Huo:
DF-CLIP: Towards Disentangled and Fine-grained Image Editing from Text. 582-587 - Changshuo Wang, Lei Wu, Xu Chen, Xiang Li, Lei Meng, Xiangxu Meng:
Letter Embedding Guidance Diffusion Model for Scene Text Editing. 588-593 - Rongyu Zhang, Yun Chen, Chenrui Wu, Fangxin Wang:
Cluster-driven GNN-based Federated Recommendation with Biased Message Dropout. 594-599 - Tianyu Huai, Shuwen Yang, Junhang Zhang, Guoan Wang, Xinru Yu, Tianlong Ma, Liang He:
SQT: Debiased Visual Question Answering via Shuffling Question Types. 600-605 - Shizhuo Deng, Chuangui Yang, Zhubao Guo, Boqian Lin, Dongyue Chen, Tong Jia, Botao Wang:
Fast Personalized Human Activity Recognition on Heuristic Parameter Estimation. 606-611 - Yaolong Ju, Chunyang Xu, Yichen Guo, Jinhu Li, Simon Lui:
Improving Automatic Singing Skill Evaluation with Timbral Features, Attention, and Singing Voice Separation. 612-617 - Han Guo, Yuanlong Yu, Yujie Wang, Xuelin Chen, Yixin Zhuang:
Learning High Frequency Surface Functions In Shells. 618-623 - Eli Lei, Jia Shao, Youfa Liu, Bo Du:
Multi-template Tracker Driven by Cache Manager Algorithm, Towards Multi-distractor Scenarios. 624-629 - Aoran Liu, Kun Hu, Wenxi Yue, Qiuxia Wu, Zhiyong Wang:
Material-Aware Self-Supervised Network for Dynamic 3D Garment Simulation. 630-635 - Yulin Wu, Ruimin Hu, Xiaochen Wang:
Multi-speaker Direction of Arrival Estimation Using Audio and Visual Modalities with Convolutional Neural Network. 636-641 - Jinxin Wang, Zhongwen Guo, Chao Yang, Xiaomei Li, Ziyuan Cui:
Multi-Scale Hybrid Fusion Network for Mandarin Audio-Visual Speech Recognition. 642-647 - Tianhan Liu, Zhuang Qi, Zitan Chen, Xiangxu Meng, Lei Meng:
Cross-Training with Prototypical Distillation for improving the generalization of Federated Learning. 648-653 - Mehdi Setayesh, Vincent W. S. Wong:
A Content-based Viewport Prediction Framework for 360° Video Using Personalized Federated Learning and Fusion Techniques. 654-659 - Chenrui Wu, Zexi Li, Fangxin Wang, Chao Wu:
Learning Cautiously in Federated Learning with Noisy and Heterogeneous Clients. 660-665 - Yulan Gao, Yansong Zhao, Han Yu:
Multi-Tier Client Selection for Mobile Federated Learning Networks. 666-671 - Chengyi Yang, Zhaoxiang Hou, Sheng Guo, Hui Chen, Zengxiang Li:
SWATM: Contribution-Aware Adaptive Federated Learning Framework Based on Augmented Shapley Values. 672-677 - Yiqiang Chen, Xiaodong Yang, Yuting He, Chunyan Miao, Piu Chan:
FedDBM: Federated Digital Biomarker for Detecting Parkinson's Disease Progress. 678-683 - Haihang Ruan, Feng Wang, Tongda Xu, Zhiyong Tan, Yan Wang:
MIXLIC: Mixing Global and Local Context Model for learned Image Compression. 684-689 - Ruoke Yan, Qian Yin, Xinfeng Zhang, Siwei Ma:
Model-Driven Compression for Digital Human Using Multi-Granularity Representations. 690-695