


default search action
38th AAAI 2024: Vancouver, Canada
- Michael J. Wooldridge, Jennifer G. Dy, Sriraam Natarajan:
Thirty-Eighth AAAI Conference on Artificial Intelligence, AAAI 2024, Thirty-Sixth Conference on Innovative Applications of Artificial Intelligence, IAAI 2024, Fourteenth Symposium on Educational Advances in Artificial Intelligence, EAAI 2014, February 20-27, 2024, Vancouver, Canada. AAAI Press 2024
AAAI Technical Track on Application Domains
- Yongkang Wang, Xuan Liu, Feng Huang, Zhankun Xiong, Wen Zhang:
A Multi-Modal Contrastive Diffusion Model for Therapeutic Peptide Generation. 3-11 - Chen Bai, Jianwang Zhai, Yuzhe Ma, Bei Yu, Martin D. F. Wong:
Towards Automated RISC-V Microarchitecture Design with Reinforcement Learning. 12-20 - Shreyas Bhat Brahmavar, Ashwin Srinivasan, Tirtharaj Dash, Sowmya Ramaswamy Krishnan, Lovekesh Vig, Arijit Roy, Raviprasad Aduri:
Generating Novel Leads for Drug Discovery Using LLMs with Logical Feedback. 21-29 - Ying-Ying Chang, Wei-Yao Wang, Wen-Chih Peng:
SeGA: Preference-Aware Self-Contrastive Learning with Prompts for Anomalous User Detection on Twitter. 30-37 - Zhihao Chang, Linzhu Yu, Yanchao Xu, Wentao Hu:
Neural Embeddings for kNN Search in Biological Sequence. 38-45 - Haoyang Chen, Peiyan Sun, Qiyuan Song, Wanyuan Wang, Weiwei Wu, Wencan Zhang, Guanyu Gao, Yan Lyu:
i-Rebalance: Personalized Vehicle Repositioning for Supply Demand Balance. 46-54 - Le Cheng, Peican Zhu
, Keke Tang, Chao Gao, Zhen Wang:
GIN-SD: Source Detection in Graphs with Incomplete Nodes via Positional Encoding and Attentive Fusion. 55-63 - Yoni Choukroun, Lior Wolf:
Deep Quantum Error Correction. 64-72 - Chaoqun Cui
, Caiyan Jia:
Propagation Tree Is Not Deep: Adaptive Graph Contrastive Learning Approach for Rumor Detection. 73-81 - Longchao Da, Minquan Gao, Hao Mei, Hua Wei:
Prompt to Transfer: Sim-to-Real Transfer for Traffic Signal Control with Prompt Learning. 82-90 - Na Fan, Zeyue Tian, Amartansh Dubey, Samruddhi Deshmukh, Ross D. Murch, Qifeng Chen:
Multitarget Device-Free Localization via Cross-Domain Wi-Fi RSS Training Data and Attentional Prior Fusion. 91-99 - Haisong Gong, Weizhi Xu, Shu Wu, Qiang Liu, Liang Wang:
Heterogeneous Graph Reasoning for Fact Checking over Texts and Tables. 100-108 - Haisong Gong, Qiang Liu, Shu Wu, Liang Wang:
Text-Guided Molecule Generation with Diffusion Language Model. 109-117 - Jiazhi Guan, Yi Zhao, Zhuoer Xu, Changhua Meng, Ke Xu, Youjian Zhao:
Adversarial Robust Safeguard for Evading Deep Facial Manipulation. 118-126 - Dongyue Guo
, Zheng Zhang, Zhen Yan, Jianwei Zhang, Yi Lin:
FlightBERT++: A Non-autoregressive Multi-Horizon Flight Trajectory Prediction Framework. 127-134 - Hongcheng Guo, Jian Yang, Jiaheng Liu, Jiaqi Bai, Boyang Wang, Zhoujun Li, Tieqiao Zheng, Bo Zhang, Junran Peng, Qi Tian:
LogFormer: A Pre-train and Tuning Pipeline for Log Anomaly Detection. 135-143 - Zhi Jin, Sheng Xu, Xiang Zhang
, Tianze Ling, Nanqing Dong, Wanli Ouyang
, Zhiqiang Gao, Cheng Chang, Siqi Sun:
ContraNovo: A Contrastive Learning Approach to Enhance De Novo Peptide Sequencing. 144-152 - Seungjun Lee, Taeil Oh:
Inducing Point Operator Transformer: A Flexible and Scalable Architecture for Solving PDEs. 153-161 - Tong Li
, Zhaoyang Liu, Yanyan Shen, Xue Wang, Haokun Chen, Sen Huang:
MASTER: Market-Guided Stock Transformer for Stock Price Forecasting. 162-170 - Yanhong Li, Jack Xu, David C. Anastasiu:
Learning from Polar Representation: An Extreme-Adaptive Model for Long-Term Time Series Forecasting. 171-179 - Yijun Li
, Cheuk Hang Leung, Xiangqian Sun, Chaoqun Wang
, Yiyan Huang
, Xing Yan, Qi Wu
, Dongdong Wang, Zhixiang Huang:
The Causal Impact of Credit Lines on Spending Distributions. 180-187 - Zhengyi Li, Menglu Li, Lida Zhu, Wen Zhang:
Improving PTM Site Prediction by Coupling of Multi-Granularity Structure and Multi-Scale Sequence Representation. 188-196 - Minghui Liao, Guojia Wan, Bo Du:
Joint Learning Neuronal Skeleton and Brain Circuit Topology with Permutation Invariant Encoders for Neuron Classification. 197-205 - Cheng-Ming Lin, Ching Chang
, Wei-Yao Wang, Kuang-Da Wang, Wen-Chih Peng:
Root Cause Analysis in Microservice Using Neural Granger Causal Discovery. 206-213 - Shengheng Liu
, Xingkang Li
, Zihuan Mao, Peng Liu, Yongming Huang:
Model-Driven Deep Neural Network for Enhanced AoA Estimation Using 5G gNB. 214-221 - Jesung Ryu, Seungyeon Rhyu, Hong-Gyu Yoon, Eunchong Kim, Ju Young Yang, Taehyun Kim:
MID-FiLD: MIDI Dataset for Fine-Level Dynamics. 222-230 - Rui She
, Sijie Wang, Qiyu Kang, Kai Zhao, Yang Song, Wee Peng Tay, Tianyu Geng, Xingchao Jian:
PosDiffNet: Positional Neural Diffusion for Point Cloud Registration in a Large Field of View with Perturbations. 231-239 - Wenkang Su
, Jiangqun Ni, Yiyan Sun:
StegaStyleGAN: Towards Generic and Practical Generative Image Steganography. 240-248 - Xiaorui Su, Pengwei Hu, Zhu-Hong You, Philip S. Yu, Lun Hu:
Dual-Channel Learning Framework for Drug-Drug Interaction Prediction via Relation-Aware Heterogeneous Graph Transformer. 249-256 - Sally Turutov, Kira Radinsky:
Molecular Optimization Model with Patentability Constraint. 257-264 - Jiquan Wang, Sha Zhao, Haiteng Jiang, Shijian Li, Tao Li, Gang Pan:
Generalizable Sleep Staging via Multi-Level Domain Alignment. 265-273 - Tong Wang, Yuan Yao, Feng Xu, Miao Xu
, Shengwei An, Ting Wang:
Inspecting Prediction Confidence for Detecting Black-Box Backdoor Attacks. 274-282 - Yingheng Wang, Shufeng Kong
, John M. Gregoire, Carla P. Gomes:
Conformal Crystal Graph Transformer with Robust Encoding of Periodic Invariance. 283-291 - Yu Wang, Xiaoye Wang, Zaiwang Gu, Weide Liu, Wee Siong Ng, Weimin Huang, Jun Cheng:
SuperJunction: Learning-Based Junction Detection for Retinal Image Registration. 292-300 - Zilin Wang, Haolin Zhuang, Lu Li, Yinmin Zhang, Junjie Zhong, Jun Chen, Yu Yang, Boshi Tang, Zhiyong Wu:
Explore 3D Dance Generation via Reward Model from Automatically-Ranked Demonstrations. 301-309 - Lirong Wu, Yufei Huang, Cheng Tan, Zhangyang Gao, Bozhen Hu, Haitao Lin, Zicheng Liu, Stan Z. Li:
PSC-CPI: Multi-Scale Protein Sequence-Structure Contrasting for Efficient and Generalizable Compound-Protein Interaction Prediction. 310-319 - Tailin Wu, Willie Neiswanger, Hongtao Zheng
, Stefano Ermon, Jure Leskovec:
Uncertainty Quantification for Forward and Inverse Problems of PDEs via Latent Global Evolution. 320-328 - Zhousan Xie, Shikui Tu, Lei Xu:
Multilevel Attention Network with Semi-supervised Domain Adaptation for Drug-Target Prediction. 329-337 - Can Xu
, Haosen Wang, Weigang Wang, Pengfei Zheng, Hongyang Chen:
Geometric-Facilitated Denoising Diffusion Model for 3D Molecule Generation. 338-346 - Shu Yin
, Peican Zhu
, Lianwei Wu, Chao Gao, Zhen Wang:
GAMC: An Unsupervised Method for Fake News Detection Using Graph Autoencoder with Masking. 347-355 - Jixiang Yu
, Nanjun Chen
, Ming Gao
, Xiangtao Li, Ka-Chun Wong:
Unsupervised Gene-Cell Collective Representation Learning with Optimal Transport. 356-364 - Shuai Yu
:
MCSSME: Multi-Task Contrastive Learning for Semi-supervised Singing Melody Extraction from Polyphonic Music. 365-373 - Yemin Yu, Luotian Yuan, Ying Wei, Hanyu Gao, Fei Wu, Zhihua Wang, Xinhai Ye:
RetroOOD: Understanding Out-of-Distribution Generalization in Retrosynthesis Prediction. 374-382 - Xi Zeng, Xiaotian Hao, Hongyao Tang, Zhentao Tang, Shaoqing Jiao, Dazhi Lu, Jiajie Peng:
Designing Biological Sequences without Prior Knowledge Using Evolutionary Reinforcement Learning. 383-391 - Xianghua Zeng, Hao Peng, Angsheng Li:
Adversarial Socialbots Modeling Based on Structural Information Principles. 392-400 - Hongbo Zhang, Guang Wang
, Xu Wang, Zhengyang Zhou, Chen Zhang, Zheng Dong
, Yang Wang:
NondBREM: Nondeterministic Offline Reinforcement Learning for Large-Scale Order Dispatching. 401-409 - Jialu Zhang, Xiaoying Yang
, Wentao He
, Jianfeng Ren, Qian Zhang, Yitian Zhao, Ruibin Bai
, Xiangjian He
, Jiang Liu:
Scale Optimization Using Evolutionary Reinforcement Learning for Object Detection on Drone Imagery. 410-418 - Rui-Xiao Zhang, Tianchi Huang:
Adversarial Attacks on Federated-Learned Adaptive Bitrate Algorithms. 419-427 - Jian Zhu, Congcong Liu, Xue Jiang, Changping Peng, Zhangang Lin, Jingping Shao:
Generalize for Future: Slow and Fast Trajectory Learning for CTR Prediction. 428-436 - Yuqi Zhu, Jia Li, Ge Li, Yunfei Zhao, Jia Li, Zhi Jin, Hong Mei:
Hot or Cold? Adaptive Temperature Sampling for Code Generation with Large Language Models. 437-445
AAAI Technical Track on Cognitive Modeling & Cognitive Systems
- Paul M. Bodily, Dan Ventura:
Operationalizing Essential Characteristics of Creativity in a Computational System for Music Composition. 447-455 - Matteo Bortoletto, Lei Shi
, Andreas Bulling:
Neural Reasoning about Agents' Goals, Preferences, and Actions. 456-464 - Min Cao, Yang Bai, Ziyin Zeng, Mang Ye, Min Zhang:
An Empirical Study of CLIP for Text-Based Person Search. 465-473 - Hongyi Chen, Jingtao Ding, Yong Li, Yue Wang, Xiao-Ping Zhang:
Social Physics Informed Diffusion Model for Crowd Simulation. 474-482 - Yingjie Chen, Jiarui Zhang, Tao Wang, Yun Liang:
Trend-Aware Supervision: On Learning Invariance for Semi-supervised Facial Action Unit Intensity Estimation. 483-491 - Jianhao Ding, Zhaofei Yu, Tiejun Huang, Jian K. Liu
:
Enhancing the Robustness of Spiking Neural Networks with Stochastic Gating Mechanisms. 492-502 - Hen Emuna, Nadav Borenstein, Xin Qian, Hyeonsu B. Kang, Joel Chan, Aniket Kittur
, Dafna Shahaf:
Imitation of Life: A Search Engine for Biologically Inspired Design. 503-511 - Xiang He, Dongcheng Zhao, Yang Li, Guobin Shen, Qingqun Kong, Yi Zeng:
An Efficient Knowledge Transfer Strategy for Spiking Neural Networks from Static to Event Domain. 512-520 - Zhejing Hu, Yan Liu, Gong Chen, Xiao Ma, Shenghua Zhong, Qianwen Luo:
Responding to the Call: Exploring Automatic Music Composition Using a Knowledge-Enhanced Model. 521-529 - Kunal Jha, Tuan Anh Le, Chuanyang Jin, Yen-Ling Kuo
, Joshua B. Tenenbaum, Tianmin Shu:
Neural Amortized Inference for Nested Multi-Agent Reasoning. 530-537 - Shu Li, Ruimin Hu, Suhui Li, Liang Liao:
Hidden Follower Detection: How Is the Gaze-Spacing Pattern Embodied in Frequency Domain? 538-546 - Sifei Li, Yuxin Zhang, Fan Tang, Chongyang Ma, Weiming Dong, Changsheng Xu:
Music Style Transfer with Time-Varying Inversion of Diffusion Models. 547-555 - Han Lu, Xiahai Zhuang, Qiang Luo:
A Brain-Inspired Way of Reducing the Network Complexity via Concept-Regularized Coding for Emotion Recognition. 556-564 - Bingjun Luo, Zewen Wang
, Jinpeng Wang, Junjie Zhu, Xibin Zhao, Yue Gao:
Multi-Energy Guided Image Translation with Stochastic Differential Equations for Near-Infrared Facial Expression Recognition. 565-573 - Gehua Ma, He Wang, Jingyuan Zhao, Rui Yan, Huajin Tang:
Successive POI Recommendation via Brain-Inspired Spatiotemporal Aware Representation. 574-582 - Yuanyuan Mao, Xin Lin, Qin Ni, Liang He:
BDIQA: A New Dataset for Video Question Answering to Explore Cognitive Reasoning through Theory of Mind. 583-591 - Junseok Park
, Yoonsung Kim, Hee bin Yoo, Min Whoo Lee, Kibeom Kim, Won-Seok Choi, Minsu Lee, Byoung-Tak Zhang:
Unveiling the Significance of Toddler-Inspired Reward Transition in Goal-Oriented Reinforcement Learning. 592-600 - Xuerui Qiu, Rui-Jie Zhu, Yuhong Chou, Zhaorui Wang, Liang-Jian Deng, Guoqi Li:
Gated Attention Coding for Training High-Performance and Efficient Spiking Neural Networks. 601-610 - Jiangrong Shen, Wenyao Ni, Qi Xu, Huajin Tang:
Efficient Spiking Neural Networks with Sparse Selective Activation for Continual Learning. 611-619 - Shanshan Wang, Zhen Zeng, Xun Yang, Ke Xu, Xingyi Zhang:
Boosting Neural Cognitive Diagnosis with Student's Affective State Modeling. 620-627 - Yiming Wang
, Bin Zhang, Yujiao Tang:
DMMR: Cross-Subject Domain Generalization for EEG-Based Emotion Recognition via Denoising Mixed Mutual Reconstruction. 628-636 - Jiyuan Zhang, Shiyan Chen, Yajing Zheng, Zhaofei Yu, Tiejun Huang:
Transient Glimpses: Unveiling Occluded Backgrounds through the Spike Camera. 637-645 - Yuhang Zhang, Yue Yao, Xuannan Liu, Lixiong Qin, Wenjing Wang, Weihong Deng:
Open-Set Facial Expression Recognition. 646-654 - Feiyu Zhu, Reid G. Simmons:
Bootstrapping Cognitive Agents with a Large Language Model. 655-663 - Yangfu Zhu, Yue Xia, Meiling Li, Tingting Zhang, Bin Wu:
Data Augmented Graph Neural Networks for Personality Detection. 664-672
AAAI Technical Track on Computer Vision I
- Namhyuk Ahn, Junsoo Lee, Chunggi Lee, Kunhee Kim, Daesik Kim, Seung-Hun Nam, Kibeom Hong:
DreamStyler: Paint by Style Inversion with Text-to-Image Diffusion Models. 674-681 - Seungjun An, Seonghoon Park, Gyeongnyeon Kim, Jeongyeol Baek, Byeongwon Lee, Seungryong Kim:
Context Enhanced Transformer for Single Image Object Detection in Video Data. 682-690 - Xiaoqi An, Lin Zhao, Chen Gong, Nannan Wang, Di Wang, Jian Yang:
SHaRPose: Sparse High-Resolution Representation for Human Pose Estimation. 691-699 - Anastasia Antsiferova, Khaled Abud
, Aleksandr Gushchin, Ekaterina Shumitskaya, Sergey Lavrushkin
, Dmitriy S. Vatolin:
Comparing the Robustness of Modern No-Reference Image- and Video-Quality Metrics to Adversarial Attacks. 700-708 - Srikar Appalaraju, Peng Tang, Qi Dong, Nishant Sankaran, Yichu Zhou, R. Manmatha:
DocFormerv2: Local Features for Document Understanding. 709-718 - Zhongjie Ba, Qingyu Liu
, Zhenguang Liu, Shuang Wu, Feng Lin, Li Lu, Kui Ren:
Exposing the Deception: Uncovering More Forgery Clues for Deepfake Detection. 719-728 - Shuanghao Bai, Min Zhang, Wanqi Zhou
, Siteng Huang
, Zhirong Luan, Donglin Wang, Badong Chen:
Prompt-Based Distribution Alignment for Unsupervised Domain Adaptation. 729-737 - Peijun Bao, Yong Xia, Wenhan Yang, Boon Poh Ng, Meng Hwa Er, Alex C. Kot:
Local-Global Multi-Modal Distillation for Weakly-Supervised Temporal Video Grounding. 738-746 - Peijun Bao, Zihao Shao, Wenhan Yang, Boon Poh Ng, Meng Hwa Er, Alex C. Kot:
Omnipotent Distillation with LLMs for Weakly-Supervised Natural Language Video Localization: When Divergence Meets Consistency. 747-755 - Qiqi Bao, Zheng Hui, Rui Zhu, Peiran Ren, Xuansong Xie, Wenming Yang:
Improving Diffusion-Based Image Restoration with Error Contraction and Error Correction. 756-764 - Xiaoyi Bao, Jie Qin, Siyang Sun, Xingang Wang, Yun Zheng:
Relevant Intrinsic Feature Enhancement Network for Few-Shot Semantic Segmentation. 765-773 - Mazal Bethany, Brandon Wherry, Nishant Vishwamitra
, Peyman Najafirad:
Image Safeguarding: Reasoning with Conditional Vision Language Model and Obfuscating Unsafe Content Counterfactually. 774-782 - Aneesh Bhattacharya, Manas Paranjape, Uttaran Bhattacharya
, Aniket Bera:
DanceAnyWay: Synthesizing Beat-Guided 3D Dances with Randomized Temporal Contrastive Learning. 783-791 - Swapnil Bhosale
, Sauradip Nag, Diptesh Kanojia
, Jiankang Deng
, Xiatian Zhu
:
DiffSED: Sound Event Detection with Denoising Diffusion. 792-800 - Qi Bi
, Shaodi You, Theo Gevers:
Learning Generalized Segmentation for Foggy-Scenes by Bi-directional Wavelet Guidance. 801-809 - Qi Bi
, Jingjun Yi
, Hao Zheng, Wei Ji, Yawen Huang, Yuexiang Li, Yefeng Zheng:
Learning Generalized Medical Image Segmentation from Decoupled Feature Queries. 810-818 - Qi Bi
, Shaodi You, Theo Gevers:
Learning Content-Enhanced Mask Transformer for Domain Generalized Urban-Scene Segmentation. 819-827 - Siyuan Bian, Jiefeng Li, Jiasheng Tang, Cewu Lu:
ShapeBoost: Boosting Human Shape Estimation with Part-Based Parameterization and Clothing-Preserving Augmentation. 828-836 - Yequan Bie, Luyang Luo
, Hao Chen:
MICA: Towards Explainable Skin Lesion Diagnosis via Multi-Level Image-Concept Alignment. 837-845 - Alexander Black, Jing Shi, Yifei Fan, Tu Bui, John P. Collomosse:
VIXEN: Visual Text Comparison Network for Image Difference Captioning. 846-854 - Qingwen Bu, Sungrae Park, Minsoo Khang, Yichuan Cheng
:
SRFormer: Text Detection Transformer with Incorporated Segmentation and Regression. 855-863 - Pingping Cai
, Deja Scott, Xiaoguang Li, Song Wang:
Orthogonal Dictionary Guided Shape Completion Network for Point Cloud. 864-872 - Qing Cai, Mu Li, Dongwei Ren, Jun Lyu, Haiyong Zheng, Junyu Dong, Yee-Hong Yang:
Spherical Pseudo-Cylindrical Representation for Omnidirectional Image Super-resolution. 873-881 - Qingyuan Cai, Xuecai Hu, Saihui Hou, Li Yao, Yongzhen Huang:
Disentangled Diffusion-Based 3D Human Pose Estimation with Hierarchical Spatial and Temporal Denoiser. 882-890 - Xiuding Cai
, Yaoyao Zhu, Dong Miao, Linjie Fu, Yu Yao:
Rethinking the Paradigm of Content Constraints in Unpaired Image-to-Image Translation. 891-899 - Yanlu Cai, Weizhong Zhang, Yuan Wu, Cheng Jin:
FusionFormer: A Concise Unified Feature Fusion Transformer for 3D Pose Estimation. 900-908 - Yufei Cai, Yuxiang Wei, Zhilong Ji, Jinfeng Bai, Hu Han, Wangmeng Zuo:
Decoupled Textual Embeddings for Customized Image Generation. 909-917 - Zikui Cai, Zhongpai Gao, Benjamin Planche, Meng Zheng, Terrence Chen, M. Salman Asif, Ziyan Wu:
Disguise without Disruption: Utility-Preserving Face De-identification. 918-926 - Bing Cao, Junliang Guo, Pengfei Zhu, Qinghua Hu:
Bi-directional Adapter for Multimodal Tracking. 927-935 - Qinglong Cao, Zhengqin Xu, Yuntian Chen
, Chao Ma, Xiaokang Yang:
Domain-Controlled Prompt Learning. 936-944 - Yuxin Cao, Ziyu Zhao, Xi Xiao, Derui Wang, Minhui Xue, Jin Lu:
LogoStyleFool: Vitiating Video Recognition Systems via Logo Style Transfer. 945-953 - Junghun Cha, Ali Haider, Seoyun Yang, Hoeyeong Jin, Subin Yang, A. F. M. Shahab Uddin, Jaehyoung Kim, Soo Ye Kim, Sung-Ho Bae:
Descanning: From Scanned to the Original Images with a Color Correction Diffusion Model. 954-963 - Kennard Yanting Chan, Fayao Liu, Guosheng Lin, Chuan Sheng Foo, Weisi Lin:
Fine Structure-Aware Sampling: A New Sampling Training Scheme for Pixel-Aligned Implicit Models in Single-View Human Reconstruction. 964-971 - Gyusam Chang, Wonseok Roh, Sujin Jang
, Dongwook Lee, Daehyun Ji, Gyeongrok Oh, Jinsun Park, Jinkyu Kim, Sangpil Kim:
CMDA: Cross-Modal and Domain Adversarial Adaptation for LiDAR-Based 3D Object Detection. 972-980 - Qing Chang, Yifei Tong:
A Hybrid Global-Local Perception Network for Lane Detection. 981-989 - Bo-Yu Chen, Wei-Chen Chiu, Yu-Lun Liu:
Improving Robustness for Joint Optimization of Camera Pose and Decomposed Low-Rank Tensorial Radiance Fields. 990-1000 - Chao Chen, Jie Liu, Chang Zhou, Jie Tang, Gangshan Wu:
Sketch and Refine: Towards Fast and Accurate Lane Detection. 1001-1009 - Chaofeng Chen, Shangchen Zhou, Liang Liao, Haoning Wu, Wenxiu Sun, Qiong Yan, Weisi Lin:
Iterative Token Evaluation and Refinement for Real-World Super-resolution. 1010-1018 - Dalong Chen, Jianjia Zhang, Wei-Shi Zheng, Ruixuan Wang:
FeatWalk: Enhancing Few-Shot Classification through Local View Leveraging. 1019-1027 - Dengsheng Chen, Jie Hu, Xiaoming Wei, Enhua Wu:
Real3D: The Curious Case of Neural Scene Degeneration. 1028-1036 - Honghao Chen, Xiangwen Kong, Xiangyu Zhang, Xin Zhao
, Kaiqi Huang:
DDAE: Towards Deep Dynamic Vision BERT Pretraining. 1037-1045 - Hongming Chen, Xiang Chen
, Jiyang Lu, Yufeng Li:
Rethinking Multi-Scale Representations in Deep Deraining Transformer. 1046-1053 - Hongxu Chen, Quan Zhang, Jian-Huang Lai, Xiaohua Xie:
Unsupervised Group Re-identification via Adaptive Clustering-Driven Progressive Learning. 1054-1062 - Hongyang Chen, Hung-Shuo Tai, Kaisheng Ma:
Guiding a Harsh-Environments Robust Detector via RAW Data Characteristic Mining. 1063-1071 - Hongyang Chen, Kaisheng Ma:
CutFreq: Cut-and-Swap Frequency Components for Low-Level Vision Augmentation. 1072-1080 - Jiacheng Chen, Jiawei Jiang, Fei Wu, Jianwei Zheng:
Null Space Matters: Range-Null Decomposition for Consistent Multi-Contrast MRI Reconstruction. 1081-1090 - Jiafu Chen, Wei Xing, Jiakai Sun, Tianyi Chu, Yiling Huang, Boyan Ji, Lei Zhao, Huaizhong Lin, Haibo Chen, Zhizhong Wang:
PNeSM: Arbitrary 3D Scene Stylization via Prompt-Based Neural Style Mapping. 1091-1099 - Jiankang Chen, Tong Zhang
, Wei-Shi Zheng, Ruixuan Wang:
TagFog: Textual Anchor Guidance and Fake Outlier Generation for Visual Out-of-Distribution Detection. 1100-1109 - Junyi Chen, Longteng Guo, Jia Sun, Shuai Shao, Zehuan Yuan, Liang Lin, Dongyu Zhang:
EVE: Efficient Vision-Language Pre-training with Masked Prediction and Modality-Aware MoE. 1110-1119 - Kaitao Chen
, Shiliang Sun, Jing Zhao:
CaMIL: Causal Multiple Instance Learning for Whole Slide Image Classification. 1120-1128 - Lianggangxu Chen, Youqi Song, Yiqing Cai, Jiale Lu, Yang Li, Yuan Xie, Changbo Wang, Gaoqi He:
Multi-Prototype Space Learning for Commonsense-Based Scene Graph Generation. 1129-1137 - Lianggangxu Chen, Youqi Song, Shaohui Lin, Changbo Wang, Gaoqi He:
Kumaraswamy Wavelet for Heterophilic Scene Graph Generation. 1138-1146 - Lin Chen, Zhijie Jia, Lechao Cheng, Yang Gao, Jie Lei, Yijun Bei, Zunlei Feng:
ViT-Calibrator: Decision Stream Calibration for Vision Transformer. 1147-1155 - Linsheng Chen, Guangrun Wang, Liuchun Yuan, Keze Wang, Ken Deng, Philip H. S. Torr:
NeRF-VPT: Learning Novel View Representations with Neural Radiance Fields via View Prompt Tuning. 1156-1164 - Qi Chen, Dileepa Pitawela, Chongyang Zhao, Gengze Zhou, Hsiang-Ting Chen, Qi Wu:
WebVLN: Vision-and-Language Navigation on Websites. 1165-1173 - Qihua Chen, Xuejin Chen, Chenxuan Wang, Yixiong Liu, Zhiwei Xiong, Feng Wu:
Learning Multimodal Volumetric Features for Large-Scale Neuron Tracing. 1174-1182 - Siran Chen, Yue Ma, Yu Qiao, Yali Wang:
M-BEV: Masked BEV Perception for Robust Autonomous Driving. 1183-1191 - Taiyan Chen, Xianghua Ying, Jinfa Yang, Ruibin Wang, Ruohao Guo, Bowei Xing, Ji Shi:
VPDETR: End-to-End Vanishing Point DEtection TRansformers. 1192-1200 - Tianxiang Chen, Zhentao Tan, Qi Chu, Yue Wu, Bin Liu, Nenghai Yu:
TCI-Former: Thermal Conduction-Inspired Transformer for Infrared Small Target Detection. 1201-1209 - Xuanhong Chen, Hang Wang, Jialiang Chen, Kairui Feng, Jinfan Liu, Xiaohang Wang, Weimin Zhang, Bingbing Ni:
Intrinsic Phase-Preserving Networks for Depth Super Resolution. 1210-1218 - Xuyang Chen, Dong Wang, Konrad Schindler, Mingwei Sun, Yongliang Wang, Nicoló Savioli, Liqiu Meng:
Box2Poly: Memory-Efficient Polygon Prediction of Arbitrarily Shaped and Rotated Text. 1219-1227 - Yanzhe Chen, Huasong Zhong, Xiangteng He, Yuxin Peng, Jiahuan Zhou, Lele Cheng:
FashionERN: Enhance-and-Refine Network for Composed Fashion Image Retrieval. 1228-1236 - Yiwen Chen, Chi Zhang, Xiaofeng Yang, Zhongang Cai, Gang Yu, Lei Yang, Guosheng Lin:
IT3D: Improved Text-to-3D Generation with Explicit View Synthesis. 1237-1244 - Yujun Chen, Xin Tan, Zhizhong Zhang, Yanyun Qu, Yuan Xie:
Beyond the Label Itself: Latent Labels Enhance Semi-supervised Point Cloud Panoptic Segmentation. 1245-1253 - Zhenfang Chen, Qinhong Zhou, Yikang Shen, Yining Hong, Zhiqing Sun, Dan Gutfreund, Chuang Gan:
Visual Chain-of-Thought Prompting for Knowledge-Based Visual Reasoning. 1254-1262 - Zhengrui Chen, Liying Lu, Ziyang Yuan, Yiming Zhu, Yu Li
, Chun Yuan, Weihong Deng:
Blind Face Restoration under Extreme Conditions: Leveraging 3D-2D Prior Fusion for Superior Structural and Texture Recovery. 1263-1271 - Zhongxi Chen, Ke Sun, Xianming Lin:
CamoDiffusion: Camouflaged Object Detection via Conditional Diffusion Models. 1272-1280 - Zhuowei Chen, Shancheng Fang, Wei Liu, Qian He, Mengqi Huang, Zhendong Mao:
DreamIdentity: Enhanced Editability for Efficient Face-Identity Preserved Image Generation. 1281-1289 - Zida Chen, Ziran Zhang, Haoying Li, Menghao Li, Yueting Chen, Qi Li, Huajun Feng, Zhihai Xu, Shiqi Chen:
Deep Linear Array Pushbroom Image Restoration: A Degradation Pipeline and Jitter-Aware Restoration Network. 1290-1298 - Ri Cheng, Ruian He, Xuhao Jiang, Shili Zhou, Weimin Tan, Bo Yan:
Context-Aware Iteration Policy Network for Efficient Optical Flow Estimation. 1299-1307 - Weihao Cheng, Yan-Pei Cao, Ying Shan:
SparseGNV: Generating Novel Views of Indoor Scenes with Sparse RGB-D Images. 1308-1316 - Yean Cheng, Renjie Wan
, Shuchen Weng, Chengxuan Zhu, Yakun Chang, Boxin Shi:
Colorizing Monochromatic Radiance Fields. 1317-1325 - Zesen Cheng, Kehan Li, Peng Jin, Siheng Li, Xiangyang Ji, Li Yuan, Chang Liu, Jie Chen:
Parallel Vertex Diffusion for Unified Visual Grounding. 1326-1334 - Dongmin Choi, Wonwoo Cho, Kangyeol Kim, Jaegul Choo:
iDet3D: Towards Efficient Interactive Object Detection for LiDAR Point Clouds. 1335-1343 - Jae-Ho Choi
, Ki-Bong Kang, Kyung-Tae Kim:
Fusion-Vital: Video-RF Fusion Transformer for Advanced Remote Physiological Measurement. 1344-1352 - Ernie Chu, Tzuhsuan Huang, Shuo-Yen Lin, Jun-Cheng Chen:
MeDM: Mediating Image Diffusion Models for Video-to-Video Translation with Temporal Correspondence Guidance. 1353-1361 - Tianyi Chu, Wei Xing, Jiafu Chen, Zhizhong Wang, Jiakai Sun, Lei Zhao, Haibo Chen, Huaizhong Lin:
Attack Deterministic Conditional Image Generative Models for Diverse and Controllable Generation. 1362-1370 - Marcos V. Conde, Javier Vazquez-Corral
, Michael S. Brown, Radu Timofte:
NILUT: Conditional Neural Implicit 3D Lookup Tables for Image Enhancement. 1371-1379 - Cong Cong
, Shiyu Xuan
, Sidong Liu
, Shiliang Zhang, Maurice Pagnucco, Yang Song:
Decoupled Optimisation for Long-Tailed Visual Recognition. 1380-1388 - Xiaofeng Cong, Jie Gui, Junming Hou:
Underwater Organism Color Fine-Tuning via Decomposition and Guidance. 1389-1398 - Mengyao Cui, Zhigang Wang, Dong Wang, Bin Zhao, Xuelong Li:
Color Event Enhanced Single-Exposure HDR Imaging. 1399-1407 - Wenting Cui, Runzhao Yao, Shaoyi Du:
PHFormer: Multi-Fragment Assembly Using Proxy-Level Hybrid Transformer. 1408-1416 - Xiaohan Cui, Long Ma, Tengyu Ma, Jinyuan Liu
, Xin Fan, Risheng Liu:
Trash to Treasure: Low-Light Object Detection via Decomposition-and-Aggregation. 1417-1425 - Yuning Cui, Wenqi Ren, Alois Knoll:
Omni-Kernel Network for Image Restoration. 1426-1434 - Ziteng Cui, Lin Gu, Xiao Sun, Xianzheng Ma, Yu Qiao, Tatsuya Harada:
Aleth-NeRF: Illumination Adaptive NeRF with Concealing Field Assumption. 1435-1444 - Qian Dai, Dong Wei, Hong Liu, Jinghan Sun, Liansheng Wang, Yefeng Zheng:
Federated Modality-Specific Encoders and Multimodal Anchors for Personalized Brain Tumor Segmentation. 1445-1453 - Songmin Dai, Yifan Wu, Xiaoqiang Li, Xiangyang Xue:
Generating and Reweighting Dense Contrastive Patterns for Unsupervised Anomaly Detection. 1454-1462 - Zhuohang Dang, Minnan Luo, Chengyou Jia, Guang Dai, Xiaojun Chang
, Jingdong Wang:
Noisy Correspondence Learning with Self-Reinforcing Errors Mitigation. 1463-1471 - Duolikun Danier, Fan Zhang, David Bull:
LDMVFI: Video Frame Interpolation with Latent Diffusion Models. 1472-1480 - Ishan Rajendrakumar Dave, Simon Jenni, Mubarak Shah:
No More Shortcuts: Realizing the Potential of Temporal Self-Supervision. 1481-1491 - Yongjian Deng
, Hao Chen, Youfu Li
:
A Dynamic GCN with Cross-Representation Distillation for Event-Based Learning. 1492-1500 - Yuxin Deng, Kaining Zhang, Shihua Zhang, Yansheng Li, Jiayi Ma:
ResMatch: Residual Attention Learning for Feature Matching. 1501-1509 - Yuxin Deng, Jiayi Ma:
SDGMNet: Statistic-Based Dynamic Gradient Modulation for Local Descriptor Learning. 1510-1518 - Shanding Diao, Yuan Chen, Yang Zhao, Wei Jia, Zhao Zhang, Ronggang Wang:
Stereo Vision Conversion from Planar Videos Based on Temporal Multiplane Images. 1519-1527 - Kun Ding, Haojian Zhang, Qiang Yu, Ying Wang, Shiming Xiang, Chunhong Pan:
Weak Distribution Detectors Lead to Stronger Generalizability of Vision-Language Prompt Tuning. 1528-1536 - Pengxiang Ding, Qiongjie Cui, Haofan Wang, Min Zhang, Mengyuan Liu, Donglin Wang:
Expressive Forecasting of 3D Whole-Body Human Motions. 1537-1545 - Xinlong Ding, Jiansheng Chen, Hongwei Yu, Yu Shang, Yining Qin, Huimin Ma:
Transferable Adversarial Attacks for Object Detection Using Object-Aware Significant Feature Distortion. 1546-1554 - Thang Doan, Xin Li, Sima Behpour, Wenbin He, Liang Gou, Liu Ren:
Hyp-OW: Exploiting Hierarchical Structure Learning with Hyperbolic Distance Enhances Open World Object Detection. 1555-1563 - Wen Dong, Haiyang Mei, Ziqi Wei, Ao Jin, Sen Qiu, Qiang Zhang, Xin Yang:
Exploiting Polarized Material Cues for Robust Car Detection. 1564-1572 - Wenqian Dong, Yang Xu
, Jiahui Qu, Shaoxiong Hou:
Learning Multi-Modal Cross-Scale Deformable Transformer Network for Unregistered Hyperspectral Image Super-resolution. 1573-1581 - Yanchen Dong
, Ruiqin Xiong, Jing Zhao, Jian Zhang, Xiaopeng Fan, Shuyuan Zhu, Tiejun Huang:
Joint Demosaicing and Denoising for Spike Camera. 1582-1590 - Yi Dong, Yuxi Wang, Ruoxi Fan, Wenqi Ouyang, Zhiqi Shen, Peiran Ren, Xuansong Xie:
ChromaFusionNet (CFNet): Natural Fusion of Fine-Grained Color Editing. 1591-1599 - Yilan Dong, Chunlin Yu, Ruiyang Ha, Ye Shi, Yuexin Ma, Lan Xu, Yanwei Fu, Jingya Wang:
HybridGait: A Benchmark for Spatial-Temporal Cloth-Changing Gait Recognition with Hybrid Explorations. 1600-1608 - Yue-Jiang Dong, Yuan-Chen Guo, Ying-Tian Liu, Fang-Lue Zhang, Song-Hai Zhang:
PPEA-Depth: Progressive Parameter-Efficient Adaptation for Self-Supervised Monocular Depth Estimation. 1609-1617 - Chenghu Du
, Junyin Wang, Yi Rong, Shuqing Liu, Kai Liu, Shengwu Xiong:
CycleVTON: A Cycle Mapping Framework for Parser-Free Virtual Try-On. 1618-1625 - Hang Du, Xuejun Yan, Jingjing Wang, Di Xie, Shiliang Pu:
Arbitrary-Scale Point Cloud Upsampling by Voxel-Based Network with Latent Geometric-Consistent Learning. 1626-1634 - Zhenjiang Du, Jiale Dou, Zhitao Liu, Jiwei Wei, Guan Wang, Ning Xie, Yang Yang:
CDPNet: Cross-Modal Dual Phases Network for Point Cloud Completion. 1635-1643 - Xiaoyue Duan, Shuhao Cui, Guoliang Kang, Baochang Zhang, Zhengcong Fei, Mingyuan Fan, Junshi Huang:
Tuning-Free Inversion-Enhanced Control for Consistent Image Editing. 1644-1652 - Yuxuan Duan, Li Niu, Yan Hong, Liqing Zhang:
WeditGAN: Few-Shot Image Generation via Latent Space Relocation. 1653-1661 - Chao Fan, Jingzhe Ma, Dongyang Jin, Chuanfu Shen, Shiqi Yu:
SkeletonGait: Gait Recognition Using Skeleton Maps. 1662-1669 - Yang Fan, Xiangping Wu, Qingcai Chen, Heng Li
, Yan Huang, Zhixiang Cai, Qitian Wu:
TDeLTA: A Light-Weight and Robust Table Detection Method Based on Learning Text Arrangement. 1670-1678 - Yeying Fan, Guangshun Wei, Chen Wang, Shaojie Zhuang
, Wenping Wang, Yuanfeng Zhou:
Collaborative Tooth Motion Diffusion Model in Digital Orthodontics. 1679-1687 - Zhaoxin Fan, Longbin Ji, Pengxin Xu, Fan Shen, Kai Chen:
Everything2Motion: Synchronizing Diverse Inputs via a Unified Framework for Human Motion Synthesis. 1688-1697 - Chaowei Fang, Ziyin Zhou, Junye Chen, Hanjing Su, Qingyao Wu, Guanbin Li:
Variance-Insensitive and Target-Preserving Mask Refinement for Interactive Image Segmentation. 1698-1706 - Qihang Fang, Yafei Song, Keqiang Li, Li Shen, Huaiyu Wu, Gang Xiong, Liefeng Bo:
Evaluate Geometry of Radiance Fields with Low-Frequency Color Prior. 1707-1715 - Ruohuan Fang, Guansong Pang, Xiao Bai:
Simple Image-Level Classification Improves Open-Vocabulary Object Detection. 1716-1725 - Shaoheng Fang, Zuhong Liu, Mingyu Wang, Chenxin Xu, Yiqi Zhong, Siheng Chen:
Self-Supervised Bird's Eye View Motion Prediction with Cross-Modality Signals. 1726-1734 - Xiang Fang, Daizong Liu, Wanlong Fang, Pan Zhou, Zichuan Xu, Wenzheng Xu, Junyang Chen, Renfu Li:
Fewer Steps, Better Performance: Efficient Cross-Modal Clip Trimming for Video Moment Retrieval Using Language. 1735-1743 - Zhixue Fang, Xinrong Guo, Jingyin Lin
, Huisi Wu, Jing Qin:
An Embedding-Unleashing Video Polyp Segmentation Framework via Region Linking and Scale Alignment. 1744-1752 - Juexiao Feng, Yuhong Yang, Yanchun Xie, Yaqian Li, Yandong Guo, Yuchen Guo, Yuwei He, Liuyu Xiang, Guiguang Ding:
Debiased Novel Category Discovering and Localization. 1753-1760 - Tuo Feng, Ruijie Quan, Xiaohan Wang, Wenguan Wang, Yi Yang:
Interpretable3D: An Ad-Hoc Interpretable Classifier for 3D Point Clouds. 1761-1769 - Hui Fu, Zeqing Wang, Ke Gong, Keze Wang, Tianshui Chen, Haojie Li, Haifeng Zeng, Wenxiong Kang:
Mimic: Speaking Style Disentanglement for Speech-Driven 3D Facial Animation. 1770-1777
AAAI Technical Track on Computer Vision II
- Qijun Gan, Wentong Li, Jinwei Ren, Jianke Zhu:
Fine-Grained Multi-View Hand Reconstruction Using Inverse Rendering. 1779-1787 - Chenxing Gao, Hang Zhou, Junqing Yu, Yuteng Ye, Jiale Cai, Junle Wang, Wei Yang:
Attacking Transformers with Feature Diversity Adversarial Perturbation. 1788-1796 - Hongzhi Gao, Zheng Chen, Zehui Chen, Lin Chen, Jiaming Liu, Shanghang Zhang, Feng Zhao:
Leveraging Imagery Data with Spatial Point Prior for Weakly Semi-supervised 3D Object Detection. 1797-1805 - Jiayi Gao, Kongming Liang, Tao Wei, Wei Chen, Zhanyu Ma, Jun Guo:
Dual-Prior Augmented Decoding Network for Long Tail Distribution in HOI Detection. 1806-1814 - Jingsheng Gao, Jiacheng Ruan, Suncheng Xiang, Zefang Yu, Ke Ji, Mingye Xie, Ting Liu, Yuzhuo Fu:
LAMM: Label Alignment for Multi-Modal Prompt Learning. 1815-1823 - Xiang Gao
, Zhengbo Xu, Junhan Zhao, Jiaying Liu:
Frequency-Controlled Diffusion Model for Versatile Text-Guided Image-to-Image Translation. 1824-1832 - Xinyu Gao, Ziyi Yang, Yunlu Zhao, Yuxiang Sun, Xiaogang Jin, Changqing Zou:
A General Implicit Framework for Fast NeRF Composition and Rendering. 1833-1841 - Yan Gao, Haojun Xu, Jie Li, Nannan Wang, Xinbo Gao:
Multi-Scene Generalized Trajectory Global Graph Solver with Composite Nodes for Multiple Object Tracking. 1842-1850 - Yudong Gao, Honglong Chen, Peng Sun, Junjian Li, Anqing Zhang, Zhibo Wang, Weifeng Liu:
A Dual Stealthy Backdoor: From Both Spatial and Frequency Perspectives. 1851-1859 - Yuting Gao, Jinfeng Liu, Zihan Xu, Tong Wu, Enwei Zhang, Ke Li, Jie Yang, Wei Liu, Xing Sun
:
SoftCLIP: Softer Cross-Modal Alignment Makes CLIP Stronger. 1860-1868 - Prajwal Gatti, Kshitij Parikh, Dhriti Prasanna Paul, Manish Gupta, Anand Mishra:
Composite Sketch+Text Queries for Retrieving Objects with Elusive Names and Complex Interactions. 1869-1877 - Chengjie Ge, Xueyang Fu, Peng He, Kunyu Wang, Chengzhi Cao, Zheng-Jun Zha:
Neuromorphic Event Signal-Driven Network for Video De-raining. 1878-1886 - Yanqi Ge, Qiang Nie, Ye Huang, Yong Liu, Chengjie Wang, Feng Zheng, Wen Li, Lixin Duan:
Beyond Prototypes: Semantic Anchor Regularization for Better Representation Learning. 1887-1895 - Wenjia Geng, Yong Liu, Lei Chen, Sujia Wang, Jie Zhou, Yansong Tang:
Learning Multi-Scale Video-Text Correspondence for Weakly Supervised Temporal Article Gronding. 1896-1904 - Mohsen Gholami, Rabab Ward, Z. Jane Wang:
PoseGen: Learning to Generate 3D Human Pose Dataset with NeRF. 1905-1913 - Lei Gong, Yu Zhang
, Yingqing Xia, Yanyong Zhang, Jianmin Ji:
SDAC: A Multimodal Synthetic Dataset for Anomaly and Corner Case Detection in Autonomous Driving. 1914-1922 - Dongjun Gu, Jaehyeok Shim, Jaehoon Jang, Changwoo Kang, Kyungdon Joo:
ContactGen: Contact-Guided Interactive 3D Human Generation for Partners. 1923-1931 - Zhaopeng Gu, Bingke Zhu, Guibo Zhu, Yingying Chen, Ming Tang, Jinqiao Wang:
AnomalyGPT: Detecting Industrial Anomalies Using Large Vision-Language Models. 1932-1940 - Huankang Guan
, Rynson W. H. Lau:
SeqRank: Sequential Ranking of Salient Objects. 1941-1949 - Yong Guan, Freddy Lécué, Jiaoyan Chen, Ru Li, Jeff Z. Pan:
Knowledge-Aware Neuron Interpretation for Scene Classification. 1950-1958 - Huijie Guo, Ying Ba, Jie Hu, Lingyu Si, Wenwen Qiang, Lei Shi:
Self-Supervised Representation Learning with Meta Comprehensive Regularization. 1959-1967 - Junwen Guo, Guobao Xiao, Shiping Wang, Jun Yu:
Graph Context Transformation Learning for Progressive Correspondence Pruning. 1968-1975 - Shuai Guo, Qiuwen Wang, Yijie Gao, Rong Xie, Li Song:
Depth-Guided Robust and Fast Point Cloud Fusion NeRF for Sparse Input Views. 1976-1984 - Tianyu Guo, Haowei Wang, Yiwei Ma, Jiayi Ji, Xiaoshuai Sun:
Improving Panoptic Narrative Grounding by Harnessing Semantic Relationships and Visual Confirmation. 1985-1993 - Wei Guo, Yuqi Zhang, De Ma, Qian Zheng:
Learning to Manipulate Artistic Images. 1994-2002 - Wengang Guo, Jiayi Yang, Huilin Yin, Qijun Chen, Wei Ye:
PICNN: A Pathway towards Interpretable Convolutional Neural Networks. 2003-2012 - Vinayak Gupta
, Rahul Goel, Dhawal Sirikonda, P. J. Narayanan:
GSN: Generalisable Segmentation in Neural Radiance Field. 2013-2021 - Bo Han, Hao Peng, Minjing Dong
, Yi Ren, Yixuan Shen, Chang Xu:
AMD: Autoregressive Motion Diffusion. 2022-2030 - Gaoge Han
, Shaoli Huang, Mingming Gong, Jinglei Tang:
HuTuMotion: Human-Tuned Navigation of Latent Motion Diffusion Models with Minimal Feedback. 2031-2039 - Mengqiao Han, Liyuan Pan, Xiabi Liu:
MA-Net: Rethinking Neural Unit in the Light of Astrocytes. 2040-2048 - Yucheng Han, Na Zhao, Weiling Chen, Keng Teck Ma, Hanwang Zhang:
Dual-Perspective Knowledge Enrichment for Semi-supervised 3D Object Detection. 2049-2057 - Yudong Han, Yupeng Hu, Xuemeng Song, Haoyu Tang, Mingzhu Xu, Liqiang Nie:
Exploiting the Social-Like Prior in Transformer for Visual Reasoning. 2058-2066 - Dawei Hao, Yuxin Mao, Bowen He, Xiaodong Han, Yuchao Dai, Yiran Zhong:
Improving Audio-Visual Segmentation with Bidirectional Generation. 2067-2075 - Yuze Hao, Jianrong Zhang, Tao Zhuo, Fuan Wen, Hehe Fan:
Hand-Centric Motion Refinement for 3D Hand-Object Interaction via Hierarchical Spatial-Temporal Modeling. 2076-2084 - Jingxuan He, Lechao Cheng, Chaowei Fang, Zunlei Feng, Tingting Mu, Mingli Song:
Progressive Feature Self-Reinforcement for Weakly Supervised Semantic Segmentation. 2085-2093 - Qibin He
:
Prompting Multi-Modal Image Segmentation with Semantic Grouping. 2094-2102 - Ruian He, Shili Zhou, Yuqi Sun, Ri Cheng, Weimin Tan, Bo Yan:
Low-Latency Space-Time Supersampling for Real-Time Rendering. 2103-2111 - Tianyao He, Huabin Liu, Yuxi Li, Xiao Ma, Cheng Zhong, Yang Zhang, Weiyao Lin:
Collaborative Weakly Supervised Video Correlation Learning for Procedure-Aware Instructional Video Analysis. 2112-2120 - Xuanhua He, Keyu Yan, Rui Li, Chengjun Xie, Jie Zhang, Man Zhou:
Frequency-Adaptive Pan-Sharpening with Mixture of Experts. 2121-2129 - Xuanhua He, Tao Hu, Guoli Wang, Zejin Wang, Run Wang, Qian Zhang, Keyu Yan, Ziyi Chen, Rui Li, Chengjun Xie, Jie Zhang, Man Zhou:
Enhancing RAW-to-sRGB with Decoupled Style Structure in Fourier Domain. 2130-2138 - Nailei Hei, Qianyu Guo, Zihao Wang, Yan Wang, Haofen Wang, Wenqiang Zhang:
A User-Friendly Framework for Generating Model-Preferred Prompts in Text-to-Image Synthesis. 2139-2147 - Or Hirschorn, Amir Jevnisek, Shai Avidan:
Optimize & Reduce: A Top-Down Approach for Image Vectorization. 2148-2156 - Nhat M. Hoang, Kehong Gong, Chuan Guo, Michael Bi Mi:
MotionMix: Weakly-Supervised Diffusion for Controllable Motion Generation. 2157-2165 - Meghana Holla, Ismini Lourentzou:
Commonsense for Zero-Shot Natural Language Video Localization. 2166-2174 - James Hong, Lu Yuan, Michaël Gharbi, Matthew Fisher, Kayvon Fatahalian:
Learning Subject-Aware Cropping by Outpainting Professional Photos. 2175-2183 - Chen Hou, Guoqiang Wei, Zhibo Chen:
High-Fidelity Diffusion-Based Image Editing. 2184-2192 - Chengyang Hu, Ke-Yue Zhang, Taiping Yao, Shice Liu, Shouhong Ding, Xin Tan, Lizhuang Ma:
Domain-Hallucinated Updating for Multi-Domain Face Anti-spoofing. 2193-2201 - Chunyu Hu, Hong Zhang, Chao Liang, Hao Huang:
QI-IRA: Quantum-Inspired Interactive Ranking Aggregation for Person Re-identification. 2202-2210 - Haoxiang Hu, Cangjun Gao
, Yaokun Li, Xiaoming Deng, Yu-Kun Lai, Cuixia Ma, Yong-Jin Liu, Hongan Wang:
SpaceGTN: A Time-Agnostic Graph Transformer Network for Handwritten Diagram Recognition and Segmentation. 2211-2219 - Junxing Hu, Hongwen Zhang, Zerui Chen, Mengcheng Li, Yunlong Wang, Yebin Liu, Zhenan Sun:
Learning Explicit Contact for Implicit Reconstruction of Hand-Held Objects from Monocular Images. 2220-2228 - Ke Hu, Tongbo Cao, Yuan Li, Song Chen, Yi Kang:
DALDet: Depth-Aware Learning Based Object Detection for Autonomous Driving. 2229-2237 - Lianyu Hu, Liqing Gao, Zekang Liu, Chi-Man Pun, Wei Feng:
COMMA: Co-articulated Multi-Modal Learning. 2238-2246 - Vincent Tao Hu, Wei Zhang, Meng Tang
, Pascal Mettes, Deli Zhao, Cees Snoek:
Latent Space Editing in Transformer-Based Flow Matching. 2247-2255 - Wenbo Hu, Yifan Xu, Yi Li, Weiyue Li, Zeyuan Chen, Zhuowen Tu:
BLIVA: A Simple Multimodal LLM for Better Handling of Text-Rich Visual Questions. 2256-2264 - Xiaoming Hu, Zilei Wang:
A Dynamic Learning Method towards Realistic Compositional Zero-Shot Learning. 2265-2273 - Youbing Hu, Yun Cheng, Anqi Lu, Zhiqiang Cao, Dawei Wei, Jie Liu, Zhijun Li:
LF-ViT: Reducing Spatial Redundancy in Vision Transformer for Efficient Image Recognition. 2274-2284 - Yubin Hu
, Sheng Ye, Wang Zhao, Matthieu Lin, Yuze He, Yu-Hui Wen, Ying He, Yong-Jin Liu:
O^2-Recon: Completing 3D Reconstruction of Occluded Objects in the Scene with a Pre-trained 2D Diffusion Model. 2285-2293 - Cong Huang, Jiahao Li, Lei Chu, Dong Liu, Yan Lu:
Arbitrary-Scale Video Super-resolution Guided by Dynamic Context. 2294-2302 - Fuxiang Huang, Lei Zhang, Xiaowei Fu, Suqi Song:
Dynamic Weighted Combiner for Mixed-Modal Image Retrieval. 2303-2311 - Han Huang, Yulun Wu
, Junsheng Zhou, Ge Gao, Ming Gu, Yu-Shen Liu
:
NeuSurf: On-Surface Priors for Neural Surface Reconstruction from Sparse Input Views. 2312-2320 - Haofeng Huang, Wenhan Yang, Lingyu Duan, Jiaying Liu:
Seeing Dark Videos via Self-Learned Bottleneck Neural Representation. 2321-2329 - Huimin Huang, Yawen Huang, Shiao Xie, Lanfen Lin, Ruofeng Tong, Yen-Wei Chen, Yuexiang Li, Yefeng Zheng:
Combinatorial CNN-Transformer Learning with Manifold Constraints for Semi-supervised Medical Image Segmentation. 2330-2338 - Jiaxin Huang, Qi Wu, Yazhou Ren, Fan Yang, Aodi Yang, Qianqian Yang, Xiaorong Pu:
Sparse Bayesian Deep Learning for Cross Domain Medical Image Reconstruction. 2339-2347 - Junjia Huang, Haofeng Li, Xiang Wan, Guanbin Li:
UniCell: Universal Cell Nucleus Classification via Prompt Learning. 2348-2356 - Shi-Sheng Huang, Zi-Xin Zou, Yichi Zhang, Yan-Pei Cao, Ying Shan:
SC-NeuS: Consistent Neural Surface Reconstruction from Sparse and Noisy Views. 2357-2365 - Shuying Huang, Ge Chen, Yong Yang, Xiaozheng Wang
, Chenbin Liang:
MFTN: Multi-Level Feature Transfer Network Based on MRI-Transformer for MR Image Super-resolution. 2366-2373 - Wenmin Huang, Weiqi Luo, Jiwu Huang, Xiaochun Cao:
SDGAN: Disentangling Semantic Manipulation for Facial Attribute Editing. 2374-2381 - Xiaoshui Huang, Zhou Huang
, Sheng Li
, Wentao Qu, Tong He, Yuenan Hou, Yifan Zuo
, Wanli Ouyang
:
Frozen CLIP Transformer Is an Efficient Point Cloud Encoder. 2382-2390 - Xin Huang, Yunfeng Bai, Dong Liang, Feng Tian, Jinyuan Jia:
G2L-CariGAN: Caricature Generation from Global Structure to Local Features. 2391-2399 - Xuan Huang, Hanhui Li, Zejun Yang, Zhisheng Wang, Xiaodan Liang:
3D Visibility-Aware Generalizable Neural Radiance Fields for Interacting Hands. 2400-2408 - Xun Huang
, Hai Wu, Xin Li, Xiaoliang Fan, Chenglu Wen, Cheng Wang:
Sunshine to Rainstorm: Cross-Weather Knowledge Distillation for Robust 3D Object Detection. 2409-2416 - Yufeng Huang, Jiji Tang, Zhuo Chen, Rongsheng Zhang, Xinfeng Zhang, Weijie Chen, Zeng Zhao, Zhou Zhao, Tangjie Lv, Zhipeng Hu, Wen Zhang:
Structure-CLIP: Towards Scene Graph Knowledge to Enhance Multi-Modal Structured Representations. 2417-2425 - Yuhao Huang, Sanping Zhou, Junjie Zhang, Jinpeng Dong, Nanning Zheng:
Voxel or Pillar: Exploring Efficient Point Cloud Representation for 3D Object Detection. 2426-2435 - Tran Huynh, Dang Nguyen, Tung Pham, Anh Tran:
COMBAT: Alternated Training for Effective Clean-Label Backdoor Attacks. 2436-2444 - Junha Hyung, Jaeyo Shin, Jaegul Choo:
MagiCapture: High-Resolution Multi-Concept Portrait Customization. 2445-2453 - Jinhyeok Jang, Chan-Hyun Youn, Minsu Jeon, Changha Lee:
Rethinking Peculiar Images by Diffusion Models: Revealing Local Minima's Role. 2454-2461 - Joonhyun Jeong, Geondo Park, Jayeon Yoo, Hyungsik Jung, Heesu Kim:
ProxyDet: Synthesizing Proxy Novel Classes via Classwise Mixup for Open-Vocabulary Object Detection. 2462-2470 - Liya Ji, Zhefan Rao, Sinno Jialin Pan, Chenyang Lei, Qifeng Chen:
A Diffusion Model with State Estimation for Degradation-Blind Inverse Imaging. 2471-2479 - Chengyou Jia, Minnan Luo, Zhuohang Dang, Guang Dai, Xiaojun Chang
, Mengmeng Wang, Jingdong Wang:
SSMG: Spatial-Semantic Map Guided Diffusion Model for Free-Form Layout-to-Image Generation. 2480-2488 - Chaoya Jiang, Wei Ye, Haiyang Xu, Qinghao Ye, Ming Yan, Ji Zhang, Shikun Zhang:
TiMix: Text-Aware Image Mixing for Effective Vision-Language Pre-training. 2489-2497 - Chenyi Jiang, Haofeng Zhang:
Revealing the Proximate Long-Tail Distribution in Compositional Zero-Shot Learning. 2498-2506 - Guangfeng Jiang, Jun Liu
, Yuzhi Wu, Wenlong Liao, Tao He, Pai Peng:
MWSIS: Multimodal Weakly Supervised Instance Segmentation with 2D Box Annotations for Autonomous Driving. 2507-2515 - Hao Jiang, Yang Yizhang, Yadong Mu:
Transferable Video Moment Localization by Moment-Guided Query Prompting. 2516-2524 - Shijian Jiang, Qi Ye, Rengan Xie, Yuchi Huo, Xiang Li, Yang Zhou, Jiming Chen:
In-Hand 3D Object Reconstruction from a Monocular RGB Video. 2525-2533 - Shiqi Jiang, Ning Li, Chen Shi, Liping Guo, Changbo Wang, Chenhui Li:
AACP: Aesthetics Assessment of Children's Paintings Based on Self-Supervised Learning. 2534-2542 - Weibo Jiang, Weihong Ren, Jiandong Tian, Liangqiong Qu, Zhiyong Wang, Honghai Liu:
Exploring Self- and Cross-Triplet Correlations for Human-Object Interaction Detection. 2543-2551 - Wenhui Jiang, Yibo Cheng, Linxin Liu, Yuming Fang, Yuxin Peng, Yang Liu:
Comprehensive Visual Grounding for Video Description. 2552-2560 - Xiaohui Jiang, Shuailin Li, Yingfei Liu, Shihao Wang
, Fan Jia, Tiancai Wang, Lijin Han, Xiangyu Zhang:
Far3D: Expanding the Horizon for Surround-View 3D Object Detection. 2561-2569 - Xin Jiang, Hao Tang
, Junyao Gao
, Xiaoyu Du, Shengfeng He
, Zechao Li:
Delving into Multimodal Prompting for Fine-Grained Visual Classification. 2570-2578 - Yangbo Jiang, Zhiwei Jiang, Le Han
, Zenan Huang, Nenggan Zheng:
MCA: Moment Channel Attention Networks. 2579-2588 - Zhiying Jiang, Xingyuan Li, Jinyuan Liu
, Xin Fan, Risheng Liu:
Towards Robust Image Stitching: An Adaptive Resistance Learning against Compatible Attacks. 2589-2597 - Yang Jiao, Zequn Jie, Shaoxiang Chen, Lechao Cheng, Jingjing Chen
, Lin Ma, Yu-Gang Jiang:
Instance-Aware Multi-Camera 3D Object Detection with Structural Priors Mining and Self-Boosting Learning. 2598-2606 - Haibo Jin, Haoxuan Che, Yi Lin, Hao Chen:
PromptMRG: Diagnosis-Driven Prompts for Medical Report Generation. 2607-2615 - Jianlong Jin, Lei Shen, Ruixin Zhang, Chenglong Zhao, Ge Jin, Jingyun Zhang, Shouhong Ding, Yang Zhao, Wei Jia:
PCE-Palm: Palm Crease Energy Based Two-Stage Realistic Pseudo-Palmprint Generation. 2616-2624 - Xin Jin
, Kai Liu, Cong Ma, Ruining Yang, Fei Hui, Wei Wu:
SwiftPillars: High-Efficiency Pillar Encoder for Lidar-Based 3D Detection. 2625-2633 - Yeying Jin, Wei Ye, Wenhan Yang, Yuan Yuan, Robby T. Tan:
DeS3: Adaptive Attention-Driven Self and Soft Shadow Removal Using ViT Similarity. 2634-2642 - Beibei Jing, Youjia Zhang, Zikai Song, Junqing Yu, Wei Yang:
AMD: Anatomical Motion Diffusion with Interpretable Motion Decomposition and Fusion. 2643-2651 - Chenchen Jing, Yukun Li, Hao Chen, Chunhua Shen:
Retrieval-Augmented Primitive Representations for Compositional Zero-Shot Learning. 2652-2660 - Linglin Jing, Sheng Xu, Yifan Wang, Yuzhe Zhou, Tao Shen, Zhigang Ji, Hui Fang, Zhen Li, Siqi Sun:
CrossBind: Collaborative Cross-Modal Identification of Protein Nucleic-Acid-Binding Residues. 2661-2669 - Linglin Jing, Ying Xue, Xu Yan, Chaoda Zheng, Dong Wang, Ruimao Zhang, Zhigang Wang, Hui Fang, Bin Zhao, Zhen Li:
X4D-SceneFormer: Enhanced Scene Understanding on 4D Point Cloud Videos through Cross-Modal Knowledge Transfer. 2670-2678 - Won Jo, Geuntaek Lim, Gwangjin Lee
, Hyunwoo Kim
, Byungsoo Ko, Yukyung Choi:
VVS: Video-to-Video Retrieval with Irrelevant Frame Suppression. 2679-2687 - Sandesh Kamath, Sankalp Mittal, Amit Deshpande, Vineeth N. Balasubramanian:
Rethinking Robustness of Model Attributions. 2688-2696 - Zhehan Kan, Xueting Hu, Zihan Liao, Ke Yu, Zhihai He:
Cross-Constrained Progressive Inference for 3D Hand Pose Estimation with Dynamic Observer-Decision-Adjuster Networks. 2697-2704 - Minsoo Kang, Minkoo Kang, Suhyun Kim
:
Catch-Up Mix: Catch-Up Class for Struggling Filters in CNN. 2705-2713 - Seunggu Kang, WonJun Moon, Euiyeon Kim, Jae-Pil Heo:
VLCounter: Text-Aware Visual Representation for Zero-Shot Object Counting. 2714-2722 - Xiao Ke, Huanqi Wu
, Wenzhong Guo:
StegFormer: Rebuilding the Glory of Autoencoder-Based Steganography. 2723-2731 - Bumsoo Kim, Jinhyung Kim, Yeonsik Jo, Seung Hwan Kim:
Expediting Contrastive Language-Image Pretraining via Self-Distilled Encoders. 2732-2740 - Dongseob Kim, Seungho Lee, Junsuk Choe
, Hyunjung Shim:
Weakly Supervised Semantic Segmentation for Driving Scenes. 2741-2749 - GeonU Kim, Kim Youwang, Tae-Hyun Oh:
FPRF: Feed-Forward Photorealistic Style Transfer of Large-Scale 3D Neural Radiance Fields. 2750-2758 - Ji-Hoon Kim, Jaehun Kim, Joon Son Chung:
Let There Be Sound: Reconstructing High Quality Speech from Silent Videos. 2759-2767 - Jiyoung Kim, Kyuhong Shim
, Insu Lee, Byonghyo Shim:
Expand-and-Quantize: Unsupervised Semantic Segmentation Using High-Dimensional Space and Product Quantization. 2768-2776 - Seoha Kim, Jeongmin Bae, Youngsik Yun, Hahyun Lee, Gun Bang, Youngjung Uh
:
Sync-NeRF: Generalizing Dynamic NeRFs to Unsynchronized Videos. 2777-2785 - Seongyeop Kim, Hyung-Il Kim, Yong Man Ro:
Improving Open Set Recognition via Visual Prompts Distilled from Common-Sense Knowledge. 2786-2794 - Sunoh Kim, Jungchan Cho, Joonsang Yu, Youngjoon Yoo, Jin Young Choi:
Gaussian Mixture Proposals with Pull-Push Learning Scheme to Capture Diverse Events for Weakly Supervised Temporal Video Grounding. 2795-2803 - Florian Kluger, Bodo Rosenhahn:
PARSAC: Accelerating Robust Multi-Model Fitting with Parallel Sample Consensus. 2804-2812 - Dimitrios Kollias, Viktoriia Sharmanska
, Stefanos Zafeiriou:
Distribution Matching for Multi-Task Learning of Classification Tasks: A Large-Scale Study on Faces & Beyond. 2813-2821 - Xiaoyu Kong, Yongyong Chen, Feng Zheng, Zhenyu He:
Block Image Compressive Sensing with Local and Global Information Interaction. 2822-2830 - Yogesh Kumar
, Saswat Mallick, Anand Mishra, Sowmya Rasipuram, Anutosh Maitra, Roshni R. Ramnani:
QDETRv: Query-Guided DETR for One-Shot Object Localization in Videos. 2831-2839 - Nilakshan Kunananthaseelan, Jing Zhang, Mehrtash Harandi:
LaViP: Language-Grounded Visual Prompting. 2840-2848 - Chengen Lai, Shengli Song, Shiqi Meng, Jingyang Li, Sitong Yan, Guangneng Hu:
Towards More Faithful Natural Language Explanation Using Multi-Level Contrastive Learning in VQA. 2849-2857 - Jinxiang Lai, Wenlong Wu, Bin-Bin Gao, Jun Liu, Jiawei Zhan, Congchong Nie, Yi Zeng, Chengjie Wang:
MatchDet: A Collaborative Framework for Image Matching and Object Detection. 2858-2865 - Danning Lao, Qi Liu
, Jiazi Bu, Junchi Yan, Wei Shen:
ViTree: Single-Path Neural Tree for Step-Wise Interpretable Fine-Grained Visual Categorization. 2866-2873 - Minh-Quan Le, Tam V. Nguyen, Trung-Nghia Le, Thanh-Toan Do, Minh N. Do, Minh-Triet Tran:
MaskDiff: Modeling Mask Distribution with Diffusion Probabilistic Model for Few-Shot Instance Segmentation. 2874-2881
AAAI Technical Track on Computer Vision III
- Chanho Lee, Jinsu Son, Hyounguk Shon, Yunho Jeon, Junmo Kim:
FRED: Towards a Full Rotation-Equivariance in Aerial Image Object Detection. 2883-2891 - Ingyun Lee, Wooju Lee, Hyun Myung:
Domain Generalization with Vital Phase Augmentation. 2892-2900 - Jae Young Lee, Woonghyun Ka, Jaehyun Choi, Junmo Kim:
Modeling Stereo-Confidence out of the End-to-End Stereo-Matching Network via Disparity Plane Sweep. 2901-2910 - JongMin Lee, Yohann Cabon, Romain Brégier, Sungjoo Yoo, Jérôme Revaud:
MFOS: Model-Free & One-Shot Object Pose Estimation. 2911-2919 - Minkyu Lee, Jae-Pil Heo:
Noise-Free Optimization in Early Training Steps for Image Super-resolution. 2920-2928 - Seokjun Lee, Seung-Won Jung, Hyunseok Seo:
Spectrum Translation for Refinement of Image Generation (STIG) Based on Contrastive Learning and Spectral Filter Profile. 2929-2937 - SeokYeong Lee
, Junyong Choi, Seungryong Kim, Ig-Jae Kim, Junghyun Cho:
Few-Shot Neural Radiance Fields under Unconstrained Illumination. 2938-2946 - Wooju Lee, Dasol Hong, Hyungtae Lim, Hyun Myung:
Object-Aware Domain Generalization for Object Detection. 2947-2955 - Saebom Leem, Hyunseok Seo:
Attention Guided CAM: Visual Explanations of Vision Transformer Guided by Self-Attention. 2956-2964 - Johannes Lehner, Benedikt Alkin, Andreas Fürst, Elisabeth Rumetshofer, Lukas Miklautz
, Sepp Hochreiter:
Contrastive Tuning: A Little Help to Make Masked Autoencoders Forget. 2965-2973 - Qinqian Lei, Bo Wang
, Robby T. Tan:
Few-Shot Learning from Augmented Label-Uncertain Queries in Bongard-HOI. 2974-2982 - Yicheng Leng
, Chaowei Fang, Gen Li, Yixiang Fang
, Guanbin Li:
Removing Interference and Recovering Content Imaginatively for Visible Watermark Removal. 2983-2990 - Matan Levy, Rami Ben-Ari, Nir Darshan, Dani Lischinski:
Data Roaming and Quality Assessment for Composed Image Retrieval. 2991-2999 - Bao Li, Zhenyu Liu, Lizhi Shao, Bensheng Qiu, Hong Bu, Jie Tian:
Point Transformer with Federated Learning for Predicting Breast Cancer HER2 Status from Hematoxylin and Eosin-Stained Whole Slide Images. 3000-3008 - Bin Li, Ye Shi, Qian Yu, Jingya Wang:
Unsupervised Cross-Domain Image Retrieval via Prototypical Optimal Transport. 3009-3017 - Bohan Li, Xiao Xu, Xinghao Wang, Yutai Hou, Yunlong Feng, Feng Wang, Xuanliang Zhang, Qingfu Zhu, Wanxiang Che:
Semantic-Guided Generative Image Augmentation Method with Diffusion Models for Image Classification. 3018-3027 - Bohan Li, Yasheng Sun, Jingxin Dong
, Zheng Zhu, Jinming Liu, Xin Jin, Wenjun Zeng
:
One at a Time: Progressive Multi-Step Volumetric Probability Learning for Reliable 3D Scene Perception. 3028-3036 - Dongze Li, Kang Zhao, Wei Wang, Bo Peng, Yingya Zhang, Jing Dong, Tieniu Tan:
AE-NeRF: Audio Enhanced Neural Radiance Field for Few Shot Talking Head Synthesis. 3037-3045 - Hanhui Li, Xiaojian Lin, Xuan Huang, Zejun Yang, Zhisheng Wang, Xiaodan Liang:
Monocular 3D Hand Mesh Recovery via Dual Noise Estimation. 3046-3054 - Hanxuan Li, Bin Fu
, Ruiping Wang, Xilin Chen:
Point2Real: Bridging the Gap between Point Cloud and Realistic Image for Open-World 3D Recognition. 3055-3063 - Hao Li, Mengqi Huang, Lei Zhang, Bo Hu, Yi Liu, Zhendong Mao:
Gradual Residuals Alignment: A Dual-Stream Framework for GAN Inversion and Image Attribute Editing. 3064-3072 - Haolong Li, Chenghao Du, Ziheng Jiang, Yifan Zhang, Jiawei Ma, Chen Ye
:
Towards Automated Chinese Ancient Character Restoration: A Diffusion-Based Method with a New Dataset. 3073-3081 - Hongjie Li, Yao Guo, Xianwei Zheng, Hanjiang Xiong:
Learning Deformable Hypothesis Sampling for Accurate PatchMatch Multi-View Stereo. 3082-3090 - Huafeng Li
, Qingsong Hu, Zhanxuan Hu:
Catalyst for Clustering-Based Unsupervised Object Re-identification: Feature Calibration. 3091-3099 - Jiafeng Li, Zelin Li, Ying Wen:
EAN: An Efficient Attention Module Guided by Normalization for Deep Neural Networks. 3100-3108 - Jianwu Li, Kaiyue Shi, Guo-Sen Xie, Xiaofeng Liu, Jian Zhang
, Tianfei Zhou:
Label-Efficient Few-Shot Semantic Segmentation with Unsupervised Meta-Training. 3109-3117 - Jichang Li, Guanbin Li, Hui Cheng, Zicheng Liao, Yizhou Yu:
FedDiv: Collaborative Noise Filtering for Federated Learning with Noisy Labels. 3118-3126 - Jing Li, Junsong Fan, Yuran Yang, Shuqi Mei, Jun Xiao, Zhaoxiang Zhang:
Fully Data-Driven Pseudo Label Estimation for Pointly-Supervised Panoptic Segmentation. 3127-3135 - Kailin Li
, Lixin Yang, Zenan Lin, Jian Xu, Xinyu Zhan, Yifei Zhao, Pengxiang Zhu, Wenxiong Kang, Kejian Wu, Cewu Lu:
FAVOR: Full-Body AR-Driven Virtual Object Rearrangement Guided by Instruction Text. 3136-3144 - Li Li, Wei Ji, Yiming Wu, Mengze Li, You Qin, Lina Wei, Roger Zimmermann:
Panoptic Scene Graph Generation with Semantics-Prototype Learning. 3145-3153 - Ru Li, Jia Liu, Guanghui Liu, Shengping Zhang, Bing Zeng, Shuaicheng Liu:
SpectralNeRF: Physically Based Spectral Rendering with Neural Radiance Field. 3154-3162 - Shengtao Li, Ge Gao, Yudong Liu, Yu-Shen Liu
, Ming Gu:
GridFormer: Point-Grid Transformer for Surface Reconstruction. 3163-3171 - Shenshen Li, Chen He, Xing Xu, Fumin Shen, Yang Yang, Heng Tao Shen:
Adaptive Uncertainty-Based Learning for Text-Based Person Retrieval. 3172-3180 - Shujuan Li, Junsheng Zhou, Baorui Ma, Yu-Shen Liu
, Zhizhong Han:
Learning Continuous Implicit Field with Local Distance Indicator for Arbitrary-Scale Point Cloud Upsampling. 3181-3189 - Weiqi Li, Fan Lyu, Fanhua Shang, Liang Wan, Wei Feng:
Long-Tailed Learning as Multi-Objective Optimization. 3190-3198 - Xi Li, Songhe Wang, Ruiquan Huang, Mahanth Gowda, George Kesidis:
Temporal-Distributed Backdoor Attack against Video Based Action Recognition. 3199-3207 - Xiang Li, Junbo Yin, Wei Li, Chengzhong Xu, Ruigang Yang, Jianbing Shen:
DI-V2X: Learning Domain-Invariant Representation for Vehicle-Infrastructure Collaborative 3D Object Detection. 3208-3215 - Xiawei Li, Qingyuan Xu, Jing Zhang, Tianyi Zhang, Qian Yu, Lu Sheng
, Dong Xu:
Multi-Modality Affinity Inference for Weakly Supervised 3D Semantic Segmentation. 3216-3224 - Ximeng Li, Chen Zhang, Wanjuan Su, Wenbing Tao:
IINet: Implicit Intra-inter Information Fusion for Real-Time Stereo Matching. 3225-3233 - Xiutian Li, Siqi Sun, Rui Feng:
Causal Representation Learning via Counterfactual Intervention. 3234-3242 - Yanjing Li, Sheng Xu, Mingbao Lin, Xianbin Cao, Chuanjian Liu, Xiao Sun, Baochang Zhang:
Bi-ViT: Pushing the Limit of Vision Transformer Quantization. 3243-3251 - Yanxi Li
, Chengbin Du, Chang Xu:
Harnessing Edge Information for Improved Robustness in Vision Transformers. 3252-3260 - Yiming Li, Peng Zhou, Jun Sun, Yi Xu:
Multi-Region Text-Driven Manipulation of Diffusion Imagery. 3261-3269 - Yuelong Li, Tengfei Xiao, Lei Geng, Jianming Wang:
Direct May Not Be the Best: An Incremental Evolution View of Pose Generation. 3270-3278 - Yuhan Li, Yishun Dou, Yue Shi, Yu Lei, Xuanhong Chen, Yi Zhang, Peng Zhou, Bingbing Ni:
FocalDreamer: Text-Driven 3D Editing via Focal-Fusion Assembly. 3279-3287 - Zekun Li, Hongying Liu, Fanhua Shang, Yuanyuan Liu, Liang Wan, Wei Feng:
SAVSR: Arbitrary-Scale Video Super-Resolution via a Learned Scale-Adaptive Network. 3288-3296 - Zepeng Li, Dongxiang Zhang, Sai Wu, Mingli Song, Gang Chen:
Sampling-Resilient Multi-Object Tracking. 3297-3305 - Zhangbin Li, Dan Guo
, Jinxing Zhou, Jing Zhang, Meng Wang:
Object-Aware Adaptive-Positivity Learning for Audio-Visual Question Answering. 3306-3314 - Sen Liang, Kai Zhu, Wei Zhai, Zhiheng Liu, Yang Cao:
Hypercorrelation Evolution for Video Class-Incremental Learning. 3315-3323 - Yaoyuan Liang, Xiao Liang, Yansong Tang, Zhao Yang, Ziran Li, Jingang Wang, Wenbo Ding, Shao-Lun Huang:
CoSTA: End-to-End Comprehensive Space-Time Entanglement for Spatio-Temporal Video Grounding. 3324-3332 - Zhaohuai Liang
, Changhe Li:
Any-Stereo: Arbitrary Scale Disparity Estimation for Iterative Stereo Matching. 3333-3341 - Dongping Liao, Xitong Gao, Chengzhong Xu:
Impartial Adversarial Distillation: Addressing Biased Data-Free Knowledge Distillation via Adaptive Constrained Optimization. 3342-3350 - Guibiao Liao, Jiankun Li, Xiaoqing Ye:
VLM2Scene: Self-Supervised Image-Text-LiDAR Learning with Foundation Models for Autonomous Driving Scene Understanding. 3351-3359 - Jiayi Liao, Xu Chen, Qiang Fu, Lun Du, Xiangnan He, Xiang Wang, Shi Han, Dongmei Zhang:
Text-to-Image Generation for Abstract Concepts. 3360-3368 - Tangfei Liao, Xiaoqin Zhang, Li Zhao, Tao Wang, Guobao Xiao:
VSFormer: Visual-Spatial Fusion Transformer for Correspondence Pruning. 3369-3377 - Beibei Lin, Yeying Jin, Wending Yan, Wei Ye, Yuan Yuan, Shunli Zhang, Robby T. Tan:
NightRain: Nighttime Video Deraining via Adaptive-Rain-Removal and Adaptive-Correction. 3378-3385 - Huangxing Lin, Yuhang Dong, Xinghao Ding, Tianpeng Liu, Yongxiang Liu:
Unsupervised Pan-Sharpening via Mutually Guided Detail Restoration. 3386-3394 - Hui Lin, Zhiheng Ma, Xiaopeng Hong, Qinnan Shangguan, Deyu Meng:
Gramformer: Learning Crowd Counting via Graph-Modulated Transformer. 3395-3403 - Jianghang Lin
, Yunhang Shen, Bingquan Wang, Shaohui Lin, Ke Li, Liujuan Cao:
Weakly Supervised Open-Vocabulary Object Detection. 3404-3412 - Jieru Lin, Danqing Huang, Tiejun Zhao, Dechen Zhan, Chin-Yew Lin:
Spot the Error: Non-autoregressive Graphic Layout Generation with Wireframe Locator. 3413-3421 - Jinhao Lin, Ziheng Wu, Weifeng Lin, Jun Huang, Ronghua Luo:
M2SD: Multiple Mixing Self-Distillation for Few-Shot Class-Incremental Learning. 3422-3431 - Longzhong Lin, Xuewu Lin, Tianwei Lin, Lichao Huang, Rong Xiong, Yue Wang:
EDA: Evolving and Distinct Anchors for Multimodal Motion Prediction. 3432-3440 - Luoyang Lin, Zutao Jiang, Xiaodan Liang, Liqian Ma, Michael C. Kampffmeyer, Xiaochun Cao:
PTUS: Photo-Realistic Talking Upper-Body Synthesis via 3D-Aware Motion Decomposition Warping. 3441-3449 - Matthieu Lin, Jenny Sheng, Yubin Hu
, Yangguang Li, Lu Qi, Andrew Zhao, Gao Huang, Yong-Jin Liu:
Exploring Temporal Feature Correlation for Efficient and Stable Video Semantic Segmentation. 3450-3458 - Qinliang Lin, Cheng Luo, Zenghao Niu, Xilin He, Weicheng Xie, Yuanbo Hou, Linlin Shen, Siyang Song:
Boosting Adversarial Transferability across Model Genus by Deformation-Constrained Warping. 3459-3467 - Wei Lin
, Antoni B. Chan:
A Fixed-Point Approach to Unified Prompt-Based Counting. 3468-3476 - Weiping Lin, Zhenfeng Zhuang, Lequan Yu, Liansheng Wang:
Boosting Multiple Instance Learning Models for Whole Slide Image Classification: A Model-Agnostic Framework Based on Counterfactual Inference. 3477-3485 - Wenbin Lin, Chengwei Zheng, Jun-Hai Yong, Feng Xu:
Relightable and Animatable Neural Avatars from Videos. 3486-3494 - Xin Lin, Chong Shi, Yibing Zhan, Zuopeng Yang, Yaqi Wu, Dacheng Tao:
TD²-Net: Toward Denoising and Debiasing for Video Scene Graph Generation. 3495-3503 - Youtian Lin:
Ced-NeRF: A Compact and Efficient Method for Dynamic Neural Radiance Fields. 3504-3512 - Yuqi Lin, Minghao Chen, Kaipeng Zhang, Hengjia Li, Mingming Li, Zheng Yang, Dongqin Lv, Binbin Lin, Haifeng Liu, Deng Cai:
TagCLIP: A Local-to-Global Framework to Enhance Open-Vocabulary Multi-Label Classification of CLIP without Training. 3513-3521 - Zhenkai Lin, Yanli Ji, Yang Yang:
Independency Adversarial Learning for Cross-Modal Sound Separation. 3522-3530 - Zhiwei Lin, Yongtao Wang, Shengxiang Qi, Nan Dong, Ming-Hsuan Yang:
BEV-MAE: Bird's Eye View Masked Autoencoders for Point Cloud Pre-training in Autonomous Driving Scenarios. 3531-3539 - Bo Liu, Bin Hu, Xiuli Bi, Weisheng Li, Bin Xiao:
Focus Stacking with High Fidelity and Superior Visual Effects. 3540-3547 - Chao Liu
, Ting Zhao, Nenggan Zheng:
DeepBranchTracer: A Generally-Applicable Approach to Curvilinear Structure Reconstruction Using Multi-Feature Learning. 3548-3557 - Chengxu Liu, Xuan Wang
, Yuanting Fan
, Shuai Li, Xueming Qian:
Decoupling Degradations with Recurrent Network for Video Restoration in Under-Display Camera. 3558-3566 - Daizong Liu, Xiang Fang, Xiaoye Qu, Jianfeng Dong, He Yan, Yang Yang, Pan Zhou, Yu Cheng:
Unsupervised Domain Adaptative Temporal Sentence Localization with Mutual Information Maximization. 3567-3575 - Daizong Liu, Wei Hu:
Explicitly Perceiving and Preserving the Local Geometric Structures for 3D Point Cloud Attack. 3576-3584 - Decheng Liu, Xijun Wang, Chunlei Peng, Nannan Wang, Ruimin Hu, Xinbo Gao:
Adv-Diffusion: Imperceptible Adversarial Face Identity Attack via Latent Diffusion Model. 3585-3593 - Fang Liu
, Yuhao Liu
, Jiaying Lin
, Ke Xu
, Rynson W. H. Lau:
Multi-View Dynamic Reflection Prior for Video Glass Surface Detection. 3594-3602 - Hao Liu, Xin Li, Mingming Gong, Bing Liu, Yunfei Wu, Deqiang Jiang, Yinsong Liu, Xing Sun
:
Grab What You Need: Rethinking Complex Table Structure Recognition with Flexible Components Deliberation. 3603-3611 - Haoran Liu, Ying Ma, Ming Yan
, Yingke Chen, Dezhong Peng, Xu Wang:
DiDA: Disambiguated Domain Alignment for Cross-Domain Retrieval with Partial Labels. 3612-3620 - Huan Liu, Julia Qi, Zhenhao Li, Mohammad Hassanpour, Yang Wang, Konstantinos N. Plataniotis, Yuanhao Yu:
Test-Time Personalization with Meta Prompt for Gaze Estimation. 3621-3629 - Jiaming Liu, Yue Wu
, Maoguo Gong, Qiguang Miao, Wenping Ma, Cai Xu, Can Qin:
M3SOT: Multi-Frame, Multi-Field, Multi-Space 3D Single Object Tracking. 3630-3638 - Jiaqi Liu, Kai Wu, Qiang Nie, Ying Chen, Bin-Bin Gao, Yong Liu, Jinbao Wang, Chengjie Wang, Feng Zheng:
Unsupervised Continual Anomaly Detection with Contrastively-Learned Prompt. 3639-3647 - Jin Liu
, Huiyuan Fu, Chuanming Wang, Huadong Ma:
Region-Aware Exposure Consistency Network for Mixed Exposure Correction. 3648-3656 - Jinxiu Liu, Qi Liu:
R3CD: Scene Graph to Image Generation with Relation-Aware Compositional Contrastive Control Diffusion. 3657-3665 - Jun Liu, Jiantao Zhou, Jiandian Zeng, Jinyu Tian
:
DifAttack: Query-Efficient Black-Box Adversarial Attack via Disentangled Feature Space. 3666-3674 - Lijun Liu, Rui Wang, Yuan Wang, Lihua Jing, Chuan Wang:
Frequency Shuffling and Enhancement for Open Set Recognition. 3675-3683 - Liu Liu
, Anran Huang, Qi Wu, Dan Guo
, Xun Yang, Meng Wang:
KPA-Tracker: Towards Robust and Real-Time Category-Level Articulated Object 6D Pose Tracking. 3684-3692 - Ruicong Liu
, Feng Lu:
UVAGaze: Unsupervised 1-to-2 Views Adaptation for Gaze Estimation. 3693-3701 - Ruixin Liu, Zejian Yuan:
Compact HD Map Construction via Douglas-Peucker Point Transformer. 3702-3710 - Siqi Liu, Yong-Lu Li, Zhou Fang, Xinpeng Liu, Yang You, Cewu Lu:
Primitive-Based 3D Human-Object Interaction Modelling and Programming. 3711-3719 - Wang Liu, Wei Gao
, Xingming Mu
:
Fast Inter-frame Motion Prediction for Compressed Dynamic Point Cloud Attribute Enhancement. 3720-3728 - Wensi Liu, Xiao-Yu Tang, Chong Yang, Chunjie Yang:
RWMS: Reliable Weighted Multi-Phase for Semi-supervised Segmentation. 3729-3737 - Xiaohui Liu, Zhilu Zhang, Xiaohe Wu, Chaoyu Feng, Xiaotao Wang, Lei Lei, Wangmeng Zuo:
Learning Real-World Image De-weathering with Imperfect Supervision. 3738-3746 - Xingyu Liu, Xu Cheng, Haoyu Chen, Hao Yu, Guoying Zhao:
Differentiable Auxiliary Learning for Sketch Re-Identification. 3747-3755 - Xingyu Liu, Pengfei Ren, Yuanyuan Gao, Jingyu Wang, Haifeng Sun, Qi Qi, Zirui Zhuang, Jianxin Liao:
Keypoint Fusion for RGB-D Based 3D Hand Pose Estimation. 3756-3764 - Xiulong Liu
, Sudipta Paul, Moitreya Chatterjee, Anoop Cherian:
CAVEN: An Embodied Conversational Agent for Efficient Audio-Visual Navigation in Noisy Environments. 3765-3773 - Yitian Liu, Zhouhui Lian:
DeepCalliFont: Few-Shot Chinese Calligraphy Font Synthesis by Integrating Dual-Modality Generative Models. 3774-3782 - Yixin Liu
, Kaidi Xu, Xun Chen, Lichao Sun:
Stable Unlearnable Example: Enhancing the Robustness of Unlearnable Examples via Stable Error-Minimizing Noise. 3783-3791 - Yongxu Liu, Yinghui Quan, Guoyao Xiao, Aobo Li, Jinjian Wu:
Scaling and Masking: A New Paradigm of Data Sampling for Image and Video Quality Assessment. 3792-3801 - Yuchun Liu, Benjamin Planche, Meng Zheng, Zhongpai Gao, Pierre Sibut-Bourde, Fan Yang, Terrence Chen, Ziyan Wu:
Implicit Modeling of Non-rigid Objects with Cross-Category Signals. 3802-3809 - Yuhao Liu
, Zhanghan Ke, Ke Xu
, Fang Liu
, Zhenwei Wang
, Rynson W. H. Lau:
Recasting Regional Lighting for Shadow Removal. 3810-3818 - Yutong Liu
, Haijiang Zhu, Mengting Liu, Huaiyuan Yu, Zihan Chen, Jie Gao:
Rolling-Unet: Revitalizing MLP's Ability to Efficiently Extract Long-Distance Dependencies for Medical Image Segmentation. 3819-3827 - Yuxuan Liu, Haizhou Ai, Junliang Xing, Xuri Li, Xiaoyi Wang, Pin Tao:
Advancing Video Synchronization with Fractional Frame Analysis: Introducing a Novel Dataset and Model. 3828-3836 - Yuzhi Liu, Huisi Wu, Jing Qin:
FedCD: Federated Semi-Supervised Learning with Class Awareness Balance via Dual Teachers. 3837-3845 - Zhaochen Liu, Zhixuan Li, Tingting Jiang:
BLADE: Box-Level Supervised Amodal Segmentation through Directed Expansion. 3846-3854 - Zhihang Liu, Jun Li, Hongtao Xie, Pandeng Li, Jiannan Ge, Sun'ao Liu, Guoqing Jin:
Towards Balanced Alignment: Modal-Enhanced Semantic Modeling for Video Moment Retrieval. 3855-3863 - Zhiyue Liu, Jinyuan Liu
, Fanrong Ma:
Improving Cross-Modal Alignment with Synthetic Pairs for Text-Only Image Captioning. 3864-3872 - Wei Lou, Guanbin Li, Xiang Wan, Haofeng Li:
Cell Graph Transformer for Nuclei Classification. 3873-3881 - Changsheng Lu, Piotr Koniusz:
Detect Any Keypoints: An Efficient Light-Weight Few-Shot Keypoint Detector. 3882-3890 - Hui Lu, Albert Ali Salah, Ronald Poppe:
TCNet: Continuous Sign Language Recognition from Trajectories and Correlated Regions. 3891-3899 - Yanzuo Lu, Meng Shen, Andy J. Ma, Xiaohua Xie, Jian-Huang Lai:
MLNet: Mutual Learning Network with Neighborhood Invariance for Universal Domain Adaptation. 3900-3908 - Yifan Lu, Ziqi Zhang, Chunfeng Yuan, Peng Li, Yan Wang, Bing Li, Weiming Hu:
Set Prediction Guided by Semantic Concepts for Diverse Video Captioning. 3909-3917 - Yiheng Lu, Ziyu Guan, Yaming Yang, Wei Zhao, Maoguo Gong, Cai Xu:
Entropy Induced Pruning Framework for Convolutional Neural Networks. 3918-3926 - Zhan Lu, Qian Zheng, Boxin Shi, Xudong Jiang:
Pano-NeRF: Synthesizing High Dynamic Range Novel Views with Geometry from Sparse Low Dynamic Range Panoramic Images. 3927-3935 - Ziyang Lu, Yunqiang Pei, Guoqing Wang, Peiwei Li, Yang Yang, Yinjie Lei, Heng Tao Shen:
ScanERU: Interactive 3D Visual Grounding Based on Embodied Reference Understanding. 3936-3944 - Luanyuan Dai, Xiaoyu Du, Hanwang Zhang, Jinhui Tang:
MGNet: Learning Correspondences via Multiple Graphs. 3945-3953 - Ao Luo, Linxin Song, Keisuke Nonaka, Kyohei Unno, Heming Sun, Masayuki Goto, Jiro Katto:
SCP: Spherical-Coordinate-Based Learned Point Cloud Compression. 3954-3962 - Chunjie Luo, Fei Luo, Yusen Wang, Enxu Zhao, Chunxia Xiao:
DLCA-Recon: Dynamic Loose Clothing Avatar Reconstruction from Monocular Videos. 3963-3971 - Fulin Luo, Xi Chen
, Xiuwen Gong, Weiwen Wu, Tan Guo:
Dual-Window Multiscale Transformer for Hyperspectral Snapshot Compressive Imaging. 3972-3980 - Naisong Luo, Rui Sun, Yuwen Pan, Tianzhu Zhang, Feng Wu:
Electron Microscopy Images as Set of Fragments for Mitochondrial Segmentation. 3981-3989
AAAI Technical Track on Computer Vision IV
- Run Luo, Zikai Song, Lintao Ma, Jinlin Wei, Wei Yang, Min Yang:
DiffusionTrack: Diffusion Model for Multi-Object Tracking. 3991-3999 - Shenghong Luo, Xuhang Chen
, Weiwen Chen, Zinuo Li, Shuqiang Wang, Chi-Man Pun:
Devignet: High-Resolution Vignetting Removal via a Dual Aggregated Fusion Transformer with Adaptive Channel Expansion. 4000-4008 - Xiaotong Luo, Zekun Ai, Qiuyuan Liang, Ding Liu, Yuan Xie, Yanyun Qu, Yun Fu:
AdaFormer: Efficient Transformer with Adaptive Token Sparsification for Image Super-resolution. 4009-4016 - Xiaotong Luo, Yuan Xie, Yanyun Qu, Yun Fu:
SkipDiff: Adaptive Skip Diffusion Model for High-Fidelity Perceptual Image Super-resolution. 4017-4025 - Zhipeng Luo, Gongjie Zhang, Changqing Zhou, Zhonghua Wu, Qingyi Tao, Lewei Lu, Shijian Lu:
Modeling Continuous Motion for 3D Point Cloud Object Tracking. 4026-4034 - Changsheng Lv, Mengshi Qi, Xia Li, Zhengyuan Yang, Huadong Ma:
SGFormer: Semantic Graph Transformer for Point Cloud-Based 3D Scene Graph Generation. 4035-4043 - Cheng Lyu, Jiake Xie, Bo Xu
, Cheng Lu, Han Huang, Xin Huang, Ming Wu, Chuang Zhang, Yong Tang:
Privileged Prior Information Distillation for Image Matting. 4044-4052 - Boyuan Ma, Xiang Yin, Jing Tan, Yongfeng Chen
, Haiyou Huang, Hao Wang, Weihua Xue, Xiaojuan Ban:
FedST: Federated Style Transfer Learning for Non-IID Image Segmentation. 4053-4061 - Chen Ma
, Ningfei Wang, Qi Alfred Chen, Chao Shen:
SlowTrack: Increasing the Latency of Camera-Based Perception in Autonomous Driving Using Adversarial Examples. 4062-4070 - Chenxi Ma:
Uncertainty-Aware GAN for Single Image Super Resolution. 4071-4079 - Fan Ma, Xiaojie Jin, Heng Wang, Jingjia Huang, Linchao Zhu, Yi Yang:
Stitching Segments and Sentences towards Generalization in Video-Text Pre-training. 4080-4088 - Feipeng Ma, Yizhou Zhou, Fengyun Rao, Yueyi Zhang, Xiaoyan Sun:
Image Captioning with Multi-Context Synthetic Data. 4089-4097 - Wan-Duo Kurt Ma, Avisek Lahiri, John P. Lewis, Thomas Leung, W. Bastiaan Kleijn
:
Directed Diffusion: Direct Control of Object Placement through Attention Guidance. 4098-4106 - Yinchao Ma, Yuyang Tang, Wenfei Yang, Tianzhu Zhang, Jinpeng Zhang, Mengxue Kang:
Unifying Visual and Vision-Language Tracking via Contrastive Learning. 4107-4116 - Yue Ma, Yingqing He, Xiaodong Cun, Xintao Wang, Siran Chen, Xiu Li, Qifeng Chen:
Follow Your Pose: Pose-Guided Text-to-Video Generation Using Pose-Free Videos. 4117-4125 - Zhe Ma
, Jianfeng Dong, Shouling Ji, Zhenguang Liu, Xuhong Zhang, Zonghui Wang, Sifeng He, Feng Qian
, Xiaobo Zhang, Lei Yang:
Let All Be Whitened: Multi-Teacher Distillation for Efficient Visual Retrieval. 4126-4135 - Zhen-Xiang Ma
, Zhen-Duo Chen, Li-Jun Zhao
, Zi-Chao Zhang, Xin Luo, Xin-Shun Xu:
Cross-Layer and Cross-Sample Feature Optimization Network for Few-Shot Fine-Grained Image Classification. 4136-4144 - Zhiyuan Ma, Zhihuan Yu, Jianjun Li, Bowen Zhou:
LMD: Faster Image Reconstruction with Latent Masking Diffusion. 4145-4153 - Zhiyuan Ma, Guoli Jia, Bowen Zhou:
AdapEdit: Spatio-Temporal Guided Adaptive Editing Algorithm for Text-Based Continuity-Sensitive Image Editing. 4154-4161 - Huayu Mai, Rui Sun, Yuan Wang, Tianzhu Zhang, Feng Wu:
Pay Attention to Target: Relation-Aware Temporal Consistency for Domain Adaptive Video Semantic Segmentation. 4162-4170 - Oscar Mañas, Benno Krojer, Aishwarya Agrawal:
Improving Automatic VQA Evaluation Using Large Language Models. 4171-4179 - Ruiyu Mao, Ouyang Xu, Yunhui Guo:
Inconsistency-Based Data-Centric Active Open-Set Annotation. 4180-4188 - Ge Meng, Jingjia Huang, Yingying Wang, Zhenqi Fu, Xinghao Ding, Yue Huang:
Progressive High-Frequency Reconstruction for Pan-Sharpening with Implicit Neural Representation. 4189-4197 - Runqi Meng, Xiao Zhang
, Shijie Huang, Yuning Gu, Guiqin Liu, Guangyu Wu, Nizhuan Wang, Kaicong Sun, Dinggang Shen:
NaMa: Neighbor-Aware Multi-Modal Adaptive Learning for Prostate Tumor Segmentation on Anisotropic MR Images. 4198-4206 - Li Mi, Syrielle Montariol, Javiera Castillo Navarro, Xianjie Dai, Antoine Bosselut, Devis Tuia:
ConVQG: Contrastive Visual Question Generation with Multimodal Guidance. 4207-4215 - Wenjun Miao
, Guansong Pang, Xiao Bai, Tianqi Li, Jin Zheng:
Out-of-Distribution Detection in Long-Tailed Recognition with Calibrated Outlier Class Learning. 4216-4224 - Xiangyang Miao, Guobao Xiao, Shiping Wang, Jun Yu:
BCLNet: Bilateral Consensus Learning for Two-View Correspondence Pruning. 4225-4232 - Roy Miles, Krystian Mikolajczyk:
Understanding the Role of the Projector in Knowledge Distillation. 4233-4241 - Zijian Min, Gundu Mohamed Hassan
, Geun-Sik Jo:
Robust Blind Text Image Deblurring via Maximum Consensus Framework. 4242-4250 - Shankhanil Mitra
, Rajiv Soundararajan:
Knowledge Guided Semi-supervised Learning for Quality Assessment of User Generated Videos. 4251-4260 - Wentao Mo, Yang Liu:
Bridging the Gap between 2D and 3D Visual Question Answering: A Fusion Approach for 3D VQA. 4261-4268 - Bahram Mohammadi, Yicong Hong, Yuankai Qi
, Qi Wu, Shirui Pan, Javen Qinfeng Shi:
Augmented Commonsense Knowledge for Remote Object Grounding. 4269-4277 - Henrique Morimitsu
, Xiaobin Zhu, Xiangyang Ji, Xu-Cheng Yin:
Recurrent Partial Kernel Network for Efficient Optical Flow Estimation. 4278-4286 - Andrey Moskalenko
, Vlad Shakhuro, Anna Vorontsova, Anton Konushin, Anton Antonov, Alexander Krapukhin, Denis Shepelev, Konstantin Soshin:
TETRIS: Towards Exploring the Robustness of Interactive Segmentation. 4287-4295 - Chong Mou, Xintao Wang, Liangbin Xie, Yanze Wu, Jian Zhang, Zhongang Qi, Ying Shan:
T2I-Adapter: Learning Adapters to Dig Out More Controllable Ability for Text-to-Image Diffusion Models. 4296-4304 - Sahal Shaji Mullappilly, Abhishek Singh Gehlot, Rao Muhammad Anwer, Fahad Shahbaz Khan, Hisham Cholakkal:
Semi-supervised Open-World Object Detection. 4305-4314 - Géraldin Nanfack, Alexander Fulleringer, Jonathan Marty
, Michael Eickenberg, Eugene Belilovsky:
Adversarial Attacks on the Interpretation of Neuron Activation Maximization. 4315-4324 - Zhangkai Ni
, Peiqi Yang, Wenhan Yang, Hanli Wang, Lin Ma, Sam Kwong:
ColNeRF: Collaboration for Generalizable Sparse Input Neural Radiance Field. 4325-4333 - Xuesong Nie, Yunfeng Yan, Siyuan Li, Cheng Tan, Xi Chen, Haoyuan Jin, Zhihang Zhu, Stan Z. Li, Donglian Qi:
Wavelet-Driven Spatiotemporal Predictive Learning: Bridging Frequency and Time Variations. 4334-4342 - Li Niu, Junyan Cao, Yan Hong, Liqing Zhang:
Painterly Image Harmonization by Learning from Painterly Objects. 4343-4351 - Li Niu, Yan Hong, Junyan Cao, Liqing Zhang:
Progressive Painterly Image Harmonization from Low-Level Styles to High-Level Styles. 4352-4360 - Minyoung Oh, Duhyun Kim, Jae-Young Sim:
Domain Generalizable Person Search Using Unreal Dataset. 4361-4368 - Wenzhe Ouyang, Xiaolin Song, Bailan Feng, Zenglin Xu:
OctOcc: High-Resolution 3D Occupancy Prediction with Octree. 4369-4377 - Parth Padalkar, Huaduo Wang, Gopal Gupta:
NeSyFOLD: A Framework for Interpretable Image Classification. 4378-4387 - Wensheng Pan, Timin Gao, Yan Zhang, Xiawu Zheng, Yunhang Shen, Ke Li, Runze Hu, Yutao Liu, Pingyang Dai:
Semi-Supervised Blind Image Quality Assessment through Knowledge Distillation and Incremental Learning. 4388-4396 - Zhiyi Pan, Nan Zhang, Wei Gao
, Shan Liu, Ge Li:
Less Is More: Label Recommendation for Weakly Supervised Point Cloud Semantic Segmentation. 4397-4405 - Zirui Pan, Mengbai Xiao, Xu Han
, Dongxiao Yu, Guanghui Zhang, Yao Liu:
patchDPCC: A Patchwise Deep Compression Framework for Dynamic Point Clouds. 4406-4414 - Atharva Pandey, Vishal Yadav, Rajendra Nagar, Santanu Chaudhury:
LISR: Learning Linear 3D Implicit Surface Representation Using Compactly Supported Radial Basis Functions. 4415-4423 - Changsong Pang, Xieyuanli Chen
, Yimin Liu, Huimin Lu, Yuwei Cheng:
RadarMOSEVE: A Spatial-Temporal Transformer Network for Radar-Only Moving Object Segmentation and Ego-Velocity Estimation. 4424-4432 - Sihwa Park, Seongjun Kim, Doeyoung Kwon, Yohan Jang, In-Seok Song
, Seung Jun Baek:
NeBLa: Neural Beer-Lambert for 3D Reconstruction of Oral Structures from Panoramic Radiographs. 4433-4441 - Suho Park, Su Been Lee, Sangeek Hyun, Hyun Seok Seong, Jae-Pil Heo:
Task-Disruptive Background Suppression for Few-Shot Segmentation. 4442-4449 - Wenjie Pei, Tongqi Xia, Fanglin Chen, Jinsong Li, Jiandong Tian, Guangming Lu:
SA²VP: Spatially Aligned-and-Adapted Visual Prompt. 4450-4458 - Bo Peng, Xinyuan Chen, Yaohui Wang, Chaochao Lu, Yu Qiao:
ConditionVideo: Training-Free Condition-Guided Video Generation. 4459-4467 - Dezhi Peng, Chongyu Liu, Yuliang Liu, Lianwen Jin:
ViTEraser: Harnessing the Power of Vision Transformers for Scene Text Removal with SegMIM Pretraining. 4468-4477 - Jinlong Peng, Zekun Luo, Liang Liu, Boshen Zhang:
FRIH: Fine-Grained Region-Aware Image Harmonization. 4478-4486 - Kunyu Peng, Cheng Yin, Junwei Zheng, Ruiping Liu, David Schneider
, Jiaming Zhang, Kailun Yang, M. Saquib Sarfraz, Rainer Stiefelhagen, Alina Roitberg
:
Navigating Open Set Scenarios for Skeleton-Based Action Recognition. 4487-4496 - Renyuan Peng, Xinyue Cai, Hang Xu, Jiachen Lu, Feng Wen, Wei Zhang, Li Zhang:
LaneGraph2Seq: Lane Topology Extraction with Language Model via Vertex-Edge Encoding and Connectivity Enhancement. 4497-4505 - Wenshuo Peng, Kaipeng Zhang, Yue Yang, Hao Zhang
, Yu Qiao:
Data Adaptive Traceback for Vision-Language Foundation Models in Image Classification. 4506-4514 - Zelin Peng, Zhengqin Xu, Zhilin Zeng, Xiaokang Yang, Wei Shen:
SAM-PARSER: Fine-Tuning SAM Efficiently by Parameter Space Reconstruction. 4515-4523 - Yayun Qi, Wentian Zhao, Xinxiao Wu:
Relational Distant Supervision for Image Captioning without Image-Text Pairs. 4524-4532 - Zhaobo Qi, Yibo Yuan, Xiaowen Ruan, Shuhui Wang, Weigang Zhang, Qingming Huang:
Bias-Conflict Sample Synthesis and Adversarial Removal Debias Strategy for Temporal Sentence Grounding in Video. 4533-4541 - Tianwen Qian, Jingjing Chen
, Linhai Zhuo, Yang Jiao, Yu-Gang Jiang:
NuScenes-QA: A Multi-Modal Visual Question Answering Benchmark for Autonomous Driving Scenario. 4542-4550 - Zhipeng Qian, Yiwei Ma, Jiayi Ji, Xiaoshuai Sun:
X-RefSeg3D: Enhancing Referring 3D Instance Segmentation via Structured Cross-Modal Graph Neural Networks. 4551-4559 - Yuming Qiao, Fanyi Wang, Jingwen Su, Yanhao Zhang, Yunjie Yu, Siyu Wu, Guo-Jun Qi:
BARET: Balanced Attention Based Real Image Editing Driven by Target-Text Inversion. 4560-4568 - Minghan Qin
, Yifan Liu, Yuelang Xu, Xiaochen Zhao, Yebin Liu, Haoqian Wang:
High-Fidelity 3D Head Avatars Reconstruction through Spatially-Varying Expression Conditioned Neural Radiance Field. 4569-4577 - Yiming Qin
, Nanxuan Zhao, Bin Sheng, Rynson W. H. Lau:
Text2City: One-Stage Text-Driven Urban Layout Regeneration. 4578-4586 - Changqing Qiu, Fusheng Jin, Yining Zhang:
Empowering CAM-Based Methods with Capability to Generate Fine-Grained and High-Faithfulness Explanations. 4587-4595 - Liuxiang Qiu, Si Chen, Yan Yan, Jing-Hao Xue, Da-Han Wang, Shunzhi Zhu:
High-Order Structure Based Middle-Feature Learning for Visible-Infrared Person Re-identification. 4596-4604 - Longtian Qiu, Shan Ning, Xuming He:
Mining Fine-Grained Image-Text Alignment for Zero-Shot Captioning via Text-Only Training. 4605-4613 - Zexuan Qiu, Jiahong Liu, Yankai Chen, Irwin King:
HiHPQ: Hierarchical Hyperbolic Product Quantization for Unsupervised Image Retrieval. 4614-4622 - Jiahui Qu, Jie He, Wenqian Dong, Jingyu Zhao:
S2CycleDiff: Spatial-Spectral-Bilateral Cycle-Diffusion Framework for Hyperspectral Image Super-resolution. 4623-4631 - Qiang Qu, Yiran Shen, Xiaoming Chen, Yuk Ying Chung, Tongliang Liu:
E2HQV: High-Quality Video Generation from Event Camera via Theory-Inspired Model-Aided Deep Learning. 4632-4640 - Sameera Ramasinghe, Violetta Shevchenko, Gil Avraham, Anton van den Hengel:
BLiRF: Bandlimited Radiance Fields for Dynamic Scene Modeling. 4641-4649 - Qi Rao, Ke Sun, Xiaohan Wang, Qi Wang, Bang Zhang:
Cross-Sentence Gloss Consistency for Continuous Sign Language Recognition. 4650-4658 - Haziq Razali, Yiannis Demiris
:
Forecasting Bimanual Object Manipulation Sequences from Unimanual Observations. 4659-4666 - Zhiyao Ren, Yibing Zhan, Liang Ding, Gaoang Wang, Chaoyue Wang, Zhongyi Fan, Dacheng Tao:
Multi-Step Denoising Scheduled Sampling: Towards Alleviating Exposure Bias for Diffusion Models. 4667-4675 - Yi Rong, Haoran Zhou, Lixin Yuan, Cheng Mei, Jiahao Wang, Tong Lu:
CRA-PCN: Point Cloud Completion with Intra- and Inter-level Cross-Resolution Transformers. 4676-4685 - Bardia Safaei, Vibashan VS, Celso M. de Melo, Vishal M. Patel:
Entropic Open-Set Active Learning. 4686-4694 - Dvir Samuel, Rami Ben-Ari, Simon Raviv, Nir Darshan, Gal Chechik:
Generating Images of Rare Concepts Using Pre-trained Diffusion Models. 4695-4703 - Divya Saxena, Jiannong Cao, Jiahao Xu, Tarun Kulshrestha:
RG-GAN: Dynamic Regenerative Pruning for Data-Efficient Generative Adversarial Networks. 4704-4712 - Pourya Shamsolmoali, Masoumeh Zareapoor, Eric Granger, Michael Felsberg:
SeTformer Is What You Need for Vision and Language. 4713-4721 - Kai Shang
, Mingwen Shao, Chao Wang, Yuanshuo Cheng, Shuigen Wang:
Multi-Domain Multi-Scale Diffusion Model for Low-Light Image Enhancement. 4722-4730 - Hao Shao, Yang Zhang, Qibin Hou:
Polyper: Boundary Sensitive Polyp Segmentation. 4731-4739 - Shuai Shao, Yu Bai, Yan Wang, Baodi Liu, Bin Liu:
Collaborative Consortium of Foundation Models for Open-World Few-Shot Learning. 4740-4747 - Gil Shapira
, Yosi Keller:
FaceCoresetNet: Differentiable Coresets for Face Set Recognition. 4748-4756 - Cuifeng Shen, Yulu Gan, Chen Chen, Xiongwei Zhu, Lele Cheng, Tingting Gao, Jinzhi Wang:
Decouple Content and Motion for Conditional Image-to-Video Generation. 4757-4765 - Haozhan Shen, Tiancheng Zhao, Mingwei Zhu, Jianwei Yin:
GroundVLP: Harnessing Zero-Shot Visual Grounding from Vision-Language Pre-training and Open-Vocabulary Object Detection. 4766-4775 - Hongyu Shen, Mingtao Pei, Juncai Liu, Zhaoxing Tian:
Automatic Radiology Reports Generation via Memory Alignment Network. 4776-4783 - Junao Shen, Kun Kuang, Jiaheng Wang, Xinyu Wang, Tian Feng, Wei Zhang:
CGMGM: A Cross-Gaussian Mixture Generative Model for Few-Shot Semantic Segmentation. 4784-4792 - Lingdong Shen, Chunlei Huo, Nuo Xu
, Chaowei Han, Zichen Wang:
Learn How to See: Collaborative Embodied Learning for Object Detection and Camera Adjusting. 4793-4801 - Xiaobo Shen, Peizhuo Song, Yun-Hao Yuan, Yuhui Zheng:
Distributed Manifold Hashing for Image Set Classification and Retrieval. 4802-4810 - Xiaolong Shen, Jianxin Ma, Chang Zhou, Zongxin Yang:
Controllable 3D Face Generation with Conditional Style Code Diffusion. 4811-4819 - Mengmeng Sheng
, Zeren Sun, Zhenhuang Cai, Tao Chen, Yichao Zhou, Yazhou Yao:
Adaptive Integration of Partial Label Learning and Negative Learning for Enhanced Noisy Label Learning. 4820-4828 - Jinsong Shi, Pan Gao, Jie Qin:
Transformer-Based No-Reference Image Quality Assessment via Supervised Contrastive Learning. 4829-4837 - Liangtao Shi, Bineng Zhong, Qihua Liang, Ning Li, Shengping Zhang, Xianxian Li:
Explicit Visual Prompts for Visual Object Tracking. 4838-4846 - Ruohua Shi, Lingyu Duan, Tiejun Huang, Tingting Jiang:
Evidential Uncertainty-Guided Mitochondria Segmentation for 3D EM Images. 4847-4855 - Sang-Heon Shim, Jiwoo Chung, Jae-Pil Heo:
Towards Squeezing-Averse Virtual Try-On via Sequential Deformation. 4856-4863 - Zhongyi Shui, Sunyi Zheng, Chenglu Zhu, Shichuan Zhang, Xiaoxuan Yu, Honglin Li, Jingxiong Li, Pingyi Chen, Lin Yang:
DPA-P2PNet: Deformable Proposal-Aware P2PNet for Accurate Point-Based Cell Detection. 4864-4872 - Nyle Siddiqui, Praveen Tirupattur, Mubarak Shah:
DVANet: Disentangling View and Action Features for Multi-View Action Recognition. 4873-4881 - Jaeyoon Sim, Sooyeon Jeon, Injun Choi, Guorong Wu, Won Hwa Kim:
Learning to Approximate Adaptive Kernel Convolution on Graphs. 4882-4890 - Ayush Singh, Aayush J. Rana, Akash Kumar, Shruti Vyas, Yogesh Singh Rawat:
Semi-supervised Active Learning for Video Action Detection. 4891-4899 - Chen Song, Chandrajit Bajaj, Qixing Huang:
DeblurSR: Event-Based Motion Deblurring under the Spiking Representation. 4900-4908 - Heping Song
, Jingyao Gong, Hongying Meng, Yuping Lai:
Multi-Cross Sampling and Frequency-Division Reconstruction for Image Compressed Sensing. 4909-4917 - Huihui Song, Tiankang Su, Yuhui Zheng, Kaihua Zhang, Bo Liu, Dong Liu:
Generalizable Fourier Augmentation for Unsupervised Video Object Segmentation. 4918-4924 - Kaiyou Song, Shan Zhang, Tong Wang:
Semantic-Aware Autoregressive Image Modeling for Visual Representation Learning. 4925-4933 - Mingchen Song, Huiqiang Wang
, Guoqiang Zhong:
Self-Prompt Mechanism for Few-Shot Image Recognition. 4934-4942 - Zifan Song, Guosheng Hu, Cairong Zhao:
Diverse Person: Customize Your Own Dataset for Text-Based Person Search. 4943-4951 - Kun Su, Judith Yue Li, Qingqing Huang, Dima Kuzmin, Joonseok Lee, Chris Donahue, Fei Sha, Aren Jansen, Yu Wang, Mauro Verzetti, Timo I. Denk:
V2Meow: Meowing to the Visual Beat via Video-to-Music Generation. 4952-4960 - Sitong Su, Jianzhi Liu, Lianli Gao, Jingkuan Song:
F³-Pruning: A Training-Free and Generalized Pruning Strategy towards Faster and Finer Text-to-Video Synthesis. 4961-4969 - Yuchao Su, Yuanman Li, Wei Wang, Jiantao Zhou, Xia Li:
A Unified Environmental Network for Pedestrian Trajectory Prediction. 4970-4978 - Yuchen Su, Zhineng Chen, Zhiwen Shao, Yuning Du, Zhilong Ji, Jinfeng Bai, Yong Zhou, Yu-Gang Jiang:
LRANet: Towards Accurate and Efficient Scene Text Detection with Low-Rank Approximation Network. 4979-4987 - Yukun Su, Yiwen Cao, Jingliang Deng, Fengyun Rao, Qingyao Wu:
Spatial-Semantic Collaborative Cropping for User Generated Content. 4988-4997 - Hao Sun, Mingyao Zhou
, Wenjing Chen, Wei Xie:
TR-DETR: Task-Reciprocal Transformer for Joint Moment Retrieval and Highlight Detection. 4998-5007 - Meiqi Sun, Zhonghan Zhao, Wenhao Chai, Hanjun Luo, Shidong Cao, Yanting Zhang, Jenq-Neng Hwang, Gaoang Wang:
UniAP: Towards Universal Animal Perception in Vision via Few-Shot Learning. 5008-5016 - Shoukun Sun
, Min Xian, Fei Xu
, Luca Capriotti, Tiankai Yao:
CFR-ICL: Cascade-Forward Refinement with Iterative Click Loss for Interactive Image Segmentation. 5017-5024 - Xinyu Sun, Zhikun Zhao, Lili Wei, Congyan Lang, Mingxuan Cai, Longfei Han, Juan Wang, Bing Li, Yuxuan Guo:
RL-SeqISP: Reinforcement Learning-Based Sequential Optimization for Image Signal Processing. 5025-5033 - Yuxuan Sun, Chenglu Zhu, Sunyi Zheng, Kai Zhang, Lin Sun, Zhongyi Shui, Yunlong Zhang, Honglin Li, Lin Yang:
PathAsst: A Generative Foundation AI Assistant towards Artificial General Intelligence of Pathology. 5034-5042 - Zhaoxu Sun, Yuze Xuan, Fang Liu, Yang Xiang:
FG-EmoTalk: Talking Head Video Generation with Fine-Grained Controllable Facial Expressions. 5043-5051 - Chuangchuang Tan, Yao Zhao, Shikui Wei, Guanghua Gu, Ping Liu, Yunchao Wei:
Frequency-Aware Deepfake Detection: Improving Generalizability through Frequency Space Domain Learning. 5052-5060 - Hao Tan, Jun Li, Yizhuang Zhou, Jun Wan, Zhen Lei, Xiangyu Zhang:
Compound Text-Guided Prompt Tuning via Image-Adaptive Cues. 5061-5069 - Lei Tan, Jiaer Xia
, Wenfeng Liu, Pingyang Dai, Yongjian Wu, Liujuan Cao:
Occluded Person Re-identification via Saliency-Guided Patch Transfer. 5070-5078 - Shuai Tan, Bin Ji, Ye Pan:
Style2Talker: High-Resolution Talking Head Generation with Emotion Style and Art Style. 5079-5087 - Shuai Tan, Bin Ji, Yu Ding, Ye Pan:
Say Anything with Any Style. 5088-5096
AAAI Technical Track on Computer Vision V
- Zhaorui Tan, Xi Yang, Kaizhu Huang:
Semantic-Aware Data Augmentation for Text-to-Image Synthesis. 5098-5107 - Bowen Tang, Jing Zhang, Long Yan, Qian Yu, Lu Sheng
, Dong Xu:
Data-Free Generalized Zero-Shot Learning. 5108-5117 - Chuanbo Tang, Xihua Sheng, Zhuoyuan Li
, Haotian Zhang, Li Li, Dong Liu:
Offline and Online Optical Flow Enhancement for Deep Video Compression. 5118-5126 - Keke Tang, Xu He, Weilong Peng, Jianpeng Wu, Yawen Shi, Daizong Liu, Pan Zhou, Wenping Wang, Zhihong Tian:
Manifold Constraints for Imperceptible Adversarial Attacks on Point Clouds. 5127-5135 - Long Tang
, Dengpan Ye
, Yunna Lv, Chuanxi Chen, Yunming Zhang:
Once and for All: Universal Transferable Adversarial Perturbation against Deep Hashing-Based Facial Image Retrieval. 5136-5144 - Peng Tang, Zhiqiang Xu, Chunlai Zhou, Pengfei Wei, Peng Han, Xin Cao, Tobias Lasser:
Prior and Prediction Inverse Kernel Transformer for Single Image Defocus Deblurring. 5145-5153 - Qi Tang
, Yao Zhao, Meiqin Liu, Jian Jin, Chao Yao:
Semantic Lens: Instance-Centric Semantic Alignment for Video Super-resolution. 5154-5161 - Shengji Tang, Peng Ye, Baopu Li, Weihao Lin
, Tao Chen, Tong He, Chong Yu, Wanli Ouyang
:
Boosting Residual Networks with Group Knowledge. 5162-5170 - Yiwen Tang, Ray Zhang, Zoey Guo, Xianzheng Ma, Bin Zhao, Zhigang Wang, Dong Wang, Xuelong Li:
Point-PEFT: Parameter-Efficient Fine-Tuning for 3D Pre-trained Models. 5171-5179 - Yuanmin Tang, Jing Yu, Keke Gai, Jiamin Zhuang, Gang Xiong, Yue Hu, Qi Wu:
Context-I2W: Mapping Images to Context-Dependent Words for Accurate Zero-Shot Composed Image Retrieval. 5180-5188 - Zhangyong Tang, Tianyang Xu, Xiaojun Wu, Xuefeng Zhu, Josef Kittler:
Generative-Based Fusion Mechanism for Multi-Modal Tracking. 5189-5197 - Xinhao Tao, Junyan Cao, Yan Hong, Li Niu:
Shadow Generation with Decomposed Mask Prediction and Attentive Shadow Filling. 5198-5206 - Kaibin Tian, Yanhua Cheng, Yi Liu, Xinglin Hou, Quan Chen, Han Li:
Towards Efficient and Effective Text-to-Video Retrieval with Coarse-to-Fine Visual Representation Learning. 5207-5214 - Wentao Tian, Zheng Wang
, Yuqian Fu, Jingjing Chen
, Lechao Cheng:
Open-Vocabulary Video Relation Extraction. 5215-5223 - Yanling Tian, Di Chen, Yunan Liu, Jian Yang, Shanshan Zhang:
Divide and Conquer: Hybrid Pre-training for Person Search. 5224-5232 - Kun Tong, Chengze Jiang, Jie Gui, Yuan Cao:
Taxonomy Driven Fast Adversarial Training. 5233-5242 - Xin Tong, Shi Peng, Yufei Guo, Xuhui Huang:
End-to-End Real-Time Vanishing Point Detection with Transformer. 5243-5251 - Siddharth Tourani, Muhammad Haris Khan, Carsten Rother, Bogdan Savchynskyy:
Discrete Cycle-Consistency Based Unsupervised Deep Graph Matching. 5252-5260 - Esteve Valls Mascaro, Hyemin Ahn
, Dongheui Lee:
A Unified Masked Autoencoder with Patchified Skeletons for Motion Synthesis. 5261-5269 - Lucas Ventura, Antoine Yang, Cordelia Schmid, Gül Varol:
CoVR: Learning Composed Video Retrieval from Web Video Captions. 5270-5279 - Thanh Vu, Baochen Sun, Bodi Yuan, Alex Ngai, Yueqi Li, Jan-Michael Frahm:
Supervision Interpolation via LossMix: Generalizing Mixup for Object Detection and Beyond. 5280-5288 - Chase Walker, Sumit Kumar Jha, Kenny Chen, Rickard Ewetz:
Integrated Decision Gradients: Compute Your Attributions Where the Model Makes Its Decision. 5289-5297 - Angtian Wang, Yuanlu Xu
, Nikolaos Sarafianos, Robert Maier
, Edmond Boyer, Alan L. Yuille, Tony Tung:
HISR: Hybrid Implicit Surface Representation for Photorealistic 3D Human Reconstruction. 5298-5308 - Bin Wang
, Fan Wu, Xiao Han, Jiahui Peng, Huaping Zhong, Pan Zhang, Xiaoyi Dong, Weijia Li, Wei Li, Jiaqi Wang, Conghui He:
VIGC: Visual Instruction Generation and Correction. 5309-5317 - Chenyang Wang, Junjun Jiang, Kui Jiang, Xianming Liu:
Low-Light Face Super-resolution via Illumination, Structure, and Texture Associated Representation. 5318-5326 - Cong Wang
, Jinshan Pan, Wanyu Lin, Jiangxin Dong, Wei Wang, Xiao-Ming Wu:
SelfPromer: Self-Prompt Dehazing Transformers with Depth-Consistency. 5327-5335 - Cong Wang
, Jinshan Pan, Wei Wang, Gang Fu, Siyuan Liang, Mengzhu Wang, Xiao-Ming Wu, Jun Liu:
Correlation Matching Transformation Transformers for UHD Image Restoration. 5336-5344 - Fei Wang, Dan Guo
, Kun Li, Meng Wang:
EulerMormer: Robust Eulerian Motion Magnification via Dynamic Filtering within Transformer. 5345-5353 - Fengxiang Wang, Wanrong Huang, Shaowu Yang, Qi Fan, Long Lan:
Learning to Learn Better Visual Prompts. 5354-5363 - Guanjie Wang, Zehua Ma, Chang Liu
, Xi Yang, Han Fang, Weiming Zhang, Nenghai Yu:
MuST: Robust Image Watermarking for Multi-Source Tracing. 5364-5371 - Haixin Wang, Jianlong Chang, Yihang Zhai, Xiao Luo
, Jinan Sun, Zhouchen Lin, Qi Tian:
LION: Implicit Vision Prompt Tuning. 5372-5380 - Hao Wang, Qiang Song, Ruofeng Yin, Rui Ma:
B-spine: Learning B-spline Curve Representation for Robust and Interpretable Spinal Curvature Estimation. 5381-5389 - Hao Wang, Fang Liu, Licheng Jiao, Jiahao Wang, Zehua Hao, Shuo Li, Lingling Li, Puhua Chen, Xu Liu:
ViLT-CLIP: Video and Language Tuning CLIP with Multimodal Prompt Learning and Scenario-Guided Optimization. 5390-5400 - Haoan Wang, Shilong Jia, Tieyong Zeng, Guixu Zhang, Zhi Li:
Triple Feature Disentanglement for One-Stage Adaptive Object Detection. 5401-5409 - Haoxiang Wang, Tao Yu, Tianwei Yang, Hui Qiao, Qionghai Dai:
Neural Physical Simulation with Multi-Resolution Hash Grid Encoding. 5410-5418 - Hebaixu Wang, Meiqi Gong, Xiaoguang Mei, Hao Zhang, Jiayi Ma:
Deep Unfolded Network with Intrinsic Supervision for Pan-Sharpening. 5419-5426 - Hexiang Wang, Fengqi Liu, Qianyu Zhou, Ran Yi, Xin Tan, Lizhuang Ma:
Continuous Piecewise-Affine Based Motion Model for Image Animation. 5427-5435 - Hongyu Wang, Xiaotao Liu, Yifan Li, Meng Sun, Dian Yuan, Jing Liu:
Temporal Adaptive RGBT Tracking with Modality Prompt. 5436-5444 - Jiahao Wang, Caixia Yan, Weizhan Zhang
, Huan Liu, Hao Sun, Qinghua Zheng:
SAUI: Scale-Aware Unseen Imagineer for Zero-Shot Object Detection. 5445-5453 - Jiangang Wang, Yuning Cui, Yawen Li, Wenqi Ren, Xiaochun Cao:
Omnidirectional Image Super-resolution via Bi-projection Fusion. 5454-5462 - Jing Wang, Jiangyun Li, Chen Chen, Yisi Zhang, Haoran Shen, Tianxiang Zhang:
Adaptive FSS: A Novel Few-Shot Segmentation Framework via Prototype Enhancement. 5463-5471 - Jun Wang, Ying Cui, Dongyan Guo, Junxia Li, Qingshan Liu, Chunhua Shen:
PointAttN: You Only Need Attention for Point Cloud Completion. 5472-5480 - Junjue Wang, Zhuo Zheng, Zihang Chen, Ailong Ma, Yanfei Zhong:
EarthVQA: Towards Queryable Earth via Relational Reasoning-Based Remote Sensing Visual Question Answering. 5481-5489 - Kewei Wang, Yizheng Wu, Zhiyu Pan, Xingyi Li
, Ke Xian, Zhe Wang, Zhiguo Cao, Guosheng Lin:
Semi-supervised Class-Agnostic Motion Prediction with Pseudo Label Regeneration and BEVMix. 5490-5498 - Keyao Wang, Guosheng Zhang
, Haixiao Yue, Ajian Liu, Gang Zhang, Haocheng Feng, Junyu Han, Errui Ding, Jingdong Wang:
Multi-Domain Incremental Learning for Face Presentation Attack Detection. 5499-5507 - Kun Wang, Zhiqiang Yan, Huang Tian, Zhenyu Zhang, Xiang Li, Jun Li, Jian Yang:
AltNeRF: Learning Robust Neural Radiance Field via Alternating Depth-Pose Optimization. 5508-5516 - Mengmeng Wang, Jiazheng Xing, Boyuan Jiang, Jun Chen, Jianbiao Mei, Xingxing Zuo, Guang Dai, Jingdong Wang, Yong Liu:
A Multimodal, Multi-Task Adapting Framework for Video Action Recognition. 5517-5525 - Miaohui Wang
, Runnan Huang, Hengjin Dong, Di Lin, Yun Song, Wuyuan Xie:
msLPCC: A Multimodal-Driven Scalable Framework for Deep LiDAR Point Cloud Compression. 5526-5534 - Ning Wang, Jiajun Deng, Mingbo Jia:
Cycle-Consistency Learning for Captioning and Grounding. 5535-5543 - Ruichen Wang, Zekang Chen, Chen Chen
, Jian Ma, Haonan Lu, Xiaodong Lin:
Compositional Text-to-Image Synthesis with Attention Map Control of Diffusion Models. 5544-5552