


default search action
39th AAAI 2025: Philadelphia, PA, USA
- Toby Walsh, Julie Shah, Zico Kolter:

AAAI-25, Sponsored by the Association for the Advancement of Artificial Intelligence, February 25 - March 4, 2025, Philadelphia, PA, USA. AAAI Press 2025, ISBN 978-1-57735-897-8
Technical Tracks 1
- Seokho Ahn, Hyungjin Kim

, Sungbok Shin
, Young-Duk Seo:
Real-Time Calibration Model for Low-Cost Sensor in Fine-Grained Time Series. 3-11 - Randy Ardywibowo, Rakesh Sunki, Shin Tsz Lucy Kuo, Sankalp Nayak:

BayesCNS: A Unified Bayesian Approach to Address Cold Start and Non-Stationarity in Search Systems at Scale. 12-20 - Feiyang Cai, Chuchu Fan, Stanley Bak:

Scalable Surrogate Verification of Image-Based Neural Network Control Systems Using Composition and Unrolling. 21-30 - Biwei Cao, Qihang Wu, Jiuxin Cao, Bo Liu, Jie Gui:

External Reliable Information-enhanced Multimodal Contrastive Learning for Fake News Detection. 31-39 - Ji Cao

, Tongya Zheng, Qinghong Guo, Yu Wang, Junshu Dai, Shunyu Liu, Jie Yang, Jie Song, Mingli Song:
Holistic Semantic Representation for Navigational Trajectory Generation. 40-48 - Jipeng Cen, Jiaxin Liu, Zhixu Li, Jingjing Wang:

SQLFixAgent: Towards Semantic-Accurate Text-to-SQL Parsing via Consistency-Enhanced Multi-Agent Collaboration. 49-57 - Geng Chen

, Wuyuan Xie, Di Lin, Ye Liu, Miaohui Wang:
mmFAS: Multimodal Face Anti-Spoofing Using Multi-Level Alignment and Switch-Attention Fusion. 58-66 - Jie Chen

, Liangmin Wang, Huijuan Zhu, Victor S. Sheng:
CLEP: A Novel Contrastive Learning Method for Evolutionary Reentrancy Vulnerability Detection. 67-74 - Xiaocan Chen, Qilin Yin, Jiarui Liu, Wei Lu, Xiangyang Luo, Jiantao Zhou:

GLCF: A Global-Local Multimodal Coherence Analysis Framework for Talking Face Generation Detection. 75-83 - Zhe Chen, Zhe Fang, Wenhao Tian, Zhaoguang Long, Changzhi Sun, Yuefeng Chen, Hao Yuan, Honglin Li, Man Lan:

ReactGPT: Understanding of Chemical Reactions via In-Context Tuning. 84-92 - Kaihui Cheng, Ce Liu, Qingkun Su, Jun Wang, Liwei Zhang, Yining Tang, Yao Yao, Siyu Zhu, Yuan Qi:

4D Diffusion for Dynamic Protein Structure Prediction with Reference and Motion Guidance. 93-101 - Xiaoxia Cheng, Zeqi Tan, Zhe Zheng, Weiming Lu:

G2LDetect: A Global-to-Local Approach for Hallucination Detection. 102-109 - Sungjun Cho, Dae-Woong Jeong, Sung Moon Ko, Jinwoo Kim, Sehui Han, Seunghoon Hong, Honglak Lee, Moontae Lee:

3D Denoisers Are Good 2D Teachers: Molecular Pretraining via Denoising and Cross-Modal Distillation. 110-118 - Muzhi Dai, Zhuoer Dong, Weining Fu, Kui Xu, Qiangfeng Cliff Zhang:

CryoDomain: Sequence-free Protein Domain Identification from Low-resolution Cryo-EM Density Maps. 119-127 - Zhenlong Dai, Bingrui Chen, Zhuoluo Zhao, Xiu Tang, Sai Wu, Chang Yao, Zhipeng Gao, Jingyuan Chen:

Less Is More: Adaptive Program Repair with Bug Localization and Preference Learning. 128-136 - Chao Deng, Hongdong Li, Jianxin Wang:

Improving Cancer Gene Prediction by Enhancing Common Information Between the PPI Network and Gene Functional Association. 137-145 - Saaketh Desai, Sadhvikas Addamane, Jeffrey Y. Tsao, Igal Brener, Laura P. Swiler, Rémi Dingreville, Prasad P. Iyer

:
AutoSciLab: A Self-Driving Laboratory for Interpretable Scientific Discovery. 146-154 - Zhihao Ding, Ting Zhang, Yiran Li, Jieming Shi

, Chen Jason Zhang:
RingFormer: A Ring-Enhanced Graph Transformer for Organic Solar Cell Property Prediction. 155-163 - Zhiang Dong, Jingyuan Chen, Fei Wu:

Knowledge Is Power: Harnessing Large Language Models for Enhanced Cognitive Diagnosis. 164-172 - Yitong Duan, Weiran Wang, Jian Li:

FactorGCL: A Hypergraph-Based Factor Model with Temporal Residual Contrastive Learning for Stock Returns Prediction. 173-181 - Haodong Feng, Yue Wang, Dixia Fan:

How to Re-enable PDE Loss for Physical Systems Modeling Under Partial Observation. 182-190 - Myles Foley

, Sergio Maffeis:
APIRL: Deep Reinforcement Learning for REST API Fuzzing. 191-199 - Daniel Freedman, Eyal Rozenberg, Alex M. Bronstein:

A Theoretical Framework for an Efficient Normalizing Flow-Based Solution to the Electronic Schrödinger Equation. 200-209 - Lihao Gan, Xin Man, Chenghong Zhang

, Jie Shao:
EWMoE: An Effective Model for Global Weather Forecasting with Mixture-of-Experts. 210-218 - Zhangyang Gao, Cheng Tan, Jue Wang, Yufei Huang, Lirong Wu, Stan Z. Li:

FoldToken: Learning Protein Language via Vector Quantization and Beyond. 219-227 - Hao Guo, Zihan Ma, Zhi Zeng, Minnan Luo, Weixin Zeng, Jiuyang Tang, Xiang Zhao:

Each Fake News Is Fake in Its Own Way: An Attribution Multi-Granularity Benchmark for Multimodal Fake News Detection. 228-236 - Rong Han, Wenbing Huang, Lingxiao Luo, Xinyan Han, Jiaming Shen, Zhiqiang Zhang, Jun Zhou, Ting Chen:

HeMeNet: Heterogeneous Multichannel Equivariant Network for Protein Multi-task Learning. 237-245 - Rong Han, Xiaohong Liu, Tong Pan

, Jing Xu, Xiaoyu Wang, Wuyang Lan, Zhenyu Li, Zixuan Wang, Jiangning Song, Guangyu Wang, Ting Chen:
CoPRA: Bridging Cross-domain Pretrained Sequence Models with Complex Structures for Protein-RNA Binding Affinity Prediction. 246-254 - Xiao Han

, Zijian Zhang, Xiangyu Zhao
, Yuanshao Zhu
, Guojiang Shen, Xiangjie Kong, Xuetao Wei, Liqiang Nie, Jieping Ye:
GARLIC: GPT-Augmented Reinforcement Learning with Intelligent Control for Vehicle Dispatching. 255-263 - Zehua Han, Jing Xiao, Qirui Zhao, Zhexuan Cui, Yufeng Wang, Duona Zhang, Wenrui Ding:

Open-world Radio Frequency Fingerprint Identification via Augmented Semi-supervised Learning. 264-272 - Harish Haresamudram, Apoorva Beedu, Mashfiqui Rabbi, Sankalita Saha, Irfan Essa, Thomas Ploetz:

Limitations in Employing Natural Language Supervision for Sensor-Based Human Activity Recognition - And Ways to Overcome Them. 273-281 - Meixia He, Peican Zhu, Keke Tang, Yangming Guo:

Hypergraph Attacks via Injecting Homogeneous Nodes into Elite Hyperedges. 282-290 - Qiang He, Yunting Bao, Hui Fang, Yuting Lin, Hao Sun:

HHAN: Comprehensive Infectious Disease Source Tracing via Heterogeneous Hypergraph Neural Network. 291-299 - Chia-Tung Ho, Haoxing Ren, Brucek Khailany:

VerilogCoder: Autonomous Verilog Coding Agents with Graph-based Planning and Abstract Syntax Tree (AST)-based Waveform Tracing Tool. 300-307 - Tran Thai Hoa, Tran Quang Duy, Khanh Quoc Tran, Kiet Van Nguyen:

ViFactCheck: A New Benchmark Dataset and Methods for Multi-Domain News Fact-Checking In Vietnamese. 308-316 - Jingjing Hu

, Dan Guo, Zhan Si, Deguang Liu
, Yunfeng Diao, Jing Zhang, Jinxing Zhou, Meng Wang:
MOL-Mamba: Enhancing Molecular Representation with Structural & Electronic Insights. 317-325 - Xinlei Huang, Zhiqi Ma, Dian Meng

, Yanran Liu, Shiwei Ruan, Qingqiang Sun, Xubin Zheng, Ziyue Qiao:
PRAGA: Prototype-aware Graph Adaptive Aggregation for Spatial Multi-modal Omics Analysis. 326-333 - Yinxuan Huang, Ke Liang, Yanyi Huang, Xiang Zeng, Kai Chen

, Bin Zhou:
Social Recommendation via Graph-Level Counterfactual Augmentation. 334-342 - Zhiheng Huang, Yannan Liu, Daojing He, Yu Li:

DF-MIA: A Distribution-Free Membership Inference Attack on Fine-Tuned Large Language Models. 343-351 - Pengcheng Jiang, Cao Xiao, Tianfan Fu, Parminder Bhatia, Taha A. Kass-Hout, Jimeng Sun, Jiawei Han:

Bi-level Contrastive Learning for Knowledge-Enhanced Molecule Representations. 352-360 - Sizhuo Jin, Shuo Chen, Jianjun Qian, Ying Tai, Jun Li:

Learning Generalized Residual Exchange-Correlation-Uncertain Functional for Density Functional Theory. 361-369 - Feifei Kou, Yuhan Yao, Siyuan Yao

, Jiahao Wang, Lei Shi, Yawen Li, Xuejing Kang:
IWRN: A Robust Blind Watermarking Method for Artwork Image Copyright Protection Against Noise Attack. 370-378 - Yao Lai

, Sungyoung Lee, Guojin Chen, Souradip Poddar, Mengkang Hu, David Z. Pan, Ping Luo:
AnalogCoder: Analog Circuit Design via Training-Free Code Generation. 379-387 - Hao Li

, Ruoyuan Gong, Hao Jiang:
Political Actor Agent: Simulating Legislative System for Roll Call Votes Prediction with Large Language Models. 388-396 - Haoran Li, Yulin Chen, Zihao Zheng, Qi Hu, Chunkit Chan, Heshan Liu, Yangqiu Song:

Simulate and Eliminate: Revoke Backdoors for Generative Large Language Models. 397-405 - Haoran Li

, Xingjian Li, Jiahua Shi
, Huaming Chen, Bo Du
, Daisuke Kihara, Johan Barthélemy, Jun Shen
, Min Xu
:
Vox-UDA: Voxel-wise Unsupervised Domain Adaptation for Cryo-Electron Subtomogram Segmentation with Denoised Pseudo-Labeling. 406-414 - Junxian Li, Di Zhang, Xunzhi Wang, Zeying Hao, Jingdi Lei, Qian Tan, Cai Zhou, Wei Liu, Yaotian Yang, Xinrui Xiong, Weiyun Wang

, Zhe Chen, Wenhai Wang, Wei Li, Mao Su, Shufei Zhang, Wanli Ouyang, Yuqiang Li, Dongzhan Zhou:
ChemVLM: Exploring the Power of Multimodal Large Language Models in Chemistry Area. 415-423 - Kai Li, Wenqi Ren, Jianshu Li, Wei Wang

, Xiaochun Cao:
Critical Forgetting-Based Multi-Scale Disentanglement for Deepfake Detection. 424-432 - Mingxin Li, Yuchen Zhang, Haowei Xu, Xianghua Li, Chao Gao

, Zhen Wang:
Learning Complex Heterogeneous Multimodal Fake News via Social Latent Network Inference. 433-441 - Tian Li, Xiao-Yue Xu, Chen Ding, Tian-Ci Tian, Wei-You Liao, Shuo Zhang, He-Liang Huang

:
AI-Powered Algorithm-Centric Quantum Processor Topology Design. 442-450 - Zhiting Li, Shibai Yin, Tai-Xiang Jiang, Yexun Hu, Jia-Mian Wu, Guowei Yang, Guisong Liu:

Enhancing the Adversarial Robustness via Manifold Projection. 451-459 - Zhufeng Li, Sandeep Suresh Cranganore, Nicholas D. Youngblut, Niki Kilbertus:

Whole Genome Transformer for Gene Interaction Effects in Microbiome Habitat Specificity. 460-469 - Zongwei Li

, Xiaoqi Li
, Wenkai Li, Xin Wang:
SCALM: Detecting Bad Practices in Smart Contracts Through LLMs. 470-477 - Yuqi Liang, Jun Luo, Xiaoxi Guo, Jianqi Bi:

An Evaluation Framework for Product Images Background Inpainting Based on Human Feedback and Product Consistency. 478-486 - Panfeng Liu, Guoliang Qiu, Biaoshuai Tao, Kuan Yang:

A Thorough Comparison Between Independent Cascade and Susceptible-Infected-Recovered Models. 487-495 - Runxin Liu, Tian Xie, Jiaming Li, Lingyun Yu, Hongtao Xie:

IDseq: Decoupled and Sequentially Detecting and Grounding Multi-Modal Media Manipulation. 496-504 - Xiangyu Liu, Yi Liu, Silei Chen, Wei Hu:

Controllable Protein Sequence Generation with LLM Preference Optimization. 505-513 - Xiyao Liu, Junxing Ma, Xinda Wang, Qianyu Lin, Jian Zhang, Gerald Schaefer

, Cagatay Turkay
, Hui Fang:
Recoverable Facial Identity Protection via Adaptive Makeup Transfer Adversarial Attacks. 514-522 - Xuan Liu

, Menglu Li:
Knowledge-Guided Domain Adaptation Model for Transferring Drug Response Prediction from Cell Lines to Patients. 523-531 - Yupei Liu, Yanting Wang, Jinyuan Jia:

TrojanDec: Data-free Detection of Trojan Inputs in Self-supervised Learning. 532-540 - Yuxuan Liu, Hongda Sun, Wenya Guo, Xinyan Xiao, Cunli Mao, Zhengtao Yu, Rui Yan:

BiDeV: Bilateral Defusing Verification for Complex Claim Fact-Checking. 541-549 - Zhendong Liu, Le Zhang, Bing Li, Yingjie Zhou, Zhenghua Chen, Ce Zhu:

WiFi CSI Based Temporal Activity Detection via Dual Pyramid Network. 550-558 - Weihai Lu

, Yu Tong
, Zhiqiu Ye:
DAMMFND: Domain-Aware Multimodal Multi-view Fake News Detection. 559-567 - Bingjun Luo, Jinpeng Wang, Zewen Wang, Junjie Zhu, Xibin Zhao:

Graph-Based Cross-Domain Knowledge Distillation for Cross-Dataset Text-to-Image Person Retrieval. 568-576 - Rui Lv, Qi Liu, Weibo Gao, Haotian Zhang

, Junyu Lu, Linbo Zhu:
GenAL: Generative Agent for Adaptive Learning. 577-585 - Tianxu Lv, Jie Zhu, Jinyi Liu, Shiyun Nie, Hongnian Tian, Yang Xiao, Yuan Liu, Lihua Li, Xiang Pan:

M²N: A Progressive Macro-to-Micro 3D Modeling Scheme for Unveiling Drug-Target Affinity. 586-594 - Takashi Matsubara, Takaharu Yaguchi:

Number Theoretic Accelerated Learning of Physics-Informed Neural Networks. 595-603 - Jian-Ping Mei, Weibin Zhang, Jie Chen, Xuyun Zhang

, Tiantian Zhu:
Defense Against Model Stealing Based on Account-Aware Distribution Discrepancy. 604-611 - Hui Miao, Yuanfang Guo, Zeming Liu, Yunhong Wang:

Multi-modal Deepfake Detection via Multi-task Audio-Visual Prompt Learning. 612-621 - Yuwei Miao, Yuzhi Guo, Hehuan Ma, Jingquan Yan, Feng Jiang, Rui Liao, Junzhou Huang

:
GoBERT: Gene Ontology Graph Informed BERT for Universal Gene Function Prediction. 622-630 - Li Ni, Rui Ye, Wenjian Luo, Yiwen Zhang, Lei Zhang, Victor S. Sheng:

SLRL: Semi-Supervised Local Community Detection Based on Reinforcement Learning. 631-639 - Zhibin Ni, Chang Liu, Hai Wan, Xibin Zhao:

Robust Heterogeneous Graph Classification for Molecular Property Prediction with Information Bottleneck. 640-648 - Pedro Orvalho

, Mikolás Janota, Vasco M. Manquinho:
Counterexample Guided Program Repair Using Zero-Shot Learning and MaxSAT-based Fault Localization. 649-657 - Jimin Park, AHyun Ji, Minji Park, Mohammad Saidur Rahman, Se Eun Oh:

MalCL: Leveraging GAN-Based Generative Replay to Combat Catastrophic Forgetting in Malware Classification. 658-666 - Gaozheng Pei, Shaojie Lyu, Ke Ma, Pinci Yang, Qianqian Xu, Yingfei Sun:

Exploring Query Efficient Data Generation Towards Data-Free Model Stealing in Hard Label Setting. 667-675 - Jiaxin Qi, Yan Cui, Kailei Guo, Xiaomin Zhang, Jianqiang Huang, Gaogang Xie:

A Simple and Comprehensive Benchmark for Single-Cell Transcriptomics. 676-684 - Xing Qiu

, Guang Cheng, Weizhou Zhu, Dandan Niu, Nan Fu:
Dual-Channel Interactive Graph Transformer for Traffic Classification with Message-Aware Flow Representation. 685-693 - Chenfan Qu, Yiwu Zhong, Fengjun Guo, Lianwen Jin:

Revisiting Tampered Scene Text Detection in the Era of Generative AI. 694-702 - Huiru Shao, Kaizhu Huang, Wei Wang

, Xiaowei Huang, Qiufeng Wang:
Towards Better Robustness Against Natural Corruptions in Document Tampering Localization. 703-710 - Guobin Shen, Dongcheng Zhao, Aorigele Bao, Xiang He, Yiting Dong, Yi Zeng:

StressPrompt: Does Stress Impact Large Language Models and Human Performance Similarly? 711-719 - Ziqi Sheng, Wei Lu, Xiangyang Luo, Jiantao Zhou, Xiaochun Cao:

SUMI-IFL: An Information-Theoretic Framework for Image Forgery Localization with Sufficiency and Minimality Constraints. 720-728 - Yi Shi, Yun-Kai Wang, Xu-Peng Tian, Tie-Yi Zhang, Bing Yao, Hui Wang, Yong Shao, Cen-Cen Wang, Rong Zeng:

SpeHeaTal: A Cluster-Enhanced Segmentation Method for Sperm Morphology Analysis. 729-737 - Yiwei Shi, Muning Wen, Qi Zhang, Weinan Zhang, Cunjia Liu, Weiru Liu

:
Autonomous Goal Detection and Cessation in Reinforcement Learning: A Case Study on Source Term Estimation. 738-745 - Shibo Feng, Peilin Zhao, Liu Liu, Pengcheng Wu, Zhiqi Shen:

HDT: Hierarchical Discrete Transformer for Multivariate Time Series Forecasting. 746-754 - Xiaozhuang Song, Yuzhao Tu, Hangting Ye, Wei Fan, Qingquan Zhang, Xiaoxue Wang, Tianshu Yu:

Enhancing Generalizability in Molecular Conformation Generation with METRIZATION-Informed Geometric Diffusion Pretraining. 755-763 - Yunpeng Song, Jiawei Li, Yiheng Bian, Zhongmin Cai:

Predicting User Behavior in Smart Spaces with LLM-Enhanced Logs and Personalized Prompts. 764-772 - Nan Sun, Han Fang, Yuxing Lu

, Chengxin Zhao, Hefei Ling:
END^2: Robust Dual-Decoder Watermarking Framework Against Non-Differentiable Distortions. 773-781 - Cheng Tan, Yijie Zhang, Zhangyang Gao, Yufei Huang, Haitao Lin, Lirong Wu, Fandi Wu, Mathieu Blanchette, Stan Z. Li:

dyAb: Flow Matching for Flexible Antibody Design with AlphaFold-driven Pre-binding Antigen. 782-790 - Lei Tan, Yuliang Xue, Guobiao Li, Zhenxing Qian, Sheng Li, Chunlei Bao:

Embedding Robust Watermarking into Pattern to Protect the Copyright of Ceramic Artifacts. 791-798 - Renshuai Tao, Manyi Le, Chuangchuang Tan, Huan Liu, Haotong Qin, Yao Zhao:

ODDN: Addressing Unpaired Data Challenges in Open-World Deepfake Detection on Online Social Networks. 799-807 - Chengyue Wang, Haicheng Liao, Bonan Wang

, Yanchen Guan, Bin Rao, Ziyuan Pu, Zhiyong Cui, Cheng-Zhong Xu, Zhenning Li
:
NEST: A Neuromodulated Small-world Hypergraph Trajectory Prediction Model for Autonomous Driving. 808-816 - Jia Wang, Liyan Zhu, Zhe Wang, Chenqiu Zhang, Yaoxing Wu, Jun Cui, Jianqiang Li:

PScalpel: A Machine Learning-based Guider for Protein Phase-Separating Behaviour Alteration. 817-825 - Jiabao Wang, Zepeng Wu, Qian Dong, Lingzhong Meng, Yunzhi Xue, Yukuan Yang:

Hybrid-Driving: An Autonomous Driving Decision Framework Integrating Large Language Models, Knowledge Graphs and Driving Rules. 826-833 - Jingyuan Wang, Yujing Lin, Yudong Li

:
GTG: Generalizable Trajectory Generation Model for Urban Mobility. 834-842 - Lingzhi Wang, Xingshan Zeng, Jinsong Guo, Kam-Fai Wong, Georg Gottlob:

Selective Forgetting: Advancing Machine Unlearning Techniques and Evaluation in Language Models. 843-851 - Ruoqi Wang, Haitao Wang, Qiong Luo, Feng Wang, Hejun Wu:

VisRec: A Semi-Supervised Approach to Visibility Data Reconstruction in Radio Astronomy. 852-860 - Xiaozheng Wang, Yong Yang, Shuying Huang, Hangyuan Lu, Weiguo Wan, Aoqi Zhao:

FMPM-DNet: Hyperspectral Pansharpening Dynamic Network Based on Feature Modulation and Probability Mask. 861-868 - Yueqing Wang, Peng Zhang, Yushuang Liu, Jianing Zhao, Jie Lin, Yi Chen

:
Aerodynamic Coefficients Prediction via Cross-Attention Fusion and Physical-Informed Training. 869-876 - Fang Wu, Bozhen Hu, Stan Z. Li:

Generalized Implicit Neural Representations for Dynamic Molecular Surface Modeling. 877-885 - Juntao Wu, Ziyu Song, Xiaoyu Zhang, Shujun Xie, Longxin Lin, Ke Wang:

Vision Transformers Beat WideResNets on Small Scale Datasets Adversarial Robustness. 886-894 - Lirong Wu, Haitao Lin, Yufei Huang, Zhangyang Gao, Cheng Tan, Yunfan Liu, Tailin Wu, Stan Z. Li:

Relation-Aware Equivariant Graph Networks for Epitope-Unknown Antibody Design and Specificity Optimization. 895-904 - Zhihao Wu, Yushi Cheng, Tianyang Sun, Xiaoyu Ji, Wenyuan Xu:

MYOPIA: Protecting Face Privacy from Malicious Personalized Text-to-Image Synthesis via Unlearnable Examples. 905-913 - Zeke Xia, Ming Hu, Dengke Yan, Ruixuan Liu, Anran Li, Xiaofei Xie

, Mingsong Chen:
MultiSFL: Towards Accurate Split Federated Learning via Multi-Model Aggregation and Knowledge Replay. 914-922 - Di Xiong

, Shuoyuan Wang, Lei Zhang, Wenbo Huang
, Chaolei Han:
Generalizable Sensor-Based Activity Recognition via Categorical Concept Invariant Learning. 923-931 - Xovee Xu

, Yifan Zhang, Fan Zhou, Jingkuan Song:
Improving Multimodal Social Media Popularity Prediction via Selective Retrieval Knowledge Augmentation. 932-940 - Yongxin Xu, Xinke Jiang, Xu Chu, Rihong Qiu, Yujie Feng, Hongxin Ding, Junfeng Zhao, Yasha Wang, Bing Xie:

DearLLM: Enhancing Personalized Healthcare via Large Language Models-Deduced Feature Correlations. 941-949 - Chenchen Yang, Hao Wu, Tao Shen, Kai Zou, Siqi Sun:

PriFold: Biological Priors Improve RNA Secondary Structure Predictions. 950-958 - Xinyu Yang, Yu Sun, Xinyang Chen, Ying Zhang, Xiaojie Yuan:

Graph Structure Learning for Spatial-Temporal Imputation: Adapting to Node and Feature Scales. 959-967 - Tong Ye, Yangkai Du, Tengfei Ma, Lingfei Wu, Xuhong Zhang, Shouling Ji, Wenhai Wang:

Uncovering LLM-Generated Code: A Zero-Shot Synthetic Code Detector via Code Rewriting. 968-976 - Na Yu, Yutong Deng, Shunyu Liu, Kaixuan Chen, Tongya Zheng, Mingli Song:

Disentangled Table-Graph Representation for Interpretable Transmission Line Fault Location. 977-985 - Xinquan Yu, Ziqi Sheng, Wei Lu, Xiangyang Luo, Jiantao Zhou:

RaCMC: Residual-Aware Compensation Network with Multi-Granularity Constraints for Fake News Detection. 986-994 - Zeqin Yu, Jiangqun Ni, Jian Zhang, Haoyi Deng, Yuzhen Lin:

Reinforced Multi-teacher Knowledge Distillation for Efficient General Image Forgery Detection and Localization. 995-1003 - Wenwu Zeng, Liangrui Pan, Boya Ji, Liwen Xu, Shaoliang Peng:

Accurate Nucleic Acid-Binding Residue Identification Based Domain-Adaptive Protein Language Model and Explainable Geometric Deep Learning. 1004-1012 - Xi Zeng, Fei Ni, Shaoqing Jiao

, Dazhi Lu, Jianye Hao, Jiajie Peng:
SWAMamba: A Sliding Window Attention Mamba Framework for Predicting Translation Elongation Rates. 1013-1021 - Jiangou Zhan, Wenhui Zhang, Zheng Zhang, Huanran Xue, Yao Zhang, Ye Wu:

Portcullis: A Scalable and Verifiable Privacy Gateway for Third-Party LLM Inference. 1022-1030 - Chaowei Zhang, Zongling Feng, Zewei Zhang

, Jipeng Qiang, Guandong Xu, Yun Li:
Is LLMs Hallucination Usable? LLM-based Negative Reasoning for Fake News Detection. 1031-1039 - Chongyu Zhang, Qiping Tao, Liangyu Chen, Min Zhang:

BERT-Based Code Learning for Exception Localization and Type Prediction. 1040-1047 - Haozhen Zhang, Haodong Yue, Xi Xiao, Le Yu, Qing Li, Zhen Ling, Ye Zhang:

Revolutionizing Encrypted Traffic Classification with MH-Net: A Multi-View Heterogeneous Graph Model. 1048-1056 - Honggen Zhang, Xiangrui Gao, June Zhang, Lipeng Lai:

mRNA2vec: mRNA Embedding with Language Model in the 5'UTR-CDS for mRNA Design. 1057-1065 - Kuiyuan Zhang, Zhongyun Hua, Rushi Lan, Yushu Zhang, Yifang Guo:

Phoneme-Level Feature Discrepancies: A Key to Detecting Sophisticated Speech Deepfakes. 1066-1074 - Kuiyuan Zhang, Zhongyun Hua, Rushi Lan, Yifang Guo, Yushu Zhang, Guoai Xu:

Multi-View Collaborative Learning Network for Speech Deepfake Detection. 1075-1083 - Lei Zhang, Guanyu Gao, Haiyan Yin, Huaizheng Zhang:

Multi-Edge Reinforced Collaborative Data Acquisition for Continuous Video Analytics by Prioritizing Quality over Quantity. 1084-1092 - Qianru Zhang, Xinyi Gao, Haixin Wang, Siu Ming Yiu, Hongzhi Yin

:
Efficient Traffic Prediction Through Spatio-Temporal Distillation. 1093-1101 - Ran Zhang, Xuezhi Wang, Guannan Liu, Pengyang Wang, Yuanchun Zhou, Pengfei Wang:

Motif-Oriented Representation Learning with Topology Refinement for Drug-Drug Interaction Prediction. 1102-1110 - Rongchao Zhang, Yu Huang, Yiwei Lou, Yi Xin, Haixu Chen, Yongzhi Cao, Hanpin Wang:

Exploit Your Latents: Coarse-Grained Protein Backmapping with Latent Diffusion Models. 1111-1119 - Shiqi Zhang, Pan Mu, Cheng Huang, Jinglin Zhang, Cong Bai:

TC-Diffuser: Bi-Condition Multi-Modal Diffusion for Tropical Cyclone Forecasting. 1120-1128 - Xin Zhang, Peiliang Zhang, Jingling Yuan, Lin Li:

Zero-Shot Learning for Materials Science Texts: Leveraging Duck Typing Principles. 1129-1137 - Xiongqi Zhang, Junwei Xu, Yang Wang, Dongming Xiang, Wang Lin, Zuohua Ding:

Formal Synthesis of Barrier Certificates Using Fourier Kolmogorov-Arnold Network. 1138-1146 - Yudong Zhang, Xu Wang, Xuan Yu, Zhaoyang Sun, Kai Wang, Yang Wang:

Drawing Informative Gradients from Sources: A One-stage Transfer Learning Framework for Cross-city Spatiotemporal Forecasting. 1147-1155 - Zhenbang Zhang, Hongjia Li

, Zhiqiang Xu, Wenjia Meng, Renmin Han:
A Gaussian Filter-Based 3D Registration Method for Series Section Electron Microscopy. 1156-1164 - Ziyang Zhang

, Yang Zhao, Ming-Ching Chang, Changyao Lin, Jie Liu:
E4: Energy-Efficient DNN Inference for Edge Video Analytics via Early Exiting and DVFS. 1165-1173 - Guanhao Zhao, Zhenya Huang, Cheng Cheng, Yan Zhuang, Qingyang Mao, Xin Li, Shijin Wang

, Enhong Chen:
Multi-Perspective Consolidation Enhanced Cognitive Diagnosis via Conditional Diffusion Model. 1174-1182 - Penghai Zhao, Qinghua Xing, Kairan Dou, Jinyu Tian, Ying Tai, Jian Yang, Ming-Ming Cheng, Xiang Li:

From Words to Worth: Newborn Article Impact Prediction with LLM. 1183-1191 - Qihua Zhou, Ruibin Li, Jingcai Guo, Yaodong Huang, Zhenda Xu, Laizhong Cui, Song Guo:

DeNC: Unleash Neural Codecs in Video Streaming with Diffusion Enhancement. 1192-1200 - Ziqi Zhou, Bowen Li, Yufei Song, Zhifei Yu, Shengshan Hu, Wei Wan

, Leo Yu Zhang, Dezhong Yao, Hai Jin:
NumbOD: A Spatial-Frequency Fusion Attack Against Object Detectors. 1201-1209 - Ziyi Zhou, Xiaoming Zhang, Shenghan Tan, Litian Zhang, Chaozhuo Li:

Collaborative Evolution: Multi-Round Learning Between Large and Small Language Models for Emergent Fake News Detection. 1210-1218 - Jun Zhu

, Yifu Li, Zhenchao Tang, Cheng Chang:
DUSTED: Dual-Attention Enhanced Spatial Transcriptomics Denoiser. 1219-1227 - Yu Zhu, Bo Lei, Chunfeng Song, Wanli Ouyang, Shan Yu, Tiejun Huang:

Multi-Modal Latent Variables for Cross-Individual Primary Visual Cortex Modeling and Analysis. 1228-1236 - Linlin Zong, Wenmin Lin, Jiahui Zhou, Xinyue Liu, Xianchao Zhang, Bo Xu, Shimin Wu:

Text-Guided Fine-grained Counterfactual Inference for Short Video Fake News Detection. 1237-1245
Technical Tracks 2
- Qing Chang, Yao-Xiang Ding, Kun Zhou:

Enhancing Identity-Deformation Disentanglement in StyleGAN for One-Shot Face Video Re-Enactment. 1247-1255 - Xuping Chen, Wuzhen Shi:

Dynamic Interactive Bimodal Hypergraph Networks for Emotion Recognition in Conversations. 1256-1264 - Yuhong Chen

, Ailin Song, Huifeng Yin, Shuai Zhong, Fuhai Chen, Qi Xu, Shiping Wang, Mingkun Xu:
Multi-View Incremental Learning with Structured Hebbian Plasticity for Enhanced Fusion Efficiency. 1265-1273 - Zhuang Chen, Yaru Cao, Guanqun Bi, Jincenzi Wu, Jinfeng Zhou, Xiyao Xiao, Si Chen, Hongning Wang, Minlie Huang:

SocialSim: Towards Socialized Simulation of Emotional Support Conversation. 1274-1282 - Mateus de Oliveira Oliveira, Wim Van den Broeck

:
Symbolic Functional Decomposition: A Reconfiguration Approach. 1283-1290 - Yiting Dong, Xiang He, Guobin Shen, Dongcheng Zhao, Yang Li, Yi Zeng:

EventZoom: A Progressive Approach to Event-Based Data Augmentation for Enhanced Neuromorphic Vision. 1291-1299 - Yi Feng, Mingyang Song, Jiaqi Wang, Zhuang Chen, Guanqun Bi, Minlie Huang, Liping Jing, Jian Yu:

SS-GEN: A Social Story Generation Framework with Large Language Models. 1300-1308 - Xilin He, Haijian Liang, Boyi Peng, Weicheng Xie, Muhammad Haris Khan, Siyang Song, Zitong Yu:

MSAmba: Exploring Multimodal Sentiment Analysis with State Space Models. 1309-1317 - Jinbing Hou, Youpeng Zhao, Jian Zhao:

CraftFactory: A Conditioned Control Policy Benchmark for Compositional Generalization. 1318-1326 - Zhejing Hu, Yan Liu, Gong Chen, Bruce X. B. Yu:

Compose with Me: Collaborative Music Inpainter for Symbolic Music Infilling. 1327-1335 - Zihan Ji, Xuetao Tian, Ye Liu:

AFFAKT: A Hierarchical Optimal Transport Based Method for Affective Facial Knowledge Transfer in Video Deception Detection. 1336-1344 - Md Rysul Kabir, James Mochizuki-Freeman, Zoran Tiganj:

Deep Reinforcement Learning with Time-Scale Invariant Memory. 1345-1354 - Lucio La Cava, Andrea Tagarelli:

Open Models, Closed Minds? On Agents Capabilities in Mimicking Human Personalities through Open Large Language Models. 1355-1363 - Zhenxin Lei, Man Yao, Jiakui Hu, Xinhao Luo, Yanye Lu, Bo Xu, Guoqi Li:

Spike2Former: Efficient Spiking Transformer for High-performance Image Segmentation. 1364-1372 - Chengtai Li, Yee Yang Tan, Yuting He

, Jianfeng Ren
, Ruibin Bai
, Yitian Zhao, Heng Yu, Xudong Jiang:
DARR: A Dual-Branch Arithmetic Regression Reasoning Framework for Solving Machine Number Reasoning. 1373-1382 - Jingmeng Li, Lukang Fu, Surun Yang, Hui Wei:

MI-CAPTCHA: Enhance the Security of CAPTCHA Using Mooney Images. 1383-1391 - Yinan Li, Jun Long, Zhan Yang

:
Asymmetric Cross-Modal Hashing Based on Formal Concept Analysis. 1392-1401 - Yu Liang, Wenjie Wei, Ammar Belatreche, Honglin Cao, Zijian Zhou, Shuai Wang, Malu Zhang, Yang Yang:

Towards Accurate Binary Spiking Neural Networks: Learning with Adaptive Gradient Modulation Mechanism. 1402-1410 - Jinhao Lin, Yifei Wang, Yanwu Xu, Qi Liu:

Semi-IIN: Semi-Supervised Intra-Inter Modal Interaction Learning Network for Multimodal Sentiment Analysis. 1411-1419 - Wei Liu, Li Yang, Mingxuan Zhao

, Dengfeng Xue, Shuxun Wang, Boyu Cai, Jin Gao, Wenjuan Li, Bing Li, Weiming Hu:
Towards More Discriminative Feature Learning in SNNs with Temporal-Self-Erasing Supervision. 1420-1428 - Xiaochuan Liu, Xin Cheng

, Yuchong Sun, Xiaoxue Wu, Ruihua Song, Hao Sun, Denghao Zhang:
EyEar: Learning Audio Synchronized Human Gaze Trajectory Based on Physics-Informed Dynamics. 1429-1437 - Yan-Kai Liu, Jinyu Cai, Bao-Liang Lu, Wei-Long Zheng:

Multi-to-Single: Reducing Multimodal Dependency in Emotion Recognition Through Contrastive Learning. 1438-1446 - Haifeng Lu, Jiuyi Chen, Feng Liang

, Mingkui Tan, Runhao Zeng, Xiping Hu:
Understanding Emotional Body Expressions via Large Language Models. 1447-1455 - Ryo Masumura, Shota Orihashi, Mana Ihori, Tomohiro Tanaka, Naoki Makishima, Satoshi Suzuki, Saki Mizuno, Nobukatsu Hojo:

Multimodal Fine-Grained Apparent Personality Trait Recognition: Joint Modeling of Big Five and Questionnaire Item-level Scores. 1456-1464 - Wei Miao, Jiangrong Shen, Qi Xu, Timo Hämäläinen

, Yi Xu, Fengyu Cong:
SpikingYOLOX: Improved YOLOX Object Detection with Fast Fourier Convolution and Spiking Neural Networks. 1465-1473 - Philippe Pasquier, Jeff Ens, Nathan Fradet

, Paul Triana, Davide Rizzotti, Jean-Baptiste Rolland, Maryam Safi:
MIDI-GPT: A Controllable Generative Model for Computer-Assisted Multitrack Music Composition. 1474-1482 - Lang Qin, Ziming Wang, Runhao Jiang, Rui Yan, Huajin Tang:

GRSN: Gated Recurrent Spiking Neurons for POMDPs and MARL. 1483-1491 - Mirabel Reid, Santosh S. Vempala:

Does GPT Really Get It? A Hierarchical Scale to Quantify Human and AI's Understanding of Algorithms. 1492-1500 - Yimeng Shan, Malu Zhang, Ruijie Zhu, Xuerui Qiu, Jason K. Eshraghian, Haicheng Qu:

Advancing Spiking Neural Networks Towards Multiscale Spatiotemporal Interaction Learning. 1501-1509 - Haojun Shi, Suyu Ye, Xinyu Fang, Chuanyang Jin, Leyla Isik, Yen-Ling Kuo, Tianmin Shu:

MuMA-ToM: Multi-modal Multi-Agent Theory of Mind. 1510-1519 - Kazutoshi Shinoda, Nobukatsu Hojo, Kyosuke Nishida, Saki Mizuno, Keita Suzuki, Ryo Masumura, Hiroaki Sugiyama, Kuniko Saito:

ToMATO: Verbalizing the Mental States of Role-Playing LLMs for Benchmarking Theory of Mind. 1520-1528 - Yuxuan Song, Qiudan Li, Yilin Wu, David Jingjun Xu

, Daniel Dajun Zeng:
Knowledge-Enhanced Hierarchical Heterogeneous Graph for Personality Identification with Limited Training Data. 1529-1537 - Bin Tang

, Keqi Pan, Miao Zheng, Ning Zhou, Jialu Sui, Dandan Zhu, Cheng-Long Deng
, Shu-Guang Kuai:
Pose as a Modality: A Psychology-Inspired Network for Personality Recognition with a New Multimodal Dataset. 1538-1546 - Chuanqi Tao, Jiaming Li, Tianzi Zang, Peng Gao:

A Multi-Focus-Driven Multi-Branch Network for Robust Multimodal Sentiment Analysis. 1547-1555 - Neha Upadhyay, Vijay Marupudi, Kamala Varma, Sashank Varma:

Alignment of CNN and Human Judgments of Geometric and Topological Concepts. 1556-1564 - Miaohui Wang, Zhenming Li, Wuyuan Xie:

DDJND: Dual Domain Just Noticeable Difference in Multi-Source Content Images with Structural Discrepancy. 1565-1573 - Yusong Wang, Xuanye Fang, Huifeng Yin, Dongyuan Li, Guoqi Li, Qi Xu, Yi Xu, Shuai Zhong, Mingkun Xu:

BIG-FUSION: Brain-Inspired Global-Local Context Fusion Framework for Multimodal Emotion Recognition in Conversations. 1574-1582 - Ziqing Wang, Yuetong Fang, Jiahang Cao, Hongwei Ren, Renjing Xu:

Adaptive Calibration: A Unified Conversion Framework of Spiking Neural Networks. 1583-1591 - Zachary Wojtowicz, Simon DeDeo:

Undermining Mental Proof: How AI Can Make Cooperation Harder by Making Thinking Easier. 1592-1600 - Sheng Wu, Dongxiao He, Xiaobao Wang, Longbiao Wang, Jianwu Dang:

Enriching Multimodal Sentiment Analysis Through Textual Emotional Descriptions of Visual-Audio Content. 1601-1609 - Zijian Wu

, Leijing Zhou, Shuanglin Li, Changzeng Fu, Jun Lu, Jing Han, Yi Zhang, Zhuang Zhao, Siyang Song:
DepMGNN: Matrixial Graph Neural Network for Video-based Automatic Depression Assessment. 1610-1619 - Dingyi Zeng, Yuchen Wang, Honglin Cao, Wanlong Liu, Yichen Xiao, Chengzhuo Lu, Wenyu Chen, Malu Zhang, Guoqing Wang, Yang Yang:

Leveraging Asynchronous Spiking Neural Networks for Ultra Efficient Event-Based Visual Processing. 1620-1628 - Dengming Zhang

, Weitao You, Ziheng Liu, Lingyun Sun, Pei Chen:
Personalized Dynamic Music Emotion Recognition with Dual-Scale Attention-Based Meta-Learning. 1629-1637 - Miao Zhang, Jiawei Wang

, Kui Xiao, Shihui Wang, Yan Zhang, Hao Chen, Zhifei Li:
Learning Concept Prerequisite Relation via Global Knowledge Relation Optimization. 1638-1646 - Chunyu Zhao

, Wentao Mu, Xian Zhou, Wenbo Liu, Fei Yan, Tao Deng:
SalM²: An Extremely Lightweight Saliency Mamba Model for Real-Time Cognitive Awareness of Driver Attention. 1647-1655 - Shiyi Zheng

, Peizhi Zhao, Zhilong Zheng, Peihang He, Haonan Cheng, Yi Cai, Qingbao Huang:
Look Around Before Locating: Considering Content and Structure Information for Visual Grounding. 1656-1664 - Hengde Zhu

, Xiangyu Kong, Weicheng Xie, Xin Huang, Xilin He, Lu Liu, Linlin Shen, Wei Zhang, Hatice Gunes, Siyang Song:
PerReactor: Offline Personalised Multiple Appropriate Facial Reaction Generation. 1665-1673 - Jiankun Zhu, Sicheng Zhao, Jing Jiang, Wenbo Tang, Zhaopan Xu, Tingting Han, Pengfei Xu, Hongxun Yao:

Bridge Then Begin Anew: Generating Target-Relevant Intermediate Model for Source-Free Visual Emotion Adaptation. 1674-1682 - Linlin Zhu, Heli Sun, Qunshu Gao, Yuze Liu, Liang He:

Aspect Enhancement and Text Simplification in Multimodal Aspect-Based Sentiment Analysis for Multi-Aspect and Multi-Sentiment Scenarios. 1683-1691 - Yaohui Zhu, Kaiming Sun, Zhengdong Luo, Lingfeng Wang:

Progressive Self-Learning for Domain Adaptation on Symbolic Regression of Integer Sequences. 1692-1699 - Han Yang, Chuanguang Yang, Zhulin An, Libo Huang, Yongjun Xu:

HSRDiff: A Hierarchical Self-Regulation Diffusion Model for Stochastic Semantic Segmentation. 1701-1709 - Yihao, Limei Hu, Feng Chen, Sen Zhao

, Shukai Duan:
GRICP: Granular-Ball Iterative Closest Point with Multikernel Correntropy for Point Cloud Fine Registration. 1710-1718 - Shivang Agarwal

, Jyoti Chaudhary, Sadiq Siraj Ebrahim, Mayank Vatsa, Richa Singh, Shyam Prasad Adhikari, Sangeeth Reddy Battu:
AQUAFace: Age-Invariant Quality Adaptive Face Recognition for Unconstrained Selfie vs ID Verification. 1719-1727 - Daechul Ahn, Yura Choi, San Kim, Youngjae Yu, Dongyeop Kang, Jonghyun Choi:

ISR-DPO: Aligning Large Multimodal Models for Videos by Iterative Self-Retrospective DPO. 1728-1736 - Tim Alpherts, Sennay Ghebreab, Nanne van Noord:

EMPLACE: Self-Supervised Urban Scene Change Detection. 1737-1745 - Jingkun An, Yinghao Zhu, Zongjian Li, Enshen Zhou, Haoran Feng, Xijie Huang, Bohua Chen, Yemin Shi, Chengwei Pan:

AGFSync: Leveraging AI-Generated Feedback for Preference Optimization in Text-to-Image Generation. 1746-1754 - Xiaoqi An, Lin Zhao, Chen Gong, Jun Li, Jian Yang:

Pre-training a Density-Aware Pose Transformer for Robust LiDAR-based 3D Human Pose Estimation. 1755-1763 - Yajun An, Jiale Chen, Huan Lin, Zhenbing Liu, Siyang Feng, Hualong Zhang

, Rushi Lan, Zaiyi Liu, Xipeng Pan:
CA-MLIF: Cross-Attention and Multimodal Low-Rank Interaction Fusion Framework for Tumor Prognostic Prediction. 1764-1772 - Kazi Hasan Ibn Arif, JinYi Yoon, Dimitrios S. Nikolopoulos, Hans Vandierendonck, Deepu John

, Bo Ji:
HiRED: Attention-Guided Token Dropping for Efficient Inference of High-Resolution Vision-Language Models. 1773-1781 - Sithu Aung

, Min-Cheol Sagong, Junghyun Cho:
Multi-View Pedestrian Occupancy Prediction with a Novel Synthetic Dataset. 1782-1790 - Hamed Ayoobi

, Nico Potyka, Francesca Toni:
ProtoArgNet: Interpretable Image Classification with Super-Prototypes and Argumentation. 1791-1799 - Sana Ayromlou, Vahid Reza Khazaie, Fereshteh Forghani, Arash Afkanpour:

Can Generative Models Improve Self-Supervised Representation Learning? 1800-1808 - Zahra Babaiee, Peyman M. Kiasari, Daniela Rus, Radu Grosu:

The Master Key Filters Hypothesis: Deep Filters Are General. 1809-1816 - Lichen Bai, Zixuan Xiong, Hai Lin, Guangwei Xu, Xiangjin Xie, Ruijie Guo, Zhanhui Kang, Haitao Zheng, Hong-Gee Kim:

Frozen Language Models Are Gradient Coherence Rectifiers in Vision Transformers. 1817-1825 - Jingwei Bao

, Jinhua Hao, Pengcheng Xu, Ming Sun, Chao Zhou, Shuyuan Zhu:
Plug-and-Play Tri-Branch Invertible Block for Image Rescaling. 1826-1834 - Oren Barkan, Yehonatan Elisha, Jonathan Weill, Noam Koenigstein

:
BEE: Metric-Adapted Explanations via Baseline Exploration-Exploitation. 1835-1843 - Jian Bi, Qianliang Wu, Jianjun Qian, Lei Luo, Jian Yang:

Dual Manifold Regularization Steered Robust Representation Learning for Point Cloud Analysis. 1844-1852 - Qi Bi, Jingjun Yi, Haolan Zhan, Wei Ji, Gui-Song Xia:

Learning Fine-grained Domain Generalization via Hyperbolic State Space Hallucination. 1853-1861 - Qi Bi, Jingjun Yi, Hao Zheng, Haolan Zhan, Wei Ji, Yawen Huang, Yuexiang Li:

DGFamba: Learning Flow Factorized State Space for Visual Domain Generalization. 1862-1870 - Xiuli Bi, Jian Lu, Bo Liu, Xiaodong Cun, Yong Zhang, Weisheng Li, Bin Xiao:

CustomTTT: Motion and Appearance Customized Video Generation via Test-Time Training. 1871-1879 - Yuxuan Bian, Ailing Zeng, Xuan Ju, Xian Liu, Zhaoyang Zhang, Wei Liu, Qiang Xu:

MotionCraft: Crafting Whole-Body Motion with Plug-and-Play Multimodal Controls. 1880-1888 - Yuntian Bo

, Yazhou Zhu, Lunbo Li, Haofeng Zhang:
FAMNet: Frequency-aware Matching Network for Cross-domain Few-shot Medical Image Segmentation. 1889-1897 - Lingling Cai, Kang Zhao, Hangjie Yuan, Yingya Zhang, Shiwei Zhang, Kejie Huang:

FreeMask: Rethinking the Importance of Attention Masks for Zero-Shot Video Editing. 1898-1906 - Rui Cai, Zhiyu Dong, Jianfeng Dong, Xun Wang:

Dynamic Adapter with Semantics Disentangling for Cross-lingual Cross-modal Retrieval. 1907-1916 - Shuo Cai, Xinzhe Han, Shuhui Wang:

Divide-and-Conquer: Tree-structured Strategy with Answer Distribution Estimator for Goal-Oriented Visual Dialogue. 1917-1925 - Wenxiao Cai, Wankou Yang:

Object-level Geometric Structure Preserving for Natural Image Stitching. 1926-1934 - Cong Cao, Huanjing Yue, Xin Liu, Jingyu Yang:

Zero-shot Video Restoration and Enhancement Using Pre-Trained Image Diffusion Model. 1935-1943 - Qihang Cao, Huangxun Chen:

ObjVariantEnsemble: Advancing Point Cloud LLM Evaluation in Challenging Scenes with Subtly Distinguished Objects. 1944-1952 - Yuan Cao, Xiangru Chen, Zifan Liu, Wenzhe Jia, Fanlei Meng, Jie Gui:

Deep Graph Online Hashing for Multi-Label Image Retrieval. 1953-1961 - Angela Castillo, Jonas Kohler, Juan C. Pérez, Juan Pablo Pérez, Albert Pumarola, Bernard Ghanem

, Pablo Arbeláez, Ali K. Thabet:
Adaptive Guidance: Training-free Acceleration of Conditional Diffusion Models. 1962-1970 - Jiazhong Cen, Jiemin Fang, Chen Yang, Lingxi Xie, Xiaopeng Zhang, Wei Shen, Qi Tian:

Segment Any 3D Gaussians. 1971-1979 - Junuk Cha, Mengwei Ren, Krishna Kumar Singh, He Zhang, Yannick Hold-Geoffroy, Seunghyun Yoon, Hyunjoon Jung, Jae Shin Yoon, Seungryul Baek:

Text2Relight: Creative Portrait Relighting with Text Guidance. 1980-1988 - Keng Wei Chang, Zi-Ming Wang, Shang-Hong Lai:

KeyGS: A Keyframe-Centric Gaussian Splatting Method for Monocular Image Sequences. 1989-1997 - Laibin Chang, Yunke Wang, Longxiang Deng, Bo Du

, Chang Xu:
WaterDiffusion: Learning a Prior-involved Unrolling Diffusion for Joint Underwater Saliency Detection and Visual Restoration. 1998-2006 - Qikai Chang, Mingjun Chen, Changpeng Pi, Pengfei Hu, Zhenrong Zhang, Jiefeng Ma, Jun Du, Baocai Yin, Jinshui Hu:

RFL: Simplifying Chemical Structure Recognition with Ring-Free Language. 2007-2015 - Changgu Chen, Junwei Shu, Gaoqi He, Changbo Wang, Yang Li:

Motion-Zero: A Zero-Shot Trajectory Control Framework of Moving Object for Diffusion-Based Video Generation. 2016-2024 - Chao Chen, Yu-Shen Liu, Zhizhong Han:

Sharpening Neural Implicit Functions with Frequency Consolidation Priors. 2025-2033 - Dongpan Chen, Dehui Kong, Jinghua Li, Baocai Yin:

MaskPrompt: Open-Vocabulary Affordance Segmentation with Object Shape Mask Prompts. 2034-2042 - Haipeng Chen, Yuheng Yang

, Yingda Lyu:
Skeleton-based Action Recognition with Non-linear Dependency Modeling and Hilbert-Schmidt Independence Criterion. 2043-2051 - Haipeng Chen, Sifan Wu, Zhigang Wang, Yifang Yin, Yingying Jiao, Yingda Lyu, Zhenguang Liu:

Causal-Inspired Multitask Learning for Video-Based Human Pose Estimation. 2052-2060 - Jiahao Chen

, Zhou Feng
, Rui Zeng, Yuwen Pu, Chunyi Zhou, Yi Jiang, Yuyou Gan, Jinbao Li, Shouling Ji:
Enhancing Adversarial Transferability with Adversarial Weight Tuning. 2061-2069 - Jie Chen, Xinyuan Liu, Xintong Liu, Jianqiang Li:

Adversarial Learning Under Hybrid Perturbations for Robust Acute Lymphoblastic Leukemia Classification. 2070-2078 - Jingyuan Chen, Fuchen Long, Jie An, Zhaofan Qiu, Ting Yao, Jiebo Luo

, Tao Mei:
Ouroboros-Diffusion: Exploring Consistent Content Generation in Tuning-free Long Video Diffusion. 2079-2087 - Junyi Chen, Weicai Ye, Yifan Wang, Danpeng Chen, Di Huang, Wanli Ouyang, Guofeng Zhang, Yu Qiao, Tong He:

GigaGS: 3D Gaussian Based Planar Representation for Large-Scene Surface Reconstruction. 2088-2096 - Kang Chen, Yajing Zheng, Tiejun Huang, Zhaofei Yu:

Rethinking High-speed Image Reconstruction Framework with Spike Camera. 2097-2104 - Kehua Chen, Zhenlong Yuan, Tianlu Mao, Zhaoqi Wang:

Dual-Level Precision Edges Guided Multi-View Stereo with Accurate Planarization. 2105-2113 - Lu Chen, Shaofeng Li, Benhao Huang, Fan Yang, Zheng Li, Jie Li, Yuan Luo:

Contrasting Adversarial Perturbations: The Space of Harmless Perturbations. 2114-2122 - Nan Chen, Mengqi Huang, Zhuowei Chen, Yang Zheng, Lei Zhang, Zhendong Mao:

CustomContrast: A Multilevel Contrastive Perspective for Subject-Driven Text-to-Image Customization. 2123-2131 - Qi Chen, Changli Wu, Jiayi Ji, Yiwei Ma, Danni Yang, Xiaoshuai Sun:

IPDN: Image-enhanced Prompt Decoding Network for 3D Referring Expression Segmentation. 2132-2140 - Qibo Chen, Weizhong Jin, Jianyue Ge, Mengdi Liu, Yuchao Yan, Jian Jiang, Li Yu, Xuanjiang Guo, Shuchang Li, Jianzhong Chen:

CP-DETR: Concept Prompt Guide DETR Toward Stronger Universal Object Detection. 2141-2149 - Qihua Chen, Yue Ma, Hongfa Wang, Junkun Yuan, Wenzhe Zhao, Qi Tian

, Hongmei Wang, Shaobo Min, Qifeng Chen, Wei Liu:
Infinite-Canvas: Higher-Resolution Video Outpainting with Extensive Content Generation. 2150-2158 - Qirui Chen, Shangzhe Di, Weidi Xie:

Grounded Multi-Hop VideoQA in Long-Form Egocentric Videos. 2159-2167 - Qizhou Chen

, Taolin Zhang, Chengyu Wang, Xiaofeng He, Dakan Wang, Tingting Liu:
Attribution Analysis Meets Model Editing: Advancing Knowledge Correction in Vision Language Models with VisEdit. 2168-2176 - Sen Chen, Hongying Liu, Chaowei Fang, Fanhua Shang, Yuanyuan Liu, Liang Wan, Dongmei Jiang, Yaowei Wang:

Unsupervised Degradation Representation Aware Transform for Real-World Blind Image Super-Resolution. 2177-2185 - Shengjia Chen

, Luping Ji
, Weiwei Duan, Shuang Peng, Mao Ye:
Motion Prior Knowledge Learning with Homogeneous Language Descriptions for Moving Infrared Small Target Detection. 2186-2194 - Shunxin Chen, Ajian Liu, Junze Zheng, Jun Wan, Kailai Peng, Sergio Escalera, Zhen Lei:

Mixture-of-Attack-Experts with Class Regularization for Unified Physical-Digital Face Attack Detection. 2195-2203 - Sijia Chen

, En Yu, Wenbing Tao:
Cross-View Referring Multi-Object Tracking. 2204-2211 - Siran Chen, Yuxiao Luo

, Yue Ma, Yu Qiao, Yali Wang:
H-MBA: Hierarchical MamBa Adaptation for Multi-Modal Video Understanding in Autonomous Driving. 2212-2220 - Wei Chen

, Jianwei Niu, Xuefeng Liu, Zhendong Wang, Shaojie Tang, Guogang Zhu:
DiffDVC: Accurate Event Detection for Dense Video Captioning via Diffusion Models. 2221-2229 - Xiao Chen

, Xudong Jiang, Yunkang Tao, Zhen Lei, Qing Li, Chenyang Lei, Zhaoxiang Zhang:
FIRM: Flexible Interactive Reflection ReMoval. 2230-2238 - Xin Chen, Ben Kang, Wanting Geng, Jiawen Zhu, Yi Liu, Dong Wang, Huchuan Lu:

SUTrack: Towards Simple and Unified Single Object Tracking. 2239-2247 - Xingchi Chen, Zhuoran Zheng, Xuerui Li, Yuying Chen

, Shu Wang, Wenqi Ren:
Ultra-High-Definition Dynamic Multi-Exposure Image Fusion via Infinite Pixel Learning. 2248-2255 - Xinyue Chen, Miaojing Shi, Zijian Zhou

, Lianghua He, Sophia Tsoka:
Enhancing Generalized Few-Shot Semantic Segmentation via Effective Knowledge Transfer. 2256-2265 - Xiongren Chen, Jiuyong Li

, Jixue Liu, Lin Liu, Stefan Peters, Thuc Duy Le
, Wentao Gao, Xiaojing Du, Anthony Walsh:
Diffusion Models for Attribution. 2266-2274 - Xuesong Chen, Shaoshuai Shi, Tao Ma, Jingqiu Zhou, Simon See, Ka Chun Cheung, Hongsheng Li:

M3Net: Multimodal Multi-task Learning for 3D Detection, Segmentation, and Occupancy Prediction in Autonomous Driving. 2275-2283 - Yi Chen, Muyoung Son, Chuanbo Hua, Joo-Young Kim:

AoP-SAM: Automation of Prompts for Efficient Segmentation. 2284-2292 - Yi Chen

, Jian Xu, Xu-Yao Zhang, Wen-Zhuo Liu, Yang-Yang Liu, Cheng-Lin Liu:
Recoverable Compression: A Multimodal Vision Token Recovery Mechanism Guided by Text Information. 2293-2301 - Yiliang Chen

, Steven SC Ho, Cheng Xu, Yao Jie Xie, Wing-Fai Yeung, Shengfeng He
, Jing Qin:
Dr. Tongue: Sign-Oriented Multi-label Detection for Remote Tongue Diagnosis. 2302-2310 - Yirui Chen, Xudong Huang, Quan Zhang, Wei Li, Mingjian Zhu, Qiangyu Yan, Simiao Li, Hanting Chen, Hailin Hu, Jie Yang, Wei Liu, Jie Hu:

GIM: A Million-scale Benchmark for Generative Image Manipulation Detection and Localization. 2311-2319 - Yitong Chen, Wenhao Yao

, Lingchen Meng, Sihong Wu, Zuxuan Wu, Yu-Gang Jiang:
Comprehensive Multi-Modal Prototypes Are Simple and Effective Classifiers for Vast-Vocabulary Object Detection. 2320-2328 - Yuchong Chen

, Jian Yu
, Shaoyan Gai, Zeyu Cai, Feipeng Da:
3D Measurement of Complex Textured Objects Based on Bidirectional Fringe Projection. 2329-2338 - Yujia Chen, Rui Sun, Wangkai Li, Huayu Mai, Naisong Luo, Yuwen Pan, Tianzhu Zhang:

Alleviate and Mining: Rethinking Unsupervised Domain Adaptation for Mitochondria Segmentation from Pseudo-Label Perspective. 2339-2347 - Yuying Chen

, Mingde Yao, Wenbo Li, Renjing Pei, Jinjing Zhao, Wenqi Ren:
Unsupervised Diffusion-Based Degradation Modeling for Real-World Super-Resolution. 2348-2356
Technical Tracks 3
- Zehao Chen, Rong Pan:

SVGBuilder: Component-Based Colored SVG Generation with Text-Guided Autoregressive Transformers. 2358-2366 - Zehao Chen, Zhan Lu, De Ma, Huajin Tang, Xudong Jiang, Qian Zheng, Gang Pan:

EvHDR-GS: Event-guided HDR Video Reconstruction with 3D Gaussian Splatting. 2367-2375 - Zehao Chen, Zhanfeng Liao, De Ma, Huajin Tang, Qian Zheng, Gang Pan:

EvHDR-NeRF: Building High Dynamic Range Radiance Fields with Single Exposure Images and Events. 2376-2384 - Zheng Chen, Yu Zeng, Zehui Chen, Hongzhi Gao, Lin Chen, Jiaming Liu, Feng Zhao:

VFM-Adapter: Adapting Visual Foundation Models for Dense Prediction with Dynamic Hybrid Operation Mapping. 2385-2393 - Zhipeng Chen

, Lan Yang, Yonggang Qi, Honggang Zhang, Kaiyue Pang, Ke Li, Yi-Zhe Song:
VersaGen: Unleashing Versatile Visual Control for Text-to-Image Synthesis. 2394-2402 - Zhiyuan Chen, Jiajiong Cao, Zhiquan Chen, Yuming Li, Chenguang Ma

:
EchoMimic: Lifelike Audio-Driven Portrait Animations through Editable Landmark Conditions. 2403-2410 - Zikang Chen, Tao Jiang, Xiaowan Hu, Wang Zhang, Huaqiu Li, Haoqian Wang:

Spatiotemporal Blind-Spot Network with Calibrated Flow Alignment for Self-Supervised Video Denoising. 2411-2419 - Zining Chen, Xingshuang Luo, Weiqiu Wang, Zhicheng Zhao

, Fei Su, Aidong Men:
Filter or Compensate: Towards Invariant Representation from Distribution Shift for Anomaly Detection. 2420-2428 - Ziyang Chen, Yiwen Ye, Yongsheng Pan, Yong Xia:

Gradient Alignment Improves Test-Time Adaptation for Medical Image Segmentation. 2429-2437 - Jiaxiang Cheng, Pan Xie, Xin Xia, Jiashi Li, Jie Wu, Yuxi Ren, Huixia Li, Xuefeng Xiao, Shilei Wen, Lean Fu:

ResAdapter: Domain Consistent Resolution Adapter for Diffusion Models. 2438-2446 - Junfeng Cheng, Yingkai Yang, Tania Stathaki:

3DPGS: 3D Probabilistic Graph Search for Archaeological Piece Grouping. 2447-2454 - Kun Cheng, Lei Yu

, Zhijun Tu, Xiao He, Liyu Chen, Yong Guo, Mingrui Zhu, Nannan Wang, Xinbo Gao, Jie Hu:
Effective Diffusion Transformer Architecture for Image Super-Resolution. 2455-2463 - Yongkang Cheng, Shaoli Huang, Xuelin Chen, Jifeng Ning, Mingming Gong:

DIDiffGes: Decoupled Semi-Implicit Diffusion Models for Real-time Gesture Generation from Speech. 2464-2472 - Yutao Cheng, Zhao Zhang, Maoke Yang, Hui Nie, Chunyuan Li, Xinglong Wu, Jie Shao:

Graphic Design with Large Multimodal Model. 2473-2481 - Zesen Cheng, Kehan Li, Hao Li, Peng Jin, Xiawu Zheng, Chang Liu, Jie Chen:

Aligning Instance Brownian Bridge with Texts for Open-Vocabulary Video Instance Segmentation. 2482-2490 - Zhixin Cheng, Jiacheng Deng, Xinjun Li, Baoqun Yin, Tianzhu Zhang:

Bridge 2D-3D: Uncertainty-aware Hierarchical Registration Network with Domain Alignment. 2491-2499 - Cheol-Ho Cho, WonJun Moon, Woojin Jun, Minseok Jung, Jae-Pil Heo:

Ambiguity-Restrained Text-Video Representation Learning for Partially Relevant Video Retrieval. 2500-2508 - Kyusik Cho

, Dong Yeop Kim, Euntai Kim:
Zero-Shot Scene Change Detection. 2509-2517 - Seungju Cho, Hongsin Lee, Changick Kim:

Enhancing Robustness in Incremental Learning with Adversarial Training. 2518-2526 - Suhwan Cho, Seoung Wug Oh, Sangyoun Lee, Joon-Young Lee:

Elevating Flow-Guided Video Inpainting with Reference Generation. 2527-2535 - Dasol Choi, Dongbin Na:

Distribution-Level Feature Distancing for Machine Unlearning: Towards a Better Trade-off Between Model Utility and Forgetting. 2536-2544 - Sooyoung Choi, Sungyong Park, Heewon Kim:

SIDL: A Real-World Dataset for Restoring Smartphone Images with Dirty Lenses. 2545-2554 - Wonhyeok Choi, Kyumin Hwang, Minwoo Choi, Kiljoon Han, Wonjoon Choi, Mingyu Shin, Sunghoon Im:

Intrinsic Image Decomposition for Robust Self-supervised Monocular Depth Estimation on Reflective Surfaces. 2555-2563 - Yongjin Choi, Chanhun Park, Seung Jun Baek:

DynASyn: Multi-Subject Personalization Enabling Dynamic Action Synthesis. 2564-2572 - Jisheng Chu, Wenrui Li, Xingtao Wang, Kanglin Ning, Yidan Lu, Xiaopeng Fan

:
Digging into Intrinsic Contextual Information for High-fidelity 3D Point Cloud Completion. 2573-2581 - Chaeyeon Chung, Sunghyun Park, Jeongho Kim, Jaegul Choo:

What to Preserve and What to Transfer: Faithful, Identity-Preserving Diffusion-based Hairstyle Transfer. 2582-2590 - Jiwan Chung, Seungwon Lim, Sangkyu Lee, Youngjae Yu:

MASS: Overcoming Language Bias in Image-Text Matching. 2591-2599 - Antonio Emanuele Cinà

, Jérôme Rony, Maura Pintor
, Luca Demetrio, Ambra Demontis, Battista Biggio, Ismail Ben Ayed, Fabio Roli:
AttackBench: Evaluating Gradient-based Attacks for Adversarial Examples. 2600-2608 - Yubo Cui, Zhiheng Li, Jiaqiang Wang, Zheng Fang:

LOMA: Language-assisted Semantic Occupancy Network via Triplane Mamba. 2609-2617 - Ming Dai, Jian Li, Jiedong Zhuang, Xian Zhang, Wankou Yang:

Multi-task Visual Grounding with Coarse-to-Fine Consistency Constraints. 2618-2626 - Tao Dai, Yang Lin, Hang Guo, Jinbao Wang, Zexuan Zhu

:
DCSF-KD: Dynamic Channel-wise Spatial Feature Knowledge Distillation for Object Detection. 2627-2635 - Tao Dai, Yanzi Wang, Jianyu Xiong, Yaohua Zha, Shu-Tao Xia, Zexuan Zhu

:
GCD-Sampling: A General Cross-scale Decoupled Sampling for Point Cloud. 2636-2644 - Yuqin Dai, Wanlu Zhu, Ronghui Li, Zeping Ren, Xiangzheng Zhou, Jixuan Ying, Jun Li, Jian Yang:

Harmonious Music-driven Group Choreography with Trajectory-Controllable Diffusion. 2645-2653 - Quan Dao, Hao Phung, Trung Tuan Dao, Dimitris N. Metaxas, Anh Tuan Tran:

Self-Corrected Flow Distillation for Consistent One-Step and Few-Step Image Generation. 2654-2662 - Shristi Das Biswas, Matthew Shreve, Xuelu Li, Prateek Singhal, Kaushik Roy:

PIXELS: Progressive Image Xemplar-based Editing with Latent Surgery. 2663-2671 - Gabriel della Maggiora, Luis Alberto Croquevielle, Harry Horsley, Thomas Heinis, Artur Yakimovich:

Single Exposure Quantitative Phase Imaging with a Conventional Microscope Using Diffusion Models. 2672-2680 - Hui Deng, Jiawei Shi, Zhen Qin, Yiran Zhong, Yuchao Dai:

Deep Non-Rigid Structure-from-Motion Revisited: Canonicalization and Sequence Modeling. 2681-2689 - Jiacheng Deng, Jiahao Lu, Zhixin Cheng, Wenfei Yang:

DiffCorr: Conditional Diffusion Model with Reliable Pseudo-Label Guidance for Unsupervised Point Cloud Shape Correspondence. 2690-2698 - Jiacheng Deng, Jiahao Lu:

Adaptive Siamese Masked Autoencoder with Global Optimization for Unsupervised Point Cloud Shape Correspondence. 2699-2707 - Shangqi Deng, Jun Ma, Liang-Jian Deng, Ping Wei

:
OTIAS: OcTree Implicit Adaptive Sampling for Multispectral and Hyperspectral Image Fusion. 2708-2716 - Xiongwen Deng, Haoyu Tang, Han Jiang

, Qinghai Zheng, Jihua Zhu:
Boundary-Aware Temporal Dynamic Pseudo-Supervision Pairs Generation for Zero-Shot Natural Language Video Localization. 2717-2725 - Yuhui Deng, Yuqin Lu, Yangyang Xu, Yongwei Nie, Shengfeng He

:
Occlusion-Insensitive Talking Head Video Generation via Facelet Compensation. 2726-2734 - Bonan Ding, Jin Xie

, Jing Nie, Jiale Cao:
SSLFusion: Scale and Space Aligned Latent Fusion Model for Multimodal 3D Object Detection. 2735-2743 - Guanqi Ding, Chengyu Yang, Shuhui Wang, Xincheng Li, Jinzhe Zhang, Xin Jin, Qingming Huang:

Dis²Booth: Learning Image Distribution with Disentangled Features for Text-to-Image Diffusion Models. 2744-2752 - Yanbo Ding, Shaobin Zhuang, Kunchang Li, Zhengrong Yue, Yu Qiao, Yali Wang:

Muses: 3D-Controllable Image Generation via Multi-Modal Agent Collaboration. 2753-2761 - Ziheng Ding, Xiaze Zhang, Qi Jing, Ying Cheng, Rui Feng:

AS-Det: Active Sampling for Adaptive 3D Object Detection in Point Clouds. 2762-2770 - Chenghu Du

, Junyin Wang, Yi Rong, Feng Yu
, Shengwu Xiong:
GarFast: Realistic and Fast Garment Transfer with a Simplified Parser-Free Approach. 2771-2779 - Chenghu Du

, Junyin Wang, Feng Yu
, Shengwu Xiong:
Latent Diffusion-Enhanced Virtual Try-On via Optimized Pseudo-Label Generation. 2780-2788 - Keyu Du

, Hao Xu, Haipeng Li, Hong Qu, Chi-Wing Fu, Shuaicheng Liu:
HybridReg: Robust 3D Point Cloud Registration with Hybrid Motions. 2789-2797 - Yongkun Du, Zhineng Chen, Caiyan Jia, Xieping Gao, Yu-Gang Jiang:

Out of Length Text Recognition with Sub-String Matching. 2798-2806 - Chen Duan, Qianyi Jiang, Pei Fu, Jiamin Chen, Shengxi Li, Zining Wang, Shan Guo, Junfeng Luo:

InstructOCR: Instruction Boosting Scene Text Spotting. 2807-2815 - Zheng-Peng Duan, Jiawei Zhang, Siyu Liu, Zheng Lin, Chun-Le Guo, Dongqing Zou, Jimmy S. J. Ren, Chongyi Li:

A Diffusion-Based Framework for Occluded Object Movement. 2816-2824 - Zheng-Peng Duan, Jiawei Zhang, Zheng Lin, Xin Jin, Xundong Wang, Dongqing Zou, Chun-Le Guo, Chongyi Li:

DiffRetouch: Using Diffusion to Retouch on the Shoulder of Experts. 2825-2833 - Guodong Fan, Zishu Yao, Guang-Yong Chen, Jian-Nan Su

, Min Gan
:
IniRetinex: Rethinking Retinex-type Low-Light Image Enhancer via Initialization Perspective. 2834-2842 - Haozhi Fan, Yuan Cao:

Vision-guided Text Mining for Unsupervised Cross-modal Hashing with Community Similarity Quantization. 2843-2851 - Junkai Fan, Kun Wang, Zhiqiang Yan

, Xiang Chen, Shangbing Gao, Jun Li, Jian Yang:
Depth-Centric Dehazing and Depth-Estimation from Real-World Hazy Driving Video. 2852-2860 - Rui Fan, Weidong Hao, Juntao Guan, Lai Rui, Lin Gu

, Tong Wu, Fanhong Zeng, Zhangming Zhu:
EventPillars: Pillar-based Efficient Representations for Event Data. 2861-2869 - Wenxiao Fan, Kan Li:

Combating Semantic Contamination in Learning with Label Noise. 2870-2878 - Zhen Fan, Peng Dai, Zhuo Su, Xu Gao, Zheng Lv, Jiarui Zhang, Tianyuan Du, Guidong Wang, Yang Zhang:

EMHI: A Multimodal Egocentric Human Motion Dataset with HMD and Body-Worn IMUs. 2879-2887 - Han Fang, Kejiang Chen, Zijin Yang, Bosen Cui, Weiming Zhang, Ee-Chien Chang:

CoSDA: Enhancing the Robustness of Inversion-based Generative Image Watermarking Framework. 2888-2896 - Shijie Fang, Hongping Gan:

SSUN-Net: Spatial-Spectral Prior-Aware Unfolding Network for Pan-Sharpening. 2897-2905 - Wenxuan Fang, Junkai Fan, Yu Zheng, Jiangwei Weng, Ying Tai, Jun Li:

Guided Real Image Dehazing Using YCbCr Color Space. 2906-2914 - Xiang Fang, Wanlong Fang, Changshuo Wang, Daizong Liu, Keke Tang, Jianfeng Dong, Pan Zhou, Beibei Li:

Multi-Pair Temporal Sentence Grounding via Multi-Thread Knowledge Transfer Network. 2915-2923 - Chaoran Feng, Wangbo Yu, Xinhua Cheng, Zhenyu Tang, Junwu Zhang, Li Yuan, Yonghong Tian:

AE-NeRF: Augmenting Event-Based Neural Radiance Fields for Non-ideal Conditions and Larger Scenes. 2924-2932 - Chen Feng, Ziquan Liu, Zhuo Zhi, Ilija Bogunovic, Carsten Gerner-Beuerle, Miguel Rodrigues:

PROSAC: Provably Safe Certification for Machine Learning Models under Adversarial Attacks. 2933-2941 - Chun-Mei Feng, Yang Bai, Tao Luo, Zhen Li, Salman H. Khan, Wangmeng Zuo, Rick Siow Mong Goh, Yong Liu:

VQA4CIR: Boosting Composed Image Retrieval with Visual Question Answering. 2942-2950 - Dong Feng

, Ping Guo, Encheng Peng, Mingmin Zhu, Wenhao Yu, Peng Wang:
PoseLLaVA: Pose Centric Multimodal LLM for Fine-Grained 3D Pose Manipulation. 2951-2959 - Haoxuan Feng, Haohui Zhou, Tian Ye, Sixiang Chen, Lei Zhu:

Residual Diffusion Deblurring Model for Single Image Defocus Deblurring. 2960-2968 - Kunyu Feng, Yue Ma, Bingyuan Wang, Chenyang Qi, Haozhe Chen, Qifeng Chen, Zeyu Wang:

DiT4Edit: Diffusion Transformer for Image Editing. 2969-2977 - Mingtao Feng, Fenghao Tian, Jianqiao Luo, Zijie Wu, Weisheng Dong, Yaonan Wang, Ajmal Saeed Mian

:
Semantic Ambiguity Modeling and Propagation for Fine-Grained Visual Cross View Geo-Localization. 2978-2986 - Siyang Feng

, Huadeng Wang, Chu Han, Zhenbing Liu, Hualong Zhang
, Rushi Lan, Xipeng Pan:
Weakly Supervised Gland Segmentation with Class Semantic Consistency and Purified Labels Filtration. 2987-2995 - Tonghui Feng, Chunsheng Yan, Qianru Wang, Jiangtao Cui, Xiaotian Qiao:

HDLayout: Hierarchical and Directional Layout Planning for Arbitrary Shaped Visual Text Generation. 2996-3003 - Yi Feng, Yu Han, Xijing Zhang, Tanghui Li, Yanting Zhang, Rui Fan:

ViPOcc: Leveraging Visual Priors from Vision Foundation Models for Single-View 3D Occupancy Prediction. 3004-3012 - Zhida Feng, Li Chen, Yuenan Sun, Jiaxiang Liu, Shikun Feng:

Simplifying Control Mechanism in Text-to-Image Diffusion Models. 3013-3021 - Chenlin Fu, Yingying Zhu:

BGHR: Bridging the Gap Between HBox-Supervised and RBox-Supervised Oriented Object Detection via Adaptive Fine-Grained Sample Mining. 3022-3030 - Teng Fu

, Haiyang Yu, Ke Niu, Bin Li, Xiangyang Xue:
Foundation Model Driven Appearance Extraction for Robust Multiple Object Tracking. 3031-3039 - Xinghe Fu, Zhiyuan Yan, Taiping Yao, Shen Chen, Xi Li:

Exploring Unbiased Deepfake Detection via Token-Level Shuffling and Mixing. 3040-3048 - Keke Gai, Dongjue Wang, Jing Yu, Mohan Wang, Liehuang Zhu, Qi Wu:

MFL-Owner: Ownership Protection for Multi-modal Federated Learning via Orthogonal Transform Watermark. 3049-3058 - Lianqiang Gan, Junyu Lai, Jingze Ju, Lianli Gao, Yi Bin:

DFDNet: Disentangling and Filtering Dynamics for Enhanced Video Prediction. 3059-3067 - Ge Gao, Ho Man Kwan, Fan Zhang, David Bull:

PNVC: Towards Practical INR-based Video Compression. 3068-3076 - Jun Gao, Qian Qiao, Tianxiang Wu, Zili Wang, Ziqiang Cao

, Wenjie Li:
AIM: Let Any Multimodal Large Language Models Embrace Efficient In-Context Learning. 3077-3085 - Mingze Gao, Jingyu Liu, Mingda Li, Jiangtao Xie, Qingbin Liu, Kevin Zhao, Xi Chen, Hui Xiong:

TC-LLaVA: Rethinking the Transfer of LLava from Image to Video Understanding with Temporal Considerations. 3086-3094 - Xianqiang Gao, Pingrui Zhang, Delin Qu, Dong Wang, Zhigang Wang, Yan Ding, Bin Zhao:

Learning 2D Invariant Affordance Knowledge for 3D Affordance Grounding. 3095-3103 - Chengjie Ge, Xueyang Fu, Peng He, Kunyu Wang

, Chengzhi Cao, Zheng-Jun Zha:
EventMamba: Enhancing Spatio-Temporal Locality with State Space Models for Event-Based Video Reconstruction. 3104-3112 - Shiping Ge, Qiang Chen, Zhiwei Jiang, Yafeng Yin, Liu Qin, Ziyao Chen, Qing Gu:

Implicit Location-Caption Alignment via Complementary Masking for Weakly-Supervised Dense Video Captioning. 3113-3121 - Xinyu Geng, Jiaming Wang, Xiaolin Huang, Fanglin Chen, Jun Xu:

ParseCaps: An Interpretable Parsing Capsule Network for Medical Image Diagnosis. 3122-3130 - Zichen Geng, Zeeshan Hayder, Wei Liu

, Ajmal Saeed Mian
:
Auto-Regressive Diffusion for Generating 3D Human-Object Interactions. 3131-3139 - Haifan Gong, Yu Lu, Xiang Wan, Haofeng Li:

Domain Generalized Medical Landmark Detection via Robust Boundary-Aware Pre-Training. 3140-3148 - Tao Gong, Qi Chu, Bin Liu, Nenghai Yu:

Rethinking Masked Data Reconstruction Pretraining for Strong 3D Action Representation Learning. 3149-3157 - Jiaxiang Gou, Luping Ji

, Pei Liu, Mao Ye:
Queryable Prototype Multiple Instance Learning with Vision-Language Models for Incremental Whole Slide Image Classification. 3158-3166 - Anna Grim, Jayaram Chandrashekar, Uygar Sümbül:

Efficient Connectivity-Preserving Instance Segmentation with Supervoxel-Based Loss Function. 3167-3175 - Shengbo Gu, Yu-Kun Qiu, Yu-Ming Tang, Ancong Wu, Weishi Zheng:

MaintaAvatar: A Maintainable Avatar Based on Neural Radiance Fields by Continual Learning. 3176-3184 - Zijian Gu, Jianwei Ma, Yan Huang, Honghao Wei, Zhanye Chen, Hui Zhang, Wei Hong:

HGSFusion: Radar-Camera Fusion with Hybrid Generation and Synchronization for 3D Object Detection. 3185-3193 - Xianchao Guan, Yifeng Wang, Ye Zhang

, Zheng Zhang, Yongbing Zhang:
OT-StainNet: Optimal Transport Driven Semantic Matching for Weakly Paired H&E-to-IHC Stain Transfer. 3194-3202 - Ming Gui, Johannes Schusterbauer, Ulrich Prestel, Pingchuan Ma, Dmytro Kotovenko, Olga Grebenkova, Stefan Andreas Baumann, Vincent Tao Hu, Björn Ommer:

DepthFM: Fast Generative Monocular Depth Estimation with Flow Matching. 3203-3211 - Chuchen Guo, Weijie Zhou, Zheng Liu, Ying He:

You Should Learn to Stop Denoising on Point Clouds in Advance. 3212-3219 - Diandian Guo, Weixin Si, Zhixi Li, Jialun Pei, Pheng-Ann Heng:

Surgical Workflow Recognition and Blocking Effectiveness Detection in Laparoscopic Liver Resection with Pringle Maneuver. 3220-3228 - Haipeng Guo, Huanyu Liu, Jiazheng Wen

, Junbao Li:
Cross-Spectral Gaussian Splatting with Spatial Occupancy Consistency. 3229-3237 - Haojie Guo, Junyu Gao, Yuan Yuan:

Enhancing Low-Rank Adaptation with Recoverability-Based Reinforcement Pruning for Object Counting. 3238-3246 - Heng Guo, Jianfeng Zhang, Jiaxing Huang

, Tony C. W. Mok, Dazhou Guo, Ke Yan, Le Lu, Dakai Jin, Minfeng Xu:
Towards a Comprehensive, Efficient and Promptable Anatomic Structure Segmentation Model Using 3D Whole-Body CT Scans. 3247-3256 - Jialong Guo, Ke Liu, Jiangchao Yao, Zhihua Wang, Jiajun Bu, Haishuai Wang:

MetaNeRV: Meta Neural Representations for Videos with Spatial-Temporal Guidance. 3257-3265 - Kun Guo, Qiang Ling:

PromptDet: A Lightweight 3D Object Detection Framework with LiDAR Prompts. 3266-3274 - Pinxue Guo, Hao Huang, Peiyang He, Xuefeng Liu, Tianjun Xiao, Wenqiang Zhang:

OpenVIS: Open-vocabulary Video Instance Segmentation. 3275-3283 - Puyuan Guo, Tuo Hao, Wenxin Fu, Yingming Gao, Ya Li:

Controllable 3D Dance Generation Using Diffusion-Based Transformer U-Net. 3284-3292 - Yijia Guo, Liwen Hu, Yuanxi Bai, Jiawei Yao, Lei Ma, Tiejun Huang:

SpikeGS: Reconstruct 3D Scene Captured by a Fast-Moving Bio-Inspired Camera. 3293-3301 - Yongxin Guo, Jingyu Liu, Mingda Li, Dingxin Cheng, Xiaoying Tang, Dianbo Sui, Qingbin Liu, Xi Chen, Kevin Zhao:

VTG-LLM: Integrating Timestamp Knowledge into Video LLMs for Enhanced Video Temporal Grounding. 3302-3310 - Ameer Hamza, Abdullah, Yong Hyun Ahn, Sungyoung Lee, Seong Tae Kim:

LLaVA Needs More Knowledge: Retrieval Augmented Natural Language Generation with Knowledge Graph for Explaining Thoracic Pathologies. 3311-3319 - Feng Han, Kai Chen, Chao Gong, Zhipeng Wei, Jingjing Chen, Yu-Gang Jiang:

DuMo: Dual Encoder Modulation Network for Precise Concept Erasure. 3320-3328 - Huasong Han, Kaixuan Zhou, Xiaoxiao Long, Yusen Wang, Chunxia Xiao:

GGS: Generalizable Gaussian Splatting for Lane Switching in Autonomous Driving. 3329-3337 - Jumin Han, Jun-Hee Kim, Seong-Whan Lee:

ProPose: Probabilistic 3D Human Pose Estimation with Instance-Level Distribution and Normalizing Flow. 3338-3346 - Wencheng Han, Dongqian Guo, Cheng-Zhong Xu, Jianbing Shen:

DME-Driver: Integrating Human Decision Logic and 3D Scene Perception in Autonomous Driving. 3347-3355 - Xumeng Han, Longhui Wei, Xuehui Yu, Zhiyang Dou, Xin He, Kuiran Wang, Yingfei Sun, Zhenjun Han, Qi Tian:

Boosting Segment Anything Model Towards Open-Vocabulary Learning. 3356-3365 - Yushan Han, Hui Zhang, Honglei Zhang, Jing Wang, Yidong Li:

CoDTS: Enhancing Sparsely Supervised Collaborative Perception with a Dual Teacher-Student Framework. 3366-3373 - Zihao Han, Baoquan Zhang, Lisai Zhang, Shanshan Feng, Kenghong Lin, Guotao Liang, Yunming Ye, Joeq, Kola Ye:

AsyncDSB: Schedule-Asynchronous Diffusion Schrödinger Bridge for Image Inpainting. 3374-3382 - Jinkun Hao, Junshu Tang, Jiangning Zhang

, Ran Yi, Yijia Hong, Moran Li, Weijian Cao, Yating Wang, Chengjie Wang
, Lizhuang Ma:
ID-Sculpt: ID-aware 3D Head Generation from Single In-the-wild Portrait Image. 3383-3391 - Gang He, Guancheng Quan, Chang Wu, Shihao Wang, Dajiang Zhou, Yunsong Li:

Multi-Frame Deformable Look-Up Table for Compressed Video Quality Enhancement. 3392-3400 - Hangzhou He, Lei Zhu, Xinliang Zhang, Shuang Zeng, Qian Chen, Yanye Lu:

V2C-CBM: Building Concept Bottlenecks with Vision-to-Concept Tokenizer. 3401-3409 - Qingdong He, Jiangning Zhang

, Jinlong Peng, Haoyang He, Xiangtai Li, Yabiao Wang
, Chengjie Wang
:
PointRWKV: Efficient RWKV-Like Model for Hierarchical Point Cloud Learning. 3410-3418 - Ruian He, Ri Cheng, Xinkai Lyu, Weimin Tan, Bo Yan:

Efficient Online Training for Zero-Shot Time-Lapse Microscopy Denoising and Super-Resolution. 3419-3427 - Xiankang He, Guangkai Xu, Bo Zhang, Hao Chen, Ying Cui, Dongyan Guo:

DiffCalib: Reformulating Monocular Camera Calibration as Diffusion-Based Dense Incident Map Generation. 3428-3436 - Xu He, Zhiyong Wu, Xiaoyu Li, Di Kang, Chaopeng Zhang, Jiangnan Ye, Liyang Chen, Xiangjun Gao

, Han Zhang, Haolin Zhuang:
MagicMan: Generative Novel View Synthesis of Humans with 3D-Aware Diffusion and Iterative Refinement. 3437-3445 - Yina He, Lei Peng, Yongcun Zhang, Juanjuan Weng, Shaozi Li, Zhiming Luo:

Long-Tailed Out-of-Distribution Detection: Prioritizing Attention to Tail. 3446-3454 - Yulin He

, Wei Chen, Siqi Wang, Tianci Xun, Yusong Tan:
Achieving Speed-Accuracy Balance in Vision-based 3D Occupancy Prediction via Geometric-Semantic Disentanglement. 3455-3463 - Yuwen He, Wei Wang, Wanyu Wu

, Kui Jiang:
Disentangle Nighttime Lens Flares: Self-supervised Generation-based Lens Flare Removal. 3464-3472
Technical Tracks 4
- Zihao He, Shengchuan Zhang, Runze Hu, Yunhang Shen, Yan Zhang:

BUFF: Bayesian Uncertainty Guided Diffusion Probabilistic Model for Single Image Super-Resolution. 3474-3482 - Miran Heo, Seoung Wug Oh, Seon Joo Kim, Joon-Young Lee:

Robust and Consistent Online Video Instance Segmentation via Instance Mask Propagation. 3483-3490 - Cuong Manh Hoang, Yeejin Lee, Byeongkeun Kang:

Generalized Class Discovery in Instance Segmentation. 3491-3499 - Yan Hong, Jianming Feng, Haoxing Chen, Jun Lan, Huijia Zhu, Weiqiang Wang, Jianfu Zhang:

WildFake: A Large-Scale and Hierarchical Dataset for AI-Generated Images Detection. 3500-3508 - Jie Hou, Jianghong Ma, Xiangyu Mu

, Haijun Zhang, Zhao Zhang:
FashionTailor: Controllable Clothing Editing for Human Images with Appearance Preserving. 3509-3517 - Shiyu Hou, Tianfei Zhou, Shuai Zhang, Ye Yuan, Guoren Wang:

Prompt Tuning In a Compact Attribute Space. 3518-3526 - Wenjin Hou, Dingjie Fu, Kun Li, Shiming Chen, Hehe Fan, Yi Yang:

ZeroMamba: Exploring Visual State Space Model for Zero-Shot Learning. 3527-3535 - Xiaolu Hou, Mingcheng Li, Dingkang Yang, Jiawei Chen, Ziyun Qian, Xiao Zhao, Yue Jiang, Jinjie Wei, Qingyao Xu, Lihua Zhang:

BloomScene: Lightweight Structured 3D Gaussian Splatting for Crossmodal Scene Generation. 3536-3544 - Teng-Fang Hsiao, Bo-Kai Ruan, Hong-Han Shuai:

Training-and-Prompt-Free General Painterly Harmonization via Zero-Shot Disentenglement on Style and Content References. 3545-3553 - Jintong Hu, Bin Xia, Bin Chen, Wenming Yang, Lei Zhang:

GaussianSR: High Fidelity 2D Gaussian Splatting for Arbitrary-Scale Image Super-Resolution. 3554-3562 - Qiang Hu, Houqiang Zhong

, Zihan Zheng, Xiaoyun Zhang, Zhengxue Cheng, Li Song, Guangtao Zhai, Yanfeng Wang:
VRVVC: Variable-Rate NeRF-Based Volumetric Video Compression. 3563-3571 - Qiang Hu, Zhenyu Yi, Ying Zhou, Fan Huang, Mei Liu, Qiang Li, Zhiwei Wang:

MonoBox: Tightness-Free Box-Supervised Polyp Segmentation Using Monotonicity Constraint. 3572-3580 - Xiantao Hu, Ying Tai, Xu Zhao, Chen Zhao, Zhenyu Zhang, Jun Li, Bineng Zhong, Jian Yang:

Exploiting Multimodal Spatial-temporal Patterns for Video Object Tracking. 3581-3589 - Xiao Hu, Libo Long, Jochen Lang:

Motion Decoupled 3D Gaussian Splatting for Dynamic Object Representation. 3590-3598 - Hang Hua, Yunlong Tang

, Chenliang Xu, Jiebo Luo
:
V2Xum-LLM: Cross-Modal Video Summarization with Temporal Prompt Instruction Tuning. 3599-3607 - Bin Huang, Xin Wang, Hong Chen, Houlun Chen, Yaofei Wu, Wenwu Zhu:

Identity-Text Video Corpus Grounding. 3608-3616 - Binyuan Huang, Yuqing Wen, Yucheng Zhao, Yaosi Hu, Yingfei Liu, Fan Jia, Weixin Mao, Tiancai Wang, Chi Zhang, Chang Wen Chen, Zhenzhong Chen, Xiangyu Zhang:

SubjectDrive: Scaling Generative Data in Autonomous Driving via Subject Control. 3617-3625 - Chihan Huang, Xiaobo Shen:

HUANG: A Robust Diffusion Model-based Targeted Adversarial Attack Against Deep Hashing Retrieval. 3626-3634 - Dongshuo Huang, Xiaoshui Huang, Chengdong Zhang, Yilei Shi:

LPCG: A Self-conditional Architecture for Labeled Point Cloud Generation. 3635-3643 - Han Huang, Yulun Wu

, Chao Deng, Ge Gao, Ming Gu, Yu-Shen Liu:
FatesGS: Fast and Accurate Sparse-View Surface Reconstruction Using Gaussian Splatting with Depth-Feature Consistency. 3644-3652 - Jiaqi Huang

, Zunnan Xu, Ting Liu, Yong Liu, Haonan Han, Kehong Yuan, Xiu Li:
Densely Connected Parameter-Efficient Tuning for Referring Image Segmentation. 3653-3661 - Jie Huang, Rui Huang

, Jinghao Xu, Siran Peng, Yule Duan, Liang-Jian Deng
:
Wavelet-Assisted Multi-Frequency Attention Network for Pansharpening. 3662-3670 - Lifeng Huang, Tian Su, Chengying Gao, Ning Liu, Qiong Huang:

AUTE: Peer-Alignment and Self-Unlearning Boost Adversarial Robustness for Training Ensemble Models. 3671-3679 - Muye Huang, Han Lai, Xinyu Zhang, Wenjun Wu, Jie Ma, Lingling Zhang, Jun Liu:

EvoChart: A Benchmark and a Self-Training Approach Towards Real-World Chart Understanding. 3680-3688 - Muye Huang, Lingling Zhang, Han Lai, Wenjun Wu, Xinyu Zhang, Jun Liu:

VProChart: Answering Chart Question Through Visual Perception Alignment Agent and Programmatic Solution Reasoning. 3689-3696 - Pei-Kai Huang, Jun-Xiong Chong, Cheng-Hsuan Chiang, Tzu-Hsien Chen, Tyng-Luh Liu, Chiou-Ting Hsu:

SLIP: Spoof-Aware One-Class Face Anti-Spoofing with Language Image Pretraining. 3697-3706 - Qihan Huang, Siming Fu, Jinlong Liu, Hao Jiang, Yipeng Yu, Jie Song:

Resolving Multi-Condition Confusion for Finetuning-Free Personalized Image Generation. 3707-3714 - Shaofei Huang, Rui Ling, Hongyu Li, Tianrui Hui, Zongheng Tang, Xiaoming Wei, Jizhong Han

, Si Liu:
Unleashing the Temporal-Spatial Reasoning Capacity of GPT for Training-Free Audio and Language Referenced Video Object Segmentation. 3715-3723 - Shiqi Huang, Shuting He, Bihan Wen

:
ZoRI: Towards Discriminative Zero-Shot Remote Sensing Instance Segmentation. 3724-3732 - Tianyu Huang, Haoze Zhang, Yihan Zeng, Zhilu Zhang, Hui Li, Wangmeng Zuo, Rynson W. H. Lau:

DreamPhysics: Learning Physics-Based 3D Dynamics with Video Diffusion Priors. 3733-3741 - Tingxuan Huang, Jiacheng Miao, Shizhuo Deng, Tong Jia, Dongyue Chen:

Efficient Indoor Depth Completion Network Using Mask-adaptive Gated Convolution. 3742-3750 - Wenbo Huang

, Jinghui Zhang, Guang Li, Lei Zhang, Shuoyuan Wang, Fang Dong, Jiahui Jin, Takahiro Ogawa, Miki Haseyama:
Manta: Enhancing Mamba for Few-Shot Action Recognition of Long Sub-Sequence. 3751-3759 - Xiang Huang, Qing Zhang, Jian-Fang Hu, Wei-Shi Zheng:

CLIP-RestoreX: Restore Image Structure and Perception in Exposure Correction. 3760-3768 - Xiaofei Huang, Wenting Chen

, Jie Liu
, Qisheng Lu, Xiaoling Luo, Linlin Shen:
DAMPER: A Dual-Stage Medical Report Generation Framework with Coarse-Grained MeSH Alignment and Fine-Grained Hypergraph Matching. 3769-3778 - Xiaoshuang Huang, Lingdong Shen, Jia Liu, Fangxin Shang, Hongxiang Li, Haifeng Huang, Yehui Yang:

Towards a Multimodal Large Language Model with Pixel-Level Insight for Biomedicine. 3779-3787 - Xiaoshui Huang, Zhou Huang, Yifan Zuo, Yongshun Gong, Chengdong Zhang, Deyang Liu, Yuming Fang:

PSReg: Prior-guided Sparse Mixture of Experts for Point Cloud Registration. 3788-3796 - Xijie Huang, Xinyuan Wang, Hantao Zhang, Yinghao Zhu, Jiawen Xi, Jingkun An, Hao Wang, Hao Liang, Chengwei Pan:

Medical MLLM Is Vulnerable: Cross-Modality Jailbreak and Mismatched Attacks on Medical Multimodal Large Language Models. 3797-3805 - Xun Huang, Ziyu Xu, Hai Wu, Jinlong Wang, Qiming Xia, Yan Xia, Jonathan Li, Kyle Gao, Chenglu Wen, Cheng Wang:

L4DR: LiDAR-4DRadar Fusion for Weather-Robust 3D Object Detection. 3806-3814 - Yan Huang, Xiaoshan Liao, Jinxiu Liang, Yuhui Quan, Boxin Shi

, Yong Xu:
Zero-Shot Low-Light Image Enhancement via Latent Diffusion Models. 3815-3823 - Yanglin Huang

, Kai Hu, Yuan Zhang, Zhineng Chen, Xieping Gao:
Distilling Knowledge from Heterogeneous Architectures for Semantic Segmentation. 3824-3832 - Yongle Huang, Haodong Chen, Zhenbang Xu, Zihan Jia, Haozhou Sun, Dian Shao:

SeFAR: Semi-supervised Fine-grained Action Recognition with Temporal Perturbation and Learning Stabilization. 3833-3841 - Yunlong Huang, Junshuo Liu, Ke Xian, Robert Caiming Qiu:

PoseMamba: Monocular 3D Human Pose Estimation with Bidirectional Global-Local Spatio-Temporal State Space Model. 3842-3850 - Jiayu Huo

, Xi Ouyang, Sébastien Ourselin, Rachel Sparks:
Generative Medical Segmentation. 3851-3859 - Yixiong Huo, Guangfeng Jiang, Hongyang Wei, Ji Liu, Song Zhang, Han Liu, Xingliang Huang, Mingjie Lu, Jinzhang Peng, Dong Li, Lu Tian, Emad Barsoum:

EGSRAL: An Enhanced 3D Gaussian Splatting Based Renderer with Automated Labeling for Large-Scale Driving Scene. 3860-3867 - Junhwa Hur, Charles Herrmann, Saurabh Saxena, Janne Kontkanen, Wei-Sheng Lai, Yichang Shih, Michael Rubinstein, David J. Fleet, Deqing Sun:

High-Resolution Frame Interpolation with Patch-based Cascaded Diffusion. 3868-3876 - Hyoseok Lee, Kyeong Seon Kim, Byung-Ki Kwon, Tae-Hyun Oh:

Zero-shot Depth Completion via Test-time Alignment with Affine-invariant Depth Prior. 3877-3885 - Muhammet Furkan Ilaslan, Ali Köksal

, Kevin Qinghong Lin, Burak Satar, Mike Zheng Shou, Qianli Xu:
VG-TVP: Multimodal Procedural Planning via Visually Grounded Text-Video Prompting. 3886-3894 - Elkhan Ismayilzada

, MD Khalequzzaman Chowdhury Sayem
, Yihalem Yimolal Tiruneh, Mubarrat Tajoar Chowdhury, Muhammadjon Boboev, Seungryul Baek:
QORT-Former: Query-optimized Real-time Transformer for Understanding Two Hands Manipulating Objects. 3895-3903 - Alexander Jaus, Constantin Marc Seibold, Simon Reiß

, Zdravko Marinov, Keyi Li, Zeling Ye, Stefan Krieg, Jens Kleesiek, Rainer Stiefelhagen:
Every Component Counts: Rethinking the Measure of Success for Medical Semantic Segmentation in Multi-Instance Segmentation Tasks. 3904-3912 - Yuxiang Ji, Boyong He, Zhuoyue Tan, Liaoni Wu:

Game4Loc: A UAV Geo-Localization Benchmark from Game Data. 3913-3921 - Yuzhou Ji, He Zhu, Junshu Tang, Wuyi Liu, Zhizhong Zhang, Xin Tan, Yuan Xie:

FastLGS: Speeding Up Language Embedded Gaussians with Feature Grid Mapping. 3922-3930 - Mingda Jia, Liming Zhao, Ge Li, Yun Zheng:

ContextHOI: Spatial Context Learning for Human-Object Interaction Detection. 3931-3939 - Mingda Jia, Liming Zhao, Ge Li, Yun Zheng:

Orchestrating the Symphony of Prompt Distribution Learning for Human-Object Interaction Detection. 3940-3948 - Yizhen Jia

, Rong Quan, Yue Feng, Haiyan Chen, Jie Qin:
Doubly Contrastive Learning for Source-Free Domain Adaptive Person Search. 3949-3957 - Yueru Jia, Aosong Cheng, Yuhui Yuan, Chuke Wang, Ji Li, Huizhu Jia, Shanghang Zhang:

DesignEdit: Unify Spatial-Aware Image Editing via Training-free Inpainting with a Multi-Layered Latent Diffusion Framework. 3958-3966 - Dadong Jiang, Xianghui Yang, Zibo Zhao, Sheng Zhang, Jiaao Yu, Zeqiang Lai, Shaoxiong Yang, Chunchao Guo, Xiaobo Zhou, Zhihui Ke:

FlexiTex: Enhancing Texture Generation via Visual Guidance. 3967-3975 - Hao Jiang, Yang Jin, Zhicheng Sun, Kun Xu, Kun Xu, Liwei Chen, Yang Song, Kun Gai, Yadong Mu:

Granularity-Adaptive Spatial Evidence Tokenization for Video Question Answering. 3976-3984 - Jianan Jiang

, Hao Tang, Zhilin Jiang, Weiren Yu, Di Wu:
ARNet: Self-Supervised FG-SBIR with Unified Sample Feature Alignment and Multi-Scale Token Recycling. 3985-3993 - Jianfei Jiang

, Liyong Wang, Haochen Yu, Tianyu Hu, Jiansheng Chen, Huimin Ma:
RRT-MVS: Recurrent Regularization Transformer for Multi-View Stereo. 3994-4002 - Jimao Jiang

, Diya Sun, Tianbing Wang, Yuru Pei:
SCCS: Deep Neural Spectral Clustering for Self-Supervised Subcellular Structure Segmentation. 4003-4011 - Liyao Jiang, Negar Hassanpour, Mohammad Salameh, Mohammadreza Samadi, Jiao He, Fengyu Sun, Di Niu:

PixelMan: Consistent Object Editing with Diffusion Models via Pixel Manipulation and Generation. 4012-4020 - Luoqian Jiang, Yong Guo, Bingna Xu, Haolin Pan, Jiezhang Cao, Wenbo Li, Jian Chen:

Restabilizing Diffusion Models with Predictive Noise Fusion Strategy for Image Super-Resolution. 4021-4029 - Nan Jiang, Shanchao Liang, Chengxiao Wang, Jiannan Wang, Lin Tan

:
LATTE: Improving Latex Recognition for Tables and Formulae with Iterative Refinement. 4030-4038 - Pengfei Jiang, Mingbao Lin, Fei Chao:

Move and Act: Enhanced Object Manipulation and Background Integrity for Image Editing. 4039-4047 - Rui Jiang, Xinghe Fu, Guangcong Zheng, Teng Li, Taiping Yao, Xi Li:

Energy-Guided Optimization for Personalized Image Editing with Pretrained Text-to-Image Diffusion Models. 4048-4056 - Sijia Jiang, Jing Hua, Zhizhong Han:

Query Quantized Neural SLAM. 4057-4065 - Sijia Jiang, Tong Wu, Jing Hua, Zhizhong Han:

Sensing Surface Patches in Volume Rendering for Inferring Signed Distance Functions. 4066-4074 - Yutao Jiang, Qiong Wu, Wenhao Lin, Wei Yu, Yiyi Zhou:

What Kind of Visual Tokens Do We Need? Training-Free Visual Token Pruning for Multi-Modal Large Language Models from the Perspective of Graph. 4075-4083 - Xianhe Jiao, Chenlei Lv, Junli Zhao, Ran Yi, Yu-Hui Wen, Zhenkuan Pan, Zhongke Wu, Yong-Jin Liu:

Weighted Poisson-disk Resampling on Large-Scale Point Clouds. 4084-4092 - Yingying Jiao, Zhigang Wang, Sifan Wu, Shaojing Fan, Zhenguang Liu, Zhuoyue Xu, Zheqi Wu:

SpatioTemporal Learning for Human Pose Estimation in Sparsely-Labeled Videos. 4093-4101 - Yingying Jiao, Zhigang Wang, Zhenguang Liu, Shaojing Fan, Sifan Wu, Zheqi Wu, Zhuoyue Xu:

Optimizing Human Pose Estimation Through Focused Human and Joint Regions. 4102-4110 - Can Jin, Tianjin Huang, Yihua Zhang, Mykola Pechenizkiy, Sijia Liu, Shiwei Liu

, Tianlong Chen:
Visual Prompting Upgrades Neural Network Sparsification: A Data-Model Perspective. 4111-4119 - Dongyang Jin, Chao Fan, Weihua Chen, Shiqi Yu:

Exploring More from Multiple Gait Modalities for Human Identification. 4120-4128 - Er Jin, Qihui Feng, Yongli Mou, Gerhard Lakemeyer, Stefan Decker, Oliver Simons, Johannes Stegmaier:

LogicAD: Explainable Anomaly Detection via VLM-based Text Feature Extraction. 4129-4137 - Jiandong Jin, Xiao Wang, Qian Zhu, Haiyang Wang, Chenglong Li:

Pedestrian Attribute Recognition: A New Benchmark Dataset and a Large Language Model Augmented Framework. 4138-4146 - Long Jin, Han Nong, Liangming Chen, Zhenming Su:

A Method for Enhancing Generalization of Adam by Multiple Integrations. 4147-4155 - Hyungjun Joo, Hyeonggeun Han, Sehwan Kim, Sangwoo Hong, Jungwoo Lee:

Constructing Fair Latent Space for Intersection of Fairness and Explainability. 4156-4165 - Woojin Jun, WonJun Moon, Cheol-Ho Cho, Minseok Jung, Jae-Pil Heo:

Bridging the Semantic Granularity Gap Between Text and Frame Representations for Partially Relevant Video Retrieval. 4166-4174 - Dachun Kai, Yueyi Zhang, Jin Wang, Zeyu Xiao, Zhiwei Xiong, Xiaoyan Sun:

Event-Enhanced Blurry Video Super-Resolution. 4175-4183 - Danial Kamali, Elham J. Barezi, Parisa Kordjamshidi:

NeSyCoCo: A Neuro-Symbolic Concept Composer for Compositional Generalization. 4184-4193 - Ben Kang, Xin Chen, Simiao Lai, Yang Liu, Yi Liu, Dong Wang:

Exploring Enhanced Contextual Information for Video-Level Object Tracking. 4194-4202 - Gyeongjin Kang

, Younggeun Lee, Seungjun Oh, Eunbyung Park:
CodecNeRF: Toward Fast Encoding and Decoding, Compact, and High-quality Novel-view Synthesis. 4203-4211 - Jiahui Kang, Qing Cai, Runqing Tan, Yimei Liu, Zhi Liu:

C2PD: Continuity-Constrained Pixelwise Deformation for Guided Depth Super-Resolution. 4212-4220 - Jingcheng Ke, Waikeung Wong, Jia Wang, Mu Li, Lunke Fei, Jie Wen:

DiffusionREC: Diffusion Model with Adaptive Condition for Referring Expression Comprehension. 4221-4229 - Muhammad Uzair Khattak, Muhammad Ferjad Naeem, Muzammal Naseer, Luc Van Gool, Federico Tombari:

Learning to Prompt with Text Only Supervision for Vision-Language Models. 4230-4238 - Donghyun Kim, Hyeonkyeong Kwon, Yumin Kim, Seong Jae Hwang:

PLATYPUS: Progressive Local Surface Estimator for Arbitrary-Scale Point Cloud Upsampling. 4239-4247 - Hyeonseok Kim, Byeongkeun Kang, Yeejin Lee:

Generalized Zero-Shot Learning for Point Cloud Segmentation with Evidence-Based Dynamic Calibration. 4248-4256 - Hyunjun Kim, Nam Ik Cho:

APR-RD: Complemental Two Steps for Self-Supervised Real Image Denoising. 4257-4265 - Jihwan Kim, Miso Lee, Cheol-Ho Cho, Jihyun Lee, Jae-Pil Heo:

Prediction-Feedback DETR for Temporal Action Detection. 4266-4274 - Jisoo Kim, Jungbin Cho, Joonho Park, Soonmin Hwang, Da Eun Kim, Geon Kim, Youngjae Yu:

DEEPTalk: Dynamic Emotion Embedding for Probabilistic Speech-Driven 3D Face Animation. 4275-4283 - Jungho Kim, Changwon Kang, Dongyoung Lee, Sehwan Choi, Jun Won Choi:

ProtoOcc: Accurate, Efficient 3D Occupancy Prediction Using Dual Branch Encoder-Prototype Query Decoder. 4284-4292 - Minkuk Kim, Hyeon Bae Kim, Jinyoung Moon, Jinwoo Choi, Seong Tae Kim:

HiCM²: Hierarchical Compact Memory Modeling for Dense Video Captioning. 4293-4301 - Seyeon Kim, Siyoon Jin, Jihye Park, Kihong Kim, Jiyoung Kim, Jisu Nam, Seungryong Kim:

MoDiTalker: Motion-Disentangled Diffusion Model for High-Fidelity Talking Head Generation. 4302-4310 - Soowoong Kim, Minseong Kwon, Junho Choi, Gun Bang, Seungjoon Yang:

TSDF-Based Efficient Motion-Compensated Temporal Interpolation for 3D Dynamic Sequences. 4311-4319 - Taewhan Kim

, Soeun Lee, Si-Woo Kim
, Dong-Jin Kim:
ViPCap: Retrieval Text-Based Visual Prompts for Lightweight Image Captioning. 4320-4328 - Taewoong Kim, Byeonghwi Kim, Jonghyun Choi:

Multi-Modal Grounded Planning and Efficient Replanning for Learning Embodied Agents with a Few Examples. 4329-4337 - Younghyun Kim, Geunmin Hwang, Junyu Zhang, Eunbyung Park:

DiffuseHigh: Training-Free Progressive High-Resolution Image Synthesis Through Structure Guidance. 4338-4346 - Konstantin Klemmer, Esther Rolf, Caleb Robinson, Lester Mackey, Marc Rußwurm

:
SatCLIP: Global, General-Purpose Location Embeddings with Satellite Imagery. 4347-4355 - Hyun-kyu Ko, Dongheok Park, Youngin Park, Byeonghyeon Lee, Juhee Han, Eunbyung Park:

Sequence Matters: Harnessing Video Models in 3D Super-Resolution. 4356-4364 - Maksim Kolodiazhnyi, Anna Vorontsova, Matvey Skripkin, Danila Rukhovich, Anton Konushin:

UniDet3D: Multi-dataset Indoor 3D Object Detection. 4365-4373 - Hanyang Kong, Xingyi Yang, Xinchao Wang:

Efficient Gaussian Splatting for Monocular Dynamic Scene Rendering via Sparse Time-Variant Attribute Modeling. 4374-4382 - Jiayi Kong, Xurui Song, Shuo Huai, Baixin Xu, Jun Luo, Ying He:

Do Not DeepFake Me: Privacy-Preserving Neural 3D Head Reconstruction Without Sensitive Images. 4383-4391 - Mengxun Kong, Jie Guo, Chen Wang, Ye Yuan, Yanwen Guo:

Real-Time Neural Denoising with Render-Aware Knowledge Distillation. 4392-4400 - Ming Kong

, Xianzhou Zeng, Luyuan Chen, Yadong Li, Bo Yan, Qiang Zhu:
MHBench: Demystifying Motion Hallucination in VideoLLMs. 4401-4409 - Koen Kraaijveld, Yifan Jiang, Kaixin Ma, Filip Ilievski

:
COLUMBUS: Evaluating COgnitive Lateral Understanding Through Multiple-Choice reBUSes. 4410-4418 - Akash Kumar, Sirshapan Mitra, Yogesh Singh Rawat:

Stable Mean Teacher for Semi-supervised Video Action Detection. 4419-4427 - Suruchi Kumari, Pravendra Singh:

A Unified Degradation-Robust Approach to SSL and UDA for 3D Medical Images. 4428-4436 - Myung-Joon Kwon, Wonjun Lee

, Seung-Hun Nam, Minji Son, Changick Kim:
SAFIRE: Segment Any Forged Image Region. 4437-4445 - Jian Lan, Diego Frassinelli, Barbara Plank:

Mind the Uncertainty in Human Disagreement: Evaluating Discrepancies Between Model Predictions and Human Responses in VQA. 4446-4454 - Yunwei Lan, Zhigao Cui

, Chang Liu, Jialun Peng, Nian Wang, Xin Luo, Dong Liu:
Exploiting Diffusion Prior for Real-World Image Dehazing with Unpaired Training. 4455-4463 - Maria A. Larchenko, Alexander Lobashev, Dmitry Guskov, Vladimir Vladimirovich Palyulin:

Color Transfer with Modulated Flows. 4464-4472 - Quang-Hung Le, Long Hoang Dang, Ngan Hoang Le, Truyen Tran, Thao Minh Le

:
Progressive Multi-granular Alignments for Grounded Reasoning in Large Vision-Language Models. 4473-4481 - Chan Lee, Seungho Shin

, Gyeong-Moon Park, Jung Uk Kim:
Multispectral Pedestrian Detection with Sparsely Annotated Label. 4482-4490 - Hyunjee Lee, Youngsik Yun, Jeongmin Bae, Seoha Kim, Youngjung Uh:

Rethinking Open-Vocabulary Segmentation of Radiance Fields in 3D Space. 4491-4498 - Ji Soo Lee, Jongha Kim, Jeehye Na, Jinyoung Park, Hyunwoo J. Kim:

VidChain: Chain-of-Tasks with Metric-based Direct Preference Optimization for Dense Video Captioning. 4499-4507 - Jooyoung Lee, Jaeyoon Lee, Jongwon Choi:

NBA3D: Neighbor-Based Confidence Adjustment for 3D Rare Object Detection Using LiDAR. 4508-4516 - JunGyu Lee, Yeji Choi, Haksub Kim, Ig-Jae Kim, Gi Pyo Nam:

Navigating Label Ambiguity for Facial Expression Recognition in the Wild. 4517-4525 - Minhyeok Lee, Suhwan Cho, Chajin Shin, Jungho Lee, Sunghun Yang, Sangyoun Lee:

Video Diffusion Models Are Strong Video Inpainter. 4526-4533
Technical Tracks 5
- Sangho Lee, Il Yong Chun, Hogun Park:

MAMS: Model-Agnostic Module Selection Framework for Video Captioning. 4535-4543 - Sanghyeon Lee, Jooyeol Yun, Jaegul Choo:

Enabling Region-Specific Control via Lassos in Point-Based Colorization. 4544-4552 - Subeen Lee, Jiyeon Han

, Soyeon Kim, Jaesik Choi:
Diverse Rare Sample Generation with Pretrained GANs. 4553-4561 - Yuxiao Lee, Xiaofeng Cao, Jingcai Guo, Wei Ye, Qing Guo, Yi Chang:

Concept Matching with Agent for Out-of-Distribution Detection. 4562-4570 - Mengqi Lei, Haochen Wu, Xinhua Lv, Xin Wang:

ConDSeg: A General Medical Image Segmentation Framework via Contrast-Driven Feature Enhancement. 4571-4579 - Jiaqi Leng, Yakun Ju, Yuanxu Duan, Jiangnan Zhang, Qingxuan Lv, Zuxuan Wu, Hao Fan:

FNIN: A Fourier Neural Operator-based Numerical Integration Network for Surface-from-gradients. 4580-4588 - Yicheng Leng, Chaowei Fang, Junye Chen, Yixiang Fang, Sheng Li, Guanbin Li:

Bridging Knowledge Gap Between Image Inpainting and Large-Area Visible Watermark Removal. 4589-4597 - Yarin Yerushalmi Levi, Edita Grolman, Idan Yankelev

, Amit Giloni, Omer Hofman, Toshiya Shimizu, Asaf Shabtai, Yuval Elovici:
KDAT: Inherent Adversarial Robustness via Knowledge Distillation with Adversarial Tuning for Object Detection Models. 4598-4606 - Jaihyun Lew, Jooyoung Choi, Chaehun Shin, Dahuin Jung, Sungroh Yoon:

Disentangled Motion Modeling for Video Frame Interpolation. 4607-4615 - Bingliang Li, Fengyu Yang, Yuxin Mao, Qingwen Ye, Hongkai Chen, Yiran Zhong:

Tri-Ergon: Fine-Grained Video-to-Audio Generation with Multi-Modal Conditions and LUFS Control. 4616-4624 - Bonan Li, Zicheng Zhang, Xuecheng Nie, Congying Han, Yinhan Hu, Xinmin Qiu, Tiande Guo:

StyO: Stylize Your Face in Only One-Shot. 4625-4633 - Chade Li

, Pengju Zhang, Bo Liu, Hao Wei, Yihong Wu:
FEAST-Mamba: FEAture and SpaTial Aware Mamba Network with Bidirectional Orthogonal Fusion for Cross-Modal Point Cloud Segmentation. 4634-4642 - Chen Li, Rui Zhao, Zeyu Wang, Huiying Xu, Xinzhong Zhu:

RemDet: Rethinking Efficient Model Design for UAV Object Detection. 4643-4651 - Chenxin Li, Xinyu Liu, Wuyang Li, Cheng Wang, Hengyu Liu, Yifan Liu, Zhen Chen, Yixuan Yuan:

U-KAN Makes Strong Backbone for Medical Image Segmentation and Generation. 4652-4660 - Chuanhao Li, Zhen Li, Chenchen Jing, Xiaomeng Fan, Wenbo Ye, Yuwei Wu, Yunde Jia:

Consistency of Compositional Generalization Across Multiple Levels. 4661-4669 - Chunxiao Li, Xiaoxiao Wang, Boming Miao, Chuanlong Xie, Zizhe Wang, Yao Zhu:

An Efficient Framework for Enhancing Discriminative Models via Diffusion Techniques. 4670-4678 - Daxin Li, Yuanchao Bai, Kai Wang, Junjun Jiang, Xianming Liu, Wen Gao:

CALLIC: Content Adaptive Learning for Lossless Image Compression. 4679-4688 - Guangyuan Li, Yongkang Wang, Junsheng Luan, Lei Zhao, Wei Xing, Huaizhong Lin, Binkai Ou:

Cascaded Diffusion Models for Virtual Try-On: Improving Control and Resolution. 4689-4697 - Guoqiu Li, Jin Song, Yiyun Fei:

HomeDiffusion: Zero-Shot Object Customization with Multi-View Representation Learning for Indoor Scenes. 4698-4706 - Hao Li, Hao Fei, Zechao Hu, Zhengwei Yang, Zheng Wang:

VEGAS: Towards Visually Explainable and Grounded Artificial Social Intelligence. 4707-4715 - Haojin Li, Heng Li, Jianyu Chen, Rihan Zhong, Ke Niu, Huazhu Fu, Jiang Liu:

AIF-SFDA: Autonomous Information Filter Driven Source-Free Domain Adaptation for Medical Image Segmentation. 4716-4724 - Huafeng Li, Dayong Su, Qing Cai, Yafei Zhang:

BSAFusion: A Bidirectional Stepwise Feature Alignment Network for Unaligned Medical Image Fusion. 4725-4733 - Huaqiu Li, Wang Zhang, Xiaowan Hu, Tao Jiang, Zikang Chen, Haoqian Wang:

Prompt-SID: Learning Structural Representation Prompt via Latent Diffusion for Single Image Denoising. 4734-4742 - Jiafeng Li, Ying Wen, Lianghua He:

M²RL-Net: Multi-View and Multi-Level Relation Learning Network for Weakly-Supervised Image Forgery Detection. 4743-4751 - Jiahao Li, Yang Lu, Yuan Xie, Yanyun Qu:

MaskViM: Domain Generalized Semantic Segmentation with State Space Models. 4752-4760 - Jian Li, Siwang Zhou:

Block-Based Multi-Scale Image Rescaling. 4761-4769 - Jiawei Li

, Hongwei Yu, Jiansheng Chen, Xinlong Ding
, Jinlong Wang, Jinyuan Liu, Bochao Zou, Huimin Ma:
A²RNet: Adversarial Attack Resilient Network for Robust Infrared and Visible Image Fusion. 4770-4778 - Jiaxing Li, Lin Jiang, Zeqi Ma, Kaihang Jiang, Xiaozhao Fang, Jie Wen:

Lightweight Contrastive Distilled Hashing for Online Cross-modal Retrieval. 4779-4787 - Junyi Li, Zhilu Zhang, Wangmeng Zuo:

Rethinking Transformer-Based Blind-Spot Network for Self-Supervised Image Denoising. 4788-4796 - Ke Li, Di Wang, Zhangyuan Hu, Shaofeng Li, Weiping Ni, Lin Zhao, Quan Wang:

FD2-Net: Frequency-Driven Feature Decomposition Network for Infrared-Visible Object Detection. 4797-4805 - Ke Li, Gengyu Lyu, Hao Chen, Bochen Xie

, Zhen Yang, Youfu Li
, Yongjian Deng
:
Know Where You Are From: Event-Based Segmentation via Spatio-Temporal Propagation. 4806-4814 - Kun Li

, Dan Guo, Guoliang Chen, Chunxiao Fan, Jingyuan Xu, Zhiliang Wu, Hehe Fan, Meng Wang:
Prototypical Calibrating Ambiguous Samples for Micro-Action Recognition. 4815-4823 - Kunxi Li, Tianyu Zhan

, Kairui Fu, Shengyu Zhang, Kun Kuang, Jiwei Li, Zhou Zhao, Fan Wu, Fei Wu:
MergeNet: Knowledge Migration Across Heterogeneous Models, Tasks, and Modalities. 4824-4832 - Ling Li, Ruiwen Gu, Chongyang Wang, Junliang Xing, Xinchun Yu, Xiao-Ping Zhang:

Multi-View 3D Human Pose Estimation with Weakly Synchronized Images. 4833-4841 - Maodong Li, Chao Zheng, Jian Wang, Bing Li:

Similar Modality Enhancement and Action Consistency Learning for Weakly Supervised Temporal Action Localization. 4842-4850 - Peize Li, Qingyi Si, Peng Fu, Zheng Lin, Yan Wang:

Multimodal Hypothetical Summary for Retrieval-based Multi-image Question Answering. 4851-4859 - Pengna Li, Kangyi Wu, Jingwen Fu, Sanping Zhou:

REGNav: Room Expert Guided Image-Goal Navigation. 4860-4868 - Pu Li, Wenhao Zhang

, Jianwei Guo, Jinglu Chen, Dong-Ming Yan:
Revisiting CAD Model Generation by Learning Raster Sketch. 4869-4877 - Qiang Li, Di Liu, Jun Kong, Sen Li, Hui Xu, Jianzhong Wang:

Temporal Action Localization with Cross Layer Task Decoupling and Refinement. 4878-4886 - Rong Li, Liang Li, Jiehua Zhang, Qiang Zhao, Hongkui Wang, Chenggang Yan:

Region-aware Difference Distilling with Attribute-guided Contrastive Regularization for Change Captioning. 4887-4895 - Ruihang Li, Tao Li, Shanding Ye, Kaikai Xiao, Huangnan Zheng, Zhe Yin, Zhijie Pan:

Enhancing Generalizability via Utilization of Unlabeled Data for Occupancy Perception. 4896-4904 - Ruihuang Li, Liyi Chen, Zhengqiang Zhang

, Varun Jampani, Vishal M. Patel, Lei Zhang:
SyncNoise: Geometrically Consistent Noise Prediction for Instruction-based 3D Editing. 4905-4913 - Ruoran Li, Runzhao Yang, Wenxin Xiang, Yuxiao Cheng, Tingxiong Xiao, Lu Yang, Jinli Suo:

A Compact Implicit Neural Representation for Efficient Storage of Massive 4D Functional Magnetic Resonance Imaging. 4914-4922 - Shijie Li, Weijun Lin, Qingyuan Xiang, Yunbin Tu, Shitan Asu, Zheng Li:

Unsupervised Photometric-Consistent Depth Estimation from Endoscopic Monocular Video. 4923-4931 - Shiyu Li, Pengxu Wei, Pengchong Qiao, Chang Liu, Jie Chen:

DigitalLLaVA: Incorporating Digital Cognition Capability for Physical World Comprehension in Multimodal LLMs. 4932-4940 - Teng Li, Xingjun Ma, Yu-Gang Jiang:

AIM: Additional Image Guided Generation of Transferable Adversarial Attacks. 4941-4949 - Tengpeng Li, Hanli Wang, Xianfei Li, Wenlong Liao, Tao He, Pai Peng:

Generative Planning with 3D-Vision Language Pre-training for End-to-End Autonomous Driving. 4950-4958 - Wenrui Li, Zhe Yang, Wei Han, Hengyu Man, Xingtao Wang, Xiaopeng Fan

:
Hyperbolic-Constraint Point Cloud Reconstruction from Single RGB-D Images. 4959-4967 - Wenxue Li

, Lie Ju, Feilong Tang, Peng Xia
, Xinyu Xiong, Ming Hu, Lei Zhu, Zongyuan Ge:
Towards Realistic Semi-supervised Medical Image Classification. 4968-4976 - Wenyun Li, Zheng Zhang, Xiangyuan Lan, Dongmei Jiang:

Transferable Adversarial Face Attack with Text Controlled Attribute. 4977-4985 - Xiaohai Li, Bineng Zhong, Qihua Liang, Guorong Li, Zhiyi Mo, Shuxiang Song:

MambaLCT: Boosting Tracking via Long-term Context State Space Model. 4986-4994 - Xinzhe Li, Jiahui Zhan, Shengfeng He

, Yangyang Xu, Junyu Dong, Huaidong Zhang, Yong Du:
PersonaMagic: Stage-Regulated High-Fidelity Face Customization with Tandem Equilibrium. 4995-5003 - Xudong Li, Yan Zhang, Yunhang Shen, Ke Li, Runze Hu, Xiawu Zheng, Sicheng Zhao:

Feature Denoising Diffusion Model for Blind Image Quality Assessment. 5004-5012 - Xueyang Li, Yunzhong Lou, Yu Song, Xiangdong Zhou:

Mamba-CAD: State Space Model for 3D Computer-Aided Design Generative Modeling. 5013-5021 - Yachao Li, Dong Liang, Tianyu Ding, Sheng-Jun Huang:

StructSR: Refuse Spurious Details in Real-World Image Super-Resolution. 5022-5030 - Yaowei Li, Xintao Wang, Zhaoyang Zhang, Zhouxia Wang, Ziyang Yuan, Liangbin Xie, Ying Shan, Yuexian Zou:

Image Conductor: Precision Control for Interactive Video Synthesis. 5031-5038 - Yayuan Li, Jintao Guo, Lei Qi, Wenbin Li, Yinghuan Shi:

Text and Image Are Mutually Beneficial: Enhancing Training-Free Few-Shot Classification with CLIP. 5039-5047 - Yiheng Li

, Yang Yang, Zhen Lei:
RCTrans: Radar-Camera Transformer via Radar Densifier and Sequential Decoder for 3D Object Detection. 5048-5056 - Yihui Li, Chengxin Lv, Hongyu Yang, Di Huang:

Micro-macro Wavelet-based Gaussian Splatting for 3D Reconstruction from Unconstrained Images. 5057-5065 - Yinghui Li, Qianyu Zhou, Jingyu Gong, Ye Zhu, Richard Dazeley, Xinkui Zhao, Xuequan Lu:

DAPoinTr: Domain Adaptive Point Transformer for Point Cloud Completion. 5066-5074 - Zhangbin Li, Jinxing Zhou, Jing Zhang, Shengeng Tang, Kun Li, Dan Guo:

Patch-level Sounding Object Tracking for Audio-Visual Question Answering. 5075-5083 - Zhangheng Li, Tianlong Chen, Linyi Li, Bo Li, Zhangyang Wang:

Sparse Transfer Learning Accelerates and Enhances Certified Robustness: A Comprehensive Study. 5084-5091 - Zhuoyuan Li, Yubo Ai, Jiahao Lu, Chuxin Wang, Jiacheng Deng, Hanzhi Chang, Yanzhe Liang, Wenfei Yang, Shifeng Zhang, Tianzhu Zhang:

Pamba: Enhancing Global Interaction in Point Clouds via State Space Model. 5092-5100 - Zixu Li, Zhiwei Chen, Haokun Wen

, Zhiheng Fu, Yupeng Hu, Weili Guan:
ENCODER: Entity Mining and Modification Relation Binding for Composed Image Retrieval. 5101-5109 - Zonglin Li, Xiaoqian Lv, Qinglin Liu, Quanling Meng, Xin Sun, Shengping Zhang:

ProsodyTalker: 3D Visual Speech Animation via Prosody Decomposition. 5110-5118 - Zongyi Li, Jianbo Li, Yuxuan Shi, Jiazhong Chen, Shijuan Huang, Linnan Tu, Fei Shen, Hefei Ling:

Exploring the Potential of Large Vision-Language Models for Unsupervised Text-Based Person Retrieval. 5119-5127 - Baoyu Liang

, Qile Su, Shoutai Zhu, Yuchen Liang, Chao Tong:
VidEvent: A Large Dataset for Understanding Dynamic Evolution of Events in Videos. 5128-5136 - Guoyan Liang, Qin Zhou, Zhe Wang, Jingyuan Chen, Lin Gu

, Chang Yao, Sai Wu, Bingcang Huang, Kai Chen:
Semantic-guided Masked Mutual Learning for Multi-modal Brain Tumor Segmentation with Arbitrary Missing Modalities. 5137-5145 - Hanzhe Liang

, Guoyang Xie, Chengbin Hou
, Bingshu Wang, Can Gao, Jinbao Wang:
Look Inside for More: Internal Spatial Modality Perception for 3D Anomaly Detection. 5146-5154 - Li Liang

, Naveed Akhtar, Jordan Vice
, Xiangrui Kong
, Ajmal Saeed Mian
:
Skip Mamba Diffusion for Monocular 3D Semantic Scene Completion. 5155-5163 - Yiyuan Liang, Zhiying Yan, Liqun Chen, Jiahuan Zhou, Luxin Yan, Sheng Zhong, Xu Zou:

DriveEditor: A Unified 3D Information-Guided Framework for Controllable Object Editing in Driving Scenes. 5164-5172 - Zixi Liang, Guowei Xu, Haifeng Wu, Ye Huang, Wen Li, Lixin Duan:

S-INF: Towards Realistic Indoor Scene Synthesis via Scene Implicit Neural Field. 5173-5181 - Bencheng Liao, Xinggang Wang, Lianghui Zhu, Qian Zhang, Chang Huang:

ViG: Linear-complexity Visual Sequence Learning with Gated Linear Attention. 5182-5190 - Dongping Liao, Xitong Gao, Yabo Xu, Cheng-Zhong Xu:

Progressive Distribution Matching for Federated Semi-Supervised Learning. 5191-5199 - Sangbeom Lim, Seongchan Kim, Seungjun An, Seokju Cho, Paul Hongsuck Seo, Seungryong Kim:

Multi-Granularity Video Object Segmentation. 5200-5208 - Beibei Lin, Yeying Jin, Wending Yan, Wei Ye, Yuan Yuan, Robby T. Tan:

NightHaze: Nighttime Image Dehazing via Self-Prior Learning. 5209-5217 - Ente Lin, Xujie Zhang, Fuwei Zhao, Yuxuan Luo, Xin Dong, Long Zeng, Xiaodan Liang:

DreamFit: Garment-Centric Human Generation via a Lightweight Anything-Dressing Encoder. 5218-5226 - Guixu Lin, Muyao Niu, Qingtian Zhu, Zhengwei Yin, Zhuoxiao Li, Shengfeng He

, Yinqiang Zheng:
Adversarial Attacks on Event-Based Pedestrian Detectors: A Physical Approach. 5227-5235 - Jiaqi Lin, Zhihao Li, Binxiao Huang, Xiao Tang, Jianzhuang Liu, Shiyong Liu, Xiaofei Wu, Fenglong Song, Wenming Yang:

Decoupling Appearance Variations with 3D Consistent Features in Gaussian Splatting. 5236-5244 - Jiayi Lin, Jiabo Huang, Jian Hu, Shaogang Gong:

InvSeg: Test-Time Prompt Inversion for Semantic Segmentation. 5245-5253 - Jiaying Lin

, Yuen Hei Yeung, Shuquan Ye
, Rynson W. H. Lau:
Leveraging RGB-D Data with Cross-Modal Context Mining for Glass Surface Detection. 5254-5261 - Kaiqing Lin, Yuzhen Lin, Weixiang Li

, Taiping Yao, Bin Li:
Standing on the Shoulders of Giants: Reprogramming Visual-Language Model for General Deepfake Detection. 5262-5270 - Min Lin, Gangwei Xu, Yun Wang, Xianqi Wang, Xin Yang:

FlowMamba: Learning Point Cloud Scene Flow with Global Motion Propagation. 5271-5279 - Pei Lin:

HandDiffuse: Generative Controllers for Two-Hand Interactions via Diffusion Models. 5280-5288 - Yangkai Lin, Jiabao Lei, Kui Jia:

Multi-StyleGS: Stylized Gaussian Splatting with Multiple Styles. 5289-5297 - Yiheng Lin, Yihan Hu, Chenyi Zhang, Ting Liu, Xiaochao Qu, Luoqi Liu, Yao Zhao, Yunchao Wei:

Memory Efficient Matting with Adaptive Token Routing. 5298-5306 - Yunlong Lin, Tian Ye, Sixiang Chen, Zhenqi Fu, Yingying Wang, Wenhao Chai, Zhaohu Xing, Wenxue Li

, Lei Zhu, Xinghao Ding:
AGLLDiff: Guiding Diffusion Models Towards Unsupervised Training-free Real-world Low-light Image Enhancement. 5307-5315 - Yunlong Lin, Zhenqi Fu, Kairun Wen, Tian Ye, Sixiang Chen, Ge Meng, Yingying Wang, Chui Kong, Yue Huang, Xiaotong Tu, Xinghao Ding:

DPLUT: Unsupervised Low-light Image Enhancement with Lookup Tables and Diffusion Priors. 5316-5324 - Yuxin Lin

, Wei Wang, Xiaoling Luo, Zhihao Wu, Chengliang Liu, Jie Wen, Yong Xu:
Deep Hierarchies and Invariant Disease-Indicative Feature Learning for Computer Aided Diagnosis of Multiple Fundus Diseases. 5325-5333 - Zhihang Lin, Mingbao Lin, Luxi Lin, Rongrong Ji:

Boosting Multimodal Large Language Models with Visual Tokens Withdrawal for Rapid Inference. 5334-5342 - Peng Ling, Tiao Tan

, Jiaqi Lin, Wenming Yang:
SOVGaussian: Sparse-View 3D Gaussian Splatting for Open-Vocabulary Scene Understanding. 5343-5351 - Baolong Liu, Ruiqing Yang, Roukai Huang, Wenhao Xu, Xin Pan, Chuanhuang Li, Bin Wang, Xun Wang, Jianfeng Dong:

Towards Ship License Plate Recognition in the Wild: A Large Benchmark and Strong Baseline. 5352-5360 - Chengzhi Liu, Zile Huang, Zhe Chen, Feilong Tang, Yu Tian, Zhongxing Xu, Zihong Luo, Yalin Zheng

, Yanda Meng
:
Incomplete Modality Disentangled Representation for Ophthalmic Disease Grading and Diagnosis. 5361-5369 - Chuang Liu, Yichao Cao, YingYing Zhang, Xiu Su, Haogang Zhu:

Perturbating, Tuning, and Collaborating: Harnessing Vision Foundation Models for Single Domain Generalization on Medical Imaging. 5370-5378 - Decheng Liu, Zongqi Wang, Chunlei Peng, Nannan Wang, Ruimin Hu, Xinbo Gao:

Thinking Racial Bias in Fair Forgery Detection: Models, Datasets and Evaluations. 5379-5387 - Delong Liu, Zhaohui Hou, Mingjie Zhan, Shihao Han, Zhicheng Zhao

, Fei Su:
UFO: Enhancing Diffusion-Based Video Generation with a Uniform Frame Organizer. 5388-5396 - Dunqiang Liu, Shujun Huang, Wen Li, Siqi Shen, Cheng Wang:

Text to Point Cloud Localization with Multi-Level Negative Contrastive Learning. 5397-5405 - Duo Liu, Yiqi Shi, Guoyin Zhang, Sizhao Li, Liguo Zhang:

Zero-Shot Noise2Mean: Gap Minimization for Efficient Denoising from a Single Noisy Image. 5406-5414 - Fan Liu, Wenwen Cai, Jian Huo, Chuanyi Zhang, Delong Chen, Jun Zhou:

Making Large Vision Language Models to Be Good Few-Shot Learners. 5415-5423 - Gaofeng Liu, Zhiyuan Ma, Tao Fang:

DreamAlign: Dynamic Text-to-3D Optimization with Human Preference Alignment. 5424-5432 - Han Liu, Yuanyuan Wang, Xiaotong Zhang, Feng Zhang,



Google
Google Scholar
Semantic Scholar
Internet Archive Scholar
CiteSeerX
ORCID