


default search action
CVPR 2024: Seattle, WA, USA
- IEEE/CVF Conference on Computer Vision and Pattern Recognition, CVPR 2024, Seattle, WA, USA, June 16-22, 2024. IEEE 2024, ISBN 979-8-3503-5300-6
- Saurabh Saini, P. J. Narayanan:
Specularity Factorization for Low-Light Enhancement. 1-12 - Yuyi Liu, Xinhang Song, Weijie Li, Xiaohan Wang, Shuqiang Jiang:
A Category Agnostic Model for Visual Rearrangment. 1-10 - Yixuan Zhu, Wenliang Zhao, Ao Li, Yansong Tang, Jie Zhou, Jiwen Lu:
FlowIE: Efficient Image Enhancement via Rectified Flow. 13-22 - Guoqiang Liang, Kanghao Chen, Hangyu Li, Yunfan Lu, Lin Wang:
Towards Robust Event-guided Low-Light Image Enhancement: A Large-Scale Real-World Event-Image Dataset and Novel Approach. 23-33 - Zhilin Huang, Quanmin Liang, Yijie Yu, Chujun Qin, Xiawu Zheng, Kai Huang, Zikun Zhou, Wenming Yang:
Bilateral Event Mining and Complementary for Event Stream Super-Resolution. 34-43 - Geunhyuk Youk, Jihyong Oh, Munchurl Kim:
FMA-Net: Flow-Guided Dynamic Filtering and Iterative Feature Refinement with Multi-Attention for Joint Video Super-Resolution and Deblurring. 44-55 - Yuan Dong, Qi Zuo, Xiaodong Gu, Weihao Yuan, Zhengyi Zhao, Zilong Dong, Liefeng Bo, Qixing Huang:
GPLD3D: Latent Diffusion of 3D Shape Generative Models by Enforcing Geometric and Physical Priors. 56-66 - Daichi Horita, Naoto Inoue, Kotaro Kikuchi, Kota Yamaguchi
, Kiyoharu Aizawa:
Retrieval-Augmented Layout Transformer for Content-Aware Layout Generation. 67-76 - Dor Verbin, Ben Mildenhall, Peter Hedman, Jonathan T. Barron, Todd E. Zickler, Pratul P. Srinivasan:
Eclipse: Disambiguating Illumination and Materials Using Unintended Shadows. 77-86 - Bailey Miller, Hanyu Chen, Alice Lai, Ioannis Gkioulekas
:
Objects as Volumes: A Stochastic Geometry View of Opaque Solids. 87-97 - Pakkapon Phongthawee, Worameth Chinchuthakun, Nontaphat Sinsunthithet, Varun Jampani, Amit Raj, Pramook Khungurn, Supasorn Suwajanakorn:
DiffusionLight: Light Probes for Free by Painting a Chrome Ball. 98-108 - Zeren Jiang, Chen Guo, Manuel Kaufmann
, Tianjian Jiang, Julien Valentin, Otmar Hilliges, Jie Song:
MultiPly: Reconstruction of Multiple People from Monocular Video in the Wild. 109-118 - Zhaoxi Chen, Gyeongsik Moon, Kaiwen Guo, Chen Cao, Stanislav Pidhorskyi, Tomas Simon, Rohan Joshi, Yuan Dong, Yichen Xu, Bernardo Pires, He Wen, Lucas Evans, Bo Peng, Julia Buffalini, Autumn Trimble, Kevyn McPhail, Melissa Schoeller, Shoou-I Yu, Javier Romero, Michael Zollhöfer, Yaser Sheikh, Ziwei Liu, Shunsuke Saito:
URHand: Universal Relightable Hands. 119-129 - Shunsuke Saito, Gabriel Schwartz, Tomas Simon, Junxuan Li, Giljoo Nam:
Relightable Gaussian Codec Avatars. 130-141 - Xiaoyu Zhan
, Jianxin Yang, Yuanqi Li, Jie Guo, Yanwen Guo, Wenping Wang:
Semantic Human Mesh Reconstruction with Textures. 142-152 - Han Feng, Wenchao Ma
, Quankai Gao, Xianwei Zheng, Nan Xue, Huijuan Xu:
Stratified Avatar Generation from Sparse Observations. 153-163 - Haidong Zhu, Pranav Budhwant, Zhaoheng Zheng, Ram Nevatia:
SEAS: ShapE-Aligned Supervision for Person Re-Identification. 164-174 - Qianyu Zhou, Ke-Yue Zhang, Taiping Yao, Xuequan Lu, Shouhong Ding, Lizhuang Ma:
Test-Time Domain Generalization for Face Anti-Spoofing. 175-187 - Binh Minh Le, Simon S. Woo:
Gradient Alignment for Cross-Domain Face Anti-Spoofing. 188-199 - Dingqiang Ye, Chao Fan, Jingzhe Ma, Xiaoming Liu, Shiqi Yu:
BigGait: Learning Gait Representation You Want by Large Vision Models. 200-210 - Xun Lin, Shuai Wang, Rizhao Cai, Yizhong Liu, Ying Fu, Wenzhong Tang
, Zitong Yu, Alex C. Kot:
Suppress and Rebalance: Towards Generalized Multi-Modal Face Anti-Spoofing. 211-221 - Ajian Liu, Shuai Xue, Jianwen Gan, Jun Wan, Yanyan Liang, Jiankang Deng
, Sergio Escalera
, Zhen Lei:
CFPL-FAS: Class Free Prompt Learning for Generalizable Face Anti-Spoofing. 222-232 - Ruijie Quan, Wenguan Wang, Zhibo Tian, Fan Ma, Yi Yang:
Psychometry: An Omnifit Model for Image Reconstruction from Human Brain Activity. 233-243 - Minchul Kim, Yiyang Su, Feng Liu, Anil Jain, Xiaoming Liu:
KeyPoint Relative Position Encoding for Face Recognition. 244-255 - Feng Liu, Minchul Kim, Zhiyuan Ren, Xiaoming Liu:
Distilling CLIP with Dual Guidance for Learning Discriminative Human Body Shape Representation. 256-266 - Leslie Ching Ow Tiong
, Dick Sigmund, Chen-Hui Chan, Andrew Beng Jin Teoh:
Flexible Biometrics Recognition: Bridging the Multimodality Gap Through Attention, Alignment and Prompt Tuning. 267-276 - Pei-Kai Huang
, Cheng-Hsuan Chiang, Tzu-Hsien Chen, Jun-Xiong Chong, Tyng-Luh Liu, Chiou-Ting Hsu:
One-Class Face Anti-Spoofing via Spoof Cue Map-Guided Feature Learning. 277-286 - Shehreen Azad, Yogesh Singh Rawat:
Activity-Biometrics: Person Identification from Daily Activities. 287-296 - Yuxi Mi
, Zhizhou Zhong, Yuge Huang, Jiazhen Ji, Jianqing Xu, Jun Wang, Shaoming Wang, Shouhong Ding, Shuigeng Zhou:
Privacy-Preserving Face Recognition Using Trainable Feature Subtraction. 297-307 - Xin Juan, Kaixiong Zhou, Ninghao Liu, Tianlong Chen, Xin Wang:
Molecular Data Programming: Towards Molecule Pseudo-labeling with Systematic Weak Supervision. 308-318 - Ruijie Quan, Wenguan Wang, Fan Ma, Hehe Fan, Yi Yang:
Clustering for Protein Representation Learning. 319-329 - Nathan Mankovich, Gustau Camps-Valls, Tolga Birdal:
Fun with Flags: Robust Principal Directions via Flag Manifolds. 330-340 - Shunsuke Yasuki, Masato Taki:
CAM Back Again: Large Kernel CNNs from a Weakly Supervised Object Localization Perspective. 341-351 - Tsu-Ching Hsiao, Hao-Wei Chen, Hsuan-Kung Yang, Chun-Yi Lee:
Confronting Ambiguity in 6D Object Pose Estimation via Score-Based Diffusion on SE(3). 352-362 - Wooseong Jeong, Kuk-Jin Yoon:
Quantifying Task Priority for Multi-Task Optimization. 363-372 - Chaehyeon Song, Jaeho Shin, Myung-Hwan Jeon, Jongwoo Lim, Ayoung Kim:
Unbiased Estimator for Distorted Conics in Camera Calibration. 373-381 - Xinzhe Wang
, Kang Ma, Qiankun Liu, Yunhao Zou, Ying Fu:
Multi-Object Tracking in the Dark. 382-392 - Kaijie Ren, Lei Zhang:
Implicit Discriminative Knowledge Learning for Visible-Infrared Person Re-Identification. 393-402 - Javier Tirado-Garín
, Javier Civera:
From Correspondences to Pose: Non-Minimal Certifiably Optimal Relative Pose Without Disambiguation. 403-412 - Hemanth Saratchandran, Sameera Ramasinghe, Simon Lucey:
From Activation to Initialization: Scaling Insights for Optimizing Neural Fields. 413-422 - Ammar Ali, Georgii Gaikov, Denis Rybalchenko, Alexander Chigorin, Ivan Laptev, Sergey Zagoruyko:
PairDETR : Joint Detection and Association of Human Bodies and Faces. 423-432 - Zan Wang, Yixin Chen, Baoxiong Jia, Puhao Li, Jinlu Zhang, Jingze Zhang, Tengyu Liu, Yixin Zhu, Wei Liang, Siyuan Huang:
Move as you Say, Interact as you can: Language-Guided Human Motion Generation with Scene Affordance. 433-444 - Xinyu Zhan, Lixin Yang, Yifei Zhao
, Kangrui Mao, Hanlin Xu, Zenan Lin, Kailin Li, Cewu Lu:
OakInk2 : A Dataset of Bimanual Hands-Object Manipulation in Complex Task Completion. 445-456 - Germán Barquero, Sergio Escalera
, Cristina Palmero:
Seamless Human Motion Composition with Blended Positional Encodings. 457-469 - Liao Wang, Kaixin Yao, Chengcheng Guo, Zhirui Zhang, Qiang Hu, Jingyi Yu, Lan Xu, Minye Wu:
VideoRF: Rendering Dynamic Radiance Fields as 2D Feature Video Streams. 470-481 - Han Liang, Jiacheng Bao, Ruichi Zhang, Sihan Ren, Yuecheng Xu, Sibei Yang, Xin Chen, Jingyi Yu
, Lan Xu:
OMG: Towards Open-vocabulary Motion Generation via Mixture of Controllers. 482-493 - Zicong Fan, Maria Parelli, Maria Eleni Kadoglou, Xu Chen, Muhammed Kocabas, Michael J. Black, Otmar Hilliges:
HOLD: Category-Agnostic 3D Reconstruction of Interacting Hands and Objects from Video. 494-504 - Muhammed Kocabas, Jen-Hao Rick Chang, James Gabriel, Oncel Tuzel, Anurag Ranjan:
HUGS: Human Gaussian Splats. 505-515 - Juze Zhang, Jingyan Zhang, Zining Song, Zhanhe Shi, Chengfeng Zhao, Ye Shi, Jingyi Yu, Lan Xu, Jingya Wang:
HOI-M3: Capture Multiple Humans and Objects Interaction within Contextual Environment. 516-526 - Jihyun Lee, Shunsuke Saito, Giljoo Nam, Minhyuk Sung, Tae-Kyun Kim:
InterHandGen: Two-Hand Interaction Generation via Cascaded Reverse Diffusion. 527-537 - Hsuan-I Ho, Jie Song, Otmar Hilliges:
SiTH: Single-view Textured Human Reconstruction with Image-Conditioned Diffusion. 538-549 - Wenbo Wang, Hsuan-I Ho, Chen Guo, Boxiang Rong, Artur Grigorev, Jie Song, Juan Jose Zarate
, Otmar Hilliges:
4D-DRESS: A 4D Dataset of Real-World Human Clothing with Semantic Annotations. 550-560 - Jinglin Xu, Yijie Guo, Yuxin Peng:
FinePOSE: Fine-Grained Prompt-Driven 3D Human Pose Estimation via Diffusion Models. 561-570 - Zhengyi Luo, Jinkun Cao, Rawal Khirodkar, Alexander Winkler, Jing Huang, Kris Kitani, Weipeng Xu:
Real-Time Simulated Avatar from Head-Mounted Sensors. 571-581 - Zhongang Cai, Jianping Jiang, Zhongfei Qing, Xinying Guo, Mingyuan Zhang, Zhengyu Lin, Haiyi Mei, Chen Wei, Ruisi Wang, Wanqi Yin, Liang Pan, Xiangyu Fan, Han Du, Peng Gao, Zhitao Yang, Yang Gao, Jiaqi Li, Tianxiang Ren, Yukun Wei, Xiaogang Wang, Chen Change Loy, Lei Yang, Ziwei Liu:
Digital Life Project: Autonomous 3D Characters with Social Intelligence. 582-592 - Kang Ma, Ying Fu, Chunshui Cao, Saihui Hou, Yongzhen Huang, Dezhi Zheng:
Learning Visual Prompt for Gait Recognition. 593-603 - Wenhao Li, Mengyuan Liu, Hong Liu, Pichao Wang, Jialun Cai, Nicu Sebe
:
Hourglass Tokenizer for Efficient Transformer-Based 3D Human Pose Estimation. 604-613 - Dongkai Wang, Shiyu Xuan, Shiliang Zhang:
LocLLM: Exploiting Generalizable Human Keypoint Localization via Large Language Model. 614-623 - Dongkai Wang, Shiliang Zhang:
Spatial-Aware Regression for Keypoint Localization. 624-633 - Liangxiao Hu, Hongwen Zhang, Yuxiang Zhang, Boyao Zhou, Boning Liu
, Shengping Zhang, Liqiang Nie:
GaussianAvatar: Towards Realistic Human Avatar Modeling from a Single Video via Animatable 3D Gaussians. 634-644 - Mengcheng Li, Hongwen Zhang, Yuxiang Zhang, Ruizhi Shao, Tao Yu, Yebin Liu:
HHMR: Holistic Hand Mesh Recovery by Enhancing the Multimodal Controllability of Graph Diffusion Models. 645-654 - Qi Fang, Yinghui Fan, Yanjun Li, Junting Dong, Dingwei Wu, Weidong Zhang, Kang Chen:
Capturing Closely Interacted Two-Person Motions with Reaction Priors. 655-665 - Ziqiao Peng, Wentao Hu, Yue Shi, Xiangyu Zhu
, Xiaomei Zhang, Hao Zhao, Jun He, Hongyan Liu, Zhaoxin Fan:
SyncTalk: The Devil is in the Synchronization for Talking Head Synthesis. 666-676 - Ruicong Liu, Takehiko Ohkawa, Mingfang Zhang, Yoichi Sato:
Single-to-Dual-View Adaptation for Egocentric 3D Hand Pose Estimation. 677-686 - Canyu Zhang, Youbao Tang, Ning Zhang, Ruei-Sung Lin, Mei Han, Jing Xiao, Song Wang:
Bidirectional Autoregressive Diffusion Model for Dance Generation. 687-696 - Yuxuan Han, Junfeng Lyu, Feng Xu:
High-Quality Facial Geometry and Appearance Capture at Home. 697-707 - Ziwei Liao, Jialiang Zhu, Chunyu Wang, Han Hu, Steven L. Waslander:
Multiple View Geometry Transformers for 3D Human Pose Estimation. 708-717 - Jingbo Wang, Zhengyi Luo, Ye Yuan, Yixuan Li, Bo Dai:
PACER+: On-Demand Pedestrian Animation Controller in Driving Scenarios. 718-728 - Chengfeng Zhao, Juze Zhang, Jiashen Du, Ziwei Shan, Junye Wang, Jingyi Yu, Jingya Wang, Lan Xu:
I'M HOI: Inertia-Aware Monocular Capture of 3D Human-Object Interactions. 729-741 - Xihe Yang, Xingyu Chen, Daiheng Gao, Shaohui Wang, Xiaoguang Han, Baoyuan Wang:
HAVE-FUN: Human Avatar Reconstruction from Few-Shot Unconstrained Images. 742-752 - Inhwan Bae, Junoh Lee, Hae-Gon Jeon:
Can Language Beat Numerical Regression? Language-Based Multimodal Trajectory Prediction. 753-766 - Hiroyasu Akada, Jian Wang, Vladislav Golyanik, Christian Theobalt
:
3D Human Pose Perception from Egocentric Stereo Videos. 767-776 - Jian Wang, Zhe Cao, Diogo C. Luvizon, Lingjie Liu, Kripasindhu Sarkar, Danhang Tang, Thabo Beeler, Christian Theobalt
:
Egocentric Whole-Body Motion Capture with FisheyeViT and Diffusion-Based Motion Refinement. 777-787 - Arthur Moreau, Jifei Song, Helisa Dhamo, Richard Shaw, Yiren Zhou, Eduardo Pérez-Pellitero:
Human Gaussian Splatting: Real-Time Rendering of Animatable Avatars. 788-798 - Xiaozheng Zheng, Chao Wen, Zhuo Su, Zeran Xu, Zhaohu Li, Yang Zhao, Zhou Xue:
OHTA: One-shot Hand Avatar via Data-driven Implicit Priors. 799-810 - Wenfeng Song, Xinyu Zhang, Shuai Li, Yang Gao, Aimin Hao, Xia Hau, Chenglizhao Chen, Ning Li, Hong Qin:
HOIAnimator: Generating Text-Prompt Human-Object Animations Using Novel Perceptive Diffusion Models. 811-820 - Wenfeng Song, Xingliang Jin, Shuai Li, Chenglizhao Chen, Aimin Hao, Xia Hou, Ning Li, Hong Qin:
Arbitrary Motion Style Transfer with Multi-Condition Motion Latent Diffusion Model. 821-830 - Yan-Kang Wang, Chengyi Xing, Yi-Lin Wei, Xiao-Ming Wu, Wei-Shi Zheng:
Single-View Scene Point Cloud Human Grasp Generation. 831-841 - Taeho Kang, Youngki Lee:
Attention-Propagation Network for Egocentric Heatmap to 3D Pose Lifting. 842-851 - Jieming Cui, Tengyu Liu, Nian Liu, Yaodong Yang, Yixin Zhu, Siyuan Huang:
AnySkill: Learning Open-Vocabulary Physical Skill for Interactive Agents. 852-862 - Zekun Qian
, Ruize Han, Wei Feng, Song Wang:
From a Bird's Eye View to See: Joint Camera and Subject Registration without the Camera Calibration. 863-873 - Peng Dai, Yang Zhang, Tao Liu, Zhen Fan, Tianyuan Du, Zhuo Su, Xiaozheng Zheng, Zeming Li:
HMD-Poser: On-Device Real-time Human Motion Tracking from Scalable Sparse Observations. 874-884 - Xingyu Ren, Jiankang Deng
, Yuhao Cheng, Jia Guo, Chao Ma, Yichao Yan, Wenhan Zhu, Xiaokang Yang:
Monocular Identity-Conditioned Facial Reflectance Reconstruction. 885-895 - Ye Yuan, Xueting Li, Yangyi Huang, Shalini De Mello, Koki Nagano, Jan Kautz, Umar Iqbal:
GAvatar: Animatable 3D Gaussian Avatars with Implicit Mesh Learning. 896-905 - Anastasis Stathopoulos, Ligong Han, Dimitris N. Metaxas:
Score-Guided Diffusion for 3D Human Recovery. 906-915 - Yuhao Cheng, Zhuo Chen, Xingyu Ren, Wenhan Zhu, Zhengqin Xu, Di Xu, Changpeng Yang, Yichao Yan:
3D-Aware Face Editing via Warping-Guided Latent Direction Learning. 916-926 - Markos Diomataris, Nikos Athanasiou, Omid Taheri, Xi Wang, Otmar Hilliges, Michael J. Black:
WANDR: Intention-guided Human Motion Generation. 927-936 - Qing Yu, Mikihiro Tanaka, Kent Fujiwara:
Exploring Vision Transformers for 3D Human Motion-Language Models with Motion Patches. 937-946 - Nilesh Kulkarni, Davis Rempe, Kyle Genova, Abhijit Kundu, Justin Johnson, David Fouhey, Leonidas J. Guibas:
NIFTY: Neural Object Interaction Fields for Guided Human Motion Synthesis. 947-957 - Yukang Cao, Yan-Pei Cao, Kai Han, Ying Shan, Kwan-Yee K. Wong:
DreamAvatar: Text-and-Shape Guided 3D Human Avatar Generation via Diffusion Models. 958-968 - Kangwei Yan, Fei Wang, Bo Qian, Han Ding, Jinsong Han, Xing Wei:
Person-in-WiFi 3D: End-to-End Multi-Person 3D Pose Estimation with Wi-Fi. 969-978 - Yuan Xu, Xiaoxuan Ma, Jiajun Su, Wentao Zhu, Yu Qiao, Yizhou Wang:
ScoreHypo: Probabilistic Human Mesh Estimation with Hypothesis Scoring. 979-989 - Zhen Xu, Sida Peng, Chen Geng
, Linzhan Mou, Zihan Yan, Jiaming Sun, Hujun Bao, Xiaowei Zhou:
Relightable and Animatable Neural Avatar from Sparse-View Video. 990-1000 - Evonne Ng, Javier Romero, Timur M. Bagautdinov, Shaojie Bai, Trevor Darrell, Angjoo Kanazawa, Alexander Richard:
From Audio to Photoreal Embodiment: Synthesizing Humans in Conversations. 1001-1010 - Buzhen Huang, Chen Li
, Chongyang Xu, Liang Pan, Yangang Wang, Gim Hee Lee:
Closely Interactive Human Reconstruction with Proxemics and Physics-Guided Adaption. 1011-1021 - Jijie He, Wenwu Yang:
Video-Based Human Pose Regression via Decoupled Space-Time Aggregation. 1022-1031 - Chengyang Hu, Ke-Yue Zhang, Taiping Yao, Shouhong Ding, Lizhuang Ma:
Rethinking Generalizable Face Anti-Spoofing via Hierarchical Prototype-Guided Distribution Refinement in Hyperbolic Space. 1032-1041 - Xiaoning Sun, Huaijiang Sun, Bin Li, Dong Wei, Weiqing Li, Jianfeng Lu:
MoML: Online Meta Adaptation for 3D Human Motion Prediction. 1042-1051 - Fengyuan Yang, Kerui Gu, Angela Yao:
KITRO: Refining Human Mesh by 2D Clues and Kinematic-tree Rotation. 1052-1061 - Inhee Lee, Byungjun Kim, Hanbyul Joo:
Guess The Unseen: Dynamic 3D Scene Reconstruction from Partial 2D Glimpses. 1062-1071 - Hyunsoo Cha, Byungjun Kim, Hanbyul Joo:
PEGASUS: Personalized Generative 3D Avatars with Composable Attributes. 1072-1081 - Sichen Chen, Yingyi Zhang, Siming Huang, Ran Yi, Ke Fan, Ruixin Zhang, Peixian Chen, Jun Wang, Shouhong Ding, Lizhuang Ma:
SDPose: Tokenized Pose Estimation via Circulation-Guide Self-Distillation. 1082-1090 - Jiye Lee, Hanbyul Joo:
Mocap Everyone Everywhere: Lightweight Motion Capture with Smartwatches and a Head-Mounted Camera. 1091-1100 - Yixuan Zhu, Ao Li, Yansong Tang, Wenliang Zhao, Jie Zhou, Jiwen Lu:
DPMesh: Exploiting Diffusion Prior for Occluded Human Mesh Recovery. 1101-1110 - Jiapeng Tang, Angela Dai, Yinyu Nie, Lev Markhasin, Justus Thies, Matthias Nießner:
DPHMs: Diffusion Parametric Head Models for Depth-Based Tracking. 1111-1122 - Jihua Peng, Yanghong Zhou, P. Y. Mok:
KTPFormer: Kinematics and Trajectory Prior Knowledge-Enhanced Transformer for 3D Human Pose Estimation. 1123-1132 - Jongwook Choi, Taehoon Kim, Yonghyun Jeong, Seungryul Baek, Jongwon Choi:
Exploiting Style Latent Flows for Generalizing Deepfake Video Detection. 1133-1143 - Haiyang Liu, Zihao Zhu, Giorgio Becherini, Yichen Peng, Mingyang Su, You Zhou, Xuefei Zhe, Naoya Iwamoto, Bo Zheng, Michael J. Black:
EMAGE: Towards Unified Holistic Co-Speech Gesture Generation via Expressive Masked Audio Gesture Modeling. 1144-1154 - Yiteng Xu, Kecheng Ye, Xiao Han, Yiming Ren, Xinge Zhu
, Yuexin Ma:
A Unified Framework for Human-centric Point Cloud Video Understanding. 1155-1164 - Haokai Pang, Heming Zhu, Adam Kortylewski, Christian Theobalt
, Marc Habermann:
ASH: Animatable Gaussian Splats for Efficient and Photoreal Human Rendering. 1165-1175 - Andrey Davydov, Martin Engilberge, Mathieu Salzmann, Pascal Fua:
CLOAF: CoLlisiOn-Aware Human Flow. 1176-1185 - Christen Millerdurai, Hiroyasu Akada, Jian Wang, Diogo C. Luvizon, Christian Theobalt
, Vladislav Golyanik:
EventEgo3D: 3D Human Motion Capture from Egocentric Event Streams. 1186-1195 - Jakub Paplhám, Vojtech Franc:
A Call to Reflect on Evaluation Practices for Age Estimation: Comparative Analysis of the State-of-the-Art and a Unified Benchmark. 1196-1205 - Ashwath Shetty, Marc Habermann, Guoxing Sun
, Diogo C. Luvizon, Vladislav Golyanik, Christian Theobalt
:
Holoported Characters: Real-Time Free-Viewpoint Rendering of Humans from Sparse RGB Cameras. 1206-1215 - Yizhou Zhao, Tuanfeng Yang Wang, Bhiksha Raj, Min Xu, Jimei Yang, Chun-Hao Paul Huang:
Synergistic Global-Space Camera and Human Reconstruction from Videos. 1216-1226 - Felix Taubner, Prashant Raina, Mathieu Tuli, Eu Wern Teh, Chul Lee, Jinmiao Huang:
3D Face Tracking from 2D Video through Iterative Dense UV to Image Flow. 1227-1237 - Mingyuan Zhou, Rakib Hyder, Ziwei Xuan, Guojun Qi:
UltrAvatar: A Realistic Animatable 3D Avatar Diffusion Model with Authenticity Guided Textures. 1238-1248 - Zhangsihao Yang, Mingyuan Zhou, Mengyi Shan, Bingbing Wen, Ziwei Xuan, Mitch Hill, Junjie Bai, Guo-Jun Qi, Yalin Wang:
OmniMotionGPT: Animal Motion Generation with Limited Data. 1249-1259 - Yunjie Wu, Yapeng Meng, Zhipeng Hu, Lincheng Li, Haoqian Wu, Kun Zhou, Weiwei Xu, Xin Yu
:
Text-Guided 3D Face Synthesis - From Generation to Editing. 1260-1269 - Zihan Wang, Siyang Song, Cheng Luo, Songhe Deng, Weicheng Xie, Linlin Shen:
Multi-Scale Dynamic and Hierarchical Relationship Modeling for Facial Action Units Recognition. 1270-1280 - Yiming Ren, Xiao Han, Chengfeng Zhao, Jingya Wang, Lan Xu, Jingyi Yu, Yuexin Ma:
LiveHPS: LiDAR-Based Scene-Level Human Pose and Shape Estimation in Free Environment. 1281-1291 - Chao Xu, Yang Liu, Jiazheng Xing, Weida Wang, Mingze Sun, Jun Dan, Tianxin Huang, Siyuan Li, Zhi-Qi Cheng, Ying Tai, Baigui Sun:
FaceChain-ImagineID: Freely Crafting High-Fidelity Diverse Talking Faces from Disentangled Audio. 1292-1302 - Yuchen Pan, Junjun Jiang, Kui Jiang, Zhihao Wu, Keyuan Yu, Xianming Liu:
OpticalDR: A Deep Optical Imaging Model for Privacy-Protective Depression Recognition. 1303-1312 - Kejia Yin, Varshanth S. Rao, Ruowei Jiang, Xudong Liu, Parham Aarabi, David B. Lindell:
SCE-MAE: Selective Correspondence Enhancement with Masked Autoencoder for Self-Supervised Landmark Estimation. 1313-1322 - Sai Kumar Dwivedi, Yu Sun
, Priyanka Patel, Yao Feng, Michael J. Black:
TokenHMR: Advancing Human Mesh Recovery with a Tokenized Pose Representation. 1323-1333 - Korrawe Karunratanakul, Konpat Preechakul, Emre Aksan, Thabo Beeler, Supasorn Suwajanakorn, Siyu Tang:
Optimizing Diffusion Noise Can Serve As Universal Motion Priors. 1334-1345 - Luyang Zhu, Yingwei Li, Nan Liu, Hao Peng, Dawei Yang, Ira Kemelmacher-Shlizerman:
M&M VTO: Multi-Garment Virtual Try-On and Editing. 1346-1356 - Zixiang Zhou, Yu Wan, Baoyuan Wang:
AvatarGPT: All-in-One Framework for Motion Understanding, Planning, Generation and Beyond. 1357-1366 - Zhishan Zhou, Shihao Zhou, Zhi Lv, Minqiang Zou, Yao Tang, Jiajun Liang:
A Simple Baseline for Efficient Hand Mesh Reconstruction. 1367-1376 - Zhouyingcheng Liao, Vladislav Golyanik, Marc Habermann, Christian Theobalt
:
VINECS: Video-based Neural Character Skinning. 1377-1387 - Muhammad Hamza Mughal, Rishabh Dabral, Ikhsanul Habibie, Lucia Donatelli, Marc Habermann, Christian Theobalt
:
ConvoFusion: Multi-Modal Conversational Diffusion for Co-Speech Gesture Synthesis. 1388-1398 - Hanchao Liu, Xiaohang Zhan, Shaoli Huang, Tai-Jiang Mu, Ying Shan:
Programmable Motion Generation for Open-Set Motion Control Tasks. 1399-1408 - Yiwei Bao
, Feng Lu:
From Feature to Gaze: A Generalizable Replacement of Linear Layer for Gaze Estimation. 1409-1418 - Yiwei Bao
, Feng Lu:
Unsupervised Gaze Representation Learning from Multi-view Face Images. 1419-1428 - Muxin Zhang, Qiao Feng, Zhuo Su, Chao Wen, Zhou Xue, Kun Li:
Joint2Human: High-quality 3D Human Generation via Compact Spherical Embedding of 3D Joints. 1429-1438 - Akash Sengupta, Thiemo Alldieck, Nikos Kolotouros, Enric Corona, Andrei Zanfir, Cristian Sminchisescu:
DiffHuman: Probabilistic Photorealistic 3D Reconstruction of Humans. 1439-1449 - Youliang Zhang, Wenxuan Liu, Danni Xu, Zhuo Zhou
, Zheng Wang:
Bi-Causal: Group Activity Recognition via Bidirectional Causality. 1450-1459 - Caoyuan Ma, Yu-Lun Liu, Zhixiang Wang, Wu Liu, Xinchen Liu, Zheng Wang:
HumanNeRF-SE: A Simple yet Effective Approach to Animate HumanNeRF with Diverse Poses. 1460-1470 - Haoyang Ge, Qiao Feng, Hailong Jia, Xiongzheng Li, Xiangjun Yin, You Zhou, Jingyu Yang, Kun Li:
LPSNet: End-to-End Human Pose and Shape Estimation with Lensless Imaging. 1471-1480 - Zhongcong Xu, Jianfeng Zhang, Jun Hao Liew, Hanshu Yan, Jia-Wei Liu, Chenxu Zhang, Jiashi Feng, Mike Zheng Shou:
MagicAnimate: Temporally Consistent Human Image Animation using Diffusion Model. 1481-1490 - Peng Lu, Tao Jiang, Yining Li, Xiangtai Li, Kai Chen, Wenming Yang:
RTMO: Towards High-Performance One-Stage Real-Time Multi-Person Pose Estimation. 1491-1500 - Jiangbei Yue, Baiyi Li, Julien Pettré, Armin Seyfried, He Wang:
Human Motion Prediction Under Unexpected Perturbation. 1501-1511 - Matthieu Armando, Salma Galaaoui, Fabien Baradel, Thomas Lucas, Vincent Leroy, Romain Brégier, Philippe Weinzaepfel, Grégory Rogez:
Cross-View and Cross-Pose Completion for 3D Human Understanding. 1512-1523 - Ronghui Li, Yuxiang Zhang, Yachao Zhang, Hongwen Zhang, Jie Guo, Yan Zhang, Yebin Liu, Xiu Li:
Lodge: A Coarse to Fine Diffusion Network for Long Dance Generation Guided by the Characteristic Dance Primitives. 1524-1534 - Taeksoo Kim, Byungjun Kim, Shunsuke Saito, Hanbyul Joo:
GALA: Generating Animatable Layered Assets from a Single Scan. 1535-1545 - Ekkasit Pinyoanuntapong, Pu Wang, Minwoo Lee, Chen Chen:
MMM: Generative Masked Motion Model. 1546-1555 - Yihua Cheng, Yaning Zhu, Zongji Wang, Hongquan Hao, Yongwei Liu, Shiqing Cheng, Xi Wang, Hyung Jin Chang:
What Do You See in Vehicle? Comprehensive Vision Solution for In-Vehicle Gaze Estimation. 1556-1565 - Yifei Liu, Qiong Cao, Yandong Wen, Huaiguang Jiang, Changxing Ding:
Towards Variable and Coordinated Holistic Co-Speech Motion Generation. 1566-1576 - Junuk Cha, Jihyeon Kim, Jae Shin Yoon, Seungryul Baek:
Text2HOI: Text-Guided 3D Motion Generation for Hand-Object Interaction. 1577-1585 - Ren Li
, Corentin Dumery, Benoît Guillard, Pascal Fua:
Garment Recovery with Shape and Deformation Priors. 1586-1595 - Kangning Yin, Shihao Zou, Yuxuan Ge, Zheng Tian:
Tri-Modal Motion Retrieval by Learning a Joint Embedding Space. 1596-1605 - Zhijing Shao
, Zhaolong Wang, Zhuang Li, Duotun Wang, Xiangru Lin, Yu Zhang, Mingming Fan, Zeyu Wang:
SplattingAvatar: Realistic Real-Time Human Avatars With Mesh-Embedded Gaussian Splatting. 1606-1616 - Jiaqi Liao, Chuanchen Luo, Yinuo Du, Yuxi Wang, Xucheng Yin, Man Zhang, Zhaoxiang Zhang, Junran Peng:
HardMo: A Large-Scale Hardcase Dataset for Motion Capture. 1629-1638 - Zhonglin Sun, Chen Feng
, Ioannis Patras, Georgios Tzimiropoulos:
LAFS: Landmark-Based Facial Self-Supervised Learning for Face Recognition. 1639-1649 - Hee Jae Kim, Eshed Ohn-Bar:
Motion Diversification Networks. 1650-1660 - Yannan He, Garvita Tiwari, Tolga Birdal, Jan Eric Lenssen, Gerard Pons-Moll:
NRDF: Neural Riemannian Distance Fields for Learning Articulated Pose Priors. 1661-1671 - Zidu Wang
, Xiangyu Zhu
, Tianshuo Zhang, Baiqin Wang, Zhen Lei:
3D Face Reconstruction with the Geometric Guidance of Facial Part Segmentation. 1672-1682 - Zhibo Yang, Sounak Mondal, Seoyoung Ahn, Ruoyu Xue, Gregory J. Zelinsky, Minh Hoai, Dimitris Samaras:
Unifying Top-Down and Bottom-Up Scanpath Prediction Using Transformers. 1683-1693 - Fu-Zhao Ou
, Chongyi Li
, Shiqi Wang, Sam Kwong:
CLIB-FIQA: Face Image Quality Assessment with Confidence Calibration. 1694-1704 - Boeun Kim, Jungho Kim, Hyung Jin Chang, Jin Young Choi:
MoST: Motion Style Transformer Between Diverse Action Contents. 1705-1714 - Yuxiao Liu, Zhe Li, Yebin Liu, Haoqian Wang:
TexVocab: Texture Vocabulary-Conditioned Human Avatars. 1715-1725 - Haitao Yan
, Qiongjie Cui, Jiexin Xie, Shijie Guo:
Forecasting of 3D Whole-Body Human Poses with Grasping Objects. 1726-1736 - Nan Jiang, Zhiyuan Zhang, Hongjie Li, Xiaoxuan Ma, Zan Wang, Yixin Chen, Tengyu Liu, Yixin Zhu, Siyuan Huang:
Scaling Up Dynamic Human-Scene Interaction Modeling. 1737-1747 - Jiali Zheng
, Rolandos Alexandros Potamias, Stefanos Zafeiriou:
Design2Cloth: 3D Cloth Generation from 2D Masks. 1748-1758 - Liang Xu, Yizhou Zhou, Yichao Yan, Xin Jin, Wenhan Zhu, Fengyun Rao, Xiaokang Yang, Wenjun Zeng:
ReGenNet: Towards Human Action-Reaction Synthesis. 1759-1769 - Abdallah Dib, Luiz Gustavo Hafemann, Emeline Got, Trevor Anderson, Amin Fadaeinejad, Rafael M. O. Cruz, Marc-André Carbonneau:
MoSAR: Monocular Semi-Supervised Model for Avatar Reconstruction using Differentiable Shading. 1770-1780 - David Ferman, Pablo Garrido, Gaurav Bharaj:
FaceLift: Semi-Supervised 3D Facial Landmark Localization. 1781-1791 - Shengxiang Hu, Huaijiang Sun, Bin Li, Dong Wei, Weiqing Li, Jianfeng Lu:
Fast Adaptation for Human Pose Estimation via Meta-Optimization. 1792-1801 - Jun Xiang, Xuan Gao, Yudong Guo, Juyong Zhang:
FlashAvatar: High-Fidelity Head Avatar with Efficient Gaussian Embedding. 1802-1812 - Tianyu Li, Calvin Qiao, Guanqiao Ren, KangKang Yin, Sehoon Ha:
AAMDM: Accelerated Auto-Regressive Motion Diffusion Model. 1813-1823 - Tao Wang
, Lei Jin, Zheng Wang, Jianshu Li, Liang Li, Fang Zhao, Yu Cheng, Li Yuan, Li Zhou, Junliang Xing, Jian Zhao:
SynSP: Synergy of Smoothness and Precision in Pose Sequences Refinement. 1824-1833 - Qingping Sun
, Yanjun Wang, Ailing Zeng, Wanqi Yin, Chen Wei, Wenjia Wang, Haiyi Mei, Chi-Sing Leung, Ziwei Liu, Lei Yang, Zhongang Cai:
AiOS: All-in-One-Stage Expressive Human Pose and Shape Estimation. 1834-1843 - Jingbo Zhang
, Xiaoyu Li, Qi Zhang, Yanpei Cao, Ying Shan, Jing Liao
:
HumanRef: Single Image to 3D Human Generation via Reference-Guided Diffusion. 1844-1854 - Zhi Cen, Huaijin Pi, Sida Peng, Zehong Shen, Minghui Yang, Shuai Zhu, Hujun Bao, Xiaowei Zhou:
Generating Human Motion in 3D Scenes from Text Descriptions. 1855-1866 - Michail Tarasiou, Rolandos Alexandros Potamias, Eimear O' Sullivan, Stylianos Ploumpis, Stefanos Zafeiriou:
Locally Adaptive Neural 3D Morphable Models. 1867-1876 - Shaofei Wang, Bozidar Antic, Andreas Geiger, Siyu Tang:
IntrinsicAvatar: Physically Based Inverse Rendering of Dynamic Humans from Monocular Videos via Explicit Ray Tracing. 1877-1888 - Yu Zhang, Songpengcheng Xia, Lei Chu, Jiarui Yang, Qi Wu, Ling Pei:
Dynamic Inertial Poser (DynaIP): Part-Based Motion Dynamics Learning for Enhanced Human Pose Estimation with Sparse Inertial Sensors. 1889-1899 - Chuan Guo, Yuxuan Mu, Muhammad Gohar Javed, Sen Wang
, Li Cheng:
MoMask: Generative Masked Modeling of 3D Human Motions. 1900-1910 - Yufei Ye, Abhinav Gupta, Kris Kitani, Shubham Tulsiani:
G-HOP: Generative Hand-Object Prior for Interaction Reconstruction and Grasp Synthesis. 1911-1920 - Pengfei Ren, Yuanyuan Gao, Haifeng Sun, Qi Qi, Jingyu Wang
, Jianxin Liao:
Dynamic Support Information Mining for Category-Agnostic Pose Estimation. 1921-1930 - Yuelang Xu, Bengwang Chen, Zhe Li, Hongwen Zhang, Lizhen Wang, Zerong Zheng, Yebin Liu:
Gaussian Head Avatar: Ultra High-Fidelity Head Avatar via Dynamic Gaussians. 1931-1941 - Kiran Chhatre, Radek Danecek, Nikos Athanasiou, Giorgio Becherini, Christopher Peters, Michael J. Black, Timo Bolkart:
Emotional Speech-Driven 3D Body Animation via Disentangled Latent Diffusion. 1942-1953 - Yuxiang Zhang, Hongwen Zhang, Liangxiao Hu, Jiajun Zhang
, Hongwei Yi, Shengping Zhang, Yebin Liu:
ProxyCap: Real-Time Monocular Full-Body Capture in World Space via Human-Centric Proxy-to-Motion Learning. 1954-1964 - Roy Kapon, Guy Tevet, Daniel Cohen-Or, Amit H. Bermano:
MAS: Multi-view Ancestral Sampling for 3D Motion Generation Using 2D Diffusion. 1965-1974 - Ziqian Bai, Feitong Tan, Sean Fanello, Rohit Pandey, Mingsong Dou, Shichen Liu, Ping Tan, Yinda Zhang:
Efficient 3D Implicit Head Avatar With Mesh-Anchored Hash Table Blendshapes. 1975-1984 - Vasileios Baltatzis, Rolandos Alexandros Potamias, Evangelos Ververas, Guanxiong Sun, Jiankang Deng, Stefanos Zafeiriou:
Neural Sign Actors: A diffusion model for 3D sign language production from text. 1985-1995 - Xiang Deng, Zerong Zheng, Yuxiang Zhang, Jingxiang Sun, Chao Xu, Xiaodong Yang, Lizhen Wang, Yebin Liu:
RAM-Avatar: Real-time Photo-Realistic Avatar from Monocular Videos with Full-body Control. 1996-2007 - Samy Tafasca, Anshul Gupta, Jean-Marc Odobez:
Sharingan: A Transformer Architecture for Multi-Person Gaze Following. 2008-2017 - Yan Zhang, Sergey Prokudin, Marko Mihajlovic, Qianli Ma, Siyu Tang:
Degrees of Freedom Matter: Inferring Dynamics from Point Trajectories. 2018-2028 - Gyeongsik Moon, Weipeng Xu, Rohan Joshi, Chenglei Wu, Takaaki Shiratori:
Authentic Hand Avatar from a Phone Scan via Universal Hand Model. 2029-2038 - Nannan Li, Qing Liu, Krishna Kumar Singh, Yilin Wang, Jianming Zhang, Bryan A. Plummer, Zhe Lin:
UniHuman: A Unified Model For Editing Human Images in the Wild. 2039-2048 - Yuxuan Zhou, Xudong Yan, Zhi-Qi Cheng
, Yan Yan, Qi Dai, Xian-Sheng Hua:
BlockGCN: Redefine Topology Awareness for Skeleton-Based Action Recognition. 2049-2058 - Jing Wen, Xiaoming Zhao, Zhongzheng Ren, Alexander G. Schwing, Shenlong Wang:
GoMAvatar: Efficient Animatable Human Modeling from Monocular Video Using Gaussians-on-Mesh. 2059-2069 - Soyong Shin, Juyong Kim, Eni Halilaj, Michael J. Black:
WHAM: Reconstructing World-Grounded Humans with Accurate 3D Motion. 2070-2080 - Zheng Gao, Ioannis Patras:
Self-Supervised Facial Representation Learning with Facial Region Awareness. 2081-2092 - Yao Feng, Jing Lin, Sai Kumar Dwivedi, Yu Sun
, Priyanka Patel, Michael J. Black:
ChatPose: Chatting about 3D Human Pose. 2093-2103 - Shiwei Jin, Zhen Wang, Lei Wang, Peng Liu, Ning Bi, Truong Nguyen:
AUEditNet: Dual-Branch Facial Action Unit Intensity Manipulation with Implicit Disentanglement. 2104-2113 - Renshuai Liu, Bowen Ma, Wei Zhang, Zhipeng Hu, Changjie Fan, Tangjie Lv, Yu Ding, Xuan Cheng:
Towards a Simultaneous and Granular Identity-Expression Control in Personalized Face Generation. 2114-2123 - Yanlu Cai, Weizhong Zhang, Yuan Wu, Cheng Jin:
PoseIRM: Enhance 3D Human Pose Estimation on Unseen Camera Settings via Invariant Risk Minimization. 2124-2133 - Haipeng Chen, Kedi Lyu, Zhenguang Liu, Yifang Yin, Xun Yang, Yingda Lyu:
Rethinking Human Motion Prediction with Symplectic Integral. 2134-2143 - Zhenyu Lou, Qiongjie Cui, Haofan Wang, Xu Tang, Hong Zhou:
Multimodal Sense-Informed Forecasting of 3D Human Motions. 2144-2154 - Haodong Zhang, Zhike Chen, Haocheng Xu, Lei Hao, Xiaofei Wu, Songcen Xu, Zhensong Zhang, Yue Wang, Rong Xiong:
Semantics-Aware Motion Retargeting with Vision-Language Models. 2155-2164 - Xingchao Yang, Takafumi Taketomi, Yuki Endo, Yoshihiro Kanamori:
Makeup Prior Models for 3D Facial Makeup Estimation and Applications. 2165-2175 - Yinglong Li, Hongyu Wu, Xiaogang Wang, Qingzhao Qin, Yijiao Zhao, Yong Wang, Aimin Hao:
FaceCom: Towards High-fidelity 3D Facial Shape Completion via Optimization and Inpainting Guidance. 2177-2186 - Xiaoming Li, Xinyu Hou, Chen Change Loy:
When StyleGAN Meets Stable Diffusion: a $\mathcal{W}_{+}$ Adapter for Personalized Image Generation. 2187-2196 - Chandradeep Pokhariya, Ishaan Nikhil Shah
, Angela Xing, Zekun Li, Kefan Chen, Avinash Sharma, Srinath Sridhar:
MANUS: Markerless Grasp Capture Using Articulated 3D Gaussians. 2197-2208 - Chengxu Zuo, Yiming Wang, Lishuang Zhan, Shihui Guo, Xinyu Yi, Feng Xu, Yipeng Qin:
Loose Inertial Poser: Motion Capture with IMU-attached Loose-Wear Jacket. 2209-2219 - Prashanth Chandran, Gaspard Zoss:
Anatomically Constrained Implicit Face Models. 2220-2229 - Dayi Tan, Hansheng Chen, Wei Tian, Lu Xiong:
DiffusionRegPose: Enhancing Multi-Person Pose Estimation Using a Diffusion-Based End-to-End Regression Approach. 2230-2239 - Qucheng Peng, Ce Zheng, Chen Chen:
A Dual-Augmentor Framework for Domain Generalization in 3D Human Pose Estimation. 2240-2249 - Ming Yan, Yan Zhang, Shuqiang Cai, Shuqi Fan, Xincheng Lin, Yudi Dai, Siqi Shen, Chenglu Wen, Lan Xu, Yuexin Ma, Cheng Wang:
RELI11D: A Comprehensive Multimodal Human Motion Dataset and Method. 2250-2262 - Xu He, Qiaochu Huang, Zhensong Zhang, Zhiwei Lin, Zhiyong Wu, Sicheng Yang, Minglei Li, Zhiyi Chen, Songcen Xu, Xiaofei Wu:
Co-Speech Gesture Video Generation via Motion-Decoupled Diffusion Model. 2263-2273 - Wencan Cheng, Hao Tang, Luc Van Gool, Jong Hwan Ko:
HandDiff: 3D Hand Pose Estimation with Diffusion on Image-Point Cloud. 2274-2284 - Olaf Dünkel
, Tim Salzmann, Florian Pfaff:
Normalizing Flows on the Product Space of SO(3) Manifolds for Probabilistic Human Pose Modeling. 2285-2294 - Haoyu Chen, Hao Tang, Ehsan Adeli
, Guoying Zhao:
Towards Robust 3D Pose Transfer with Adversarial Learning. 2295-2304 - Yufei Zhang, Jeffrey O. Kephart, Zijun Cui, Qiang Ji:
PhysPT: Physics-aware Pretrained Transformer for Estimating Human Dynamics from Monocular Videos. 2305-2317 - Arnab Kumar Mondal, Stefano Alletto, Denis Tomè:
HumMUSS: Human Motion Understanding Using State Space Models. 2318-2330 - Nicolas Ugrinovic, Boxiao Pan, Georgios Pavlakos, Despoina Paschalidou, Bokui Shen, Jordi Sanchez-Riera, Francesc Moreno-Noguer, Leonidas J. Guibas:
MultiPhys: Multi-Person Physics-Aware 3D Motion Estimation. 2331-2340 - Haowen Luo, Yunze Liu, Li Yi:
Physics-Aware Hand-Object Interaction Denoising. 2341-2350 - Supreeth Narasimhaswamy, Huy Anh Nguyen
, Lihan Huang, Minh Hoai:
HOIST-Former: Hand-Held Objects Identification, Segmentation, and Tracking in the Wild. 2351-2361 - Soubhik Sanyal, Partha Ghosh, Jinlong Yang, Michael J. Black, Justus Thies, Timo Bolkart:
SCULPT: Shape-Conditioned Unpaired Learning of Pose-dependent Clothed and Textured Human Meshes. 2362-2371 - Tuomas Varanka, Tapani Toivonen, Soumya Tripathy, Guoying Zhao, Erman Acar:
PFStorer: Personalized Face Restoration and Super-Resolution. 2372-2381 - Pengfei Xie, Wenqiang Xu, Tutian Tang, Zhenjun Yu, Cewu Lu:
MS-MANO: Enabling Hand Pose Tracking with Biomechanical Constraints. 2382-2392 - Wenqian Zhang, Molin Huang, Yuxuan Zhou, Juze Zhang, Jingyi Yu, Jingya Wang, Lan Xu:
BOTH2Hands: Inferring 3D Hands from Both Text Prompts and Body Dynamics. 2393-2404 - Eric-Tuan Le, Antonis Kakolyris, Petros Koutras, Himmy Tam, Efstratios Skordos, George Papandreou, Riza Alp Güler, Iasonas Kokkinos:
MeshPose: Unifying DensePose and 3D Body Mesh reconstruction. 2405-2414 - Xi Liu, Ying Guo, Cheng Zhen, Tong Li, Yingying Ao, Pengfei Yan:
CustomListener: Text-Guided Responsive Interaction for User-Friendly Listening Head Generation. 2415-2424 - Jiayi Liang
, Haotian Liu, Hongteng Xu, Dixin Luo:
Generalizable Face Landmarking Guided by Conditional Face Warping. 2425-2435 - Xinshun Wang, Zhongbin Fang, Xia Li, Xiangtai Li, Chen Chen, Mengyuan Liu:
Skeleton-in-Context: Unified Skeleton Sequence Modeling with In-Context Learning. 2436-2446 - Reni Paskaleva, Mykyta Holubakha, Andela Ilic, Saman Motamed, Luc Van Gool, Danda Pani Paudel:
A Unified and Interpretable Emotion Representation and Expression Generation. 2447-2456 - Yingyan Xu
, Prashanth Chandran, Sebastian Weiss
, Markus Gross, Gaspard Zoss, Derek Bradley:
Artist-Friendly Relightable and Animatable Neural Heads. 2457-2467 - Supreeth Narasimhaswamy, Uttaran Bhattacharya, Xiang Chen, Ishita Dasgupta, Saayan Mitra, Minh Hoai:
HanDiffuser: Text-to-Image Generation with Realistic Hand Appearances. 2468-2479 - Abhishek Tandon, Anujraaj Goyal, Henry M. Clever, Zackory Erickson:
BodyMAP - Jointly Predicting Body Mesh and 3D Applied Pressure Map for People in Bed. 2480-2489 - George Retsinas, Panagiotis Paraskevas Filntisis, Radek Danecek, Victoria Fernández Abrevaya, Anastasios Roussos
, Timo Bolkart, Petros Maragos:
3D Facial Expressions through Analysis-by-Neural-Synthesis. 2490-2501 - Vinkle Srivastav, Keqi Chen, Nicolas Padoy:
SelfPose3d: Self-Supervised Multi-Person Multi-View 3d Pose Estimation. 2502-2512 - Tom Van Wouwe, Seunghwan Lee, Antoine Falisse, Scott L. Delp
, C. Karen Liu
:
DiffusionPoser: Real-Time Human Motion Reconstruction From Arbitrary Sparse Sensors Using Autoregressive Diffusion. 2513-2523 - Tian Ye, Sixiang Chen, Wenhao Chai, Zhaohu Xing, Jing Qin, Ge Lin, Lei Zhu:
Learning Diffusion Texture Priors for Image Restoration. 2524-2534 - Shangchen Zhou, Peiqing Yang, Jianyi Wang, Yihang Luo, Chen Change Loy:
Upscale-A-Video: Temporal-Consistent Diffusion Model for Real-World Video Super-Resolution. 2535-2545 - Kai Xu, Ziwei Yu, Xin Wang, Michael Bi Mi, Angela Yao:
Enhancing Video Super-Resolution via Implicit Resampling-based Alignment. 2546-2555 - Xinjie Zhang, Ren Yang, Dailan He, Xingtong Ge, Tongda Xu, Yan Wang, Hongwei Qin, Jun Zhang:
Boosting Neural Representations for Videos with a Conditional Decoder. 2556-2566 - Zheng Ding, Xuaner Zhang, Zhuowen Tu, Zhihao Xia:
Restoration by Generation with Constrained Priors. 2567-2577 - Pingping Zhang, Tianyu Yan, Yang Liu, Huchuan Lu:
Fantastic Animals and Where to Find Them: Segment Any Marine Animal with Dual SAM. 2578-2587 - Shay Dekel, Yosi Keller, Martin Cadík:
Estimating Extreme 3D Image Rotations using Cascaded Attention. 2588-2598 - Kanglong Fan
, Wen Wen
, Mu Li, Yifan Peng, Kede Ma
:
Learned Scanpaths Aid Blind Panoramic Video Quality Assessment. 2599-2608 - Xiaoyan Cong, Yue Wu, Qifeng Chen, Chenyang Lei:
Automatic Controllable Colorization via Imagination. 2609-2619 - Chenxi Qiu, Tao Yue, Xuemei Hu
:
Reconstruction-free Cascaded Adaptive Compressive Sensing. 2620-2630 - Xiaofeng Cong, Jie Gui, Jing Zhang, Junming Hou, Hao Shen:
A Semi-Supervised Nighttime Dehazing Baseline with Spatial-Frequency Aware and Realistic Brightness Constraint. 2631-2640 - Cheeun Hong, Kyoung Mu Lee:
AdaBM: On-the-Fly Adaptive Bit Mapping for Image Super-Resolution. 2641-2650 - Jaeha Kim, Junghun Oh, Kyoung Mu Lee:
Beyond Image Super-Resolution for Image Recognition with Task-Driven Perceptual Loss. 2651-2661 - Kangmin Xu, Liang Liao, Jing Xiao, Chaofeng Chen, Haoning Wu, Qiong Yan, Weisi Lin:
Boosting Image Quality Assessment Through Efficient Transformer Adaptation with Local Feature Enhancement. 2662-2672 - Guilherme A. Potje, Felipe Cadar, André Araújo, Renato Martins, Erickson R. Nascimento:
XFeat: Accelerated Features for Lightweight Image Matching. 2682-2691 - Tianhao Zhou
, Haipeng Li, Ziyi Wang, Ao Luo, Chen-Lin Zhang, Jiajun Li, Bing Zeng, Shuaicheng Liu
:
RecDiffusion: Rectangling for Image Stitching with Diffusion Models. 2692-2701 - Xin Tian, Ke Xu
, Rynson W. H. Lau:
Unsupervised Salient Instance Detection. 2702-2712 - Zhen Liu, Hao Zhu, Qi Zhang, Jingde Fu, Weibing Deng, Zhan Ma, Yanwen Guo, Xun Cao:
FINER: Flexible Spectral-Bias Tuning in Implicit NEural Representation by Variableperiodic Activation Functions. 2713-2722 - Donghun Ryou, Inju Ha, Hyewon Yoo, Dongwan Kim, Bohyung Han:
Robust Image Denoising Through Adversarial Frequency Mixup. 2723-2732 - Xin Gao, Tianheng Qiu, Xinyu Zhang, Hanlin Bai, Kang Liu, Xuan Huang, Hu Wei, Guoying Zhang, Huaping Liu:
Efficient Multi-Scale Network with Learnable Discrete Wavelet Transform for Blind Motion Deblurring. 2733-2742 - Zhongyu Li, Lei Zhang:
Efficient Scene Recovery Using Luminous Flux Prior. 2743-2752 - Guangyang Wu, Xin Tao, Changlin Li, Wenyi Wang, Xiaohong Liu
, Qingqing Zheng:
Perception-Oriented Video Frame Interpolation via Asymmetric Blending. 2753-2762 - Wen Wen
, Mu Li, Yabin Zhang, Yiting Liao, Junlin Li, Li Zhang, Kede Ma
:
Modular Blind Video Quality Assessment. 2763-2772 - Jiawei Liu
, Qiang Wang, Huijie Fan, Yinong Wang, Yandong Tang, Liangqiong Qu:
Residual Denoising Diffusion Models. 2773-2783 - Woo Kyoung Han, Sunghoon Im, Jaedeok Kim, Kyong Hwan Jin:
JDEC: JPEG Decoding via Enhanced Continuous Cosine Coefficients. 2784-2793 - Agneet Chatterjee, Tejas Gokhale, Chitta Baral, Yezhou Yang:
On the Robustness of Language Guidance for Low-Level Vision Tasks: Findings from Depth Estimation. 2794-2803 - Bang-Dang Pham, Phong Tran, Anh Tuan Tran, Cuong Pham, Rang Nguyen, Minh Hoai:
Blur2Blur: Blur Conversion for Unsupervised Image Deblurring on Unknown Domains. 2804-2813 - Shiyan Chen, Jiyuan Zhang, Zhaofei Yu, Tiejun Huang:
Exploring Efficient Asymmetric Blind-Spots for Self-Supervised Denoising in Real-World Scenarios. 2814-2823 - Jiezhang Cao, Yue Shi, Kai Zhang, Yulun Zhang, Radu Timofte, Luc Van Gool:
Deep Equilibrium Diffusion Restoration with Parallel Sampling. 2824-2834 - Kun Yuan, Hongbo Liu, Mading Li, Muyi Sun, Ming Sun, Jiachao Gong, Jinhua Hao, Chao Zhou, Yansong Tang:
PTM-VQA: Efficient Video Quality Assessment Leveraging Diverse PreTrained Models from the Wild. 2835-2845 - Yafei Zhang, Shen Zhou, Huafeng Li:
Depth Information Assisted Collaborative Mutual Promotion Network for Single Image Dehazing. 2846-2855 - Leheng Zhang, Yawei Li, Xingyu Zhou
, Xiaorui Zhao, Shuhang Gu:
Transcending the Limit of Local Window: Advanced Super-Resolution Transformer with Adaptive Token Dictionary. 2856-2865 - Jingbo Lin, Zhilu Zhang, Yuxiang Wei, Dongwei Ren, Dongsheng Jiang, Qi Tian, Wangmeng Zuo:
Improving Image Restoration Through Removing Degradations in Textual Representations. 2866-2878 - Yong Shu, Liquan Shen, Xiangyu Hu, Mengyao Li, Zihao Zhou:
Towards Real-World HDR Video Reconstruction: A Large-Scale Benchmark Dataset and A Two-Stage Alignment Network. 2879-2888 - Xingguang Zhang, Nicholas Chimitt, Yiheng Chi, Zhiyuan Mao, Stanley H. Chan:
Spatio-Temporal Turbulence Mitigation: A Translational Perspective. 2889-2899 - Xiaogang Xu, Shu Kong, Tao Hu, Zhe Liu, Hujun Bao:
Boosting Image Restoration via Priors from Pre-Trained Models. 2900-2909 - Zhangkai Ni, Juncheng Wu, Zian Wang, Wenhan Yang, Hanli Wang, Lin Ma:
Misalignment-Robust Frequency Distribution Loss for Image Transformation. 2910-2919 - Enxuan Gu, Hongwei Ge, Yong Guo:
CoDe: An Explicit Content Decoupling Framework for Image Restoration. 2920-2930 - Wei-Ting Chen, Gurunandan Krishnan, Qiang Gao, Sy-Yen Kuo, Sizhuo Ma, Jian Wang:
DSL-FIQA: Assessing Facial Image Quality via Dual-Set Degradation Learning and Landmark-Guided Transformer. 2931-2941 - Hyeongmin Lee, Kyoungkook Kang, Jungseul Ok, Sunghyun Cho:
CLIPtone: Unsupervised Learning for Text-Based Image Tone Adjustment. 2942-2951 - Shihao Zhou, Duosheng Chen, Jinshan Pan, Jinglei Shi, Jufeng Yang:
Adapt or Perish: Adaptive Sparse Transformer with Attentive Feature Refinement for Image Restoration. 2952-2963 - Qiang Zhu, Jinhua Hao, Yukang Ding, Yu Liu, Qiao Mo, Ming Sun, Chao Zhou, Shuyuan Zhu:
CPGA: Coding Priors-Guided Aggregation Network for Compressed Video Quality Enhancement. 2964-2974 - Kyunghyun Lee, Ukcheol Shin, Byeong-Uk Lee:
Learning to Control Camera Exposure via Reinforcement Learning. 2975-2983 - Ziwen Li
, Feng Zhang, Meng Cao, Jinpu Zhang, Yuanjie Shao, Yuehuan Wang, Nong Sang:
Real-Time Exposure Correction via Collaborative Transformations and Adaptive Sampling. 2984-2994 - Jun Xiao, Zihang Lyu, Cong Zhang, Yakun Ju, Changjian Shui, Kin-Man Lam:
Towards Progressive Multi-Frequency Representation for Image Warping. 2995-3004 - Li Pang, Xiangyu Rui, Long Cui, Hongzhong Wang, Deyu Meng, Xiangyong Cao:
HIR-Diff: Unsupervised Hyperspectral Image Restoration Via Improved Diffusion Models. 3005-3014 - Yiqi Shi, Duo Liu, Liguo Zhang, Ye Tian, Xuezhi Xia, Xiaojing Fu:
ZERO-IG: Zero-Shot Illumination-Guided Joint Denoising and Adaptive Enhancement for Low-Light Images. 3015-3024 - Hamadi Chihaoui, Paolo Favaro:
Masked and Shuffled Blind Spot Denoising for Real-World Images. 3025-3034 - Huiyuan Fu, Fei Peng
, Xianwei Li, Yejun Li, Xin Wang, Huadong Ma:
Continuous Optical Zooming: A Benchmark for Arbitrary-Scale Image Super-Resolution in Real World. 3035-3044 - Atefeh Khoshkhahtinat, Ali Zafari, Piyush M. Mehta, Nasser M. Nasrabadi:
Laplacian-guided Entropy Model in Neural Codec with Blur-dissipated Synthesis. 3045-3054 - Yuan Gao, Yuqing Zhu, Xinjun Li, Yimin Du, Tianzhu Zhang:
SD2Event: Self-Supervised Learning of Dynamic Detectors and Contextual Descriptors for Event Cameras. 3055-3064 - Lanyun Zhu, Tianrun Chen, Deyi Ji, Jieping Ye, Jun Liu:
LLaFS: When Large Language Models Meet Few-Shot Segmentation. 3065-3075 - Junyi Zhang, Charles Herrmann, Junhwa Hur, Eric Chen, Varun Jampani, Deqing Sun, Ming-Hsuan Yang:
Telling Left from Right: Identifying Geometry-Aware Semantic Correspondence. 3076-3085 - Gen Li, Deqing Sun, Laura Sevilla-Lara, Varun Jampani:
One-Shot Open Affordance Learning with Foundation Models. 3086-3096 - Boyuan Sun, Yuqi Yang, Le Zhang, Ming-Ming Cheng, Qibin Hou:
CorrMatch: Label Propagation via Correlation Matching for Semi-Supervised Semantic Segmentation. 3097-3107 - Yasser Benigmim, Subhankar Roy, Slim Essid, Vicky Kalogeiton, Stéphane Lathuilière:
Collaborating Foundation Models for Domain Generalized Semantic Segmentation. 3108-3119 - You Huang, Zongyu Lan, Liujuan Cao, Xianming Lin, Shengchuan Zhang, Guannan Jiang, Rongrong Ji:
FocSAM: Delving Deeply into Focused Objects in Segmenting Anything. 3120-3130 - Simon Weber, Thomas Dagès, Maolin Gao, Daniel Cremers:
Finsler-Laplace-Beltrami Operators with Application to Shape Analysis. 3131-3140 - Yijia Weng, Bowen Wen, Jonathan Tremblay, Valts Blukis, Dieter Fox, Leonidas J. Guibas, Stan Birchfield:
Neural Implicit Representation for Building Digital Twins of Unknown Articulated Objects. 3141-3150 - Ho Kei Cheng, Seoung Wug Oh, Brian L. Price, Joon-Young Lee, Alexander G. Schwing:
Putting the Object Back into Video Object Segmentation. 3151-3161 - Yiran Song, Qianyu Zhou, Xiangtai Li, Deng-Ping Fan, Xuequan Lu, Lizhuang Ma:
BA-SAM: Scalable Bias-Mode Attention Mask for Segment Anything Model. 3162-3173 - Daan de Geus
, Gijs Dubbelman:
Task-Aligned Part-Aware Panoptic Segmentation Through Joint Object-Part Representations. 3174-3183 - Matteo Sodano, Federico Magistri, Lucas Nunes, Jens Behley, Cyrill Stachniss
:
Open-World Semantic Segmentation Including Class Similarity. 3184-3194 - Thomas V. Chang, Simon Seibt, Bartosz von Rymon Lipinski
:
Hierarchical Histogram Threshold Segmentation - Auto-terminating High-detail Oversegmentation. 3195-3204 - Duojun Huang, Xinyu Xiong, Jie Ma, Jichang Li, Zequn Jie, Lin Ma, Guanbin Li:
AlignSAM: Aligning Segment Anything Model to Open Context via Reinforcement Learning. 3205-3215 - Yichen Liu, Benran Hu, Chi-Keung Tang, Yu-Wing Tai:
SANeRF-HQ: Segment Anything for NeRF in High Quality. 3216-3226 - Minghan Li, Shuai Li, Xindong Zhang, Lei Zhang:
UniVS: Unified and Universal Video Segmentation with Prompts as Queries. 3227-3238 - Bedrettin Cetinkaya, Sinan Kalkan, Emre Akbas:
RankED: Addressing Imbalance and Uncertainty in Edge Detection Using Ranking-based Losses. 3239-3249 - Hebei Li, Jin Wang, Jiahui Yuan, Yue Li, Wenming Weng, Yansong Peng, Yueyi Zhang, Zhiwei Xiong, Xiaoyan Sun:
Event-Assisted Low-Light Video Object Segmentation. 3250-3259 - Jianan Li, Qiulei Dong:
Density-Guided Semi-Supervised 3D Semantic Segmentation with Dual-Space Hardness Sampling. 3260-3269 - Yi Zhang, Meng-Hao Guo, Miao Wang, Shi-Min Hu:
Exploring Regional Clues in CLIP for Zero-Shot Semantic Segmentation. 3270-3280 - Yichen Li
, Kaichun Mo, Yueqi Duan, He Wang, Jiequan Zhang, Lin Shao, Wojciech Matusik, Leonidas J. Guibas:
Category-Level Multi-Part Multi-Joint 3D Shape Assembly. 3281-3291 - Yingda Yin, Yuzheng Liu, Yang Xiao, Daniel Cohen-Or, Jingwei Huang, Baoquan Chen:
SAI3D: Segment any Instance in 3D Scenes. 3292-3302 - Xiaoyang Wang, Huihui Bai, Limin Yu, Yao Zhao, Jimin Xiao:
Towards the Uncharted: Density-Descending Feature Perturbation for Semi-supervised Semantic Segmentation. 3303-3312 - Lennart Bastian
, Yizheng Xie, Nassir Navab, Zorah Lähner:
Hybrid Functional Maps for Crease-Aware Non-Isometric Shape Matching. 3313-3323 - Feilong Tang, Zhongxing Xu, Zhaojun Qu, Wei Feng, Xingjian Jiang, Zongyuan Ge
:
Hunting Attributes: Context Prototype-Aware Learning for Weakly Supervised Semantic Segmentation. 3324-3334 - Jiawei Liu, Changkun Ye, Ruikai Cui, Nick Barnes:
Self-Calibrating Vicinal Risk Minimisation for Model Calibration. 3335-3345 - Beomyoung Kim, Joonsang Yu, Sung Ju Hwang:
ECLIPSE: Efficient Continual Learning in Panoptic Segmentation with Visual Prompt Tuning. 3346-3356 - Yuhang Ding, Liulei Li, Wenguan Wang, Yi Yang:
Clustering Propagation for Universal Medical Image Segmentation. 3357-3369 - Lanyun Zhu, Tianrun Chen, Jianxiong Yin, Simon See, Jun Liu:
Addressing Background Context Bias in Few-Shot Segmentation Through Iterative Modulation. 3370-3379 - Jiahao Nie, Yun Xing, Gongjie Zhang, Pei Yan, Aoran Xiao, Yap-Peng Tan, Alex C. Kot, Shijian Lu:
Cross-Domain Few-Shot Segmentation via Iterative Support-Query Correspondence Mining. 3380-3390 - Huayu Mai, Rui Sun, Tianzhu Zhang, Feng Wu:
RankMatch: Exploring the Better Consistency Regularization for Semi-Supervised Semantic Segmentation. 3391-3401 - Xiang Li, Jinglu Wang, Xiaohao Xu, Xiulian Peng, Rita Singh, Yan Lu, Bhiksha Raj:
QDFormer: Towards Robust Audiovisual Segmentation in Complex Environments with Quantization-based Semantic Decomposition. 3402-3413 - Linwei Chen, Lin Gu, Dezhi Zheng, Ying Fu:
Frequency-Adaptive Dilated Convolution for Semantic Segmentation. 3414-3425 - Bin Xie, Jiale Cao, Jin Xie, Fahad Shahbaz Khan, Yanwei Pang:
SED: A Simple Encoder-Decoder for Open-Vocabulary Semantic Segmentation. 3426-3436 - Xinqiao Zhao, Ziqian Yang, Tianhong Dai, Bingfeng Zhang, Jimin Xiao:
PSDPM: Prototype-based Secondary Discriminative Pixels Mining for Weakly Supervised Semantic Segmentation. 3437-3446 - Matteo Bastico, Etienne Decencière, Laurent Corté, Yannick Tillier, David Ryckelynck:
Coupled Laplacian Eigenmaps for Locally-Aware 3D Rigid Point Cloud Matching. 3447-3458 - Yong Liu, Cairong Zhang, Yitong Wang, Jiahao Wang, Yujiu Yang, Yansong Tang:
Universal Segmentation at Arbitrary Granularity with Language Instruction. 3459-3469 - Ardian Umam, Cheng-Kun Yang, Min-Hung Chen, Jen-Hui Chuang, Yen-Yu Lin:
PartDistill: 3D Shape Part Segmentation by Vision-Language Model Distillation. 3470-3479 - Marilyn Keller
, Vaibhav Arora, Abdelmouttaleb Dakri, Shivam Chandhok, Jürgen Machann, Andreas Fritsche, Michael J. Black, Sergi Pujades:
HIT: Estimating Internal Human Implicit Tissues from the Body Surface. 3480-3490 - Yong Liu, Sule Bai, Guanbin Li, Yitong Wang, Yansong Tang:
Open-Vocabulary Segmentation with Semantic-Assisted Calibration. 3491-3500 - Yian Zhao, Kehan Li, Zesen Cheng, Pengchong Qiao, Xiawu Zheng, Rongrong Ji, Chang Liu, Li Yuan, Jie Chen:
GraCo: Granularity-Controllable Interactive Segmentation. 3501-3510 - Zhiheng Cheng, Qingyue Wei, Hongru Zhu, Yan Wang, Liangqiong Qu, Wei Shao, Yuyin Zhou:
Unleashing the Potential of SAM for Medical Adaptation via Hierarchical Decoding. 3511-3522 - Chanyoung Kim, Woojung Han, Dayun Ju, Seong Jae Hwang:
EAGLE: Eigen Aggregation Learning for Object-Centric Unsupervised Semantic Segmentation. 3523-3533 - Yuanchen Wu, Xichen Ye, Kequan Yang, Jide Li, Xiaoqiang Li:
DuPL: Dual Student with Trustworthy Progressive Learning for Robust Weakly Supervised Semantic Segmentation. 3534-3543 - Diandian Guo, Deng-Ping Fan, Tongyu Lu, Christos Sakaridis, Luc Van Gool:
Vanishing-Point-Guided Video Semantic Segmentation of Driving Scenes. 3544-3553 - Junjiao Tian, Lavisha Aggarwal, Andrea Colaco, Zsolt Kira, Mar González-Franco:
Diffuse, Attend, and Segment: Unsupervised Zero-Shot Segmentation using Stable Diffusion. 3554-3563 - Ayush Jain, Pushkal Katara, Nikolaos Gkanatsios, Adam W. Harley, Gabriel Sarch, Kriti Aggarwal, Vishrav Chaudhary, Katerina Fragkiadaki:
ODIN: A Single Model for 2D and 3D Segmentation. 3564-3574 - Jiafan Zhuang, Zilei Wang, Yixin Zhang, Zhun Fan:
Infer from What You Have Seen Before: Temporally-dependent Classifier for Semi-supervised Video Segmentation. 3575-3584 - Zhaoyang Wei, Pengfei Chen, Xuehui Yu, Guorong Li, Jianbin Jiao, Zhenjun Han:
Semantic-aware SAM for Point-Prompted Instance Segmentation. 3585-3594 - Sung-Hoon Yoon, Hoyong Kwon, Hyeonseong Kim, Kuk-Jin Yoon:
Class Tokens Infusion for Weakly Supervised Semantic Segmentation. 3595-3605 - Zhiwei Yang, Kexue Fu, Minghong Duan, Linhao Qu, Shuo Wang, Zhijian Song:
Separate and Conquer: Decoupling Co-occurrence via Decomposition and Representation for Weakly Supervised Semantic Segmentation. 3606-3615 - Huicong Zhang, Haozhe Xie, Hongxun Yao:
Blur-Aware Spatio-Temporal Sparse Transformer for Video Deblurring. 3616-3626 - Woo-Jin Ahn, Geun-Yeong Yang, Hyun Duck Choi, Myo-Taeg Lim:
Style Blind Domain Generalized Semantic Segmentation via Covariance Alignment and Semantic Consistence Contrastive Learning. 3616-3626 - Haonan Wang, Qixiang Zhang
, Yi Li, Xiaomeng Li:
AllSpark: Reborn Labeled Features from Unlabeled in Transformer for Semi-Supervised Semantic Segmentation. 3627-3636 - Leon Sick, Dominik Engel, Pedro Hermosilla, Timo Ropinski:
Unsupervised Semantic Segmentation Through Depth-Guided Feature Correlation and Sampling. 3637-3646 - Nissim Maruani, Maks Ovsjanikov, Pierre Alliez, Mathieu Desbrun:
PoNQ: A Neural QEM-Based Mesh Representation. 3647-3657 - Dongliang Cao, Marvin Eisenberger, Nafie El Amrani, Daniel Cremers, Florian Bernard:
Spectral Meets Spatial: Harmonising 3D Shape Matching and Interpolation. 3658-3668 - Jiayi Zhu, Qing Guo, Felix Juefei-Xu, Yihao Huang, Yang Liu, Geguang Pu:
Cosalpure: Learning Concept from Group Images for Robust Co-Saliency Detection. 3669-3678 - Jiawei Wang, Changjian Li:
ContextSeg: Sketch Semantic Segmentation by Querying the Context with Attention. 3679-3688 - Luca Barsellotti, Roberto Amoroso, Marcella Cornia, Lorenzo Baraldi
, Rita Cucchiara:
Training-Free Open-Vocabulary Segmentation with Offline Diffusion-Augmented Prototype Generation. 3689-3698 - Bo Li, Haoke Xiao, Lv Tang:
ASAM: Boosting Segment Anything Model with Adversarial Tuning. 3699-3710 - He Guo, Zixuan Ye, Zhiguo Cao, Hao Lu:
In-Context Matting. 3711-3720 - Hyeokjun Kweon, Jihun Kim, Kuk-Jin Yoon:
Weakly Supervised Point Cloud Semantic Segmentation via Artificial Oracle. 3721-3731 - Changki Sung, Wanhee Kim, Jungho An, Wooju Lee, Hyungtae Lim, Hyun Myung:
Contextrast: Contextual Contrastive Learning for Semantic Segmentation. 3732-3742 - Zelin Peng, Zhengqin Xu, Zhilin Zeng, Lingxi Xie, Qi Tian, Wei Shen:
Parameter Efficient Fine-Tuning via Cross Block Orchestration for Segment Anything Model. 3743-3752 - Haocheng Yuan, Jing Xu, Hao Pan, Adrien Bousseau, Niloy J. Mitra, Changjian Li:
CADTalk: An Algorithm and Benchmark for Semantic Commenting of CAD Programs. 3753-3762 - Yujia Liu, Anton Obukhov, Jan Dirk Wegner, Konrad Schindler:
Point2CAD: Reverse Engineering CAD Models from 3D Point Clouds. 3763-3772 - Qin Liu, Jaemin Cho, Mohit Bansal, Marc Niethammer:
Rethinking Interactive Image Segmentation with Low Latency, High Quality, and Diverse Prompts. 3773-3782 - Junfeng Wu
, Yi Jiang, Qihao Liu, Zehuan Yuan, Xiang Bai, Song Bai:
General Object Foundation Model for Images and Videos at Scale. 3783-3795 - Bingfeng Zhang, Siyue Yu, Yunchao Wei, Yao Zhao, Jimin Xiao:
Frozen CLIP: A Strong Backbone for Weakly Supervised Semantic Segmentation. 3796-3806 - Minhyeok Lee, Suhwan Cho, Dogyoon Lee, Chaewon Park, Jungho Lee, Sangyoun Lee:
Guided Slot Attention for Unsupervised Video Object Segmentation. 3807-3816 - Ziqin Zhou, Hai-Ming Xu, Yangyang Shu, Lingqiao Liu:
Unlocking the Potential of Pre-Trained Vision Transformers for Few-Shot Semantic Segmentation through Relationship Descriptors. 3817-3827 - Walid Bousselham, Felix Petersen, Vittorio Ferrari, Hilde Kuehne:
Grounding Everything: Emerging Localization Properties in Vision-Language Transformers. 3828-3837 - Xiangyang Zhu, Renrui Zhang, Bowei He
, Ziyu Guo, Jiaming Liu, Han Xiao, Chaoyou Fu, Hao Dong, Peng Gao:
No Time to Train: Empowering Non-Parametric Networks for Few-Shot 3D Scene Segmentation. 3838-3847 - Yizheng Gong, Siyue Yu, Xiaoyang Wang, Jimin Xiao:
Continual Segmentation with Disentangled Objectness Learning and Class Recognition. 3848-3857 - Zhuofan Xia, Dongchen Han, Yizeng Han, Xuran Pan, Shiji Song, Gao Huang:
GSVA: Generalized Segmentation via Multimodal Large Language Models. 3858-3869 - Chuong Huynh, Seoung Wug Oh, Abhinav Shrivastava, Joon-Young Lee:
MaGGIe: Masked Guided Gradual Human Instance Matting. 3870-3879 - Zitao Wang, Qiguang Miao, Yue Xi, Peipei Zhao:
EFormer: Enhanced Transformer Towards Semantic-Contour Features of Foreground for Portraits Matting. 3880-3889 - Zhiwen Chen, Zhiyu Zhu, Yifan Zhang
, Junhui Hou
, Guangming Shi, Jinjian Wu:
Segment Any Event Streams via Weighted Adaptation of Pivotal Tokens. 3890-3900 - Kenji Enomoto, TJ Rhodes, Brian Price, Gavin Miller:
Polar Matte: Fully Computational Ground-Truth-Quality Alpha Matte Extraction for Images and Video using Polarized Screen Matting. 3901-3909 - Wenjie Zhao, Jia Li, Xin Dong, Yu Xiang, Yunhui Guo:
Segment Every Out-of-Distribution Object. 3910-3920 - Qian Yu, Xiaoqi Zhao, Youwei Pang, Lihe Zhang, Huchuan Lu:
Multi-View Aggregation Network for Dichotomous Image Segmentation. 3921-3930 - Ege Ozguroglu, Ruoshi Liu, Dídac Surís, Dian Chen, Achal Dave, Pavel Tokmakov, Carl Vondrick:
pix2gestalt: Amodal Segmentation by Synthesizing Wholes. 3931-3940 - Jin Wang, Bingfeng Zhang, Jian Pang, Honglong Chen, Weifeng Liu:
Rethinking Prior Information Generation with CLIP for Few-Shot Segmentation. 3941-3951 - Yuan Wang, Rui Sun, Naisong Luo, Yuwen Pan, Tianzhu Zhang:
Image-to-Image Matching via Foundation Models: A New Perspective for Open-Vocabulary Semantic Segmentation. 3952-3963 - Zijian Wu, Jun Lu, Jing Han, Lianfa Bai, Yi Zhang, Zhuang Zhao, Siyang Song:
Domain Separation Graph Neural Networks for Saliency Object Ranking. 3964-3974 - Sandra Kara, Hejer Ammar, Julien Denize, Florian Chabot, Quoc-Cuong Pham:
DIOD: Self-Distillation Meets Object Discovery. 3975-3985 - Chengxiang Fan
, Muzhi Zhu, Hao Chen, Yang Liu, Weijia Wu, Huaqi Zhang, Chunhua Shen:
DiverGen: Improving Instance Segmentation by Learning Wider Data Distribution with More Diverse Generative Data. 3986-3995 - Zhaochong An, Guolei Sun, Yun Liu, Fayao Liu, Zongwei Wu, Dan Wang, Luc Van Gool, Serge J. Belongie:
Rethinking Few-shot 3D Point Cloud Semantic Segmentation. 3996-4006 - Xinting Hu
, Li Jiang, Bernt Schiele:
Training Vision Transformers for Semi-Supervised Semantic Segmentation. 4007-4017 - Phuc D. A. Nguyen, Tuan Duc Ngo, Evangelos Kalogerakis, Chuang Gan, Anh Tuan Tran, Cuong Pham, Khoi Nguyen:
Open3DIS: Open-Vocabulary 3D Instance Segmentation with 2D Mask Guidance. 4018-4028 - Jiayun Luo, Siddhesh Khandelwal, Leonid Sigal, Boyang Li:
Emergent Open-Vocabulary Semantic Segmentation from Off-the-Shelf Vision-Language Models. 4029-4040 - Robin Magnet, Maks Ovsjanikov:
Memory-Scalable and Simplified Functional Map Learning. 4041-4050 - Chaewon Lee, Seon-Ho Lee, Chang-Su Kim:
MFP: Making Full Use of Probability Maps for Interactive Image Segmentation. 4051-4059 - Sangyun Shin
, Kaichen Zhou, Madhu Vankadari, Andrew Markham, Niki Trigoni:
Spherical Mask: Coarse-to-Fine 3D Point Cloud Instance Segmentation with Spherical Representation. 4060-4069 - Hanyang Chi, Jian Pang, Bingfeng Zhang, Weifeng Liu:
Adaptive Bidirectional Displacement for Semi-Supervised Medical Image Segmentation. 4070-4080 - Wei-Ting Chen, Yu-Jiet Vong, Sy-Yen Kuo, Sizhuo Ma, Jian Wang:
RobustSAM: Segment Anything Robustly on Degraded Images. 4081-4091 - Pancheng Zhao, Peng Xu, Pengda Qin, Deng-Ping Fan, Zhicheng Zhang
, Guoli Jia, Bowen Zhou, Jufeng Yang:
LAKE-RED: Camouflaged Images Generation by Latent Background Knowledge Retrieval-Augmented Diffusion. 4092-4101 - Jingyun Wang, Guoliang Kang:
Learn to Rectify the Bias of CLIP for Unsupervised Semantic Segmentation. 4102-4112 - Seokju Cho, Heeseong Shin, Sunghwan Hong, Anurag Arnab, Paul Hongsuck Seo, Seungryong Kim:
CAT-Seg: Cost Aggregation for Open-Vocabulary Semantic Segmentation. 4113-4123 - Chao Shang
, Zichen Song, Heqian Qiu, Lanxiao Wang, Fanman Meng, Hongliang Li:
Prompt-Driven Referring Image Segmentation with Instance Contrasting. 4124-4134 - Joren Brunekreef, Eric Marcus, Ray Sheombarsing, Jan-Jakob Sonke, Jonas Teuwen:
Kandinsky Conformal Prediction: Efficient Calibration of Image Segmentation Algorithms. 4135-4143 - Xiongwei Wu, Sicheng Yu, Ee-Peng Lim, Chong-Wah Ngo:
OVFoodSeg: Elevating Open-Vocabulary Food Image Segmentation via Image-Informed Textual Representation. 4144-4153 - Thomas Wimmer, Peter Wonka, Maks Ovsjanikov:
Back to 3D: Few-Shot 3D Keypoint Detection with Back-Projected 2D Features. 4154-4164 - Xiao Zhang, David Yunis, Michael Maire:
Deciphering 'What' and 'Where' Visual Pathways from Spectral Clustering of Layer-Distributed Neural Representations. 4165-4175 - Ahmed Bourouis, Judith Ellen Fan, Yulia Gryaditskaya:
Open Vocabulary Semantic Scene Sketch Understanding. 4176-4186 - Xiaoqi Wang, Wenbin He, Xiwei Xuan
, Clint Sebastian, Jorge Piazentin Ono, Xin Li, Sima Behpour, Thang Doan, Liang Gou, Han-Wei Shen, Liu Ren:
USE: Universal Segment Embeddings for Open-Vocabulary Image Segmentation. 4187-4196 - Yuhao Liu
, Zhanghan Ke, Fang Liu
, Nanxuan Zhao, Rynson W. H. Lau:
Diff-Plugin: Revitalizing Details for Diffusion-Based Low-Level Tasks. 4197-4208 - Xuanchi Ren, Jiahui Huang, Xiaohui Zeng, Ken Museth, Sanja Fidler, Francis Williams:
XCube: Large-Scale 3D Generative Modeling using Sparse Voxel Hierarchies. 4209-4219 - Yi-Hua Huang, Yang-Tian Sun, Ziyi Yang, Xiaoyang Lyu, Yan-Pei Cao, Xiaojuan Qi:
SC-GS: Sparse-Controlled Gaussian Splatting for Editable Dynamic Scenes. 4220-4230 - Anand Bhattad, James Soole, David A. Forsyth:
StyLitGAN: Image-Based Relighting via Latent Control. 4231-4240 - Jiraphon Yenphraphai, Xichen Pan, Sainan Liu, Daniele Panozzo, Saining Xie:
Image Sculpting: Precise Object Editing with 3D Geometry Control. 4241-4251 - Xianfang Zeng, Xin Chen, Zhongqi Qi, Wen Liu, Zibo Zhao, Zhibin Wang, Bin Fu, Yong Liu, Gang Yu:
Paint3D: Paint Anything 3D With Lighting-Less Texture Diffusion Models. 4252-4262 - Yiqun Mei, Yu Zeng, He Zhang, Zhixin Shu, Xuaner Zhang, Sai Bi, Jianming Zhang, Hyunjoon Jung, Vishal M. Patel:
Holo-Relighting: Controllable Volumetric Portrait Relighting from a Single Image. 4263-4273 - Daniel Rebain, Soroosh Yazdani, Kwang Moo Yi, Andrea Tagliasacchi:
Neural Fields as Distributions: Signal Processing Beyond Euclidean Space. 4274-4283 - Jialun Liu, Chenming Wu, Xinqi Liu, Xing Liu, Jinbo Wu, Haotian Peng, Chen Zhao, Haocheng Feng, Jingtuo Liu, Errui Ding:
TexOct: Generating Textures of 3D Models with Octree-based Diffusion. 4284-4293 - Yishun Dou, Zhong Zheng, Qiaoqiao Jin, Rui Shi, Yuhan Li, Bingbing Ni:
Differentiable Micro-Mesh Construction. 4294-4303 - Yu-Ying Yeh, Jia-Bin Huang, Changil Kim, Lei Xiao, Thu Nguyen-Phuoc, Numair Khan, Cheng Zhang, Manmohan Chandraker, Carl S. Marshall, Zhao Dong, Zhengqin Li:
TextureDreamer: Image-Guided Texture Synthesis through Geometry-Aware Diffusion. 4304-4314 - Seungwoo Yoo, Kunho Kim, Vladimir G. Kim, Minhyuk Sung:
As-Plausible-As-Possible: Plausibility-Aware Mesh Deformation Using 2D Diffusion Priors. 4315-4324 - Rinon Gal, Yael Vinker, Yuval Alaluf, Amit Bermano, Daniel Cohen-Or, Ariel Shamir, Gal Chechik:
Breathing Life Into Sketches Using Text-to-Video Priors. 4325-4336 - Yishun Dou, Zhong Zheng, Qiaoqiao Jin, Bingbing Ni, Yugang Chen, Junxiang Ke:
Real-Time Neural BRDF with Spherically Distributed Primitives. 4337-4346 - Kim Youwang, Tae-Hyun Oh, Gerard Pons-Moll:
Paint-it: Text-to-Texture Synthesis via Deep Convolutional Texture Map Optimization and Physically-Based Rendering. 4347-4356 - Jia Li, Ziling Chen, Xiaolong Wu, Lu Wang, Beibei Wang, Lei Zhang:
Neural Super-Resolution for Real-Time Rendering with Radiance Demodulation. 4357-4367 - Yifei Li, Hsiao-Yu Chen, Egor Larionov, Nikolaos Sarafianos, Wojciech Matusik, Tuur Stuyck:
DiffAvatar: Simulation-Ready Garment Optimization with Differentiable Simulation. 4368-4378 - Ivan Lopes, Fabio Pizzati, Raoul de Charette:
Material Palette: Extraction of Materials from a Single Image. 4379-4388 - Tianyi Xie, Zeshun Zong, Yuxing Qiu, Xuan Li, Yutao Feng, Yin Yang, Chenfanfu Jiang:
PhysGaussian: Physics-Integrated 3D Gaussians for Generative Dynamics. 4389-4398 - Hoon-Gyu Chung, Seokjun Choi, Seung-Hwan Baek:
Differentiable Point-Based Inverse Rendering. 4399-4408 - Justine Giroux, Mohammad Reza Karimi Dastjerdi, Yannick Hold-Geoffroy, Javier Vazquez-Corral, Jean-François Lalonde:
Towards a Perceptual Evaluation Framework for Lighting Estimation. 4410-4419 - Zhongyin Zhao, Ye Chen, Zhangli Hu, Xuanhong Chen, Bingbing Ni:
Vector Graphics Generation via Mutually Impulsed Dual-Domain Diffusion. 4420-4428 - Giuseppe Vecchio, Renato Sortino, Simone Palazzo, Concetto Spampinato:
MatFuse: Controllable Material Generation with Diffusion Models. 4429-4438 - Carlos Rodríguez-Pardo
, Dan Casas, Elena Garces, Jorge Lopez-Moreno:
TexTile: A Differentiable Metric for Texture Tileability. 4439-4449 - Yutao Feng, Yintong Shang, Xuan Li, Tianjia Shao, Chenfanfu Jiang, Yin Yang:
PIE-NeRF: Physics-Based Interactive Elastodynamics with NeRF. 4450-4461 - Jiahao Ma, Miaomiao Liu, David Ahmedt-Aristizabal
, Chuong Nguyen:
HashPoint: Accelerated Point Searching and Sampling for Neural Rendering. 4462-4472 - Dale Decatur, Itai Lang, Kfir Aberman, Rana Hanocka:
3D Paintbrush: Local Stylization of 3D Shapes with Cascaded Score Distillation. 4473-4483 - Miguel Fainstein, Viviana Siless, Emmanuel Iarussi:
DUDF: Differentiable Unsigned Distance Fields with Hyperbolic Scaling. 4484-4493 - Niladri Shekhar Dutt, Sanjeev Muralikrishnan, Niloy J. Mitra:
Diffusion 3D Features (Diff3F) Decorating Untextured Shapes with Distilled Semantic Features. 4494-4504 - Soyeon Yoon, Kwan Yun, Kwanggyoon Seo, Sihun Cha, Jung Eun Yoo, Junyong Noh:
LeGO: Leveraging a Surface Deformation Network for Animatable Stylized Face Generation with One Example. 4505-4514 - Yichen Sheng, Zixun Yu, Lu Ling, Zhiwen Cao, Xuaner Zhang, Xin Lu, Ke Xian, Haiting Lin, Bedrich Benes:
Dr.Bokeh: DiffeRentiable Occlusion-Aware Bokeh Rendering. 4515-4525 - Xiaoliang Ju, Zhaoyang Huang, Yijiin Li, Guofeng Zhang, Yu Qiao, Hongsheng Li
:
DiffInDScene: Diffusion-Based High-Quality 3D Indoor Scene Generation. 4526-4535 - Xuecan Wang, Shibang Xiao, Xiaohui Liang
:
LightOctree: Lightweight 3D Spatially-Coherent Indoor Lighting Estimation. 4536-4545 - Ximing Xing, Haitao Zhou, Chuang Wang, Jing Zhang, Dong Xu, Qian Yu:
SVGDreamer: Text Guided SVG Generation with Diffusion Model. 4546-4555 - Ruizhi Shao, Jingxiang Sun, Cheng Peng, Zerong Zheng, Boyao Zhou, Hongwen Zhang, Yebin Liu:
Control4D: Efficient 4D Portrait Editing With Text. 4556-4567 - Xin Huang, Ruizhi Shao, Qi Zhang, Hongwen Zhang, Ying Feng, Yebin Liu, Qing Wang:
HumanNorm: Learning Normal Diffusion Model for High-quality and Realistic 3D Human Generation. 4568-4577 - Hongchi Xia, Zhi-Hao Lin, Wei-Chiu Ma, Shenlong Wang:
Video2Game: Real-time, Interactive, Realistic and Browser-Compatible Environment from a Single Video. 4578-4588 - Vikas Thamizharasan, Difan Liu, Matthew Fisher, Nanxuan Zhao, Evangelos Kalogerakis, Michal Lukác:
NIVeL: Neural Implicit Vector Layers for Text-to-Vector Generation. 4589-4597 - Jinseo Jeong, Junseo Koo, Qimeng Zhang, Gunhee Kim:
ESR-NeRF: Emissive Source Reconstruction Using LDR Multi-View Images. 4598-4609 - Linqi Zhou, Andy Shih, Chenlin Meng, Stefano Ermon:
DreamPropeller: Supercharge Text-to-3D Generation with Parallel Sampling. 4610-4619 - Chenjian Gao, Boyan Jiang, Xinghui Li, Yingpeng Zhang, Qian Yu:
GenesisTex: Adapting Image Denoising Diffusion to Texture Space. 4620-4629 - Lior Yariv, Omri Puny, Oran Gafni, Yaron Lipman:
Mosaic-SDF for 3D Generative Models. 4630-4639 - Michael Fischer, Zhengqin Li, Thu Nguyen-Phuoc, Aljaz Bozic, Zhao Dong, Carl S. Marshall, Tobias Ritschel:
NeRF Analogies: Example-Based Visual Attribute Transfer for NeRFs. 4640-4650 - Xingtao Wang, Hongliang Wei, Xiaopeng Fan, Debin Zhao:
Hyper-MD: Mesh Denoising with Customized Parameters Aware of Noise Intensity and Geometric Characteristics. 4651-4660 - Maximilian Frühauf, Hayko Riemenschneider, Markus Gross, Christopher Schroers:
QUADify: Extracting Meshes with Pixel-Level Details and Materials from Images. 4661-4670 - Pu Li, Jianwei Guo, Huibin Li, Bedrich Benes, Dong-Ming Yan:
SfmCAD: Unsupervised CAD Reconstruction by Learning Sketch-based Feature Modeling Operations. 4671-4680 - Ramana Sundararaman, Roman Klokov, Maks Ovsjanikov:
Self-Supervised Dual Contouring. 4681-4691 - Yuan Li, Zhihao Liu, Bedrich Benes, Xiaopeng Zhang, Jianwei Guo:
SVDTree: Semantic Voxel Diffusion for Single Image Tree Reconstruction. 4692-4702 - Vanessa Sklyarova, Egor Zakharov, Otmar Hilliges, Michael J. Black, Justus Thies:
Text-Conditioned Generative Model of 3D Strand-Based Human Hairstyles. 4703-4712 - Mohammad Sadil Khan, Elona Dupont, Sk Aziz Ali, Kseniya Cherenkova, Anis Kacem
, Djamila Aouada:
CAD-SIGNet: CAD Language Inference from Point Clouds Using Layer-Wise Sketch Instance Guided Attention. 4713-4722 - Biao Zhang
, Peter Wonka:
Functional Diffusion. 4723-4732 - Chenyang Si, Ziqi Huang, Yuming Jiang, Ziwei Liu:
FreeU: Free Lunch in Diffusion U-Net. 4733-4743 - Yutong Feng, Biao Gong, Di Chen, Yujun Shen, Yu Liu, Jingren Zhou:
Ranni: Taming Text-to-Image Diffusion for Accurate Instruction Following. 4744-4753 - Hexiang Hu, Kelvin C. K. Chan, Yu-Chuan Su, Wenhu Chen, Yandong Li, Kihyuk Sohn, Yang Zhao, Xue Ben, Boqing Gong, William W. Cohen, Ming-Wei Chang, Xuhui Jia:
Instruct-Imagen: Image Generation with Multi-modal Instruction. 4754-4763 - Yanbing Zhang, Mengping Yang, Qin Zhou, Zhe Wang:
Attention Calibration for Disentangled Text-to-Image Personalization. 4764-4774 - Amir Hertz, Andrey Voynov, Shlomi Fruchter, Daniel Cohen-Or:
Style Aligned Image Generation via Shared Attention. 4775-4785 - Damien Teney, Armand Mihai Nicolicioiu, Valentin Hartmann, Ehsan Abbasnejad:
Neural Redshift: Random Networks are not Random Functions. 4786-4796 - Runpeng Yu, Xinchao Wang:
Neural Lineage. 4797-4807 - Lucas Brynte, José Pedro Iglesias, Carl Olsson, Fredrik Kahl:
Learning Structure-From-Motion with Graph Attention Networks. 4808-4817 - Bin Xiao, Haiping Wu, Weijian Xu, Xiyang Dai, Houdong Hu, Yumao Lu, Michael Zeng, Ce Liu, Lu Yuan:
Florence-2: Advancing a Unified Representation for a Variety of Vision Tasks. 4818-4829 - Junwon Seo, Sangyoon Lee, Kwang In Kim, Jaeho Lee:
In Search of a Data Transformation that Accelerates Neural Field Training. 4830-4839 - Xiaoyang Wu, Li Jiang, Peng-Shuai Wang, Zhijian Liu, Xihui Liu, Yu Qiao, Wanli Ouyang, Tong He, Hengshuang Zhao:
Point Transformer V3: Simpler, Faster, Stronger. 4840-4851 - Axel Barroso-Laguna, Sowmya Munukutla, Victor Adrian Prisacariu, Eric Brachmann:
Matching 2D Images in 3D: Metric Relative Pose from Metric Correspondences. 4852-4863 - Hadi Alzayer, Kevin Zhang
, Brandon Y. Feng, Christopher A. Metzler, Jia-Bin Huang:
Seeing the World through Your Eyes. 4864-4873 - Zhiqiang Yan, Yuankai Lin, Kun Wang, Yupeng Zheng, Yufei Wang, Zhenyu Zhang, Jun Li, Jian Yang:
Tri-Perspective view Decomposition for Geometry-Aware Depth Completion. 4874-4884 - Georg Bökman, Johan Edstedt, Michael Felsberg, Fredrik Kahl:
Steerers: A Framework for Rotation Equivariant Keypoint Descriptors. 4885-4895 - Yang Chen, Yingwei Pan, Haibo Yang, Ting Yao, Tao Mei:
VP3D: Unleashing 2D Visual Prompt for Text-to-3D Generation. 4896-4905 - Zhiyuan Min, Yawei Luo, Wei Yang, Yuesong Wang, Yi Yang:
Entangled View-Epipolar Information Aggregation for Generalizable Neural Radiance Fields. 4906-4916 - Chengyao Wang, Li Jiang, Xiaoyang Wu, Zhuotao Tian, Bohao Peng, Hengshuang Zhao, Jiaya Jia
:
GroupContrast: Semantic-Aware Self-Supervised Representation Learning for 3D Understanding. 4917-4928 - Yu Meng, Zhou Xue, Xu Chang, Xuemei Hu
, Tao Yue:
iToF-Flow-Based High Frame Rate Depth Imaging. 4929-4938 - Haechan Lee, Wonjoon Jin, Seung-Hwan Baek, Sunghyun Cho:
Generalizable Novel-View Synthesis Using a Stereo Camera. 4939-4948 - Zhipeng Hu, Minda Zhao, Chaoyi Zhao, Xinyue Liang, Lincheng Li, Zeng Zhao, Changjie Fan, Xiaowei Zhou, Xin Yu
:
EfficientDreamer: High-Fidelity and Stable 3D Creation via Orthogonal-view Diffusion Priors. 4949-4958 - Lalit Manam, Venu Madhav Govindu:
Leveraging Camera Triplets for Efficient and Accurate Structure-from-Motion. 4959-4968 - Lukas Radl, Michael Steiner, Andreas Kurz, Markus Steinberger:
LAENeRF: Local Appearance Editing for Neural Radiance Fields. 4969-4978 - Kirill Mazur, Gwangbin Bae, Andrew J. Davison:
SuperPrimitive: Scene Reconstruction at a Primitive Level. 4979-4989 - Felix Rydell, Angélica Torres, Viktor Larsson:
Revisiting Sampson Approximations for Geometric Estimation Problems. 4990-4998 - Shaocong Dong, Lihe Ding, Zhanpeng Huang, Zibin Wang, Tianfan Xue, Dan Xu:
Interactive3D: Create What You Want by Interactive 3D Generation. 4999-5008 - Zihan Gao
, Licheng Jiao, Lingling Li, Xu Liu, Fang Liu, Puhua Chen, Yuwei Guo:
Multiplane Prior Guided Few-Shot Aerial Scene Rendering. 5009-5019 - Zhiyin Qian, Shaofei Wang, Marko Mihajlovic, Andreas Geiger, Siyu Tang:
3DGS-Avatar: Animatable Avatars via Deformable 3D Gaussian Splatting. 5020-5030 - Ange Lou, Benjamin Planche, Zhongpai Gao, Yamin Li, Tianyu Luan, Hao Ding, Terrence Chen, Jack H. Noble, Ziyan Wu:
DaReNeRF: Direction-aware Representation for Dynamic Scenes. 5031-5042 - Lukas Höllein
, Aljaz Bozic, Norman Müller, David Novotný, Hung-Yu Tseng, Christian Richardt, Michael Zollhöfer, Matthias Nießner:
ViewDiff: 3D-Consistent Image Generation with Text-to-Image Models. 5043-5052 - Jaehoon Choi
, Rajvi Shah, Qinbo Li, Yipeng Wang, Ayush Saraf, Changil Kim, Jia-Bin Huang, Dinesh Manocha, Suhib Alsisan, Johannes Kopf:
LTM: Lightweight Textured Mesh Extraction and Refinement of Large Unbounded Scenes for Efficient Storage and Real-Time Rendering. 5053-5063 - Andrea Porfiri Dal Cin, Timothy Duff, Luca Magri, Tomás Pajdla:
Minimal Perspective Autocalibration. 5064-5073 - Shuofeng Sun, Yongming Rao, Jiwen Lu, Haibin Yan:
X-3D: Explicit 3D Structure Modeling for Point Cloud Recognition. 5074-5083 - Junkai Deng, Fei Hou, Xuhui Chen, Wencheng Wang, Ying He
:
2S-UDF: A Novel Two-Stage UDF Learning Method for Robust Non-Watertight Model Reconstruction from Multi-View Images. 5084-5093 - Youngju Na, Woo Jae Kim, Kyu Beom Han, Suhyeon Ha, Sung-Eui Yoon:
UFORecon: Generalizable Sparse-View Surface Reconstruction from Arbitrary and Unfavorable Sets. 5094-5104 - Xiangyue Liu, Han Xue, Kunming Luo, Ping Tan, Li Yi:
GenN2N: Generative NeRF2NeRF Translation. 5105-5114 - Lihe Ding, Shaocong Dong, Zhanpeng Huang, Zibin Wang, Yiyuan Zhang, Kaixiong Gong, Dan Xu, Tianfan Xue:
Text-to-3D Generation with Bidirectional Diffusion Using Both 2D and 3D Priors. 5115-5124 - Yaqing Ding
, Jonathan Astermark, Magnus Oskarsson
, Viktor Larsson:
Noisy One-Point Homographies are Surprisingly Good. 5125-5134 - Peng Xu, Zhiyu Xiang, Chengyu Qiao, Jingyun Fu, Tianyu Pu:
Adaptive Multi-Modal Cross-Entropy Loss for Stereo Matching. 5135-5144 - Zehan Zheng, Fan Lu, Weiyi Xue, Guang Chen, Changjun Jiang:
LiDAR4D: Dynamic Neural Fields for Novel Space-Time View LiDAR Synthesis. 5145-5154 - Ziyi Chen, Xiaolong Wu, Yu Zhang:
NC-SDF: Enhancing Indoor Scene Reconstruction Using Neural SDFs with View-Dependent Normal Compensation. 5155-5165 - Jiaqi Lin, Zhihao Li, Xiao Tang, Jianzhuang Liu, Shiyong Liu, Jiayue Liu, Yangdi Lu, Xiaofei Wu, Songcen Xu, Youliang Yan, Wenming Yang:
VastGaussian: Vast 3D Gaussians for Large Scene Reconstruction. 5166-5175 - Ka-Chun Shum, Jaeyeon Kim, Binh-Son Hua, Duc Thanh Nguyen, Sai-Kit Yeung:
Language-driven Object Fusion into Neural Radiance Fields with Pose-Conditioned Dataset Updates. 5176-5187 - Yanzhe Liu
, Rong Chen, Yushi Li, Yixi Li, Xuehou Tan:
SPU-PMD: Self-Supervised Point Cloud Upsampling via Progressive Mesh Deformation. 5188-5197 - Peter Kocsis, Vincent Sitzmann, Matthias Nießner:
Intrinsic Image Diffusion for Indoor Single-view Material Estimation. 5198-5208 - Zicheng Zhang, Ruobing Zheng, Bonan Li, Congying Han, Tianqi Li, Meng Wang, Tiande Guo, Jingdong Chen, Ziwen Liu, Ming Yang
:
Learning Dynamic Tetrahedra for High-Quality Talking Head Synthesis. 5209-5219 - Viktor Kocur
, Daniel Kyselica, Zuzana Kukelova:
Robust Self-Calibration of Focal Lengths from the Fundamental Matrix. 5220-5229 - Baptiste Brument, Robin Bruneau, Yvain Quéau, Jean Mélou, François Bernard Lauze, Jean-Denis Durou, Lilian Calvet:
RNb-NeuS: Reflectance and Normal-Based Multi-View 3D Reconstruction. 5230-5239 - Hao-Bin Duan, Miao Wang, Yan-Xun Li, Yong-Liang Yang:
Neural 3D Strokes: Creating Stylized 3D Scenes with Vectorized 3D Strokes. 5240-5249 - Jiacheng Deng, Jiahao Lu, Tianzhu Zhang:
Unsupervised Template-assisted Point Cloud Shape Correspondence Network. 5250-5259 - Shaohan Li, Yunpeng Shi, Gilad Lerman:
Efficient Detection of Long Consistent Cycles and its Application to Distributed Synchronization. 5260-5269 - Jamie Watson, Filippo Aleotti, Mohamed Sayed, Zawar Qureshi, Oisin Mac Aodha, Gabriel J. Brostow, Michael Firman, Sara Vicente:
AirPlanes: Accurate Plane Estimation via 3D-Consistent Embeddings. 5270-5280 - Jonas Kälble, Sascha Wirges, Maxim Tatarchenko, Eddy Ilg:
Accurate Training Data for Occupancy Map Prediction in Automated Driving Using Evidence Theory. 5281-5290 - Qi Ma, Danda Pani Paudel, Ajad Chhatkuli, Luc Van Gool:
Continuous Pose for Monocular Cameras in Neural Implicit Representation. 5291-5301 - Fangzhou Mu, Carter Sifferman, Sacha Jungerman, Yiquan Li, Mark Han, Michael Gleicher, Mohit Gupta, Yin Li:
Towards 3D Vision with Low-Cost Single-Photon Cameras. 5302-5311 - Yongzhe Yuan, Yue Wu, Xiaolong Fan, Maoguo Gong, Qiguang Miao, Wenping Ma:
Inlier Confidence Calibration for Point Cloud Registration. 5312-5321 - Yingwenqi Jiang, Jiadong Tu, Yuan Liu, Xifeng Gao, Xiaoxiao Long, Wenping Wang, Yuexin Ma:
GaussianShader: 3D Gaussian Splatting with Shading Functions for Reflective Surfaces. 5322-5332 - Jin-Chuan Shi, Miao Wang, Hao-Bin Duan, Shao-Hua Guan:
Language Embedded 3D Gaussians for Open-Vocabulary Scene Understanding. 5333-5343 - Honghua Chen, Chen Change Loy, Xingang Pan:
MVIP-NeRF: Multi-View 3D Inpainting on NeRF Scenes via Diffusion Prior. 5344-5353 - Antoine Guédon, Vincent Lepetit:
SuGaR: Surface-Aligned Gaussian Splatting for Efficient 3D Mesh Reconstruction and High-Quality Mesh Rendering. 5354-5363 - Tianyu Huang, Yihan Zeng, Zhilu Zhang, Wan Xu, Hang Xu, Songcen Xu, Rynson W. H. Lau, Wangmeng Zuo:
DreamControl: Control-Based Text-to-3D Generation with 3D Self-Prior. 5364-5373 - Silvia Zuffi
, Ylva Mellbin, Ci Li, Markus Höschle, Hedvig Kjellström, Senya Polikovsky, Elin Hernlund
, Michael J. Black:
VAREN: Very Accurate and Realistic Equine Network. 5374-5383 - Chaoyue Song, Jiacheng Wei, Chuan Sheng Foo, Guosheng Lin, Fayao Liu:
REACTO: Reconstructing Articulated Objects from a Single Video. 5384-5395 - Jaehyeok Shim, Kyungdon Joo:
DITTO: Dual and Integrated Latent Topologies for Implicit 3D Reconstruction. 5396-5405 - Weiyao Wang, Pierre Gleize, Hao Tang, Xingyu Chen, Kevin J. Liang, Matt Feiszli:
ICON: Incremental CONfidence for Joint Pose and Radiance Field Optimization. 5406-5417 - Yiyang Chen, Lunhao Duan, Shanshan Zhao, Changxing Ding, Dacheng Tao:
Local-consistent Transformation Learning for Rotation-invariant Point Cloud Analysis. 5418-5427 - Xiao Tang, Min Yang, Penghui Sun, Hui Li, Yuchao Dai, Feng Zhu, Hojae Lee:
PaReNeRF: Toward Fast Large-Scale Dynamic NeRF with Patch-Based Reference. 5428-5438 - Gabriel Dogadov, Ugo Paavo Finnendahl, Marc Alexa:
Fitting Flats to Flats. 5439-5447 - Marco Pesavento, Yuanlu Xu
, Nikolaos Sarafianos, Robert Maier
, Ziyan Wang, Chun-Han Yao, Marco Volino, Edmond Boyer, Adrian Hilton, Tony Tung:
ANIM: Accurate Neural Implicit Model for Human Reconstruction from a Single RGB-D Image. 5448-5458 - Tongfan Guan, Chen Wang, Yun-Hui Liu:
Neural Markov Random Field for Stereo Matching. 5459-5469 - Takuhiro Kaneko:
Improving Physics-Augmented Continuum Neural Radiance Field-Based Geometry-Agnostic System Identification with Lagrangian Particle Optimization. 5470-5480 - Tobias Kirschstein, Simon Giebenhain, Matthias Nießner:
DiffusionAvatars: Deferred Diffusion for High-fidelity 3D Head Avatars. 5481-5492 - Chunlong Xia, Xinliang Wang, Feng Lv
, Xin Hao, Yifeng Shi:
ViT-CoMer: Vision Transformer with Convolutional Multi-scale Feature Interaction for Dense Predictions. 5493-5502 - Ruixuan Yu, Jian Sun:
Pose-Transformed Equivariant Network for 3D Point Trajectory Prediction. 5503-5512 - Xiaohan Ding, Yiyuan Zhang, Yixiao Ge, Sijie Zhao, Lin Song, Xiangyu Yue, Ying Shan:
UniRepLKNet: A Universal Perception Large-Kernel ConvNet for Audio, Video, Point Cloud, Time-Series and Image Recognition. 5513-5524 - Hugues Thomas, Yao-Hung Hubert Tsai, Timothy D. Barfoot, Jian Zhang:
KPConvX: Modernizing Kernel Point Convolution with Kernel Attention. 5525-5535 - Otniel-Bogdan Mercea, Alexey A. Gritsenko, Cordelia Schmid, Anurag Arnab:
Time-, Memory- and Parameter-Efficient Visual Adaptation. 5536-5545 - Yikang Li, Yeqing Qiu, Yuxuan Chen, Lingshen He, Zhouchen Lin:
Affine Equivariant Networks Based on Differential Invariants. 5546-5556 - Honghao Chen, Xiangxiang Chu, Yongjian Ren, Xin Zhao
, Kaiqi Huang:
PeLK: Parameter-Efficient Large Kernel ConvNets with Peripheral Convolution. 5557-5567 - Renan A. Rojas-Gomez, Teck-Yian Lim, Minh N. Do, Raymond A. Yeh:
Making Vision Transformers Truly Shift-Equivariant. 5568-5577 - Hancheng Ye, Chong Yu, Peng Ye, Renqiu Xia, Yansong Tang, Jiwen Lu, Tao Chen, Bo Zhang:
Once for Both: Single Stage of Importance and Sparsity Search for Vision Transformer Compression. 5578-5588 - Chunxiao Fan, Ziqi Wang, Dan Guo
, Meng Wang:
Data-Free Quantization via Pseudo-label Filtering. 5589-5598 - Yuxiang Lu, Suizhi Huang, Yuwen Yang, Shalayiding Sirejiding, Yue Ding, Hongtao Lu:
Fedhca2: Towards Hetero-Client Federated Multi-Task Learning. 5599-5609 - Xinyu Shi
, Zecheng Hao
, Zhaofei Yu:
SpikingResformer: Bridging ResNet and Vision Transformer in Spiking Neural Networks. 5610-5619 - Lewei Yao, Renjie Pi, Jianhua Han, Xiaodan Liang, Hang Xu, Wei Zhang, Zhenguo Li, Dan Xu:
DetCLIPv3: Towards Versatile Generative Open-Vocabulary Object Detection. 5610-5619 - Pavlo Melnyk, Andreas Robinson, Michael Felsberg, Mårten Wadenbäck:
TetraSphere: A Neural Descriptor for O(3)-Invariant Point Cloud Analysis. 5620-5630 - Tao Li, Pan Zhou, Zhengbao He, Xinwen Cheng, Xiaolin Huang:
Friendly Sharpness-Aware Minimization. 5631-5640 - Qihang Fan, Huaibo Huang, Mingrui Chen, Hongmin Liu, Ran He:
RMT: Retentive Networks Meet Vision Transformers. 5641-5651 - Yuwen Xiong, Zhiqi Li, Yuntao Chen, Feng Wang, Xizhou Zhu, Jiapeng Luo, Wenhai Wang, Tong Lu, Hongsheng Li
, Yu Qiao, Lewei Lu, Jie Zhou, Jifeng Dai:
Efficient Deformable ConvNets: Rethinking Dynamic and Sparse Operator for Vision Applications. 5652-5661 - Beichen Zhang, Xiaoxing Wang, Xiaohan Qin, Junchi Yan:
Boosting Order-Preserving and Transferability for Neural Architecture Search: A Joint Architecture Refined Search and Fine-Tuning Approach. 5662-5671 - Weihao Yu, Pan Zhou
, Shuicheng Yan, Xinchao Wang:
InceptionNeXt: When Inception Meets ConvNeXt. 5672-5683 - Edwin Vargas, Claudia V. Correa P., Carlos Hinojosa
, Henry Arguello:
BiPer: Binary Neural Networks Using a Periodic Function. 5684-5693 - Xu Ma, Xiyang Dai, Yue Bai, Yizhou Wang, Yun Fu:
Rewrite the Stars. 5694-5703 - Ruichen Ma, Guanchao Qiao, Yian Liu, Liwei Meng, Ning Ning, Yang Liu, Shaogang Hu:
A&B BNN: Add&Bit-Operation-Only Hardware-Friendly Binary Neural Network. 5704-5713 - Guikun Chen, Xia Li, Yi Yang, Wenguan Wang:
Neural Clustering Based Visual Representation Learning. 5714-5725 - Keith G. Mills, Fred X. Han, Mohammad Salameh, Shengyao Lu, Chunhua Zhou, Jiao He, Fengyu Sun, Di Niu:
Building Optimal Neural Architectures Using Interpretable Knowledge. 5726-5735 - Mengfei Xia, Yujun Shen, Changsong Lei, Yu Zhou, Deli Zhao, Ran Yi, Wenping Wang, Yong-Jin Liu:
Towards More Accurate Diffusion Model Acceleration with a Timestep Tuner. 5736-5745 - Jingjing Xie, Yuxin Zhang, Mingbao Lin, Zhihang Lin, Liujuan Cao, Rongrong Ji:
UniPTS: A Unified Framework for Proficient Post-Training Sparsity. 5746-5755 - Seokju Yun, Youngmin Ro:
SHViT: Single-Head Vision Transformer with Memory Efficient Macro Design. 5756-5767 - Aihua Mao, Biao Yan, Zijing Ma, Ying He
:
Denoising Point Clouds in Latent Space via Graph Convolution and Invertible Neural Network. 5768-5777 - Weiying Xie, Haowei Li, Jitao Ma, Yunsong Li, Jie Lei, Donglai Liu, Leyuan Fang:
JointSQ: Joint Sparsification-Quantization for Distributed Learning. 5778-5787 - Alon Zolfi, Guy Amit, Amit Baras, Satoru Koda, Ikuya Morikawa, Yuval Elovici, Asaf Shabtai:
YolOOD: Utilizing Object Detection Concepts for Multi-Label Out-of-Distribution Detection. 5788-5797 - Xiang Fei, Xiawu Zheng, Yan Wang, Fei Chao, Chenglin Wu, Liujuan Cao:
RepAn: Enhanced Annealing through Re-parameterization. 5798-5808 - Duo Su
, Junjie Hou, Weizhi Gao, Yingjie Tian, Bowen Tang:
D4M: Dataset Distillation via Disentangled Diffusion Model. 5809-5818 - Fangjinhua Wang, Xudong Jiang, Silvano Galliani, Christoph Vogel, Marc Pollefeys:
GLACE: Global Local Accelerated Coordinate Encoding. 5819-5828 - Nikola Zubic, Mathias Gehrig, Davide Scaramuzza:
State Space Models for Event Cameras. 5819-5828 - Sofia Casarin, Cynthia Ifeyinwa Ugwu, Sergio Escalera
, Oswald Lanz
:
Your Image Is My Video: Reshaping the Receptive Field via Image-to-Video Differentiable AutoAugmentation and Fusion. 5829-5839 - Tahira Shehzadi, Khurram Azeem Hashmi, Didier Stricker, Muhammad Zeshan Afzal:
Sparse Semi-DETR: Sparse Learnable Queries for Semi-Supervised Object Detection. 5840-5850 - Xuzhe Zhang
, Yuhao Wu, Elsa D. Angelini, Ang Li, Jia Guo, Jerod M. Rasmussen, Thomas G. O'Connor, Pathik D. Wadhwa, Andrea Parolin Jackowski, Hai Li, Jonathan Posner, Andrew F. Laine, Yun Wang:
MAPSeg: Unified Unsupervised Domain Adaptation for Heterogeneous Medical Image Segmentation Based on 3D Masked Autoencoding and Pseudo-Labeling. 5851-5862 - Ha Min Son
, Moon-Hyun Kim, Tai-Myoung Chung, Chao Huang, Xin Liu:
FedUV: Uniformity and Variance for Heterogeneous Federated Learning. 5863-5872 - Ashish Kumar, Daneul Kim, Jaesik Park, Laxmidhar Behera:
Pick-or-Mix: Dynamic Channel Sampling for ConvNets. 5873-5882 - Zhiyuan Yu, Li Shen, Liang Ding, Xinmei Tian, Yixin Chen, Dacheng Tao:
Sheared Backpropagation for Fine-Tuning Foundation Models. 5883-5892 - Junghyup Lee, Bumsub Ham:
AZ-NAS: Assembling Zero-Cost Proxies for Network Architecture Search. 5893-5903 - Sumanth Udupa, Prajwal Gurunath, Aniruddh Sikdar, Suresh Sundaram:
MRFP: Learning Generalizable Semantic Segmentation from Sim-2-Real with Multi-Resolution Feature Perturbation. 5904-5914 - Zhengqi Xu, Ke Yuan, Huiqiong Wang, Yong Wang, Mingli Song, Jie Song:
Training-Free Pretrained Model Merging. 5915-5925 - Cansu Korkmaz, A. Murat Tekalp, Zafer Dogan
:
Training Generative Image Super-Resolution Models by Wavelet-Domain Losses Enables Better Control of Artifacts. 5926-5936 - Alessio Mazzucchelli, Adrian Garcia-Garcia
, Elena Garces, Fernando Rivas-Manzaneque, Francesc Moreno-Noguer, Adrián Peñate Sánchez:
IReNe: Instant Recoloring of Neural Radiance Fields. 5937-5946 - Sudong Cai:
AdaShift: Learning Discriminative Self-Gated Neural Feature Activation With an Adaptive Shift Factor. 5947-5956 - Jinzhi Zheng, Heng Fan
, Libo Zhang:
Kernel Adaptive Convolution for Scene Text Detection via Distance Map Prediction. 5957-5966 - Yuwei Ou, Yuqi Feng
, Yanan Sun:
Towards Accurate and Robust Architectures via Neural Architecture Search. 5967-5976 - Jinfeng Xu, Siyuan Yang, Xianzhi Li, Yuan Tang, Yixue Hao, Long Hu, Min Chen:
PDF: A Probability-Driven Framework for Open World 3D Point Cloud Semantic Segmentation. 5977-5986 - Hengyuan Xu, Liyao Xiang, Hangyu Ye, Dixi Yao, Pengzhi Chu, Baochun Li:
Permutation Equivariance of Transformers and its Applications. 5987-5996 - Hyejin Park, Jeongyeon Hwang, Sunung Mun, Sangdon Park, Jungseul Ok:
MedBN: Robust Test-Time Adaptation against Malicious Test Samples. 5997-6007 - He Liu, Yikai Wang, Huaping Liu, Fuchun Sun, Anbang Yao:
Small Scale Data-Free Knowledge Distillation. 6008-6016 - Kosuke Sumiyasu, Kazuhiko Kawamoto, Hiroshi Kera:
Identifying Important Group of Pixels using Interactions. 6017-6026 - Khiem Le, Long Ho, Cuong Do, Danh Le Phuoc, Kok-Seng Wong:
Efficiently Assemble Normalization Layers and Regularization for Federated Domain Generalization. 6027-6036 - Xinyu Geng, Jiaming Wang, Jiawei Gong, Yuerong Xue, Jun Xu, Fanglin Chen, Xiaolin Huang:
OrthCaps: An Orthogonal CapsNet with Sparse Attention Routing and Pruning. 6037-6046 - Takumi Kobayashi:
Mean-Shift Feature Transformer. 6047-6056 - Shuoxi Zhang, Hanpeng Liu, Stephen Lin, Kun He:
You Only Need Less Attention at Each Stage in Vision Transformers. 6057-6066 - Oscar Carlsson
, Jan E. Gerken, Hampus Linander, Heiner Spieß, Fredrik Ohlsson, Christoffer Petersson, Daniel Persson:
HEAL-SWIN: A Vision Transformer on the Sphere. 6067-6077 - David Osowiechi, Gustavo Adolfo Vargas Hakim, Mehrdad Noori, Milad Cheraghalikhani, Ali Bahri, Moslem Yazdanpanah, Ismail Ben Ayed, Christian Desrosiers:
NC-TTT: A Noise Constrastive Approach for Test-Time Training. 6078-6086 - Wenlong Deng, Christos Thrampoulidis, Xiaoxiao Li:
Unlocking the Potential of Prompt-Tuning in Bridging Generalized and Personalized Federated Learning. 6087-6097 - Siddharth Roheda, Amit Satish Unde, Loay Rashid:
MR-VNet: Media Restoration using Volterra Networks. 6098-6107 - Sherry X. Chen, Yaron Vaxman, Elad Ben Baruch, David Asulin, Aviad Moreshet, Kuo-Chin Lien, Misha Sra, Pradeep Sen:
TiNO-Edit: Timestep and Noise Optimization for Robust Diffusion-Based Image Editing. 6108-6117 - Yiyuan Zhang, Xiaohan Ding, Kaixiong Gong, Yixiao Ge, Ying Shan, Xiangyu Yue:
Multimodal Pathway: Improve Transformers with Irrelevant Data from Other Modalities. 6108-6117 - Mustafa Munir, William Avery, Md Mostafijur Rahman, Radu Marculescu:
GreedyViG: Dynamic Axial Graph Construction for Efficient Vision GNNs. 6118-6127 - Dongyeong Hwang, Hyunju Kim, Sunwoo Kim, Kijung Shin:
FlowerFormer: Empowering Neural Architecture Encoding Using a Flow-Aware Graph Transformer. 6128-6137 - Huancheng Chen, Haris Vikalo:
Mixed-Precision Quantization for Federated Learning on Resource-Constrained Heterogeneous Devices. 6138-6148 - Zhiyu Qu, Lan Yang, Honggang Zhang, Tao Xiang, Kaiyue Pang, Yi-Zhe Song:
Wired Perspectives: Multi-View Wire Art Embraces Generative AI. 6149-6158 - Ruoyi Du, Dongliang Chang, Timothy M. Hospedales, Yi-Zhe Song, Zhanyu Ma:
DemoFusion: Democratising High-Resolution Image Generation With No $$$. 6159-6168 - Chenyang Wang
, Zerong Zheng, Tao Yu, Xiaoqian Lv, Bineng Zhong, Shengping Zhang, Liqiang Nie:
DiffPerformer: Iterative Learning of Consistent Latent Guidance for Diffusion-Based Human Video Generation. 6169-6179 - Jiun Tian Hoe
, Xudong Jiang
, Chee Seng Chan, Yap-Peng Tan, Weipeng Hu:
InteractDiffusion: Interaction Control in Text-to-Image Diffusion Models. 6180-6189 - Chang Liu, Haoning Wu, Yujie Zhong, Xiaoyun Zhang, Yanfeng Wang, Weidi Xie:
Intelligent Grimm - Open-ended Visual Storytelling via Latent Diffusion Models. 6190-6200 - Jonas Schult, Sam S. Tsai, Lukas Höllein
, Bichen Wu, Jialiang Wang, Chih-Yao Ma, Kunpeng Li, Xiaofang Wang, Felix Wimbauer, Zijian He, Peizhao Zhang, Bastian Leibe, Peter Vajda, Ji Hou:
ControlRoom3D: Room Generation Using Semantic Proxy Rooms. 6201-6210 - Felix Wimbauer, Bichen Wu, Edgar Schönfeld, Xiaoliang Dai, Ji Hou, Zijian He, Artsiom Sanakoyeu, Peizhao Zhang, Sam S. Tsai, Jonas Kohler, Christian Rupprecht, Daniel Cremers, Peter Vajda, Jialiang Wang:
Cache Me if You Can: Accelerating Diffusion Models through Block Caching. 6211-6220 - Ziqi Cai
, Kaiwen Jiang, Shu-Yu Chen, Yu-Kun Lai, Hongbo Fu, Boxin Shi, Lin Gao:
Real-Time 3D-Aware Portrait Video Relighting. 6221-6231 - Xudong Wang, Trevor Darrell, Sai Saketh Rambhatla, Rohit Girdhar, Ishan Misra:
InstanceDiffusion: Instance-Level Control for Image Generation. 6232-6242 - Junshu Tang, Yanhong Zeng, Ke Fan, Xuheng Wang, Bo Dai, Kai Chen, Lizhuang Ma:
Make-It-Vivid: Dressing Your Animatable Biped Cartoon Characters from Text. 6243-6253 - Shanglin Li, Bohan Zeng, Yutang Feng, Sicheng Gao, Xiuhui Liu, Jiaming Liu, Lin Li, Xu Tang, Yao Hu, Jianzhuang Liu, Baochang Zhang:
ZONE: Zero-Shot Instruction-Guided Local Editing. 6254-6263 - Nicolas Dufour, Victor Besnier, Vicky Kalogeiton, David Picard
:
Don't Drop Your Samples! Coherence-Aware Training Benefits Conditional Diffusion. 6264-6273 - Sachit Menon, Ishan Misra, Rohit Girdhar:
Generating Illustrated Instructions. 6274-6284 - Lin Zhu, Kangmin Jia, Yifan Zhao, Yunshan Qi, Lizhi Wang, Hua Huang:
SpikeNeRF: Learning Neural Radiance Fields from Continuous Spike Stream. 6285-6295 - Ziyu Wang, Yue Xu, Cewu Lu, Yong-Lu Li:
Dancing with Still Images: Video Distillation via Static-Dynamic Disentanglement. 6296-6304 - Lu Qi, Lehan Yang, Weidong Guo, Yu Xu, Bo Du, Varun Jampani, Ming-Hsuan Yang:
UniGS: Unified Representation for Image Generation and Segmentation. 6305-6315 - Kilichbek Haydarov
, Aashiq Muhamed, Xiaoqian Shen
, Jovana Lazarevic, Ivan Skorokhodov
, Chamuditha Jayanga Galappaththige, Mohamed Elhoseiny
:
Adversarial Text to Continuous Image Generation. 6316-6326