


default search action
18th ECCV 2024: Milan, Italy - Part LXII
- Ales Leonardis

, Elisa Ricci
, Stefan Roth
, Olga Russakovsky
, Torsten Sattler
, Gül Varol
:
Computer Vision - ECCV 2024 - 18th European Conference, Milan, Italy, September 29-October 4, 2024, Proceedings, Part LXII. Lecture Notes in Computer Science 15120, Springer 2025, ISBN 978-3-031-73032-0 - Aayam Shrestha, Pan Liu, Germán Ros, Kai Yuan, Alan Fern:

Generating Physically Realistic and Directable Human Motions from Multi-modal Inputs. 1-17 - Nikita Karaev, Ignacio Rocco, Benjamin Graham, Natalia Neverova, Andrea Vedaldi, Christian Rupprecht:

CoTracker: It Is Better to Track Together. 18-35 - Ziyi Lin, Dongyang Liu, Renrui Zhang, Peng Gao, Longtian Qiu, Han Xiao, Han Qiu, Wenqi Shao, Keqin Chen, Jiaming Han, Siyuan Huang, Yichi Zhang, Xuming He, Yu Qiao, Hongsheng Li

:
SPHINX: A Mixer of Weights, Visual Embeddings and Image Scales for Multi-modal Large Language Models. 36-55 - Yuxuan Sun

, Hao Wu
, Chenglu Zhu
, Sunyi Zheng
, Qizi Chen, Kai Zhang, Yunlong Zhang, Dan Wan, Xiaoxiao Lan, Mengyue Zheng, Jingxiong Li, Xinheng Lyu, Tao Lin
, Lin Yang:
PathMMU: A Massive Multimodal Expert-Level Benchmark for Understanding and Reasoning in Pathology. 56-73 - Avery Ma, Amir-massoud Farahmand, Yangchen Pan, Philip Torr, Jindong Gu:

Improving Adversarial Transferability via Model Alignment. 74-92 - Wenhao Ding, Yulong Cao, Ding Zhao

, Chaowei Xiao, Marco Pavone
:
RealGen: Retrieval Augmented Generation for Controllable Traffic Scenarios. 93-110 - Hao Tang, Weiyao Wang, Pierre Gleize, Matt Feiszli:

ADen: Adaptive Density Representations for Sparse-View Camera Pose Estimation. 111-128 - Yunsong Zhou, Linyan Huang, Qingwen Bu, Jia Zeng, Tianyu Li, Hang Qiu, Hongzi Zhu, Minyi Guo, Yu Qiao, Hongyang Li:

Embodied Understanding of Driving Scenarios. 129-148 - Chris Zhang, Sourav Biswas, Kelvin Wong, Kion Fallah, Lunjun Zhang, Dian Chen, Sergio Casas, Raquel Urtasun:

Learning to Drive via Asymmetric Self-Play. 149-168 - Zhening Huang

, Xiaoyang Wu
, Xi Chen
, Hengshuang Zhao
, Lei Zhu
, Joan Lasenby
:
OpenIns3D: Snap and Lookup for 3D Open-Vocabulary Instance Segmentation. 169-185 - Xijun Wang, Junbang Liang, Chun-Kai Wang, Kenan Deng, Yu Lou, Ming C. Lin, Shan Yang:

ViLA: Efficient Video-Language Alignment for Video Question Answering. 186-204 - Rohit Girdhar, Mannat Singh, Andrew Brown, Quentin Duval, Samaneh Azadi, Sai Saketh Rambhatla, Akbar Shah, Xi Yin, Devi Parikh, Ishan Misra:

Factorizing Text-to-Video Generation by Explicit Image Conditioning. 205-224 - Yang Zhao, Yanwu Xu, Zhisheng Xiao, Haolin Jia, Tingbo Hou:

MobileDiffusion: Instant Text-to-Image Generation on Mobile Devices. 225-242 - Yiyang Su, Minchul Kim

, Feng Liu
, Anil K. Jain, Xiaoming Liu
:
Open-Set Biometrics: Beyond Good Closed-Set Models. 243-261 - Siyuan Cheng, Guangyu Shen, Kaiyuan Zhang, Guanhong Tao, Shengwei An, Hanxi Guo

, Shiqing Ma, Xiangyu Zhang:
UNIT: Backdoor Mitigation via Automated Neural Distribution Tightening. 262-281 - Fengyuan Liu, Haochen Luo, Yiming Li, Philip Torr, Jindong Gu:

Which Model Generated This Image? A Model-Agnostic Approach for Origin Attribution. 282-301 - Opher Bar Nathan, Deborah Levy

, Tali Treibitz
, Dan Rosenbaum
:
Osmosis: RGBD Diffusion Prior for Underwater Image Restoration. 302-319 - Feixiang Zhou

, Bryan M. Williams, Hossein Rahmani
:
Towards Adaptive Pseudo-Label Learning for Semi-Supervised Temporal Action Localization. 320-338 - Anders Holst

, Niels Chr. Overgaard
:
Computing the Lipschitz Constant Needed for Fast Scene Recovery from CASSI Measurements. 339-353 - Yu Chi

, Fangneng Zhan
, Sibo Wu, Christian Theobalt
, Adam Kortylewski
:
DatasetNeRF: Efficient 3D-Aware Data Factory with Generative Radiance Fields. 354-372 - Mikhail Okunev

, Marc Mapeke, Benjamin Attal
, Christian Richardt
, Matthew O'Toole
, James Tompkin
:
Flowed Time of Flight Radiance Fields. 373-389 - Haoran Li

, Long Ma
, Haolin Shi
, Yanbin Hao
, Yong Liao
, Lechao Cheng
, Peng Yuan Zhou
:
3D-GOI: 3D GAN Omni-Inversion for Multifaceted and Multi-object Editing. 390-406 - Chaitanya Patel, Shaojie Bai, Te-Li Wang, Jason M. Saragih, Shih-En Wei:

Fast Registration of Photorealistic Avatars for VR Facial Animation. 407-423 - Cristina Mata, Kanchana Ranasinghe, Michael S. Ryoo:

CoPT: Unsupervised Domain Adaptive Segmentation Using Domain-Agnostic Text Embeddings. 424-440 - Ziwei Yao

, Ruiping Wang
, Xilin Chen
:
HiFi-Score: Fine-Grained Image Description Evaluation with Hierarchical Parsing Graphs. 441-458 - Anas Mahmoud

, Ali Harakeh
, Steven L. Waslander
:
Image-to-Lidar Relational Distillation for Autonomous Driving Data. 459-475 - Gemma Canet Tarres

, Zhe Lin
, Zhifei Zhang
, Jianming Zhang
, Yizhi Song, Dan Ruta
, Andrew Gilbert
, John P. Collomosse
, Soo Ye Kim
:
Thinking Outside the BBox: Unconstrained Generative Object Compositing. 476-495

manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.


Google
Google Scholar
Semantic Scholar
Internet Archive Scholar
CiteSeerX
ORCID














