default search action
17th ECCV 2022: Tel Aviv, Israel - Volume 37
- Shai Avidan, Gabriel J. Brostow, Moustapha Cissé, Giovanni Maria Farinella, Tal Hassner:
Computer Vision - ECCV 2022 - 17th European Conference, Tel Aviv, Israel, October 23-27, 2022, Proceedings, Part XXXVII. Lecture Notes in Computer Science 13697, Springer 2022, ISBN 978-3-031-19835-9 - Liuwan Zhu, Rui Ning, Jiang Li, Chunsheng Xin, Hongyi Wu:
Most and Least Retrievable Images in Visual-Language Query Systems. 1-18 - Dekun Wu, He Zhao, Xingce Bao, Richard P. Wildes:
Sports Video Analysis on Large-Scale Data. 19-36 - Seonwoo Min, Nokyung Park, Siwon Kim, Seunghyun Park, Jinkyu Kim:
Grounding Visual Representations with Texts for Domain Generalization. 37-53 - Joaquín Ossandón, Benjamín Earle, Álvaro Soto:
Bridging the Visual Semantic Gap in VLN via Semantically Richer Instructions. 54-69 - Adyasha Maharana, Darryl Hannan, Mohit Bansal:
StoryDALL-E: Adapting Pretrained Text-to-Image Transformers for Story Continuation. 70-87 - Katherine Crowson, Stella Biderman, Daniel Kornis, Dashiell Stander, Eric Hallahan, Louis Castricato, Edward Raff:
VQGAN-CLIP: Open Domain Image Generation and Editing with Natural Language Guidance. 88-105 - Xian Liu, Yinghao Xu, Qianyi Wu, Hang Zhou, Wayne Wu, Bolei Zhou:
Semantic-Aware Implicit Neural Audio-Driven Video Portrait Generation. 106-125 - Juan León Alcázar, Moritz Cordes, Chen Zhao, Bernard Ghanem:
End-to-End Active Speaker Detection. 126-143 - Dingkang Yang, Shuai Huang, Shunli Wang, Yang Liu, Peng Zhai, Liuzhen Su, Mingcheng Li, Lihua Zhang:
Emotion Recognition for Multiple Context Awareness. 144-162 - Ayan Kumar Bhunia, Aneeshan Sain, Parth Hiren Shah, Animesh Gupta, Pinaki Nath Chowdhury, Tao Xiang, Yi-Zhe Song:
Adaptive Fine-Grained Sketch-Based Image Retrieval. 163-181 - Ye Zhu, Kyle Olszewski, Yu Wu, Panos Achlioptas, Menglei Chai, Yan Yan, Sergey Tulyakov:
Quantized GAN for Complex Music Generation from Dance Videos. 182-199 - Hu Wang, Jianpeng Zhang, Yuanhong Chen, Congbo Ma, Jodie Avery, Louise Hull, Gustavo Carneiro:
Uncertainty-Aware Multi-modal Learning via Cross-Modal Random Network Prediction. 200-217 - Shentong Mo, Pedro Morgado:
Localizing Visual Sounds the Easy Way. 218-234 - Tingle Li, Yichen Liu, Andrew Owens, Hang Zhao:
Learning Visual Styles from Audio-Visual Associations. 235-252 - Jae-Ho Choi, Ki-Bong Kang, Kyung-Tae Kim:
Remote Respiration Monitoring of Moving Person Using Radio Signals. 253-270 - Karren Yang, Michael Firman, Eric Brachmann, Clément Godard:
Camera Pose Estimation and Localization with Active Audio Sensing. 271-291 - Samuel Yu, Peter Wu, Paul Pu Liang, Ruslan Salakhutdinov, Louis-Philippe Morency:
PACS: A Dataset for Physical Audiovisual CommonSense Reasoning. 292-309 - Juan F. Montesinos, Venkatesh S. Kadandale, Gloria Haro:
VoViT: Low Latency Graph-Based Audio-Visual Voice Separation Transformer. 310-326 - Zhenqiang Ying, Deepti Ghadiyaram, Alan C. Bovik:
Telepresence Video Quality Assessment. 327-347 - Roman Bachmann, David Mizrahi, Andrei Atanov, Amir Zamir:
MultiMAE: Multi-modal Multi-task Masked Autoencoders. 348-367 - Efthymios Tzinis, Scott Wisdom, Tal Remez, John R. Hershey:
AudioScopeV2: Audio-Visual Attention Architectures for Calibrated Open-Domain On-Screen Sound Separation. 368-385 - Jinxing Zhou, Jianyuan Wang, Jiayi Zhang, Weixuan Sun, Jing Zhang, Stan Birchfield, Dan Guo, Lingpeng Kong, Meng Wang, Yiran Zhong:
Audio-Visual Segmentation. 386-403 - Yeying Jin, Wenhan Yang, Robby T. Tan:
Unsupervised Night Image Enhancement: When Layer Decomposition Meets Light-Effects Suppression. 404-421 - Suprosanna Shit, Rajat Koner, Bastian Wittmann, Johannes C. Paetzold, Ivan Ezhov, Hongwei Li, Jiazhen Pan, Sahand Sharifzadeh, Georgios Kaissis, Volker Tresp, Bjoern H. Menze:
Relationformer: A Unified Framework for Image-to-Graph Generation. 422-439 - Shruti Vyas, Chen Chen, Mubarak Shah:
GAMa: Cross-View Video Geo-Localization. 440-456 - Kengo Nakata, Youyang Ng, Daisuke Miyashita, Asuka Maki, Yu-Chieh Lin, Jun Deguchi:
Revisiting a kNN-Based Image Classification System with High-Capacity Storage. 457-474 - Hao Feng, Wengang Zhou, Jiajun Deng, Yuechen Wang, Houqiang Li:
Geometric Representation Learning for Document Image Rectification. 475-492 - Guoli Jia, Jufeng Yang:
S2-VER: Semi-supervised Visual Emotion Recognition. 493-509 - Ruoyu Feng, Xin Jin, Zongyu Guo, Runsen Feng, Yixin Gao, Tianyu He, Zhizheng Zhang, Simeng Sun, Zhibo Chen:
Image Coding for Machines with Omnipotent Feature Learning. 510-528 - Conghui Hu, Gim Hee Lee:
Feature Representation Learning for Unsupervised Cross-Domain Image Retrieval. 529-544 - Shilin Xu, Xiangtai Li, Jingbo Wang, Guangliang Cheng, Yunhai Tong, Dacheng Tao:
Fashionformer: A Simple, Effective and Unified Baseline for Human Fashion Segmentation and Recognition. 545-563 - Xuqian Ren, Yifan Liu:
Semantic-Guided Multi-mask Image Harmonization. 564-579 - Sagnik Das, Ke Ma, Zhixin Shu, Dimitris Samaras:
Learning an Isometric Surface Parameterization for Texture Unwrapping. 580-597 - Rahul Duggal, Hao Zhou, Shuo Yang, Jun Fang, Yuanjun Xiong, Wei Xia:
Towards Regression-Free Neural Networks for Diverse Compute Platforms. 598-614 - Xiaoyu Xu, Jiayan Qiu, Xinchao Wang, Zhou Wang:
Relationship Spatialization for Depth Estimation. 615-637 - Chenfeng Xu, Shijia Yang, Tomer Galanti, Bichen Wu, Xiangyu Yue, Bohan Zhai, Wei Zhan, Peter Vajda, Kurt Keutzer, Masayoshi Tomizuka:
Image2Point: 3D Point-Cloud Understanding with 2D Image Pretrained Models. 638-656 - Divya Kothandaraman, Tianrui Guan, Xijun Wang, Shuowen Hu, Ming C. Lin, Dinesh Manocha:
FAR: Fourier Aerial Video Recognition. 657-676 - Ruocheng Wang, Yunzhi Zhang, Jiayuan Mao, Chin-Yi Cheng, Jiajun Wu:
Translating a Visual LEGO Manual to a Machine-Executable Plan. 677-694 - Junbang Liang, Ming C. Lin:
Fabric Material Recovery from Video Using Multi-scale Geometric Auto-Encoder. 695-714 - Jie Ren, Wenteng Liang, Ran Yan, Luo Mai, Shiwen Liu, Xiao Liu:
MegBA: A GPU-Based Distributed Library for Large-Scale Bundle Adjustment. 715-731 - Georgios Pavlakos, Ethan Weber, Matthew Tancik, Angjoo Kanazawa:
The One Where They Reconstructed 3D Humans and Environments in TV Shows. 732-749
manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.