default search action
WACV 2022: Waikoloa, HI, USA
- IEEE/CVF Winter Conference on Applications of Computer Vision, WACV 2022, Waikoloa, HI, USA, January 3-8, 2022. IEEE 2022, ISBN 978-1-6654-0915-5
- Dina Bashkirova, Ben Usman, Kate Saenko:
Evaluation of Correctness in Unsupervised Many-to-Many Image Translation. 1-10 - Pengsheng Guo, Miguel Ángel Bautista, Alex Colburn, Liang Yang, Daniel Ulbricht, Joshua M. Susskind, Qi Shan:
Fast and Explicit Neural View Synthesis. 11-20 - Aamir Mustafa, Aliaksei Mikhailiuk, Dan-Andrei Iliescu, Varun Babbar, Rafal K. Mantiuk:
Training a Task-Specific Image Reconstruction Loss. 21-30 - Karin Jakoel, Liron Efraim, Tamar Rott Shaham:
GANs Spatial Control via Inference-Time Adaptive Normalization. 31-40 - Yuhao Liu, Felipe Gutierrez-Barragan, Atul Ingle, Mohit Gupta, Andreas Velten:
Single-Photon Camera Guided Extreme Dynamic Range Imaging. 41-51 - Abdelrahman Abdelhamed, Jonghwa Yim, Abhijith Punnappurath, Michael S. Brown, Jihwan Choe, Kihwan Kim:
Extracting Vignetting and Grain Filter Effects from Photos. 52-60 - Haesoo Chung, Nam Ik Cho:
High Dynamic Range Imaging of Dynamic Scenes with Saturation Compensation but without Explicit Motion Compensation. 61-71 - Hankui Peng, Angelica I. Avilés-Rivero, Carola-Bibiane Schönlieb:
HERS Superpixels: Deep Affinity Learning for Hierarchical Entropy Rate Segmentation. 72-81 - Abdullah Abuolaim, Mahmoud Afifi, Michael S. Brown:
Improving Single-Image Defocus Deblurring: How Dual-Pixel Images Help Through Multi-Task Learning. 82-90 - Shady Abu-Hussein, Tom Tirer, Se Young Chun, Yonina C. Eldar, Raja Giryes:
Image Restoration by Deep Projected GSURE. 91-100 - Ziqiao Guan, Esther H. R. Tsai, Xiaojing Huang, Kevin G. Yager, Hong Qin:
Non-Blind Deblurring for Fluorescence: A Deformable Latent Space Approach with Kernel Parameterization. 101-109 - Vaibhav Vavilala, David A. Forsyth:
Controlled GAN-Based Creature Synthesis via a Challenging Game Art Dataset - Addressing the Noise-Latent Trade-Off. 110-119 - Reza Ghoddoosian, Saif Iftekar Sayed, Vassilis Athitsos:
Hierarchical Modeling for Task Recognition and Action Segmentation in Weakly-Labeled Instructional Videos. 120-130 - Md Taufeeq Uddin, Shaun J. Canavan:
Quantified Facial Expressiveness for Affective Behavior Analytics. 131-140 - Anshul Shah, Shlok Mishra, Ankan Bansal, Jun-Cheng Chen, Rama Chellappa, Abhinav Shrivastava:
Pose and Joint-Aware Action Recognition. 141-151 - Maheen Rashid, Sofia Broomé, Katrina Ask, Elin Hernlund, Pia Haubro Andersen, Hedvig Kjellström, Yong Jae Lee:
Equine Pain Behavior Classification via Self-Supervised Disentangled Pose Representation. 152-162 - Mirco Planamente, Chiara Plizzari, Emanuele Alberti, Barbara Caputo:
Domain Generalization through Audio-Visual Relative Norm Alignment in First Person Action Recognition. 163-174 - Zhe Wang, Hao Chen, Xinyu Li, Chunhui Liu, Yuanjun Xiong, Joseph Tighe, Charless C. Fowlkes:
SSCAP: Self-supervised Co-occurrence Action Parsing for Unsupervised Temporal Action Segmentation. 175-184 - Mohamed Elfeki, Liqiang Wang, Ali Borji:
Multi-stream dynamic video Summarization. 185-195 - Dominik Bauer, Timothy Patten, Markus Vincze:
SporeAgent: Reinforced Scene-level Plausibility for Object Pose Refinement. 196-204 - Hunter Blanton, Scott Workman, Nathan Jacobs:
A Structure-Aware Method for Direct Pose Estimation. 205-214 - Pei-Ze Chiang, Meng-Shiun Tsai, Hung-Yu Tseng, Wei-Sheng Lai, Wei-Chen Chiu:
Stylizing 3D Scene via Implicit Representation and HyperNetwork. 215-224 - Xidong Peng, Xinge Zhu, Tai Wang, Yuexin Ma:
SIDE: Center-based Stereo 3D Detector with Structure-aware Instance Depth Estimation. 225-234 - Cheng Yang, Jia Zheng, Xili Dai, Rui Tang, Yi Ma, Xiaojun Yuan:
Learning to Reconstruct 3D Non-Cuboid Room Layout from a Single RGB Image. 235-244 - Ryo Furukawa, Michihiro Mikamo, Ryusuke Sagawa, Hiroshi Kawasaki:
Single-shot dense active stereo with pixel-wise phase estimation based on grid-structure using CNN and correspondence estimation using GCN. 245-255 - Yecheng Lyu, Xinming Huang, Ziming Zhang:
EllipsoidNet: Ellipsoid Representation for Point Cloud Classification and Segmentation. 256-266 - Chuangguan Ye, Hongyuan Zhu, Yongbin Liao, Yanggang Zhang, Tao Chen, Jiayuan Fan:
What Makes for Effective Few-shot Point Cloud Classification? 267-276 - Shivam Duggal, Zihao Wang, Wei-Chiu Ma, Sivabalan Manivasagam, Justin Liang, Shenlong Wang, Raquel Urtasun:
Mending Neural Implicit Modeling for 3D Vehicle Reconstruction in the Wild. 277-286 - Otto Seiskari, Pekka Rantalankila, Juho Kannala, Jerry Ylilammi, Esa Rahtu, Arno Solin:
HybVIO: Pushing the Limits of Real-time Visual-inertial Odometry. 287-296 - Hitika Tiwari, Vinod K. Kurmi, K. S. Venkatesh, Yong-Sheng Chen:
Occlusion Resistant Network for 3D Face Reconstruction. 297-306 - Kai Fischer, Martin Simon, Stefan Milz, Patrick Mäder:
StickyLocalization: Robust End-To-End Relocalization on Point Clouds using Graph Neural Networks. 307-316 - Kazuma Minami, Hiroaki Santo, Fumio Okura, Yasuyuki Matsushita:
Symmetric-light Photometric Stereo. 317-325 - Lam Huynh, Phong Nguyen, Jirí Matas, Esa Rahtu, Janne Heikkilä:
Lightweight Monocular Depth with a Novel Neural Architecture Search Method. 326-336 - Saisubramaniam Gopalakrishnan, Pranshu Ranjan Singh, Haytham M. Fayek, Savitha Ramasamy, Arulmurugan Ambikapathi:
Knowledge Capture and Replay for Continual Learning. 337-345 - Christian Simon, Piotr Koniusz, Mehrtash Harandi:
Meta-Learning for Multi-Label Few-Shot Classification. 346-355 - Waqar Ahmed, Pietro Morerio, Vittorio Murino:
Cleaning Noisy Labels by Negative Ensemble Learning for Source-Free Unsupervised Domain Adaptation. 356-365 - Yu Yang, Hakan Bilen, Qiran Zou, Wing Yin Cheung, Xiangyang Ji:
Learning Foreground-Background Segmentation from Improved Layered GANs. 366-375 - Vaishnavi Khindkar, Chetan Arora, Vineeth N. Balasubramanian, Anbumani Subramanian, Rohit Saluja, C. V. Jawahar:
To miss-attend is to misalign! Residual Self-Attentive Feature Alignment for Adapting Object Detectors. 376-386 - Evgenii Zheltonozhskii, Chaim Baskin, Avi Mendelson, Alex M. Bronstein, Or Litany:
Contrast to Divide: Self-Supervised Pre-Training for Learning with Noisy Labels. 387-397 - Chaoliang Zhong, Jie Wang, Cheng Feng, Ying Zhang, Jun Sun, Yasuto Yokota:
PICA: Point-wise Instance and Centroid Alignment Based Few-shot Domain Adaptive Object Detection with Loose Annotations. 398-407 - Peng Yang, Shaogang Ren, Yang Zhao, Ping Li:
Calibrating CNNs for Few-Shot Meta Learning. 408-417 - Shyam Nandan Rai, Rohit Saluja, Chetan Arora, Vineeth N. Balasubramanian, Anbumani Subramanian, C. V. Jawahar:
FLUID: Few-Shot Self-Supervised Image Deraining. 418-427 - Mustafa Sercan Amac, Ahmet Sencan, Orhun Bugra Baran, Nazli Ikizler-Cinbis, Ramazan Gokberk Cinbis:
MaskSplit: Self-supervised Meta-learning for Few-shot Semantic Segmentation. 428-438 - Chull Hwan Song, Hye Joo Han, Yannis Avrithis:
All the attention you need: Global-local, spatial-channel attention for image retrieval. 439-448 - Zhibo Yang, Muhammet Bastan, Xinliang Zhu, Douglas Gray, Dimitris Samaras:
Hierarchical Proxy-based Loss for Deep Metric Learning. 449-458 - Weijun Tan, Hongwei Guo, Rushuai Liu:
A Fast Partial Video Copy Detection Using KNN and Global Feature Database. 459-467 - Sarah Ibrahimi, Arnaud Sors, Rafael Sampaio de Rezende, Stéphane Clinchant:
Learning with Label Noise for Image Retrieval by Selecting Interactions. 468-477 - Ameen Ali, Idan Schwartz, Tamir Hazan, Lior Wolf:
Video and Text Matching with Conditioned Embeddings. 478-487 - Yang Cheng, Qian Lin, Jan P. Allebach:
Re-Compose the Image by Evaluating the Crop on More Than Just a Score. 488-496 - Utkarsh Mall, Kavita Bala, Tamara Berg, Kristen Grauman:
Discovering Underground Maps from Fashion. 497-506 - Medhini Narasimhan, Shiry Ginosar, Andrew Owens, Alexei A. Efros, Trevor Darrell:
Strumming to the Beat: Audio-Conditioned Contrastive Video Textures. 507-516 - Marco Godi, Christian Joppi, Geri Skenderi, Marco Cristani:
MovingFashion: a Benchmark for the Video-to-Shop Challenge. 517-525 - Pritish Sahu, Karan Sikka, Ajay Divakaran:
Challenges in Procedural Multimodal Machine Comprehension: A Novel Way To Benchmark. 526-535 - Yi-Wen Chen, Xiaojie Jin, Xiaohui Shen, Ming-Hsuan Yang:
Video Salient Object Detection via Contrastive Features and Attention Modules. 536-545 - Shraman Pramanick, Aniket Roy, Vishal M. Patel:
Multimodal Learning using Optimal Transport for Sarcasm and Humor Detection. 546-556 - Michal Kucer, Diane Oyen, Juan Castorena, Jian Wu:
DeepPatent: Large scale patent drawing recognition and retrieval. 557-566 - Ben Maman, Amit Bermano:
TypeNet: Towards Camera Enabled Touch Typing on Flat Surfaces through Self-Refinement. 567-576 - Arda Senocak, Hyeonggon Ryu, Junsik Kim, In So Kweon:
Less Can Be More: Sound Source Localization With a Classification Model. 577-586 - Chenge Li, István Fehérvári, Xiaonan Zhao, Ives Macêdo, Srikar Appalaraju:
SeeTek: Very Large-Scale Open-set Logo Recognition with Text-Aware Metric Learning. 587-596 - Surgan Jandial, Pinkesh Badjatiya, Pranit Chawla, Ayush Chopra, Mausoom Sarkar, Balaji Krishnamurthy:
SAC: Semantic Attention Composition for Text-Conditioned Image Retrieval. 597-606 - Ahmed Abdelreheem, Ujjwal Upadhyay, Ivan Skorokhodov, Rawan Al Yahya, Jun Chen, Mohamed Elhoseiny:
3DRefTransformer: Fine-Grained Object Identification in Real-World Scenes Using Natural Language. 607-616 - Taher Naderi, Amir Sadovnik, Jason P. Hayward, Hairong Qi:
Monocular Depth Estimation with Adaptive Geometric Attention. 617-627 - Kaustubh Sadekar, Ashish Tiwari, Shanmuganathan Raman:
Shadow Art Revisited: A Differentiable Rendering Based Approach. 628-636 - Ziwen Li, Bo Xu, Han Huang, Cheng Lu, Yandong Guo:
Deep Two-Stream Video Inference for Human Body Pose and Shape Estimation. 637-646 - Abhinav Narayan Harish, Rajendra Nagar, Shanmuganathan Raman:
RGL-NET: A Recurrent Graph Learning framework for Progressive Part Assembly. 647-656 - Yimin Wei, Hao Liu, Tingting Xie, Qiuhong Ke, Yulan Guo:
Spatial-Temporal Transformer for 3D Point Cloud Sequences. 657-666 - Divam Gupta, Wei Pu, Trenton Tabor, Jeff Schneider:
SBEVNet: End-to-End Deep Stereo Layout Estimation. 667-676 - Faranak Shamsafar, Samuel Woerz, Rafia Rahim, Andreas Zell:
MobileStereoNet: Towards Lightweight Deep Networks for Stereo Matching. 677-686 - Aloisio Dourado, Frederico Guth, Teofilo de Campos:
Data Augmented 3D Semantic Scene Completion with 2D Segmentation Priors. 687-696 - Yawen Lu, Guoyu Lu:
3D Modeling Beneath Ground: Plant Root Detection and Reconstruction Based on Ground-Penetrating Radar. 697-706 - Alvaro Gómez, Gregory Randall, Gabriele Facciolo, Rafael Grompone von Gioi:
An experimental comparison of multi-view stereo approaches on satellite images. 707-716 - Thiago L. Gomes, Thiago M. Coutinho, Rafael Azevedo, Renato Martins, Erickson R. Nascimento:
Creating and Reenacting Controllable 3D Humans with Differentiable Rendering. 717-726 - Camilo Pestana, Naveed Akhtar, Nazanin Rahnavard, Mubarak Shah, Ajmal Mian:
Transferable 3D Adversarial Textures using End-to-end Optimization. 727-736 - Hirokatsu Kataoka, Kensho Hara, Ryusuke Hayashi, Eisuke Yamagata, Nakamasa Inoue:
Spatiotemporal Initialization for 3D CNNs with Generated Motion Patterns. 737-746 - Shubh Maheshwari, Debtanu Gupta, Ravi Kiran Sarvadevabhatla:
MUGL: Large Scale Multi Person Conditional Action Generation with Locomotion. 747-755 - Guoxi Huang, Adrian G. Bors:
Busy-Quiet Video Disentangling for Video Classification. 756-765 - He-Yen Hsieh, Ding-Jie Chen, Tyng-Luh Liu:
Contextual Proposal Network for Action Localization. 766-775 - Peipeng Chen, Yuan Gao, Andy J. Ma:
Multi-level Attentive Adversarial Learning with Temporal Dilation for Unsupervised Video Domain Adaptation. 776-785 - Jiawei Chen, Chiu Man Ho:
MM-ViT: Multi-Modal Video Transformer for Compressed Video Action Recognition. 786-797 - Yinxiao Li, Zhichao Lu, Xuehan Xiong, Jonathan Huang:
PERF-Net: Pose Empowered RGB-Flow Net. 798-807 - Debaditya Roy, Basura Fernando:
Action anticipation using latent goal learning. 808-816 - Jun-Tae Lee, Sungrack Yun, Mihir Jain:
Leaky Gated Cross-Attention for Weakly Supervised Multi-Modal Temporal Action Localization. 817-826 - Xinyu Li, Chunhui Liu, Bing Shuai, Yi Zhu, Hao Chen, Joseph Tighe:
NUTA: Non-uniform Temporal Aggregation for Action Recognition. 827-836 - Raphael Memmesheimer, Simon Häring, Nick Theisen, Dietrich Paulus:
Skeleton-DML: Deep Metric Learning for Skeleton-Based One-Shot Action Recognition. 837-845 - Martine Toering, Ioannis Gatopoulos, Maarten Stol, Vincent Tao Hu:
Self-supervised Video Representation Learning with Cross-Stream Prototypical Contrasting. 846-856 - Jonathan Freer, Kwang Moo Yi, Wei Jiang, Jongwon Choi, Hyung Jin Chang:
Novel-View Synthesis of Human Tourist Photos. 857-864 - Zhenmei Shi, Fuhao Shi, Wei-Sheng Lai, Chia-Kai Liang, Yingyu Liang:
Deep Online Fused Video Stabilization. 865-873 - Andreas Lugmayr, Martin Danelljan, Fisher Yu, Luc Van Gool, Radu Timofte:
Normalizing Flow as a Flexible Fidelity Objective for Photo-Realistic Super-resolution. 874-883 - Soo Hyun Jung, Tae Bok Lee, Yong Seok Heo:
Deep Feature Prior Guided Face Deblurring. 884-893 - Man M. Ho, Jinjia Zhou:
Deep Photo Scan: Semi-Supervised Learning for dealing with the real-world degradation in Smartphone Photo Scanning. 894-903 - Bomi Kim, Sunhyeok Lee, Nahyun Kim, Donggon Jang, Dae-Shik Kim:
Learning Color Representations for Low-Light Image Enhancement. 904-912 - Cheeun Hong, Heewon Kim, Sungyong Baik, Junghun Oh, Kyoung Mu Lee:
DAQ: Channel-Wise Distribution-Aware Quantization for Deep Image Super-Resolution Networks. 913-922 - Yoshitomo Matsubara, Ruihan Yang, Marco Levorato, Stephan Mandt:
Supervised Compression for Resource-Constrained Edge Computing Systems. 923-933 - Mahmoud Afifi, Marcus A. Brubaker, Michael S. Brown:
Auto White-Balance Correction for Mixed-Illuminant Scenes. 934-943 - Xiaoyu Xiang, Ding Liu, Xiao Yang, Yiheng Zhu, Xiaohui Shen, Jan P. Allebach:
Adversarial Open Domain Adaptation for Sketch-to-Photo Synthesis. 944-954 - Ligong Han, Sri Harsha Musunuri, Martin Renqiang Min, Ruijiang Gao, Yu Tian, Dimitris N. Metaxas:
AE-StyleGAN: Improved Training of Style-Based Auto-Encoders. 955-964 - Dohyun Kim, Dajung Je, Kwangjin Lee, Moohyun Kim, Han Kim:
Late-resizing: A Simple but Effective Sketch Extraction Strategy for Improving Generalization of Line-art Colorization. 965-974 - Zehua Zhang, David Crandall:
Hierarchically Decoupled Spatial-Temporal Contrast for Self-supervised Video Representation Learning. 975-985 - Tri Huynh, Simon Kornblith, Matthew R. Walter, Michael Maire, Maryam Khademi:
Boosting Contrastive Self-Supervised Learning with False Negative Cancellation. 986-996 - Hiroyasu Akada, Shariq Farooq Bhat, Ibraheem Alhashim, Peter Wonka:
Self-Supervised Learning of Domain Invariant Features for Depth Estimation. 997-1007 - Luca Robbiano, Muhammad Rameez Ur Rahman, Fabio Galasso, Barbara Caputo, Fabio Maria Carlucci:
Adversarial Branch Architecture Search for Unsupervised Domain Adaptation. 1008-1018 - Quentin Bammey, Rafael Grompone von Gioi, Jean-Michel Morel:
Forgery Detection by Internal Positional Learning of Demosaicing Traces. 1019-1029 - Silvia Bucci, Francesco Cappio Borlino, Barbara Caputo, Tatiana Tommasi:
Distance-based Hyperspherical Classification for Multi-source Open-Set Domain Adaptation. 1030-1039 - Amirreza Shaban, Amir Rahimi, Thalaiyasingam Ajanthan, Byron Boots, Richard Hartley:
Few-shot Weakly-Supervised Object Detection via Directional Statistics. 1040-1049 - Colorado J. Reed, Xiangyu Yue, Ani Nrusimha, Sayna Ebrahimi, Vivek Vijaykumar, Richard Mao, Bo Li, Shanghang Zhang, Devin Guillory, Sean Metzger, Kurt Keutzer, Trevor Darrell:
Self-Supervised Pretraining Improves Self-Supervised Pretraining. 1050-1060 - Fuxun Yu, Di Wang, Yinpeng Chen, Nikolaos Karianakis, Tong Shen, Pei Yu, Dimitrios Lymberopoulos, Sidi Lu, Weisong Shi, Xiang Chen:
SC-UDA: Style and Content Gaps aware Unsupervised Domain Adaptation for Object Detection. 1061-1070 - Ohad Amosy, Gal Chechik:
Coupled Training for Multi-Source Domain Adaptation. 1071-1080 - Chun-Han Yao, Boqing Gong, Hang Qi, Yin Cui, Yukun Zhu, Ming-Hsuan Yang:
Federated Multi-Target Domain Adaptation. 1081-1090 - Tianhong Li, Lijie Fan, Yuan Yuan, Dina Katabi:
Unsupervised Learning for Human Sensing Using Radio Signals. 1091-1100 - Hojun Lee, Myunggi Lee, Nojun Kwak:
Few-Shot Object Detection by Attending to Per-Sample-Prototype. 1101-1110 - Deblina Bhattacharjee, Martin Everaert, Mathieu Salzmann, Sabine Süsstrunk:
Estimating Image Depth in the Comics Domain. 1111-1120 - Biying Fu, Cong Chen, Olaf Henniger, Naser Damer:
A Deep Insight into Measuring Face Image Utility with General and Face-specific Image Quality Metrics. 1121-1130 - Meiling Fang, Naser Damer, Florian Kirchbuchner, Arjan Kuijper:
Learnable Multi-level Frequency Decomposition and Hierarchical Attention Mechanism for Generalized Face Presentation Attack Detection. 1131-1140 - Veeru Talreja, Nasser M. Nasrabadi, Matthew C. Valenti:
Attribute-Based Deep Periocular Recognition: Leveraging Soft Biometrics to Improve Periocular Recognition. 1141-1150 - Adrian Popescu, Liviu-Daniel Stefan, Jérôme Deshayes-Chossart, Bogdan Ionescu:
Face Verification with Challenging Imposters and Diversified Demographics. 1151-1160