


default search action
CVPR 2022
- IEEE/CVF Conference on Computer Vision and Pattern Recognition, CVPR 2022, New Orleans, LA, USA, June 18-24, 2022. IEEE 2022, ISBN 978-1-6654-6946-3
- Meina Zhang, Yingying Fang, Guoxi Ni, Tieyong Zeng:
Pixel screening based intermediate correction for blind deblurring. 1-9 - Elijah Cole, Xuan Yang, Kimberly Wilber, Oisin Mac Aodha, Serge J. Belongie:
When Does Contrastive Visual Representation Learning Work? 1-10 - Dengpan Fu, Dongdong Chen, Hao Yang, Jianmin Bao, Lu Yuan, Lei Zhang, Houqiang Li, Fang Wen, Dong Chen:
Large-Scale Pre-training for Person Re-identification with Noisy Labels. 1-11 - Yunhui Guo, Xudong Wang, Yubei Chen, Stella X. Yu:
Clipped Hyperbolic Classifiers Are Super-Hyperbolic Classifiers. 1-10 - Yunhui Guo, Haoran Guo, Stella X. Yu:
CO-SNE: Dimensionality Reduction and Visualization for Hyperbolic Data. 11-20 - Jinyu Cai, Jicong Fan, Wenzhong Guo, Shiping Wang, Yunhe Zhang, Zhao Zhang:
Efficient Deep Embedded Subspace Clustering. 21-30 - Jiexi Yan, Lei Luo, Chenghao Xu, Cheng Deng, Heng Huang:
Noise Is Also Useful: Negative Correlation-Steered Latent Contrastive Learning. 31-40 - Kun-Peng Ning, Xun Zhao, Yu Li
, Sheng-Jun Huang:
Active Learning for Open-set Annotation. 41-49 - Theodoros Tsiligkaridis, Jay Roberts:
Understanding and Increasing Efficiency of Frank-Wolfe Adversarial Training. 50-59 - Kezhi Kong, Guohao Li
, Mucong Ding, Zuxuan Wu, Chen Zhu, Bernard Ghanem
, Gavin Taylor, Tom Goldstein:
Robust Optimization as Data Augmentation for Large-scale Graphs. 60-69 - Sihao Yu, Jiafeng Guo, Ruqing Zhang, Yixing Fan, Zizhen Wang, Xueqi Cheng:
A Re-Balancing Strategy for Class-Imbalanced Classification Based on Instance Difficulty. 70-79 - Bingyuan Liu, Ismail Ben Ayed, Adrian Galdran
, Jose Dolz:
The Devil is in the Margin: Margin-based Label Smoothing for Network Calibration. 80-88 - Guoliang Lin, Hanlu Chu, Hanjiang Lai:
Towards Better Plasticity-Stability Trade-off in Incremental Learning: A Simple Linear Connector. 89-98 - Rishabh Tiwari, KrishnaTeja Killamsetty, Rishabh K. Iyer, Pradeep Shenoy:
GCR: Gradient Coreset based Replay Buffer Selection for Continual Learning. 99-108 - Qingsen Yan, Dong Gong
, Yuhang Liu, Anton van den Hengel, Javen Qinfeng Shi
:
Learning Bayesian Sparse Networks with Full Experience Replay for Continual Learning. 109-118 - Daniel Grzech, Mohammad Farid Azampour, Ben Glocker, Julia A. Schnabel
, Nassir Navab, Bernhard Kainz
, Loïc Le Folgoc:
A variational Bayesian method for similarity learning in non-rigid image registration. 119-128 - Yadong Ding, Yu Wu
, Chengyue Huang, Siliang Tang, Yi Yang, Longhui Wei, Yueting Zhuang, Qi Tian:
Learning to Learn by Jointly Optimizing Neural Architecture and Weights. 129-138 - Zifeng Wang, Zizhao Zhang, Chen-Yu Lee, Han Zhang, Ruoxi Sun, Xiaoqi Ren, Guolong Su, Vincent Perot, Jennifer G. Dy, Tomas Pfister:
Learning to Prompt for Continual Learning. 139-149 - Mengqi Xue
, Haofei Zhang
, Jie Song, Mingli Song:
Meta-attention for ViT-backed Continual Learning. 150-159 - Vitor Guizilini, Rares Ambrus, Dian Chen, Sergey Zakharov, Adrien Gaidon:
Multi-Frame Self-Supervised Depth with Transformers. 160-170 - Zhen Wang, Liu Liu, Yiqun Duan, Yajing Kong, Dacheng Tao:
Continual Learning with Lifelong Vision Transformer. 171-181 - Jianfeng Wang, Thomas Lukasiewicz:
Rethinking Bayesian Deep Learning Methods for Semi-Supervised Volumetric Medical Image Segmentation. 182-190 - Yawei Li
, Kamil Adamczewski, Wen Li, Shuhang Gu, Radu Timofte
, Luc Van Gool:
Revisiting Random Channel Pruning for Neural Network Compression. 191-201 - Huayi Tang, Yong Liu:
Deep Safe Multi-view Clustering: Reducing the Risk of Clustering Performance Degradation Caused by View Increase. 202-211 - Jongin Lim
, Sangdoo Yun, Seulki Park, Jin Young Choi:
Hypergraph-Induced Semantic Tuplet Loss for Deep Metric Learning. 212-222 - Prateek Munjal, Nasir Hayat, Munawar Hayat, Jamshid Sourati, Shadab Khan:
Towards Robust and Reproducible Active Learning using Neural Networks. 223-232 - Jiulong Liu, Zhaoqiang Liu:
Non-Iterative Recovery from Nonlinear Observations using Generative Models. 233-243 - Minyoung Kim:
Gaussian Process Modeling of Approximate Inference Errors for Variational Autoencoders. 244-253 - Kwang In Kim:
Robust Combination of Distributed Gradients Under Adversarial Perturbations. 254-263 - Lan Wang, Vishnu Naresh Boddeti:
Do learned representations respect causal relationships? 264-274 - Rafid Mahmood, James Lucas, David Acuna, Daiqing Li, Jonah Philion, José M. Álvarez, Zhiding Yu, Sanja Fidler, Marc T. Law:
How Much More Data Do I Need? Estimating Requirements for Downstream Tasks. 275-284 - Magzhan Gabidolla, Miguel Á. Carreira-Perpiñán:
Pushing the Envelope of Gradient Boosting Forests via Globally-Optimized Oblique Trees. 285-294 - Dian Chen, Dequan Wang, Trevor Darrell, Sayna Ebrahimi:
Contrastive Test-Time Adaptation. 295-305 - Paritosh Mittal, Yen-Chi Cheng
, Maneesh Singh, Shubham Tulsiani:
AutoSDF: Shape Priors for 3D Completion, Reconstruction and Generation. 306-315 - Shikun Li
, Xiaobo Xia, Shiming Ge, Tongliang Liu
:
Selective-Supervised Contrastive Learning with Noisy Labels. 316-325 - Yufei Guo, Xinyi Tong
, Yuanpei Chen, Liwen Zhang, Xiaode Liu, Zhe Ma, Xuhui Huang:
RecDis-SNN: Rectifying Membrane Potential Distribution for Directly Training Spiking Neural Networks. 326-335 - M. Saquib Sarfraz, Marios Koulakis, Constantin Seibold, Rainer Stiefelhagen:
Hierarchical Nearest Neighbor Graph Embedding for Efficient Dimensionality Reduction. 336-345 - Yikai Wang, Xinwei Sun
, Yanwei Fu
:
Scalable Penalized Regression for Noise Detection in Learning with Noisy Labels. 346-355 - Xiran Fan, Chun-Hao Yang
, Baba C. Vemuri
:
Nested Hyperbolic Spaces for Dimensionality Reduction and Hyperbolic NN Design. 356-365 - Ivor J. A. Simpson
, Sara Vicente, Neill D. F. Campbell
:
Learning Structured Gaussians to Approximate Deep Ensembles. 366-374 - Ruoyu Wang, Mingyang Yi, Zhitang Chen, Shengyu Zhu:
Out-of-distribution Generalization with Causal Invariant Transformations. 375-385 - Tom Ryder, Chen Zhang, Ning Kang, Shifeng Zhang:
Split Hierarchical Variational Compression. 386-395 - Iordanis Fostiropoulos, Barry W. Boehm:
Implicit Feature Decoupling with Depthwise Quantization. 396-405 - Jurijs Nazarovs, Zhichun Huang, Songwong Tasneeyapant, Rudrasis Chakraborty, Vikas Singh:
Understanding Uncertainty Maps in Vision with Statistical Testing. 406-416 - Anh-Dzung Doan
, Michele Sasdelli
, David Suter
, Tat-Jun Chin:
A Hybrid Quantum-Classical Algorithm for Robust Fitting. 417-427 - Paul Roetzer, Paul Swoboda, Daniel Cremers
, Florian Bernard:
A Scalable Combinatorial Solver for Elastic Geometrically Consistent 3D Shape Matching. 428-438 - Ahmed Abbas
, Paul Swoboda:
FastDOG: Fast Discrete Optimization on GPU. 439-449 - Vladimir Chikin, Mikhail Antiukh:
Data-Free Network Compression via Parametric Non-uniform Mixed Precision Quantization. 450-459 - Huu Le, Rasmus Kjær Høier
, Che-Tsung Lin
, Christopher Zach:
AdaSTE: An Adaptive Straight-Through Estimator to Train Binary Neural Networks. 460-469 - Sanjeev Muralikrishnan
, Siddhartha Chaudhuri, Noam Aigerman, Vladimir G. Kim, Matthew Fisher, Niloy J. Mitra:
GLASS: Geometric Latent Augmentation for Shape Spaces. 470-479 - Matteo Spallanzani, Gian Paolo Leonardi, Luca Benini:
Training Quantised Neural Networks with STE Variants: the Additive Noise Annealing Algorithm. 470-479 - Nuo Xu
, Jianlong Chang, Xing Nie, Chunlei Huo, Shiming Xiang, Chunhong Pan:
AME: Attention and Memory Enhancement in Hyper-Parameter Optimization. 480-489 - Christina Baek, Ziyang Wu, Kwan Ho Ryan Chan, Tianjiao Ding, Yi Ma, Benjamin D. Haeffele:
Efficient Maximal Coding Rate Reduction by Variational Forms. 490-498 - Marvin Eisenberger, Aysim Toker, Laura Leal-Taixé, Florian Bernard, Daniel Cremers
:
A Unified Framework for Implicit Sinkhorn Differentiation. 499-508 - Yidong Chen, Chen Li, Zhonghua Lu
:
Computing Wasserstein-$p$ Distance Between Images with Linear Cost. 509-518 - Natacha Kuete Meli, Florian Mannel
, Jan Lellmann:
An Iterative Quantum Approach for Transformation Estimation from Point Sets. 519-527 - Nourhan Bayasi, Ghassan Hamarneh, Rafeef Garbi:
BoosterNet: Improving Domain Generalization of Deep Neural Nets using Culpability-Ranked Features. 528-538 - Dong-Hwan Jang, Sanghyeok Chu, Joonhyuk Kim, Bohyung Han:
Pooling Revisited: Your Receptive Field is Suboptimal. 539-548 - Jiajing Chen, Burak Kakillioglu, Huantao Ren, Senem Velipasalar:
Why Discard if You can Recycle?: A Recycling Max Pooling Module for 3D Point Cloud Analysis. 549-557 - Mu Hu, Junyi Feng, Jiashen Hua, Baisheng Lai, Jianqiang Huang, Xiaojin Gong, Xiansheng Hua:
Online Convolutional Reparameterization. 558-567 - Xiaohan Ding, Honghao Chen, Xiangyu Zhang, Jungong Han, Guiguang Ding:
RepMLPNet: Hierarchical Vision MLP with Re-parameterized Locality. 568-577 - Tao Huang, Shan You, Bohan Zhang, Yuxuan Du, Fei Wang, Chen Qian, Chang Xu:
DyRep: Bootstrapping Training with Dynamic Re-parameterization. 578-587 - Tianlong Chen, Zhenyu Zhang, Yihua Zhang, Shiyu Chang, Sijia Liu, Zhangyang Wang:
Quarantine: Sparsity Can Uncover the Trojan Attack Trigger for Free. 588-599 - Anil Kag, Venkatesh Saligrama
:
Condensing CNNs with Partial Differential Equations. 600-609 - Shaojie Bai, Zhengyang Geng, Yash Savani, J. Zico Kolter:
Deep Equilibrium Optical Flow Estimation. 610-620 - Matan Atzmon, Koki Nagano, Sanja Fidler, Sameh Khamis, Yaron Lipman:
Frame Averaging for Equivariant Shape Space Learning. 621-631 - Gee-Sern Hsu, Chun-Hung Tsai, Hung-Yi Wu:
Dual-Generator Face Reenactment. 632-640 - Rongzhen Zhao, Jian Li, Zhenzhi Wu:
Convolution of Convolution: Let Kernels Spatially Collaborate. 641-650 - Matthias Wödlinger, Jan Kotera
, Jan Xu, Robert Sablatnig:
SASIC: Stereo Image Compression with Latent Shifts and Stereo Attention. 651-660 - Michael Schelling, Pedro Hermosilla, Timo Ropinski
:
RADU: Ray-Aligned Depth Update Convolutions for ToF Data Denoising. 661-670 - Utkarsh Singhal, Yifei Xing, Stella X. Yu:
Co-domain Symmetry for Complex-Valued Deep Learning. 671-680 - Tong Yu, Ruslan Khalitov, Lei Cheng, Zhirong Yang:
Paramixer: Parameterizing Mixing Links in Sparse Factors Works Better than Dot-Product Self-Attention. 681-690 - Huanyu Wang, Junjie Liu
, Xin Ma, Yang Yong, Zhenhua Chai, Jianxin Wu:
Compressing Models with Few Samples: Mimicking then Replacing. 691-700 - Raymond A. Yeh, Yuan-Ting Hu, Zhongzheng Ren, Alexander G. Schwing:
Total Variation Optimization Layers for Computer Vision. 701-711 - Vinit Veerendraveer Singh, Chandra Kambhamettu:
AIM: an Auto-Augmenter for Images and Meshes. 712-721 - George Yiasemis, Jan-Jakob Sonke, Clarisa Sánchez, Jonas Teuwen:
Recurrent Variational Network: A Deep Learning Inverse Problem Solver applied to the task of Accelerated MRI Reconstruction. 722-731 - Nicolas Donati, Etienne Corman, Maks Ovsjanikov:
Deep orientation-aware functional maps: Tackling symmetry issues in Shape Matching. 732-741 - Jingqi Huang, Yue Ning, Dong Nie, Linan Guan, Xiping Jia:
Weakly-supervised Metric Learning with Cross-Module Communications for the Classification of Anterior Chamber Angle Images. 742-752 - Lei Huang, Yi Zhou, Tian Wang, Jie Luo, Xianglong Liu:
Delving into the Estimation Shift of Batch Normalization in a Network. 753-762 - Fanqing Lin, Brian L. Price, Tony R. Martinez:
Generalizing Interactive Backpropagating Refinement for Dense Prediction Networks. 763-772 - Wenshuo Li, Hanting Chen, Jianyuan Guo
, Ziyang Zhang, Yunhe Wang:
Brain-inspired Multilayer Perceptron with Spiking Neurons. 773-783 - Koushik Biswas, Sandeep Kumar, Shilpak Banerjee
, Ashish Kumar Pandey:
Smooth Maximum Unit: Smooth Activation Function for Deep Networks using Smoothing Maximum Technique. 784-793 - Mannat Singh, Laura Gustafson, Aaron Adcock, Vinicius de Freitas Reis, Bugra Gedik, Raj Prateek Kosaraju, Dhruv Mahajan, Ross B. Girshick, Piotr Dollár, Laurens van der Maaten:
Revisiting Weakly Supervised Pre-Training of Visual Perception Models. 794-804 - Xuran Pan, Chunjiang Ge, Rui Lu, Shiji Song, Guanfu Chen, Zeyi Huang, Gao Huang:
On the Integration of Self-Attention and Convolution. 805-815 - Jianyuan Guo
, Yehui Tang, Kai Han, Xinghao Chen, Han Wu, Chao Xu, Chang Xu, Yunhe Wang:
Hire-MLP: Vision MLP via Hierarchical Rearrangement. 816-826 - Benjamin Naoto Chiche, Arnaud Woiselle, Joana Frontera-Pons
, Jean-Luc Starck
:
Stable Long-Term Recurrent Video Super-Resolution. 827-836 - Aming Wu, Cheng Deng:
Single-Domain Generalized Object Detection in Urban Scene via Cyclic-Disentangled Self-Distillation. 837-846 - Anlin Zheng, Yuang Zhang, Xiangyu Zhang, Xiaojuan Qi, Jian Sun:
Progressive End-to-End Object Detection in Crowded Scenes. 847-856 - Ajay Jain, Ben Mildenhall, Jonathan T. Barron, Pieter Abbeel, Ben Poole:
Zero-Shot Text-Guided Object Generation with Dream Fields. 857-866 - Mingjin Zhang, Rui Zhang, Yuxiang Yang, Haichen Bai
, Jing Zhang, Jie Guo
:
ISNet: Shape Matters for Infrared Small Target Detection. 867-876 - Yi-Nan Chen, Hang Dai, Yong Ding:
Pseudo-Stereo for Monocular 3D Object Detection in Autonomous Driving. 877-887 - Tu Zheng, Yifei Huang, Yang Liu, Wenjian Tang, Zheng Yang, Deng Cai, Xiaofei He:
CLRNet: Cross Layer Refinement Network for Lane Detection. 888-897 - Yanan Zhang
, Jiaxin Chen, Di Huang:
CAT-Det: Contrastively Augmented Transformer for Multimodal 3D Object Detection. 898-907 - Yu-Jhe Li, Jinhyung Park, Matthew O'Toole, Kris Kitani:
Modality-Agnostic Learning for Radar-Lidar Fusion in Vehicle Detection. 908-917 - Yanbin Hao, Hao Zhang, Chong-Wah Ngo, Xiangnan He:
Group Contextualization for Video Recognition. 918-928 - Suchen Wang, Yueqi Duan, Henghui Ding
, Yap-Peng Tan, Kim-Hui Yap
, Junsong Yuan:
Learning Transferable Human-Object Interaction Detector with Natural Language Supervision. 929-938 - Gongjie Zhang, Zhipeng Luo, Yingchen Yu, Kaiwen Cui, Shijian Lu:
Accelerating DETR Convergence via Semantic-Aligned Matching. 939-948 - Jialian Wu, Sudhir Yarram, Hui Liang, Tian Lan, Junsong Yuan, Jayan Eledath
, Gérard G. Medioni:
Efficient Video Instance Segmentation via Tracklet Query and Proposal. 949-958 - Zhaozheng Chen, Tan Wang, Xiongwei Wu, Xian-Sheng Hua, Hanwang Zhang, Qianru Sun:
Class Re-Activation Maps for Weakly-Supervised Semantic Segmentation. 959-968 - Siyue Yu
, Jimin Xiao, Bingfeng Zhang
, Eng Gee Lim:
Democracy Does Matter: Comprehensive Feature Mining for Co-Salient Object Detection. 969-978 - Jinheng Xie, Jianfeng Xiang, Junliang Chen, Xianxu Hou, Xiaodong Zhao, Linlin Shen:
C2 AM: Contrastive learning of Class-agnostic Activation Map for Weakly Supervised Object Localization and Semantic Segmentation. 979-988 - Ayan Kumar Bhunia, Subhadeep Koley
, Abdullah Faiz Ur Rahman Khilji, Aneeshan Sain
, Pinaki Nath Chowdhury, Tao Xiang, Yi-Zhe Song
:
Sketching without Worrying: Noise-Tolerant Sketch-Based Image Retrieval. 989-998 - Hao Li, Tianwen Fu, Jifeng Dai
, Hongsheng Li
, Gao Huang, Xizhou Zhu:
AutoLoss-Zero: Searching Loss Functions from Scratch for Generic Tasks. 999-1008 - Jihwan Park
, Seungjun Lee, Hwan Heo, Hyeong Kyu Choi, Hyunwoo J. Kim:
Consistency Learning via Decoding Path Augmentation for Transformers in Human Object Interaction Detection. 1009-1018 - Hanyu Xuan, Zhiliang Wu
, Jian Yang, Yan Yan, Xavier Alameda-Pineda:
A Proposal-based Paradigm for Self-supervised Sound Source Localization in Videos. 1019-1028 - Canjie Luo, Lianwen Jin, Jingdong Chen:
SimAN: Exploring Self-Supervised Representation Learning of Scene Text via Similarity-Aware Normalization. 1029-1038 - Shangbang Long, Siyang Qin, Dmitry Panteleev, Alessandro Bissacco, Yasuhisa Fujii, Michalis Raptis:
Towards End-to-End Unified Scene Text Detection and Layout Analysis. 1039-1049 - Xinqian Gu, Hong Chang, Bingpeng Ma, Shutao Bai, Shiguang Shan, Xilin Chen:
Clothes-Changing Person Re-identification with RGB Modality Only. 1050-1059 - Qing Lian, Peiliang Li, Xiaozhi Chen
:
MonoJSG: Joint Semantic and Geometric Cost Volume for Monocular 3D Object Detection. 1060-1069 - Jiaqi Gu
, Bojian Wu
, Lubin Fan, Jianqiang Huang, Shen Cao, Zhiyu Xiang, Xian-Sheng Hua:
Homography Loss for Monocular 3D Object Detection. 1070-1079 - Xuyang Bai, Zeyu Hu, Xinge Zhu, Qingqiu Huang, Yilun Chen, Hongbo Fu, Chiew-Lan Tai:
TransFusion: Robust LiDAR-Camera Fusion for 3D Object Detection with Transformers. 1080-1089 - Ruihang Chu, Xiaoqing Ye, Zhengzhe Liu, Xiao Tan, Xiaojuan Qi, Chi-Wing Fu, Jiaya Jia
:
TWIST: Two-Way Inter-label Self-Training for Semi-supervised 3D Instance Segmentation. 1090-1099 - Haiyang Wang, Shaoshuai Shi, Ze Yang
, Rongyao Fang, Qi Qian, Hongsheng Li
, Bernt Schiele
, Liwei Wang:
RBGNet: Ray-based Grouping for 3D Object Detection. 1100-1109 - Yanwei Li, Xiaojuan Qi, Yukang Chen, Liwei Wang, Zeming Li, Jian Sun, Jiaya Jia
:
Voxel Field Fusion for 3D Object Detection. 1110-1119 - Yurong You
, Katie Luo, Cheng Perng Phoo, Wei-Lun Chao, Wen Sun, Bharath Hariharan, Mark E. Campbell, Kilian Q. Weinberger:
Learning to Detect Mobile Objects from LiDAR Scans Without Labels. 1120-1130 - David Schinagl, Georg Krispel, Horst Possegger
, Peter M. Roth, Horst Bischof
:
OccAM's Laser: Occlusion-based Attribution Maps for 3D Object Detectors on LiDAR Data. 1131-1140 - Yichun Shen, Wanli Jiang, Zhen Xu, Rundong Li, Junghyun Kwon:
Confidence Propagation Cluster: Unleash Full Potential of Object Detectors. 1141-1151 - Sijie Zhu, Mubarak Shah, Chen Chen:
TransGeo: Transformer Is All You Need for Cross-view Image Geo-localization. 1152-1161 - Yongjian Deng
, Hao Chen, Hai Liu, Youfu Li
:
A Voxel Graph CNN for Object Classification with Event Cameras. 1162-1171 - Dongchen Lu, Dongmei Li, Yali Li, Shengjin Wang:
OSKDet: Orientation-sensitive Keypoint Localization for Rotated Object Detection. 1172-1182 - Yang You, Zelin Ye, Yujing Lou
, Chengkun Li, Yong-Lu Li, Lizhuang Ma, Weiming Wang, Cewu Lu:
Canonical Voting: Towards Robust Oriented Bounding Box Detection in 3D Scenes. 1183-1192 - Jiaxing Huang, Dayan Guan, Aoran Xiao, Shijian Lu, Ling Shao:
Category Contrast for Unsupervised Domain Adaptation in Visual Tasks. 1193-1204 - Xiaohua Zhai, Alexander Kolesnikov, Neil Houlsby, Lucas Beyer:
Scaling Vision Transformers. 1204-1213 - Yihong Sun, Adam Kortylewski, Alan L. Yuille:
Amodal Segmentation through Out-of-Task and Out-of-Distribution Generalization with a Bayesian Model. 1205-1214 - Xingzhe He, Bastian Wandt, Helge Rhodin:
GANSeg: Learning to Segment by Unsupervised Hierarchical Image Generation. 1215-1225 - Anirud Thyagharajan, Benjamin Ummenhofer, Prashant Laddha, Om Ji Omer, Sreenivas Subramoney:
Segment-Fusion: Hierarchical Context Fusion for Robust 3D Semantic Segmentation. 1226-1235 - Liulei Li, Tianfei Zhou, Wenguan Wang
, Jianwu Li, Yi Yang:
Deep Hierarchical Semantic Segmentation. 1236-1247 - Yifan Zhang, Bo Pang, Cewu Lu:
Semantic Segmentation by Early Region Proxy. 1248-1258 - Shubhankar Borse
, Hyojin Park, Hong Cai, Debasmit Das, Risheek Garrepalli, Fatih Porikli:
Panoptic, Instance and Semantic Relations: A Relational Context Encoder to Enhance Panoptic Segmentation. 1259-1269 - Zhiqi Li, Wenhai Wang, Enze Xie, Zhiding Yu, Anima Anandkumar, José M. Álvarez, Ping Luo, Tong Lu:
Panoptic SegFormer: Delving Deeper into Panoptic Segmentation with Transformers. 1270-1279 - Bowen Cheng, Ishan Misra, Alexander G. Schwing, Alexander Kirillov, Rohit Girdhar:
Masked-attention Mask Transformer for Universal Image Segmentation. 1280-1289 - Xi Chen, Zhiyan Zhao, Yilei Zhang, Manni Duan, Donglian Qi, Hengshuang Zhao:
FocalClick: Towards Practical Interactive Image Segmentation. 1290-1299 - Tiancheng Shen, Yuechen Zhang, Lu Qi, Jason Kuen, Xingyu Xie, Jianlong Wu, Zhe Lin, Jiaya Jia
:
High Quality Segmentation for Ultra High-resolution Images. 1300-1309 - Wenwen Pan, Haonan Shi, Zhou Zhao, Jieming Zhu, Xiuqiang He, Zhigeng Pan, Lianli Gao, Jun Yu, Fei Wu, Qi Tian:
Wnet: Audio-Guided Video Object Segmentation via Wavelet-Based Cross- Modal Denoising Networks. 1310-1321 - Mingxing Li, Li Hu, Zhiwei Xiong, Bang Zhang, Pan Pan, Dong Liu:
Recurrent Dynamic Embedding for Video Object Segmentation. 1322-1331 - Kai Xu, Angela Yao:
Accelerating Video Object Segmentation with Compressed Video. 1332-1341 - Kwanyong Park, Sanghyun Woo, Seoung Wug Oh, In So Kweon, Joon-Young Lee:
Per-Clip Video Object Segmentation. 1342-1351 - Zhihui Lin, Tianyu Yang, Maomao Li, Ziyu Wang, Chun Yuan, Wenhao Jiang, Wei Liu:
SWEM: Towards Real-Time Video Object Segmentation with Sequential Weighted Expectation-Maximization. 1352-1362 - Hanyuan Liu, Chengze Li, Xueting Liu, Tien-Tsin Wong:
Neural Recognition of Dashed Curves with Gestalt Law of Continuity. 1363-1372 - Ziqiang Xu, Chunyan Xu, Zhen Cui, Xiangwei Zheng, Jian Yang:
CVNet: Contour Vibration Network for Building Extraction. 1373-1381 - Jinsheng Wang, Yinchao Ma, Shaofei Huang
, Tianrui Hui, Fei Wang, Chen Qian, Tianzhu Zhang:
A Keypoint-based Global Association Network for Lane Detection. 1382-1391 - Mengyang Pu, Yaping Huang, Yuming Liu, Qingji Guan, Haibin Ling:
EDTER: Edge Detection with Transformer. 1392-1402 - Yining Hong, Kaichun Mo, Li Yi, Leonidas J. Guibas, Antonio Torralba, Joshua B. Tenenbaum, Chuang Gan:
Fixing Malfunctional Objects With Learned Physical Simulation and Functional Prediction. 1403-1413 - Aoxiang Fan, Jiayi Ma, Xin Tian, Xiaoguang Mei, Wei Lin:
Coherent Point Drift Revisited for Non-rigid Shape Matching and Registration. 1414-1424 - Tianchen Zhao, Niansong Zhang, Xuefei Ning, He Wang, Li Yi, Yu Wang:
CodedVTR: Codebook-based Sparse Voxel Transformer with Geometric Guidance. 1425-1434 - Rishubh Singh, Pranav Gupta, Pradeep Shenoy, Ravikiran Sarvadevabhatla
:
FLOAT: Factorized Learning of Object Attributes for Improved Multi-object Multi-part Scene Parsing. 1435-1445 - Hong-Xing Yu, Jiajun Wu, Li Yi:
Rotationally Equivariant 3D Object Detection. 1446-1454 - Zhiqin Chen, Kangxue Yin, Sanja Fidler:
AUV-Net: Learning Aligned UV Maps for Texture Transfer and Synthesis. 1455-1464 - Hongsuk Choi, Gyeongsik Moon, JoonKyu Park, Kyoung Mu Lee:
Learning to Estimate Robust 3D Human Mesh from In-the-Wild Crowded Scenes. 1465-1474 - Georgios Pavlakos, Jitendra Malik, Angjoo Kanazawa:
Human Mesh Recovery from Multiple Shots. 1475-1485 - JoonKyu Park, Yeonguk Oh, Gyeongsik Moon, Hongsuk Choi, Kyoung Mu Lee:
HandOccNet: Occlusion-Robust 3D Hand Mesh Estimation Network. 1486-1495 - Thiemo Alldieck
, Mihai Zanfir, Cristian Sminchisescu:
Photorealistic Monocular 3D Reconstruction of Humans Wearing Clothing. 1496-1505 - Ayush Tewari, Mallikarjun B. R., Xingang Pan
, Ohad Fried, Maneesh Agrawala
, Christian Theobalt
:
Disentangled3D: Learning a 3D Generative Model with Disentangled Geometry and Appearance from Monocular Images. 1506-1515 - Keyu Wu, Yifan Ye, Lingchen Yang, Hongbo Fu, Kun Zhou, Youyi Zheng:
NeuralHDHair: Automatic High-fidelity Hair Modeling from a Single Image Using Implicit Neural Representations. 1516-1525 - Shivam Duggal, Deepak Pathak:
Topologically-Aware Deformation Fields for Single-View 3D Reconstruction. 1526-1536 - Rahul Dey, Vishnu Naresh Boddeti:
Generating Diverse 3D Reconstructions from a Single Occluded Face Image. 1537-1547 - Daniel Rebain, Mark J. Matthews
, Kwang Moo Yi, Dmitry Lagun, Andrea Tagliasacchi:
LOLNeRF: Learn from One Look. 1548-1557 - Yida Wang, David Joseph Tan, Nassir Navab, Federico Tombari:
Learning Local Displacements for Point Cloud Completion. 1558-1567 - Andra Petrovai, Sergiu Nedevschi
:
Exploiting Pseudo Labels in a Self-Supervised Learning Framework for Improved Monocular Depth Estimation. 1568-1578 - Yunpeng Zhang, Wenzhao Zheng, Zheng Zhu, Guan Huang, Dalong Du, Jie Zhou, Jiwen Lu:
Dimension Embeddings for Monocular 3D Object Detection. 1579-1588 - Shengyi Qian, Linyi Jin, Chris Rockwell, Siyi Chen, David F. Fouhey:
Understanding 3D Object Articulation in Internet Videos. 1589-1599 - Vaishakh Patil, Christos Sakaridis
, Alexander Liniger, Luc Van Gool:
P3Depth: Monocular Depth Estimation with a Piecewise Planarity Prior. 1600-1611 - Kehan Wang, Jia Zheng
, Zihan Zhou:
Neural Face Identification in a 2D Wireframe Projection of a Manifold Object. 1612-1621 - Naiyu Gao, Fei He, Jian Jia, Yanhu Shan, Haoyang Zhang, Xin Zhao
, Kaiqi Huang:
PanopticDepth: A Unified Framework for Depth-aware Panoptic Segmentation. 1622-1632 - Zimeng Zhao, Binghui Zuo, Wei Xie, Yangang Wang:
Stability-driven Contact Reconstruction From Monocular Color Images. 1633-1643 - Zhigang Jiang
, Zhongzheng Xiang, Jinhua Xu, Ming Zhao:
LGT-Net: Indoor Panoramic Room Layout Estimation with Geometry-Aware Transformer Network. 1644-1653 - Tze Ho Elden Tse, Kwang In Kim, Ales Leonardis, Hyung Jin Chang
:
Collaborative Learning for Hand and Object Reconstruction with Attention-guided Graph Convolution. 1654-1664 - Tak-Wai Hui:
RM-Depth: Unsupervised Learning of Recurrent Monocular Depth in Dynamic Scenes. 1665-1674 - Qing Lian, Botao Ye, Ruijia Xu, Weilong Yao, Tong Zhang:
Exploring Geometric Consistency for Monocular 3D Object Detection. 1675-1684 - Georgia Gkioxari, Nikhila Ravi, Justin Johnson:
Learning 3D Object Shape and Layout without 3D Supervision. 1685-1694 - Nikolay Patakin, Anna Vorontsova, Mikhail Artemyev, Anton Konushin:
Single-Stage 3D Geometry-Preserving Depth Estimation Model Training on Dataset Mixtures with Uncalibrated Stereo Data. 1695-1704 - Rawal Khirodkar, Shashank Tripathi, Kris Kitani:
Occluded Human Mesh Recovery. 1705-1715 - Junshu Tang, Zhijun Gong, Ran Yi, Yuan Xie, Lizhuang Ma:
LAKe-Net: Topology-Aware Point Cloud Completion by Localizing Aligned Keypoints. 1716-1725 - Wenbin Lin, Chengwei Zheng
, Jun-Hai Yong, Feng Xu:
OcclusionFusion: Occlusion-aware Motion Estimation for Real-time Dynamic 3D Reconstruction. 1726-1735 - Yuhua Xu, Xiaoli Yang, Yushan Yu, Wei Jia, Zhaobi Chu, Yulan Guo:
Depth Estimation by Combining Binocular Stereo and Monocular Structured-Light. 1736-1745 - Mingtao Feng, Kendong Liu, Liang Zhang, Hongshan Yu, Yaonan Wang, Ajmal Mian
:
Learning from Pixel-Level Noisy Label : A New Perspective for Light Field Saliency Detection. 1746-1756 - Wele Gedara Chaminda Bandara
, Vishal M. Patel:
HyperTransformer: A Textural and Spectral Feature Fusion Transformer for Pansharpening. 1757-1767 - Scott Workman, Muhammad Usman Rafique, Hunter Blanton, Nathan Jacobs
:
Revisiting Near/Remote Sensing with Geospatial Attention. 1768-1777 - Gang Yang
, Man Zhou, Keyu Yan, Aiping Liu, Xueyang Fu
, Fan Wang:
Memory-augmented Deep Conditional Unfolding Network for Pansharpening. 1778-1787 - Man Zhou, Keyu Yan, Jie Huang, Zihe Yang, Xueyang Fu
, Feng Zhao:
Mutual Information-driven Pan-sharpening. 1788-1798 - Fengyu Yang, Chenyang Ma:
Sparse and Complete Latent Organization for Geospatial Semantic Segmentation. 1799-1808 - Dominik Muhle, Lukas Koestler, Nikolaus Demmel, Florian Bernard, Daniel Cremers
:
The Probabilistic Normal Epipolar Constraint for Frame- To-Frame Rotation Optimization under Uncertain Feature Positions. 1809-1818 - Wentong Li, Yijie Chen, Kaixuan Hu, Jianke Zhu:
Oriented RepPoints for Aerial Object Detection. 1819-1828 - Christina Tsalicoglou, Thomas Rösgen:
Using 3D Topological Connectivity for Ghost Particle Reduction in Flow Reconstruction. 1829-1837 - Ngoc Long Nguyen, Jérémy Anger, Axel Davy, Pablo Arias, Gabriele Facciolo:
Self-Supervised Super-Resolution for Multi-Exposure Push-Frame Satellites. 1848-1858 - Xiaoguang Li, Qing Guo, Di Lin, Ping Li, Wei Feng, Song Wang:
MISF: Multi-level Interactive Siamese Filtering for High-Fidelity Image Inpainting. 1859-1868 - Si-Yuan Cao, Jianxin Hu, Ze-Hua Sheng, Hui-Liang Shen:
Iterative Deep Homography Estimation. 1869-1878 - Jingwen He, Wu Shi, Kai Chen, Lean Fu, Chao Dong:
GCFSR: a Generative and Controllable Face Super Resolution Method Without Facial and GAN Priors. 1879-1888 - Zhao Zhang, Huan Zheng, Richang Hong, Mingliang Xu, Shuicheng Yan, Meng Wang:
Deep Color Consistent Network for Low-Light Image Enhancement. 1889-1898 - Baisong Guo, Xiaoyun Zhang, Haoning Wu, Yu Wang, Ya Zhang
, Yanfeng Wang:
LAR-SR: A Local Autoregressive Model for Image Super-Resolution. 1899-1908 - Bo Ji, Angela Yao:
Multi-Scale Memory-Based Video Deblurring. 1909-1918 - Jaewon Lee, Kyong Hwan Jin
:
Local Texture Estimator for Implicit Representation Function. 1919-1928 - Qing Su, Shihao Ji:
ChiTransformer: Towards Reliable Stereo from Cues. 1929-1939 - Stefano Zorzi, Shabab Bazrafkan, Stefan Habenschuss, Friedrich Fraundorfer:
PolyWorld: Polygonal Building Extraction with Graph Neural Networks in Satellite Images. 1938-1947 - Jaihyun Koh, Jangho Lee, Sungroh Yoon:
BNUDC: A Two-Branched Deep Neural Network for Restoring Images from Under-Display Cameras. 1940-1949 - Metin Ersin Arican
, Ozgur Kara, Gustav Bredell, Ender Konukoglu:
ISNAS-DIP: Image-Specific Neural Architecture Search for Deep Image Prior. 1950-1958 - Lingtong Kong, Boyuan Jiang, Donghao Luo, Wenqing Chu, Xiaoming Huang, Ying Tai, Chengjie Wang, Jie Yang:
IFRNet: Intermediate Feature Refine Network for Efficient Frame Interpolation. 1959-1968 - Riccardo de Lutio, Alexander Becker, Stefano D'Aronco, Stefania Russo, Jan D. Wegner, Konrad Schindler:
Learning Graph Regularisation for Guided Super-Resolution. 1969-1978 - Weixi Wang, Ji Li, Hui Ji:
Self-supervised Deep Image Restoration via Adaptive Stochastic Gradient Langevin Dynamics. 1979-1988 - Wenbo Zhao, Xianming Liu, Zhiwei Zhong, Junjun Jiang, Wei Gao
, Ge Li, Xiangyang Ji:
Self-Supervised Arbitrary-Scale Point Clouds Upsampling via Implicit Neural Representation. 1989-1997 - Kwanyoung Kim, Taesung Kwon, Jong Chul Ye:
Noise Distribution Adaptive Self-Supervised Image Denoising using Tweedie Distribution and Score Matching. 1998-2006 - Xiang Chen
, Jinshan Pan, Kui Jiang, Yufeng Li, Yufeng Huang, Caihua Kong, Longgang Dai, Zhentao Fan:
Unpaired Deep Image Deraining Using Dual Contrastive Learning. 2007-2016 - Zejin Wang, Jiazheng Liu, Guoqing Li, Hua Han:
Blind2Unblind: Self-Supervised Image Denoising with Visible Blind Spots. 2017-2026 - Yang Yang, Chaoyue Wang, Risheng Liu, Lin Zhang, Xiaojie Guo, Dacheng Tao:
Self-augmented Unpaired Image Dehazing via Density and Depth Decomposition. 2027-2036 - Zeyuan Chen, Yinbo Chen, Jingwen Liu, Xingqian Xu, Vidit Goel, Zhangyang Wang, Humphrey Shi
, Xiaolong Wang:
VideoINR: Learning Video Implicit Neural Representation for Continuous Space-Time Super-Resolution. 2037-2047 - Ryuki Yamamoto, Hidekata Hontani, Akira Imakura, Tatsuya Yokota:
Fast Algorithm for Low-rank Tensor Completion in Delay-embedded Space. 2048-2056 - Cheng Zhang, Shaolin Su, Yu Zhu, Qingsen Yan, Jinqiu Sun, Yanning Zhang:
Exploring and Evaluating Image Restoration Potential in Dynamic Scenes. 2057-2066 - Pranjay Shyam, Kyung-Soo Kim, Kuk-Jin Yoon:
GIQE: Generic Image Quality Enhancement via Nth Order Iterative Degradation. 2067-2077 - Lai Jiang, Yifei Li, Shengxi Li, Mai Xu, Se Lei, Yichen Guo, Bo Huang:
Does text attract attention on e-commerce images: A novel saliency prediction dataset and method. 2078-2087 - Yi Zhang
, Dasong Li, Ka Lung Law, Xiaogang Wang
, Hongwei Qin, Hongsheng Li
:
IDR: Self-Supervised Image Denoising via Iterative Data Refinement. 2088-2097 - Biwen Lei, Xiefan Guo, Hongyu Yang, Miaomiao Cui, Xuansong Xie, Di Huang:
ABPN: Adaptive Blend Pyramid Network for Real-Time Local Retouching of Ultra High-Resolution Photo. 2098-2107 - Salma Abdel Magid, Zudi Lin, Donglai Wei, Yulun Zhang
, Jinjin Gu, Hanspeter Pfister
:
Texture-based Error Analysis for Image Super-Resolution. 2108-2117 - Zongsheng Yue, Qian Zhao, Jianwen Xie, Lei Zhang
, Deyu Meng, Kwan-Yee K. Wong:
Blind Image Super-resolution with Elaborate Degradation Modeling on Noise and Kernel. 2118-2128 - Hunsang Lee, Hyesong Choi, Kwanghoon Sohn, Dongbo Min:
KNN Local Attention for Image Restoration. 2129-2139 - Ruijun Gao, Qing Guo, Felix Juefei-Xu, Hongkai Yu, Huazhu Fu
, Wei Feng, Yang Liu, Song Wang:
Can You Spot the Chameleon? Adversarially Camouflaging Images from Co-Salient Object Detection. 2140-2149 - Youwei Pang, Xiaoqi Zhao, Tian-Zhu Xiang, Lihe Zhang, Huchuan Lu:
Zoom In and Out: A Mixed-scale Triplet Network for Camouflaged Object Detection. 2150-2160 - Jennifer J. Sun, Serim Ryou, Roni H. Goldshmid, Brandon Weissbourd, John O. Dabiri, David J. Anderson, Ann Kennedy, Yisong Yue, Pietro Perona:
Self-Supervised Keypoint Discovery in Behavioral Videos. 2161-2170 - Weizhe Liu, Bugra Tekin, Huseyin Coskun, Vibhav Vineet, Pascal Fua, Marc Pollefeys
:
Learning to Align Sequential Actions in the Wild. 2171-2181 - Soma Nonaka, Shohei Nobuhara, Ko Nishino:
Dynamic 3D Gaze from Afar: Deep Gaze Estimation from Temporal Eye-Head-Body Coordination. 2182-2191 - Danyang Tu, Xiongkuo Min, Huiyu Duan, Guodong Guo, Guangtao Zhai, Wei Shen:
End-to-End Human-Gaze-Target Detection with Transformers. 2192-2200 - Albert Tseng, Jennifer J. Sun, Yisong Yue:
Automatic Synthesis of Diverse Weak Supervision Sources for Behavior Analysis. 2201-2210 - Mihee Lee, Samuel S. Sohn, Seonghyeon Moon, Sejong Yoon, Mubbasir Kapadia, Vladimir Pavlovic:
MUSE-VAE: Multi-Scale VAE for Environment-Aware Long Term Trajectory Prediction. 2211-2220 - Lihuan Li, Maurice Pagnucco, Yang Song:
Graph-based Spatial Transformer with Memory Replay for Multi-future Pedestrian Trajectory Prediction. 2221-2231 - Ke Guo, Wenxi Liu, Jia Pan:
End-to-End Trajectory Distribution Prediction Based on Occupancy Grid Maps. 2232-2241 - Hongchen Luo, Wei Zhai, Jing Zhang
, Yang Cao, Dacheng Tao:
Learning Affordance Grounding from Exocentric Images. 2242-2251 - Jaebong Jeong, Janghun Jo, Sunghyun Cho, Jaesik Park:
3D Scene Painting via Semantic Image Synthesis. 2252-2262 - Jun Jia, Zhongpai Gao
, Dandan Zhu, Xiongkuo Min, Guangtao Zhai, Xiaokang Yang:
Learning Invisible Markers for Hidden Codes in Offline-to-online Photography. 2263-2272 - Lingteng Qiu, Zhangyang Xiong, Xuhao Wang, Kenkun Liu, Yihan Li, Guanying Chen
, Xiaoguang Han, Shuguang Cui
:
ETHSeg: An Amodel Instance Segmentation Network and a Real-world Dataset for X-Ray Waste Inspection. 2273-2282 - Ayan Kumar Bhunia, Viswanatha Reddy Gajjala, Subhadeep Koley
, Rohit Kundu, Aneeshan Sain
, Tao Xiang, Yi-Zhe Song
:
Doodle It Yourself: Class Incremental Learning by Drawing a Few Sketches. 2283-2292 - Xiyao Liu, Ziping Ma
, Junxing Ma, Jian Zhang
, Gerald Schaefer, Hui Fang:
Image Disentanglement Autoencoder for Steganography without Embedding. 2293-2302 - Banghuai Li:
Adaptive Hierarchical Representation Learning for Long-Tailed Object Detection. 2303-2312 - YuanFu Yang
, Min Sun:
Semiconductor Defect Detection by Hybrid Classical-Quantum Deep Learning. 2313-2322 - Yun He, Xinlin Ren, Danhang Tang, Yinda Zhang, Xiangyang Xue, Yanwei Fu
:
Density-preserving Deep Point Cloud Compression. 2323-2332 - Zheheng Jiang, Hossein Rahmani, Plamen Angelov, Sue Black, Bryan M. Williams:
Graph-context Attention Networks for Size-varied Deep Graph Matching. 2333-2342 - Jeya Maria Jose Valanarasu, Rajeev Yasarla, Vishal M. Patel:
TransWeather: Transformer-based Restoration of Images Degraded by Adverse Weather Conditions. 2343-2353 - Junke Wang, Zuxuan Wu, Jingjing Chen
, Xintong Han, Abhinav Shrivastava, Ser-Nam Lim, Yu-Gang Jiang:
ObjectFormer for Image Manipulation Detection and Localization. 2354-2363 - Qichen Fu, Xingyu Liu, Kris M. Kitani:
Sequential Voting with Relational Box Fields for Active Object Detection. 2364-2373 - Fanjie Kong, Ricardo Henao:
Efficient Classification of Very Large Images with Tiny Objects. 2374-2384 - Pinaki Nath Chowdhury, Ayan Kumar Bhunia, Viswanatha Reddy Gajjala, Aneeshan Sain
, Tao Xiang, Yi-Zhe Song
:
Partially Does It: Towards Scene-Level FG-SBIR with Partial Input. 2385-2395 - Ming-Fang Chang, Yipu Zhao, Rajvi Shah, Jakob J. Engel, Michael Kaess
, Simon Lucey
:
Long-term Visual Map Sparsification with Heterogeneous GNN. 2396-2405 - Ruize Han, Yiyang Gan, Jiacheng Li, Feifan Wang, Wei Feng, Song Wang:
Connecting the Complementary-view Videos: Joint Camera Identification and Subject Association. 2406-2415 - Gwanghyun Kim, Taesung Kwon, Jong Chul Ye:
DiffusionCLIP: Text-Guided Diffusion Models for Robust Image Manipulation. 2416-2425 - Yizhi Wang, Guo Pu, Wenhan Luo
, Yexin Wang, Pengfei Xiong, Hongwen Kang, Zhouhui Lian:
Aesthetic Text Logo Synthesis via Content-aware Layout Inferring. 2426-2435 - Gengyun Jia, Huaibo Huang, Chaoyou Fu, Ran He:
Rethinking Image Cropping: Exploring Diverse Compositions from Global Views. 2436-2445 - Jiakai Wang, Zixin Yin, Pengfei Hu, Aishan Liu, Renshuai Tao, Haotong Qin
, Xianglong Liu, Dacheng Tao:
Defensive Patches for Robust Recognition in the Physical World. 2446-2455 - Xun Jiang, Xing Xu, Jingran Zhang, Fumin Shen, Zuo Cao, Heng Tao Shen:
Semi-supervised Video Paragraph Grounding with Contrastive Encoder. 2456-2465 - Hao Ni, Jingkuan Song, Xiaopeng Luo, Feng Zheng, Wen Li, Heng Tao Shen:
Meta Distribution Alignment for Generalizable Person Re-Identification. 2477-2486 - Zhenpei Yang, Zhile Ren, Miguel Ángel Bautista, Zaiwei Zhang, Qi Shan, Qixing Huang:
FvOR: Robust Joint Shape and Pose Optimization for Few-view Object Reconstruction. 2487-2497 - Charig Yang, Weidi Xie, Andrew Zisserman:
It's About Time: Analog Clock Reading in the Wild. 2498-2507 - Samrudhdhi B. Rangrej, Chetan L. Srinidhi, James J. Clark:
Consistency driven Sequential Transformers Attention Model for Partially Observable Scenes. 2508-2517 - Ran Xu
, Fangzhou Mu, Jayoung Lee, Preeti Mukherjee, Somali Chaterji, Saurabh Bagchi, Yin Li:
Smartadapt: Multi-branch Object Detection Framework for Videos on Mobiles. 2518-2528 - Han Joo Chae, Seunghwan Lee, Hyewon Son, Seungyeob Han, Taebin Lim:
Generating 3D Bio-Printable Patches Using Wound Segmentation and Reconstruction to Treat Diabetic Foot Ulcers. 2529-2539 - Hanjiang Hu, Zuxin Liu
, Sharad Chitlangia, Akhil Agnihotri, Ding Zhao
:
Investigating the Impact of Multi-LiDAR Placement on Object Detection for Autonomous Driving. 2540-2549 - Qihang Yu, Huiyu Wang, Dahun Kim, Siyuan Qiao, Maxwell D. Collins, Yukun Zhu, Hartwig Adam, Alan L. Yuille, Liang-Chieh Chen:
CMT-DeepLab: Clustering Mask Transformers for Panoptic Segmentation. 2550-2560 - Tsung-Wei Ke
, Jyh-Jing Hwang, Yunhui Guo, Xudong Wang, Stella X. Yu:
Unsupervised Hierarchical Semantic Segmentation with Multiview Cosegmentation and Clustering Transformers. 2561-2571 - Tianfei Zhou, Wenguan Wang
, Ender Konukoglu, Luc Van Gool:
Rethinking Semantic Segmentation: A Prototype View. 2572-2583 - Duo Peng, Yinjie Lei, Munawar Hayat, Yulan Guo, Wen Li:
Semantic-Aware Domain Generalized Segmentation. 2584-2595 - Sheng Liu, Kangning Liu, Weicheng Zhu, Yiqiu Shen
, Carlos Fernandez-Granda:
Adaptive Early-Learning Correction for Segmentation from Noisy Annotations. 2596-2606 - Bowen Cheng, Omkar Parkhi, Alexander Kirillov:
Pointly-Supervised Instance Segmentation. 2607-2616 - Colin Graber, Cyril Jazra, Wenjie Luo, Liangyan Gui, Alexander G. Schwing:
Joint Forecasting of Panoptic Segmentations with Difference Attention. 2617-2626 - Zheng Lin, Zheng-Peng Duan, Zhao Zhang
, Chun-Le Guo, Ming-Ming Cheng
:
FocusCut: Diving into a Focus View in Interactive Segmentation. 2627-2636 - Yanan Sun
, Chi-Keung Tang, Yu-Wing Tai
:
Human Instance Matting via Mutual Guidance and Multi-Instance Refinement. 2637-2646 - Vickie Ye, Zhengqi Li, Richard Tucker, Angjoo Kanazawa, Noah Snavely:
Deformable Sprites for Unsupervised Video Decomposition. 2647-2656 - Wonhui Park, Dongkwon Jin, Chang-Su Kim:
Eigencontours: Novel Contour Descriptors Based on Low-Rank Approximation. 2657-2665 - Weixiao Liu
, Yuwei Wu, Sipu Ruan, Gregory S. Chirikjian:
Robust and Accurate Superquadric Recovery: a Probabilistic Approach. 2666-2675 - Morteza Rezanejad, Mohammad Khodadad, Hamidreza Mahyar, Herve Lombaert, Michael Gruninger, Dirk B. Walther, Kaleem Siddiqi:
Medial Spectral Coordinates for 3D Shape Analysis. 2676-2686 - Ozan Unal
, Dengxin Dai, Luc Van Gool:
Scribble-Supervised LiDAR Semantic Segmentation. 2687-2697 - Thang Vu, Kookhoi Kim, Tung Minh Luu, Thanh Xuan Nguyen, Chang D. Yoo:
SoftGroup for 3D Instance Segmentation on Point Clouds. 2698-2707 - Vasileios Choutas, Lea Müller, Chun-Hao P. Huang, Siyu Tang
, Dimitrios Tzionas, Michael J. Black:
Accurate 3D Body Shape Regression using Metric and Semantic Attributes. 2708-2718 - Yukang Cao, Guanying Chen
, Kai Han, Wenqi Yang, Kwan-Yee K. Wong:
JIFF: Jointly-aligned Implicit Face Function for High Quality Single View Clothed Human Reconstruction. 2719-2729 - Jathushan Rajasegaran, Georgios Pavlakos, Angjoo Kanazawa, Jitendra Malik:
Tracking People by Predicting 3D Appearance, Location and Pose. 2730-2739 - Lixin Yang, Kailin Li
, Xinyu Zhan, Jun Lv, Wenqiang Xu, Jiefeng Li, Cewu Lu:
ArtiBoost: Boosting Articulated 3D Hand-Object Pose Estimation via Online Exploration and Synthesis. 2740-2750 - Mengcheng Li, Liang An
, Hongwen Zhang, Lianpeng Wu, Feng Chen, Tao Yu, Yebin Liu:
Interacting Attention Graph for Single Image Two-Hand Reconstruction. 2751-2760 - Stylianos Ploumpis, Stylianos Moschoglou, Vasileios Triantafyllou, Stefanos Zafeiriou:
3D human tongue reconstruction from single "in-the-wild" images. 2761-2770 - Hansheng Chen, Pichao Wang, Fan Wang, Wei Tian, Lu Xiong, Hao Li:
EPro-PnP: Generalized End-to-End Probabilistic Perspective-n-Points for Monocular Object Pose Estimation. 2771-2780 - Zhuoling Li, Zhan Qu, Yang Zhou, Jianzhuang Liu, Haoqian Wang, Lihui Jiang:
Diversity Matters: Fully Exploiting Depth Clues for Reliable Monocular 3D Object Detection. 2781-2790 - Yuyan Li, Yuliang Guo, Zhixin Yan, Xinyu Huang, Ye Duan, Liu Ren:
OmniFusion: 360 Monocular Depth Estimation via Geometry-Aware Fusion. 2791-2800 - Amanpreet Walia, Stefanie Walz, Mario Bijelic, Fahim Mannan, Frank D. Julca-Aguilar, Michael S. Langer, Werner Ritter, Felix Heide:
Gated2Gated: Self-Supervised Depth Estimation from Gated Images. 2801-2811 - Rui Zhu, Zhengqin Li, Janarbek Matai, Fatih Porikli, Manmohan Chandraker:
IRISformer: Dense Vision Transformers for Single-Image Inverse Rendering in Indoor Scenes. 2812-2821 - Tien Do, Khiem Vuong, Hyun Soo Park:
Egocentric Scene Understanding via Multimodal Spatial Rectifier. 2822-2831 - Gwangbin Bae, Ignas Budvytis, Roberto Cipolla:
Multi-View Depth Estimation by Fusing Single-View Depth Probability with Multi-View Geometry. 2832-2841 - Ilya Chugunov, Yuxuan Zhang, Zhihao Xia, Xuaner Zhang, Jiawen Chen, Felix Heide:
The Implicit Values of A Good Hand Shake: Handheld Multi-Frame Neural Depth Refinement. 2842-2852 - Gengshan Yang, Minh Vo, Natalia Neverova, Deva Ramanan
, Andrea Vedaldi, Hanbyul Joo:
BANMo: Building Animatable 3D Neural Models from Many Casual Videos. 2853-2863 - Kanchana Ranasinghe, Muzammal Naseer, Salman Khan, Fahad Shahbaz Khan, Michael S. Ryoo:
Self-supervised Video Transformer. 2864-2874 - Shusheng Yang, Xinggang Wang, Yu Li
, Yuxin Fang, Jiemin Fang, Wenyu Liu, Xun Zhao, Ying Shan:
Temporally Efficient Vision Transformer for Video Instance Segmentation. 2875-2885 - Su Ho Han, Sukjun Hwang, Seoung Wug Oh, Yeonchool Park, Hyunwoo Kim, Min-Jung Kim, Seon Joo Kim:
VISOLO: Grid-Based Space-Time Aggregation for Efficient Online Video Instance Segmentation. 2886-2895 - Tengda Han, Weidi Xie, Andrew Zisserman:
Temporal Alignment Networks for Long-term Video. 2896-2906 - Shyamal Buch, Cristóbal Eyzaguirre, Adrien Gaidon, Jiajun Wu, Li Fei-Fei, Juan Carlos Niebles:
Revisiting the "Video" in Video-Language Understanding. 2907-2917 - Yicong Li, Xiang Wang, Junbin Xiao, Wei Ji, Tat-Seng Chua:
Invariant Grounding for Video Question Answering. 2918-2927 - He Zhao, Isma Hadji, Nikita Dvornik, Konstantinos G. Derpanis, Richard P. Wildes, Allan D. Jepson:
P3IV: Probabilistic Procedure Planning from Instructional Videos with Weak Supervision. 2928-2938 - Jinglin Xu, Yongming Rao, Xumin Yu, Guangyi Chen
, Jie Zhou, Jiwen Lu:
FineDiving: A Fine-grained Dataset for Procedure-aware Action Quality Assessment. 2939-2948 - Yinghao Xu, Fangyun Wei, Xiao Sun, Ceyuan Yang, Yujun Shen, Bo Dai, Bolei Zhou, Stephen Lin:
Cross-Model Pseudo-Labeling for Semi-Supervised Action Recognition. 2949-2958 - Haodong Duan, Yue Zhao, Kai Chen, Dahua Lin, Bo Dai:
Revisiting Skeleton-based Action Recognition. 2959-2968 - Wentao Bao
, Qi Yu
, Yu Kong:
OpenTAL: Towards Open Set Temporal Action Localization. 2969-2979 - Mingfei Han, David Junhao Zhang, Yali Wang, Rui Yan, Lina Yao, Xiaojun Chang
, Yu Qiao:
Dual-AI: Dual-path Actor Interaction Learning for Group Activity Recognition. 2980-2989 - Haodong Duan, Nanxuan Zhao, Kai Chen, Dahua Lin:
TransRank: Self-supervised Video Representation Learning via Ranking-based Transformation Recognition. 2990-3000 - Basile Van Hoorick, Purva Tendulkar, Dídac Surís, Dennis Park, Simon Stent, Carl Vondrick:
Revealing Occlusions with 4D Neural Fields. 3001-3011 - Ali Athar, Jonathon Luiten, Alexander Hermans, Deva Ramanan
, Bastian Leibe
:
HODOR: High-level Object Descriptors for Object Re-segmentation in Video Learned from Static Images. 3012-3021 - Juncheng Li, Junlin Xie, Long Qian, Linchao Zhu, Siliang Tang, Fei Wu, Yi Yang, Yueting Zhuang, Xin Eric Wang:
Compositional Temporal Grounding with Structured Variational Cross-Graph Correspondence Learning. 3022-3031 - Ye Liu, Siyuan Li, Yang Wu, Chang Wen Chen, Ying Shan, Xiaohu Qie:
UMT: Unified Multi-modal Transformers for Joint Video Moment Retrieval and Highlight Detection. 3032-3041 - Dayoung Gong, Joonseok Lee, Manjin Kim
, Seong Jong Ha, Minsu Cho:
Future Transformer for Long-term Action Anticipation. 3042-3051 - Zhaofan Qiu, Ting Yao, Chong-Wah Ngo, Tao Mei:
MLP-3D: A MLP-like 3D Architecture with Grouped Time Mixing. 3052-3062 - Fanyue Wei, Biao Wang, Tiezheng Ge, Yuning Jiang, Wen Li, Lixin Duan:
Learning Pixel-Level Distinctions for Video Highlight Detection. 3063-3072 - Tao Han, Lei Bai
, Junyu Gao, Qi Wang, Wanli Ouyang
:
DR.VIC: Decomposition and Reasoning for Video Individual Counting. 3073-3082 - Yi Zhou, Hui Zhang, Hana Lee, Shuyang Sun, Pingjun Li, Yangguang Zhu, ByungIn Yoo, Xiaojuan Qi, Jae-Joon Han:
Slot-VPS: Object-centric Representation Learning for Video Panoptic Segmentation. 3083-3093 - Kailai Zhou, Yibo Wang
, Tao Lv, Yunqian Li, Linsen Chen, Qiu Shen, Xun Cao:
Explore Spatio-temporal Aggregation for Insubstantial Object Detection: Benchmark Dataset and Baseline. 3094-3105 - Xiao Lu
, Yihong Cao, Sheng Liu, Chengjiang Long, Zipei Chen, Xuanyu Zhou, Yimin Yang, Chunxia Xiao:
Video Shadow Detection via Spatio-Temporal Interpolation Consistency Training. 3106-3115 - Guolei Sun, Yun Liu, Henghui Ding
, Thomas Probst, Luc Van Gool:
Coarse-to-Fine Feature Mining for Video Semantic Segmentation. 3116-3127 - Zhaoyang Zeng, Yongsheng Luo, Zhenhua Liu, Fengyun Rao, Dian Li, Weidong Guo, Zhen Wen:
Tencent-MVSE: A Large-Scale Benchmark Dataset for Multi-Modal Video Similarity Evaluation. 3128-3137 - Roei Herzig, Elad Ben-Avraham, Karttikeya Mangalam, Amir Bar, Gal Chechik, Anna Rohrbach, Trevor Darrell, Amir Globerson:
Object-Region Video Transformers. 3138-3149 - Le Yang, Junwei Han, Dingwen Zhang:
Colar: Effective and Efficient Online Action Detection by Consulting Exemplars. 3150-3159 - Zhangyang Gao, Cheng Tan, Lirong Wu, Stan Z. Li:
SimVP: Simpler yet Better Video Prediction. 3160-3170 - Jisoo Jeong, Jamie Menjay Lin, Fatih Porikli, Nojun Kwak:
Imposing Consistency for Optical Flow Estimation. 3171-3181 - Fuchen Long, Zhaofan Qiu, Yingwei Pan, Ting Yao, Jiebo Luo
, Tao Mei:
Stand-Alone Inter-Frame Attention in Video Models. 3182-3191 - Ze Liu, Jia Ning, Yue Cao, Yixuan Wei, Zheng Zhang, Stephen Lin, Han Hu:
Video Swin Transformer. 3192-3201 - Hitesh Sapkota, Qi Yu
:
Bayesian Nonparametric Submodular Video Partition for Robust Anomaly Detection. 3202-3211 - Angchi Xu, Ling-An Zeng
, Wei-Shi Zheng:
Likert Scoring with Grade Decoupling for Long-term Action Assessment. 3222-3231 - Yang Jin, Linchao Zhu, Yadong Mu:
Complex Video Action Reasoning via Learnable Markov Logic Network. 3232-3241 - Junfei Xiao, Longlong Jing, Lin Zhang, Ju He, Qi She, Zongwei Zhou, Alan L. Yuille, Yingwei Li:
Learning from Temporal Gradient for Semi-supervised Action Recognition. 3242-3252 - Jiafan Zhuang, Zilei Wang, Yuan Gao:
Semi-Supervised Video Semantic Segmentation with Inter-Frame Feature Reconstruction. 3253-3261 - Linjiang Huang, Liang Wang, Hongsheng Li
:
Weakly Supervised Temporal Action Localization via Representative Snippet Knowledge Propagation. 3262-3271 - Shaowei Liu, Subarna Tripathi, Somdeb Majumdar, Xiaolong Wang:
Joint Hand Motion and Interaction Hotspots Prediction from Egocentric Videos. 3272-3282 - Mohit Goyal, Sahil Modi, Rishabh Goyal, Saurabh Gupta:
Human Hands as Probes for Interactive Object Understanding. 3283-3293 - Dan Liu, Libo Zhang, Yanjun Wu:
LD-ConGR: A Large RGB-D Video Dataset for Long-Distance Continuous Gesture Recognition. 3294-3302 - Alex Jinpeng Wang, Yixiao Ge, Guanyu Cai, Rui Yan, Xudong Lin, Ying Shan, Xiaohu Qie, Mike Zheng Shou:
Object-aware Video-language Pre-training for Retrieval. 3303-3312 - Zexing Du, Xue Wang
, Guoqing Zhou, Qing Wang:
Fast and Unsupervised Action Boundary Detection for Action Segmentation. 3313-3322 - Shen Yan, Xuehan Xiong, Anurag Arnab, Zhichao Lu, Mi Zhang, Chen Sun, Cordelia Schmid:
Multiview Transformers for Video Recognition. 3323-3333 - Yuhan Shen
, Ehsan Elhamifar:
Semi-Weakly-Supervised Learning of Complex Actions from Instructional Task Videos. 3334-3344 - Jiaqi Tang, Zhaoyang Liu, Chen Qian, Wayne Wu, Limin Wang:
Progressive Attention on Multi-Level Dense Difference Maps for Generic Event Boundary Detection. 3345-3354 - Daniel Geng, Max Hamilton
, Andrew Owens:
Comparing Correspondences: Video Prediction with Correspondence-wise Losses. 3355-3366 - Seung Hyun Lee, Wonseok Roh, Wonmin Byeon, Sang Ho Yoon, Chanyoung Kim, Jinkyu Kim, Sangpil Kim
:
Sound-Guided Semantic Image Manipulation. 3367-3376 - Borong Liang
, Yan Pan, Zhizhi Guo, Hang Zhou, Zhibin Hong, Xiaoguang Han, Junyu Han, Jingtuo Liu, Errui Ding, Jingdong Wang:
Expressive Talking Head Generation with Granular Audio-Visual Control. 3377-3386 - Fa-Ting Hong, Longhao Zhang, Li Shen, Dan Xu:
Depth-Aware Generative Adversarial Network for Talking Head Video Generation. 3387-3396 - Jae Shin Yoon, Duygu Ceylan, Tuanfeng Y. Wang, Jingwan Lu, Jimei Yang, Zhixin Shu, Hyun Soo Park:
Learning Motion-Dependent Appearance for High-Fidelity Rendering of Dynamic Humans from a Single Camera. 3397-3407 - Yang Zhou, Jimei Yang, Dingzeyu Li, Jun Saito, Deepali Aneja, Evangelos Kalogerakis:
Audio-driven Neural Gesture Reenactment with Video Motion Graphs. 3408-3418 - Junfeng Lyu, Zhibo Wang, Feng Xu:
Portrait Eyeglasses and Shadow Removal by Leveraging 3D Synthetic Data. 3419-3429 - Ruili Feng, Cheng Ma, Chengji Shen, Xin Gao, Zhenjiang Liu, Xiaobo Li, Kairi Ou, Deli Zhao, Zheng-Jun Zha:
Weakly Supervised High-Fidelity Clothing Model Generation. 3430-3439 - You Xie, Huiqi Mao, Angela Yao, Nils Thuerey:
TemporalUV: Capturing Loose Clothing with Temporally Coherent UV Coordinates. 3440-3449 - Han Yang, Xinrui Yu, Ziwei Liu:
Full-Range Virtual Try-On with Recurrent Tri-Level Transform. 3450-3459 - Sen He, Yi-Zhe Song
, Tao Xiang:
Style-Based Global Appearance Flow for Virtual Try-On. 3460-3469 - Xin Dong, Fuwei Zhao, Zhenyu Xie, Xijin Zhang, Daniel K. Du, Min Zheng, Xiang Long, Xiaodan Liang, Jianchao Yang:
Dressing in the Wild by Watching Dance Videos. 3470-3479 - Jinwoo Kim, Heeseok Oh, Seongjean Kim
, Hoseok Tong, Sanghoon Lee:
A Brand New Dance Partner: Music-Conditioned Pluralistic Dancing Controlled by Multiple Dance Genres. 3480-3490 - Yifang Men, Yuan Yao, Miaomiao Cui, Zhouhui Lian, Xuansong Xie, Xian-Sheng Hua:
Unpaired Cartoon Image Synthesis via Gated Cycle Mapping. 3491-3500 - Jingjing Ren, Qingqing Zheng, Yuanyuan Zhao, Xuemiao Xu, Chen Li:
DLFormer: Discrete Latent Transformer for Video Inpainting. 3501-3510 - Duolikun Danier, Fan Zhang, David Bull:
ST-MFNet: A Spatio-Temporal Multi-Flow Network for Frame Interpolation. 3511-3521 - Liying Lu, Ruizheng Wu, Huaijia Lin, Jiangbo Lu
, Jiaya Jia
:
Video Frame Interpolation with Transformer. 3522-3532 - Dawit Mureja Argaw, In So Kweon:
Long-term Video Frame Interpolation via Feature Propagation. 3533-3542 - Ping Hu, Simon Niklaus, Stan Sclaroff, Kate Saenko:
Many-to-many Splatting for Efficient Video Frame Interpolation. 3543-3552 - Xuanchi Ren
, Xiaolong Wang:
Look Outside the Room: Synthesizing A Consistent Long-Term 3D Scene Video from A Single Image. 3553-3563 - Mengshun Hu, Kui Jiang, Liang Liao, Jing Xiao, Junjun Jiang, Zheng Wang:
Spatial-Temporal Space Hand-in-Hand: Spatial-Temporal Video Super-Resolution via Cycle-Projected Mutual Learning. 3564-3573 - Willi Menapace, Stéphane Lathuilière, Aliaksandr Siarohin, Christian Theobalt
, Sergey Tulyakov, Vladislav Golyanik, Elisa Ricci
:
Playable Environments: Video Manipulation in Space and Time. 3574-3583 - Lin Zhu
, Xiao Wang
, Yi Chang, Jianing Li, Tiejun Huang, Yonghong Tian:
Event-based Video Reconstruction via Potential-assisted Spiking Neural Network. 3584-3594 - Wei Yu, Wenxin Chen, Songheng Yin, Steve Easterbrook, Animesh Garg:
Modular Action Concept Grounding in Semantic Video Prediction. 3595-3604 - Ligong Han, Jian Ren, Hsin-Ying Lee, Francesco Barbieri, Kyle Olszewski, Shervin Minaee, Dimitris N. Metaxas, Sergey Tulyakov:
Show Me What and Tell Me How: Video Synthesis via Multimodal Conditioning. 3605-3615 - Ivan Skorokhodov
, Sergey Tulyakov, Mohamed Elhoseiny
:
StyleGAN-V: A Continuous Video Generator with the Price, Image Quality and Perks of StyleGAN2. 3616-3626 - Jiale Tao, Biao Wang, Borun Xu, Tiezheng Ge, Yuning Jiang, Wen Li, Lixin Duan:
Structure-Aware Motion Transfer with Deformable Anchor Model. 3627-3636 - Yoav Shalev, Lior Wolf:
Image Animation with Perturbed Masks. 3637-3646 - Jian Zhao, Hui Zhang:
Thin-Plate Spline Motion Model for Image Animation. 3647-3656 - Aniruddha Mahapatra, Kuldeep Kulkarni:
Controllable Animation of Fluid Elements in Still Images. 3657-3666 - Atsuhiro Noguchi, Umar Iqbal, Jonathan Tremblay, Tatsuya Harada, Orazio Gallo:
Watch It Move: Unsupervised Discovery of 3D Joints for Re-Posing of Articulated Objects. 3667-3677 - Peng Du, Jifeng Ning, Jiguang Cui, Shaoli Huang, Xinchao Wang
, Jiaxin Wang:
Geometric Structure Preserving Warp for Natural Image Stitching. 3678-3686 - Pei Chen, Yangkang Zhang, Zejian Li
, Lingyun Sun:
Few-Shot Incremental Learning for Label-to-Image Translation. 3687-3697 - Haiwei Chen
, Jiayi Liu, Weikai Chen, Shichen Liu, Yajie Zhao:
Exemplar-based Pattern Synthesis with Implicit Periodic Field Network. 3698-3707 - Xianling Zhang, Nathan Tseng, Ameerah Syed, Rohan Bhasin, Nikita Jaipuria:
SIMBAR: Single Image-Based Scene Relighting For Effective Data Augmentation For Automated Driving Vision Tasks. 3708-3718 - Jiahao Yu, Li Chen, Mingrui Zhang, Mading Li:
SoftCollage: A Differentiable Probabilistic Tree Generator for Image Collage. 3719-3728 - Ning Kang, Shanzhao Qiu, Shifeng Zhang, Zhenguo Li, Shutao Xia:
PILC: Practical Image Lossless Compression with an End-to-end GPU Oriented Neural Framework. 3729-3738 - Klaus Greff, Francois Belletti, Lucas Beyer, Carl Doersch, Yilun Du, Daniel Duckworth, David J. Fleet, Dan Gnanapragasam, Florian Golemo, Charles Herrmann, Thomas Kipf, Abhijit Kundu, Dmitry Lagun, Issam H. Laradji, Hsueh-Ti Derek Liu, Henning Meyer, Yishu Miao, Derek Nowrouzezahrai, A. Cengiz Öztireli, Etienne Pot, Noha Radwan, Daniel Rebain, Sara Sabour, Mehdi S. M. Sajjadi, Matan Sela, Vincent Sitzmann, Austin Stone, Deqing Sun, Suhani Vora, Ziyu Wang, Tianhao Wu, Kwang Moo Yi, Fangcheng Zhong, Andrea Tagliasacchi:
Kubric: A scalable dataset generator. 3739-3751 - Manuel Rey-Area, Mingze Yuan, Christian Richardt:
360MonoDepth: High-Resolution 360° Monocular Depth Estimation. 3752-3762 - Kalyan Vasudev Alwala, Abhinav Gupta, Shubham Tulsiani:
Pretrain, Self-train, Distill: A simple recipe for Supersizing 3D Reconstruction. 3763-3772 - Tuo Cao, Fei Luo, Yanping Fu, Wenxiao Zhang, Shengjie Zheng, Chunxia Xiao:
DGECN: A Depth-Guided Edge Convolutional Network for End-to-End 6D Pose Estimation. 3773-3782 - Zequn Qin, Xi Li:
MonoGround: Detecting Monocular 3D Objects from the Ground. 3783-3792 - Xin Wen, Junsheng Zhou, Yu-Shen Liu, Hua Su, Zhen Dong, Zhizhong Han:
3D Shape Reconstruction from 2D Images with Disentangled Attribute Flow. 3793-3803 - Cho-Ying Wu, Jialiang Wang, Michael Hall, Ulrich Neumann, Shuochen Su:
Toward Practical Monocular Indoor Depth Estimation. 3804-3814 - Georgy Ponimatkin, Yann Labbé, Bryan C. Russell, Mathieu Aubry, Josef Sivic:
Focal Length and Object Pose Estimation via Render and Compare. 3815-3824 - Can Wang
, Menglei Chai, Mingming He, Dongdong Chen, Jing Liao
:
CLIP-NeRF: Text-and-Image Driven Manipulation of Neural Radiance Fields. 3825-3834 - Heming Zhu, Lingteng Qiu, Yuda Qiu, Xiaoguang Han:
Registering Explicit to Implicit: Towards High-Fidelity Garment mesh Reconstruction from Single Images. 3835-3844 - Soo Ye Kim, Jianming Zhang, Simon Niklaus, Yifei Fan, Simon Chen, Zhe Lin, Munchurl Kim:
Layered Depth Refinement with Mask Guidance. 3845-3855 - Jiacheng Chen, Yiming Qian, Yasutaka Furukawa:
HEAT: Holistic Edge Attention Transformer for Structured Reconstruction. 3856-3865 - Nadine Rüegg, Silvia Zuffi, Konrad Schindler, Michael J. Black:
BARC: Learning to Regress 3D Dog Shape from Images by Exploiting Breed Information. 3866-3874 - Peixuan Li, Jieyu Jin:
Time3D: End-to-End Joint Monocular 3D Object Detection and Tracking for Autonomous Driving. 3875-3884 - Yufei Ye, Abhinav Gupta, Shubham Tulsiani:
What's in your hands? 3D Reconstruction of Generic Objects in Hands. 3885-3895 - Qianqian Wang, Zhengqi Li, David Salesin, Noah Snavely, Brian Curless, Janne Kontkanen:
3D Moments from Near-Duplicate Photos. 3896-3905 - Weihao Yuan, Xiaodong Gu, Zuozhuo Dai, Siyu Zhu, Ping Tan:
Neural Window Fully-connected CRFs for Monocular Depth Estimation. 3906-3915 - Jérôme Revaud, Vincent Leroy, Philippe Weinzaepfel, Boris Chidlovskii:
PUMP: Pyramidal and Uniqueness Matching Priors for Unsupervised Learning of Local Descriptors. 3916-3926 - Yannick Verdié, Jifei Song, Barnabé Mas, Benjamin Busam
, Ales Leonardis, Steven McDonagh
:
CroMo: Cross-Modal Learning for Monocular Depth Estimation. 3927-3937 - Navami Kairanda, Edith Tretschk, Mohamed Elgharib, Christian Theobalt
, Vladislav Golyanik:
$\phi$-SfT: Shape-from-Template with a Physics-Based Deformation Model. 3938-3948 - Hongwei Yi, Chun-Hao P. Huang, Dimitrios Tzionas, Muhammed Kocabas, Mohamed Hassan, Siyu Tang
, Justus Thies, Michael J. Black:
Human-Aware Object Placement for Visual Environment Reconstruction. 3949-3960 - Norman Müller, Andrea Simonelli, Lorenzo Porzi, Samuel Rota Bulò, Matthias Nießner, Peter Kontschieder:
AutoRF: Learning 3D Object Radiance Fields from Single View Observations. 3961-3970 - Shengqu Cai, Anton Obukhov, Dengxin Dai, Luc Van Gool:
Pix2NeRF: Unsupervised Conditional $\pi$-GAN for Single Image to Neural Radiance Fields Translation. 3971-3980 - Anh-Quan Cao, Raoul de Charette:
MonoScene: Monocular 3D Semantic Scene Completion. 3981-3991 - Felix Petersen, Bastian Goldluecke, Christian Borgelt, Oliver Deussen:
GenDR: A Generalized Differentiable Renderer. 3992-4001 - Kuan-Chih Huang, Tsung-Han Wu, Hung-Ting Su, Winston H. Hsu:
MonoDTR: Monocular 3D Object Detection with Depth-Aware Transformer. 4002-4011 - Can Gümeli, Angela Dai, Matthias Nießner:
ROCA: Robust CAD Model Retrieval and Alignment from a Single Image. 4012-4021 - Chang Yu, Xiangyu Zhu, Xiaomei Zhang, Zidu Wang
, Zhaoxiang Zhang, Zhen Lei:
HP-Capsule: Unsupervised Face Part Discovery by Hierarchical Parsing Capsule Network. 4022-4031 - Xiang An
, Jiankang Deng
, Jia Guo, Ziyong Feng
, Xuhan Zhu
, Jing Yang, Tongliang Liu
:
Killing Two Birds with One Stone: Efficient and Robust Training of Face Recognition CNNs by Partial FC. 4032-4041 - Jiahao Xia, Weiwei Qu, Wenjian Huang
, Jianguo Zhang, Xi Wang, Min Xu:
Sparse Local Patch Transformer for Robust Face Alignment and Landmarks Inherent Relation Learning. 4042-4051 - Mingjie He, Jie Zhang, Shiguang Shan, Xilin Chen:
Enhancing Face Recognition with Self-Supervised 3D Reconstruction. 4052-4061 - Chang Liu, Xiang Yu, Yi-Hsuan Tsai, Masoud Faraki, Ramin Moslemi, Manmohan Chandraker, Yun Fu:
Learning to Learn across Diverse Data Biases in Deep Face Recognition. 4062-4072 - Kai Wang, Shuo Wang, Panpan Zhang, Zhipeng Zhou, Zheng Zhu, Xiaobo Wang, Xiaojiang Peng, Baigui Sun, Hao Li, Yang You:
An Efficient Training Approach for Very Large Scale Face Recognition. 4073-4082 - Yang Liu, Fei Wang, Jiankang Deng
, Zhipeng Zhou, Baigui Sun, Hao Li:
MogFace: Towards a Deeper Appreciation on Face Detection. 4083-4092 - Shuai Jia
, Chao Ma, Taiping Yao, Bangjie Yin, Shouhong Ding, Xiaokang Yang:
Exploring Frequency Adversarial Attacks for Face Forgery Detection. 4093-4102 - Junyi Cao, Chao Ma, Taiping Yao, Shen Chen, Shouhong Ding, Xiaokang Yang:
End-to-End Reconstruction-Classification Learning for Face Forgery Detection. 4103-4112 - Zhuo Wang, Zezheng Wang, Zitong Yu, Weihong Deng, Jiahong Li, Tingting Gao, Zhongyuan Wang:
Domain Generalization via Shuffled Style Assembly for Face Anti-Spoofing. 4113-4123 - Chenqian Yan, Yuge Zhang, Quanlu Zhang, Yaming Yang, Xinyang Jiang, Yuqing Yang, Baoyuan Wang:
Privacy-preserving Online AutoML for Domain-Specific Face Detection. 4124-4134 - Nataniel Ruiz, Adam Kortylewski, Weichao Qiu, Cihang Xie
, Sarah Adel Bargal, Alan L. Yuille, Stan Sclaroff:
Simulated Adversarial Testing of Face Recognition Models. 4135-4145 - Qingping Zheng, Jiankang Deng
, Zheng Zhu, Ying Li, Stefanos Zafeiriou:
Decoupled Multi-task Learning with Cyclical Self-Regulation for Face Parsing. 4146-4155 - Hangyu Li
, Nannan Wang, Xi Yang, Xiaoyu Wang, Xinbo Gao:
Towards Semi-Supervised Deep Facial Expression Recognition with An Adaptive Confidence Margin. 4156-4165 - Hui Li, Zidong Guo, Seon-Min Rhee, Seungju Han
, Jae-Joon Han:
Towards Accurate Facial Landmark Detection via Cascaded Transformers. 4166-4175 - Zitong Yu, Yuming Shen, Jingang Shi, Hengshuang Zhao, Philip H. S. Torr, Guoying Zhao:
PhysFormer: Facial Video-based Physiological Measurement with Temporal Difference Transformer. 4176-4186 - Mingfang Zhang
, Yunfei Liu, Feng Lu:
GazeOnce: Real-Time Multi-Person Gaze Estimation. 4187-4196 - Yiwei Bao, Yunfei Liu, Haofei Wang, Feng Lu:
Generalizing Gaze Estimation with Rotation Consistency. 4197-4206 - Andrew Z. Hou, Michel Sarkis, Ning Bi, Yiying Tong, Xiaoming Liu:
Face Relighting with Geometrically Consistent Shadows. 4207-4216 - Yiqian Wu, Yong-Liang Yang, Xiaogang Jin:
HairMapper: Removing Hair from Portraits Using GANs. 4217-4226 - Zhenyu Zhang, Yanhao Ge, Ying Tai, Xiaoming Huang, Chengjie Wang, Hao Tang, Dongjin Huang, Zhifeng Xie:
Learning to Restore 3D Face from In-the-Wild Degraded Images. 4227-4237 - Yuchao Wang, Haochen Wang, Yujun Shen, Jingjing Fei, Wei Li, Guoqiang Jin, Liwei Wu, Rui Zhao, Xinyi Le:
Semi-Supervised Semantic Segmentation Using Unreliable Pseudo-Labels. 4238-4247 - Yuyuan Liu
, Yu Tian, Yuanhong Chen, Fengbei Liu, Vasileios Belagiannis, Gustavo Carneiro
:
Perturbed and Strict Mean Teachers for Semi-supervised Semantic Segmentation. 4248-4257 - Lihe Yang, Wei Zhuo, Lei Qi, Yinghuan Shi, Yang Gao:
ST++: Make Self-trainingWork Better for Semi-supervised Semantic Segmentation. 4258-4267 - Beomyoung Kim, Youngjoon Yoo, Chaeeun Rhee, Junmo Kim:
Beyond Semantic to Instance Segmentation: Weakly-Supervised Instance Segmentation via Semantic Knowledge Transfer and Self-Refinement. 4268-4277 - Qi Chen, Lingxiao Yang, Jianhuang Lai, Xiaohua Xie:
Self-supervised Image-specific Prototype Exploration for Weakly Supervised Semantic Segmentation. 4278-4288 - Tianfei Zhou, Meijie Zhang, Fang Zhao, Jianwu Li:
Regional Semantic Contrast and Aggregation for Weakly Supervised Semantic Segmentation. 4289-4299 - Lian Xu, Wanli Ouyang
, Mohammed Bennamoun
, Farid Boussaïd, Dan Xu:
Multi-class Token Transformer for Weakly Supervised Semantic Segmentation. 4300-4309 - Ye Du
, Zehua Fu, Qingjie Liu, Yunhong Wang:
Weakly Supervised Semantic Segmentation by Pixel-to-Prototype Contrast. 4310-4319 - Minhyun Lee, Dongseob Kim, Hyunjung Shim:
Threshold Matters in WSSS: Manipulating the Activation for the Robust and Accurate Segmentation Model Against Thresholds. 4320-4329 - Yuyang Zhao, Zhun Zhong, Nicu Sebe
, Gim Hee Lee:
Novel Class Discovery in Semantic Segmentation. 4330-4339 - Jin Kim
, Jiyoung Lee
, Jungin Park, Dongbo Min, Kwanghoon Sohn:
Pin the Memory: Learning to Generalize Semantic Segmentation. 4340-4350 - Shaohua Guo, Liang Liu, Zhenye Gan, Yabiao Wang
, Wuhao Zhang, Chengjie Wang, Guannan Jiang, Wei Zhang, Ran Yi, Lizhuang Ma, Ke Xu
:
ISDNet: Integrating Shallow and Deep Networks for Efficient Ultra-high Resolution Segmentation. 4351-4360 - Fabio Cermelli, Dario Fontanel, Antonio Tavera
, Marco Ciccone, Barbara Caputo:
Incremental Learning in Semantic Segmentation from Image Labels. 4361-4371 - Justin Lazarow, Weijian Xu, Zhuowen Tu:
Instance Segmentation with Mask-supervised Polygonal Boundary Transformers. 4372-4381 - Chenming Zhu, Xuanye Zhang, Yanran Li, Liangdong Qiu, Kai Han, Xiaoguang Han:
SharpContour: A Contour-based Boundary Refinement Approach for Efficient and Accurate Instance Segmentation. 4382-4391 - Adrian Wolny, Qin Yu
, Constantin Pape, Anna Kreshuk:
Sparse Object-level Supervision for Instance Segmentation with Pixel Embeddings. 4392-4401 - Lei Ke, Martin Danelljan, Xia Li, Yu-Wing Tai
, Chi-Keung Tang, Fisher Yu:
Mask Transfiner for High-Quality Instance Segmentation. 4402-4411 - Weiyao Wang, Matt Feiszli, Heng Wang, Jitendra Malik, Du Tran:
Open-World Instance Segmentation: Exploiting Pseudo Ground Truth From Learned Pairwise Affinity. 4412-4422 - Tianheng Cheng, Xinggang Wang, Shaoyu Chen, Wenqiang Zhang, Qian Zhang, Chang Huang, Zhaoxiang Zhang, Wenyu Liu:
Sparse Instance Activation for Real-Time Instance Segmentation. 4423-4432 - Tao Zhang, Shiqing Wei, Shunping Ji:
E2EC: An End-to-End Contour-based Method for High-Quality High-Speed Instance Segmentation. 4433-4442 - Mina Ghadimi Atigh, Julian Schoep, Erman Acar
, Nanne van Noord
, Pascal Mettes:
Hyperbolic Image Segmentation. 4443-4452 - Dasol Han, Jaewook Yoo, Dokwan Oh:
SeeThroughNet: Resurrection of Auxiliary Loss by Preserving Class Probability Information. 4453-4462 - Kunliang Liu, Ouk Choi, Jianming Wang, Wonjun Hwang
:
CDGNet: Class Distribution Guided Network for Human Parsing. 4463-4472 - Jinheng Xie, Xianxu Hou, Kai Ye, Linlin Shen:
CLIMS: Cross Language Image Matching for Weakly Supervised Semantic Segmentation. 4473-4482 - Olga Veksler, Yuri Boykov:
Sparse Non-local CRF. 4483-4493 - Yijie Zhong
, Bo Li, Lv Tang, Senyun Kuang, Shuang Wu, Shouhong Ding:
Detecting Camouflaged Object in Frequency Domain. 4494-4503 - Wei Liao:
Progressive Minimal Path Method with Embedded CNN. 4504-4512 - Chang Liu
, Chun Yang, Xu-Cheng Yin:
Open-Set Text Recognition via Character-Context Decoupling. 4513-4522 - Hao Liu, Xin Li, Bing Liu, Deqiang Jiang, Yinsong Liu, Bo Ren:
Neural Collaborative Graph Machines for Table Structure Recognition. 4523-4532 - Xiangwei Jiang, Rujiao Long, Nan Xue, Zhibo Yang, Cong Yao, Gui-Song Xia:
Revisiting Document Image Dewarping by Grid Regularization. 4533-4542 - Ye Yuan
, Xiao Liu, Wondimu Dikubab, Hui Liu, Zhilong Ji, Zhongqin Wu, Xiang Bai:
Syntax-Aware Network for Handwritten Mathematical Expression Recognition. 4543-4552 - Jingqun Tang
, Wenqing Zhang, Hongye Liu, Mingkun Yang, Bo Jiang, Guanglong Hu, Xiang Bai:
Few Could Be Better Than All: Feature Sampling and Grouping for Scene Text Detection. 4553-4562 - Chuhui Xue, Zichen Tian, Fangneng Zhan, Shijian Lu, Song Bai:
Fourier Document Restoration for Robust Document Dewarping and Recognition. 4563-4572 - Zhangxuan Gu, Changhua Meng, Ke Wang
, Jun Lan, Weiqiang Wang, Ming Gu, Liqing Zhang:
XYLayoutLM: Towards Layout-Aware Multimodal Networks For Visually-Rich Document Understanding. 4573-4582 - Mingxin Huang
, Yuliang Liu, Zhenghao Peng, Chongyu Liu, Dahua Lin, Shenggao Zhu, Nicholas Jing Yuan, Kai Ding, Lianwen Jin:
SwinTextSpotter: Scene Text Spotting via Better Synergy between Text Detection and Text Recognition. 4583-4593 - Yair Kittenplon, Inbal Lavi, Sharon Fogel, Yarin Bar, R. Manmatha, Pietro Perona:
Towards Weakly-Supervised Text Spotting using a Multi-Task Transformer. 4594-4603 - Ahmed S. Nassar, Nikolaos Livathinos, Maksym Lysak, Peter W. J. Staar:
TableFormer: Table Structure Understanding with Transformers. 4604-4613 - Hao Wang, Junchao Liao
, Tianheng Cheng, Zewen Gao, Hao Liu, Bo Ren, Xiang Bai, Wenyu Liu:
Knowledge Mining with Scene Text for Fine-Grained Recognition. 4614-4623 - Brandon Smock, Rohith Pesala, Robin Abraham:
PubTables-1M: Towards comprehensive table extraction from unstructured documents. 4624-4632 - Zhendong Yang, Zhe Li, Xiaohu Jiang, Yuan Gong, Zehuan Yuan, Danpei Zhao, Chun Yuan:
Focal and Global Knowledge Distillation for Detectors. 4633-4642 - Jiahao Fan, Huabin Liu, Wenjie Yang, John See
, Aixin Zhang, Weiyao Lin:
Speed up Object Detection on Gigapixel-level Images with Patch Arrangement. 4643-4653 - Weixiang Hong, Jiangwei Lao, Wang Ren, Jian Wang, Jingdong Chen, Wei Chu:
Training Object Detectors from Scratch: An Empirical Study in the Era of Vision Transformer. 4652-4661 - Ahmet Iscen, Jack Valmadre
, Anurag Arnab, Cordelia Schmid:
Learning with Neighbor Consistency for Noisy Labels. 4662-4671 - Chaoqun Wan, Xu Shen, Yonggang Zhang, Zhiheng Yin, Xinmei Tian, Feng Gao, Jianqiang Huang, Xian-Sheng Hua:
Meta Convolutional Neural Networks for Single Domain Generalization. 4672-4681 - Haowei Zhu, Wenjing Ke, Dong Li, Ji Liu, Lu Tian, Yi Shan:
Dual Cross-Attention Learning for Fine-Grained Visual Categorization and Object Re-Identification. 4682-4692 - Zhuangzhuang Chen, Jin Zhang, Zhuonan Lai, Jie Chen, Zun Liu, Jianqiang Li
:
Geometry-Aware Guided Loss for Deep Crack Recognition. 4693-4702 - Qi Jia, Shuilian Yao, Yu Liu, Xin Fan, Risheng Liu, Zhongxuan Luo:
Segment, Magnify and Reiterate: Detecting Camouflaged Objects the Hard Way. 4703-4712 - Qinghang Hong, Fengming Liu, Dong Li, Ji Liu, Lu Tian, Yi Shan:
Dynamic Sparse R-CNN. 4713-4722 - Senqi Cao, Zhongfei Zhang:
Deep Hybrid Models for Out-of-Distribution Detection. 4723-4733 - Hongyang Gu
, Jianmin Li, Guangyuan Fu, Chifong Wong, Xinghao Chen, Jun Zhu:
AutoLoss-GMS: Searching Generalized Margin-based Softmax Loss Function for Person Re-identification. 4734-4743 - Zhikang Wang, Feng Zhu, Shixiang Tang, Rui Zhao, Lihuo He, Jiangning Song:
Feature Erasing and Diffusion Network for Occluded Person Re-Identification. 4744-4753 - Emanuel Ben Baruch, Tal Ridnik, Itamar Friedman, Avi Ben-Cohen, Nadav Zamir, Asaf Noy, Lihi Zelnik-Manor:
Multi-label Classification with Partial Annotations using Class-aware Selective Loss. 4754-4762 - Duy-Kien Nguyen, Jihong Ju, Olaf Booij, Martin R. Oswald
, Cees G. M. Snoek:
BoxeR: Box-Attention for 2D and 3D Transformers. 4763-4772 - Sai Rajeswar, Pau Rodríguez, Soumye Singhal, David Vázquez, Aaron C. Courville:
Multi-label Iterated Learning for Image Classification with Label Ambiguity. 4773-4783 - Zhuofan Xia
, Xuran Pan, Shiji Song, Li Erran Li, Gao Huang:
Vision Transformer with Deformable Attention. 4784-4793 - Yanghao Li, Chao-Yuan Wu, Haoqi Fan, Karttikeya Mangalam, Bo Xiong, Jitendra Malik, Christoph Feichtenhofer:
MViTv2: Improved Multiscale Vision Transformers for Classification and Detection. 4794-4804 - Binghui Chen, Pengyu Li, Xiang Chen, Biao Wang, Lei Zhang
, Xian-Sheng Hua:
Dense Learning based Semi-Supervised Object Detection. 4805-4814 - Yali Li, Shengjin Wang:
R(Det)2: Randomized Decision Routing for Object Detection. 4815-4824 - Kareem M. Metwaly, Aerin Kim, Elliot Branson, Vishal Monga:
GlideNet: Global, Local and Intrinsic based Dense Embedding NETwork for Multi-category Attributes Prediction. 4825-4836 - Jongmin Lee, Byungjin Kim, Minsu Cho:
Self-Supervised Equivariant Learning for Oriented Keypoint Detection. 4837-4847 - Jingzhou Chen
, Peng Wang, Jian Liu, Yuntao Qian:
Label Relation Graphs Enhanced Hierarchical Residual Network for Hierarchical Multi-Granularity Classification. 4848-4857 - Xuehui Yu, Pengfei Chen
, Di Wu, Najmul Hassan, Guorong Li, Junchi Yan, Humphrey Shi
, Qixiang Ye, Zhenjun Han:
Object Localization under Single Coarse Point Supervision. 4858-4867 - Gabriele Moreno Berton, Carlo Masone
, Barbara Caputo:
Rethinking Visual Geo-localization for Large-Scale Applications. 4868-4878 - Supreeth Narasimhaswamy, Thanh Nguyen, Mingzhen Huang, Minh Hoai:
Whose Hands are These? Hand Detection and Hand-Body Association in the Wild. 4879-4889 - Yanan Wang, Xuezhi Liang, Shengcai Liao:
Cloning Outfits from Real-World Images to 3D Characters for Generalizable Person Re-Identification. 4890-4899 - Xingxuan Zhang, Linjun Zhou, Renzhe Xu
, Peng Cui, Zheyan Shen, Haoxin Liu:
Towards Unsupervised Domain Generalization. 4900-4910 - Haoqi Wang
, Zhizhong Li, Litong Feng, Wayne Zhang
:
ViM: Out-Of-Distribution with Virtual-logit Matching. 4911-4920 - Arnav Chavan, Zhiqiang Shen, Zhuang Liu, Zechun Liu, Kwang-Ting Cheng, Eric P. Xing:
Vision Transformer Slimming: Multi-Dimension Searching in Continuous Optimization Space. 4921-4931 - Zechun Liu, Kwang-Ting Cheng, Dong Huang, Eric P. Xing, Zhiqiang Shen:
Nonuniform-to-Uniform Quantization: Towards Accurate Quantization via Generalized Straight-Through Estimation. 4932-4942 - Dongxu Li, Junnan Li, Hongdong Li, Juan Carlos Niebles, Steven C. H. Hoi:
Align and Prompt: Video-and-Language Pre-training with Entity Prompts. 4943-4953 - Zihan Ding, Tianrui Hui, Junshi Huang, Xiaoming Wei, Jizhong Han, Si Liu:
Language-Bridged Spatial-Temporal Interaction for Referring Video Object Segmentation. 4954-4963 - Jiannan Wu, Yi Jiang, Peize Sun, Zehuan Yuan, Ping Luo:
Language as Queries for Referring Video Object Segmentation. 4964-4974 - Adam Botach, Evgenii Zheltonozhskii, Chaim Baskin
:
End-to-End Referring Video Object Segmentation with Multimodal Transformers. 4975-4985 - Dongming Wu
, Xingping Dong, Ling Shao, Jianbing Shen:
Multi-Level Representation Learning with Semantic Alignment for Referring Video Object Segmentation. 4986-4995 - Satya Krishna Gorti, Noël Vouitsis, Junwei Ma, Keyvan Golestan, Maksims Volkovs, Animesh Garg, Guangwei Yu:
X-Pool: Cross-Modal Language-Video Attention for Text-Video Retrieval. 4996-5005 - Dohwan Ko, Joonmyung Choi, Juyeon Ko, Shinyeong Noh, Kyoung-Woon On, Eun-Sol Kim, Hyunwoo J. Kim:
Video-Text Representation Learning via Differentiable Weak Temporal Alignment. 5006-5015 - Mattia Soldan
, Alejandro Pardo, Juan León Alcázar, Fabian Caba Heilbron, Chen Zhao, Silvio Giancola, Bernard Ghanem
:
MAD: A Scalable Dataset for Language Grounding in Videos from Movie Audio Descriptions. 5016-5025 - Hongwei Xue, Tiankai Hang, Yanhong Zeng, Yuchong Sun, Bei Liu, Huan Yang, Jianlong Fu, Baining Guo:
Advancing High-Resolution Video-Language Representation with Large-Scale Video Transcriptions. 5026-5035 - Mona Gandhi, Mustafa Omer Gul, Eva Prakash
, Madeleine Grunde-McLaughlin, Ranjay Krishna, Maneesh Agrawala
:
Measuring Compositional Consistency for Video Question Answering. 5036-5045 - Paola Cascante-Bonilla, Hui Wu, Letao Wang, Rogério Feris, Vicente Ordonez:
Sim VQA: Exploring Simulated Environments for Visual Question Answering. 5046-5056 - Feng Gao, Qing Ping, Govind Thattai, Aishwarya N. Reganti, Ying Nian Wu, Prem Natarajan:
Transform-Retrieve-Generate: Natural Language-Centric Outside-Knowledge Visual Question Answering. 5057-5067 - Vipul Gupta
, Zhuowan Li, Adam Kortylewski, Chenyu Zhang
, Yingwei Li, Alan L. Yuille:
SwapMix: Diagnosing and Regularizing the Over-Reliance on Visual Context in Visual Question Answering. 5068-5078 - Yang Ding, Jing Yu, Bang Liu, Yue Hu, Mingxin Cui, Qi Wu:
MuKEA: Multimodal Knowledge Extraction and Accumulation for Knowledge-based Visual Question Answering. 5079-5088 - Chenchen Jing, Yunde Jia, Yuwei Wu, Xinyu Liu, Qi Wu:
Maintaining Reasoning Consistency in Compositional Visual Question Answering. 5089-5098 - Aoxiong Yin, Zhou Zhao, Weike Jin, Meng Zhang, Xingshan Zeng, Xiaofei He:
MLSLT: Towards Multilingual Sign Language Translation. 5099-5109 - Yutong Chen, Fangyun Wei, Xiao Sun, Zhirong Wu, Stephen Lin:
A Simple Multi-Modality Transfer Learning Baseline for Sign Language Translation. 5110-5120 - Ronglai Zuo, Brian Mak
:
C2SLR: Consistency-enhanced Continuous Sign Language Recognition. 5121-5130 - Ben Saunders
, Necati Cihan Camgöz, Richard Bowden
:
Signing at Scale: Learning to Co-Articulate Signs for Large-Scale Photo-Realistic Sign Language Production. 5131-5141 - Chuan Guo, Shihao Zou, Xinxin Zuo
, Sen Wang
, Wei Ji
, Xingyu Li, Li Cheng:
Generating Diverse and Natural 3D Human Motions from Text. 5142-5151 - K. R. Prajwal, Triantafyllos Afouras, Andrew Zisserman:
Sub-word Level Lip Reading With Visual Attention. 5152-5162 - Ram Ramrakhya, Eric Undersander, Dhruv Batra, Abhishek Das:
Habitat-Web: Learning Embodied Object-Search Strategies from Human Demonstrations at Scale. 5163-5173 - Mengjun Cheng, Yipeng Sun, Longchao Wang, Xiongwei Zhu, Kun Yao, Jie Chen, Guoli Song, Junyu Han, Jingtuo Liu, Errui Ding, Jingdong Wang:
ViSTA: Vision and Scene Text Aggregation for Cross-Modal Retrieval. 5174-5183 - Simion-Vlad Bogolin, Ioana Croitoru, Hailin Jin, Yang Liu
, Samuel Albanie:
Cross Modal Retrieval with Querybank Normalisation. 5184-5195 - Yuning Lu, Jianzhuang Liu, Yonggang Zhang, Yajing Liu, Xinmei Tian:
Prompt Distribution Learning. 5196-5205 - Yi Li, Rameswar Panda, Yoon Kim, Chun-Fu Richard Chen, Rogério Feris, David D. Cox, Nuno Vasconcelos
:
VALHALLA: Visual Hallucination for Machine Translation. 5206-5216 - Yi-Lin Sung, Jaemin Cho, Mohit Bansal:
VL-ADAPTER: Parameter-Efficient Transfer Learning for Vision-and-Language Tasks. 5217-5227 - Tristan Thrush, Ryan Jiang, Max Bartolo, Amanpreet Singh, Adina Williams, Douwe Kiela, Candace Ross:
Winoground: Probing Vision and Language Models for Visio-Linguistic Compositionality. 5228-5238 - Qiang Chen, Qiman Wu, Jian Wang, Qinghao Hu, Tao Hu, Errui Ding, Jian Cheng, Jingdong Wang:
MixFormer: Mixing Features across Windows and Dimensions. 5239-5249 - Zhe Chen
, Jing Zhang, Dacheng Tao:
Recurrent Glimpse-based Decoder for Detection with Transformer. 5250-5259 - Yinpeng Chen, Xiyang Dai, Dongdong Chen, Mengchen Liu, Xiaoyi Dong, Lu Yuan, Zicheng Liu:
Mobile-Former: Bridging MobileNet and Transformer. 5260-5269 - Sivan Harary, Eli Schwartz, Assaf Arbelle, Peter W. J. Staar, Shady Abu-Hussein, Elad Amrani, Roei Herzig, Amit Alfassy, Raja Giryes, Hilde Kuehne
, Dina Katabi, Kate Saenko, Rogério Feris, Leonid Karlinsky:
Unsupervised Domain Generalization by Learning a Bridge Across Domains. 5270-5280 - Wuyang Li
, Xinyu Liu
, Yixuan Yuan
:
SIGMA: Semantic-complete Graph Matching for Domain Adaptive Object Detection. 5281-5290 - Jiaxi Wu, Jiaxin Chen, Mengzhe He, Yiru Wang, Bo Li, Bingqi Ma, Weihao Gan, Wei Wu, Yali Wang, Di Huang:
Target-Relevant Knowledge Preservation for Multi-Source Domain Adaptive Object Detection. 5291-5300 - Zeren Sun, Fumin Shen, Dan Huang, Qiong Wang, Xiangbo Shu, Yazhou Yao, Jinhui Tang:
PNP: Robust Learning from Noisy Labels by Probabilistic Noise Prediction. 5301-5310 - Guangxing Han, Jiawei Ma
, Shiyuan Huang, Long Chen
, Shih-Fu Chang:
Few-Shot Object Detection with Fully Cross-Transformer. 5311-5320 - Su Been Lee, WonJun Moon, Jae-Pil Heo:
Task Discrepancy Maximization for Fine-grained Few-Shot Classification. 5321-5330 - Weizhe Liu, Nikita Durasov, Pascal Fua:
Leveraging Self-Supervision for Cross-Domain Crowd Counting. 5331-5342 - A. S. M. Iftekhar, Hao Chen, Kaustav Kundu, Xinyu Li, Joseph Tighe, Davide Modolo:
What to look at and where: Semantic and Spatial Refined Transformer for detecting human-object interactions. 5343-5353 - Ziteng Gao, Limin Wang, Bing Han, Sheng Guo:
AdaMixer: A Fast-Converging Query-Based Object Detector. 5354-5363 - Seongwon Lee
, Hongje Seong, Suhyeon Lee
, Euntai Kim:
Correlation Verification for Image Retrieval. 5364-5374 - Jinrong Yang, Songtao Liu, Zeming Li, Xiaoping Li, Jian Sun:
Real-time Object Detection for Streaming Perception. 5375-5385 - Gabriele Moreno Berton, Riccardo Mereu, Gabriele Trivigno, Carlo Masone
, Gabriela Csurka, Torsten Sattler, Barbara Caputo:
Deep Visual Geo-localization Benchmark. 5386-5397 - Ruoxi Shi, Xinyang Jiang, Caihua Shan, Yansen Wang, Dongsheng Li:
RendNet: Unified 2D/3D Recognizer with Latent Space Rendering. 5398-5407 - Xiaopei Wu, Liang Peng, Honghui Yang, Liang Xie, Chenxi Huang, Chengqi Deng, Haifeng Liu, Deng Cai:
Sparse Fuse Dense: Towards High Quality 3D Detection with Depth Completion. 5408-5417 - Yukang Chen, Yanwei Li, Xiangyu Zhang, Jian Sun, Jiaya Jia
:
Focal Sparse Convolutional Networks for 3D Object Detection. 5418-5427 - Qiangeng Xu, Zexiang Xu, Julien Philip, Sai Bi, Zhixin Shu, Kalyan Sunkavalli, Ulrich Neumann:
Point-NeRF: Point-based Neural Radiance Fields. 5428-5438 - Xiaoshuai Zhang, Sai Bi, Kalyan Sunkavalli, Hao Su, Zexiang Xu:
NeRFusion: Fusing Radiance Fields for Large-Scale Scene Reconstruction. 5439-5448 - Cheng Sun, Min Sun, Hwann-Tzong Chen:
Direct Voxel Grid Optimization: Super-fast Convergence for Radiance Fields Reconstruction. 5449-5459 - Jonathan T. Barron, Ben Mildenhall, Dor Verbin, Pratul P. Srinivasan, Peter Hedman:
Mip-NeRF 360: Unbounded Anti-Aliased Neural Radiance Fields. 5460-5469 - Michael Niemeyer, Jonathan T. Barron, Ben Mildenhall, Mehdi S. M. Sajjadi, Andreas Geiger, Noha Radwan:
RegNeRF: Regularizing Neural Radiance Fields for View Synthesis from Sparse Inputs. 5470-5480 - Dor Verbin, Peter Hedman, Ben Mildenhall, Todd E. Zickler, Jonathan T. Barron, Pratul P. Srinivasan:
Ref-NeRF: Structured View-Dependent Appearance for Neural Radiance Fields. 5481-5490 - Sara Fridovich-Keil, Alex Yu, Matthew Tancik, Qinhong Chen, Benjamin Recht, Angjoo Kanazawa:
Plenoxels: Radiance Fields without Neural Networks. 5491-5500 - Haoyu Guo, Sida Peng, Haotong Lin, Qianqian Wang, Guofeng Zhang, Hujun Bao, Xiaowei Zhou:
Neural 3D Scene Reconstruction with the Manhattan-world Assumption. 5501-5510 - Tianye Li, Mira Slavcheva, Michael Zollhöfer, Simon Green, Christoph Lassner, Changil Kim, Tanner Schmidt, Steven Lovegrove, Michael Goesele, Richard A. Newcombe, Zhaoyang Lv:
Neural 3D Video Synthesis from Multi-view Video. 5511-5521 - Petr Hruby, Timothy Duff
, Anton Leykin, Tomás Pajdla:
Learning to Solve Hard Minimal Problems. 5522-5532 - Yingjie Cai, Kwan-Yee Lin, Chao Zhang, Qiang Wang, Xiaogang Wang
, Hongsheng Li
:
Learning a Structured Latent Space for Unsupervised Point Cloud Completion. 5533-5543 - Yang Li, Tatsuya Harada:
Lepard: Learning partial point cloud matching in rigid and deformable scenes. 5544-5554 - Kai Zhang, Fujun Luan
, Zhengqi Li, Noah Snavely:
IRON: Inverse Rendering by Optimizing Neural SDFs and Materials from Photometric Images. 5555-5564 - Damien Robert
, Bruno Vallet, Loïc Landrieu:
Learning Multi-View Aggregation In the Wild for Large-Scale 3D Semantic Segmentation. 5565-5574 - Yu Zheng, Yueqi Duan, Jiwen Lu, Jie Zhou, Qi Tian:
HyperDet3D: Learning a Scene-conditioned 3D Object Detector. 5575-5584 - David Novotný, Ignacio Rocco, Samarth Sinha, Alexandre Carlier, Gael Kerchenbaum, Roman Shapovalov, Nikita Smetanin, Natalia Neverova, Benjamin Graham, Andrea Vedaldi:
KeyTr: Keypoint Transporter for 3D Reconstruction of Deformable Objects in Videos. 5585-5594 - Boyi Jiang, Yang Hong, Hujun Bao, Juyong Zhang:
SelfRecon: Self Reconstruction Your Digital Avatar from Monocular Video. 5595-5605 - Zhenyu Jiang, Cheng-Chun Hsu, Yuke Zhu:
Ditto: Building Digital Twins of Articulated Objects from Interaction. 5606-5616 - Yurui Zhu, Jie Huang, Xueyang Fu
, Feng Zhao, Qibin Sun, Zheng-Jun Zha:
Bijective Mapping Network for Shadow Removal. 5617-5626 - Long Ma, Tengyu Ma, Risheng Liu, Xin Fan, Zhongxuan Luo:
Toward Fast, Flexible, and Robust Low-Light Image Enhancement. 5627-5636 - Dongdong Chen, Julián Tachella, Mike E. Davies:
Robust Equivariant Imaging: a fully unsupervised framework for learning to image from noisy and partial measurements. 5637-5646 - Jie Liang, Hui Zeng, Lei Zhang
:
Details or Artifacts: A Locally Discriminative Learning Approach to Realistic Image Super-Resolution. 5647-5656 - Xiaoqian Xu, Pengxu Wei, Weikai Chen, Yang Liu, Mingzhi Mao, Liang Lin, Guanbin Li:
Dual Adversarial Adaptation for Cross-Device Real-World Image Super-Resolution. 5657-5666 - Youngho Yoon, Inchul Chung, Lin Wang
, Kuk-Jin Yoon:
SphereSR: 360° Image Super-Resolution with Arbitrary Projection via Continuous Spherical Image Representation. 5667-5676 - Chengxu Liu, Huan Yang, Jianlong Fu, Xueming Qian:
Learning Trajectory-Aware Transformer for Video Super-Resolution. 5677-5686 - Zixiang Zhao
, Jiangshe Zhang, Shuang Xu, Zudi Lin, Hanspeter Pfister
:
Discrete Cosine Transform Network for Guided Depth Map Super-Resolution. 5687-5697 - Zhixuan Zhong, Liangyu Chai, Yang Zhou, Bailin Deng, Jia Pan, Shengfeng He
:
Faithful Extreme Rescaling via Generative Prior Reciprocated Invertible Representations. 5698-5707 - Dailan He, Ziming Yang, Weikun Peng, Rui Ma, Hongwei Qin, Yan Wang:
ELIC: Efficient Learned Image Compression with Unevenly Grouped Space-Channel Contextual Adaptive Coding. 5708-5717 - Syed Waqas Zamir, Aditya Arora, Salman Khan, Munawar Hayat, Fahad Shahbaz Khan, Ming-Hsuan Yang:
Restormer: Efficient Transformer for High-Resolution Image Restoration. 5718-5729 - Lang Nie, Chunyu Lin, Kang Liao, Shuaicheng Liu
, Yao Zhao:
Deep Rectangling for Image Stitching: A Learning Baseline. 5730-5738 - Shanel Gauthier, Benjamin Thérien, Laurent Alsène-Racicot, Muawiz Chaudhary, Irina Rish, Eugene Belilovsky, Michael Eickenberg, Guy Wolf:
Parametric Scattering Networks. 5739-5748 - Akshay Dudhane, Syed Waqas Zamir, Salman Khan, Fahad Shahbaz Khan, Ming-Hsuan Yang:
Burst Image Restoration and Enhancement. 5749-5758 - Zhengzhong Tu
, Hossein Talebi, Han Zhang, Feng Yang, Peyman Milanfar, Alan C. Bovik, Yinxiao Li:
MAXIM: Multi-Axis MLP for Image Processing. 5759-5770 - Javier Hidalgo-Carrió, Guillermo Gallego, Davide Scaramuzza
:
Event-aided Direct Sparse Odometry. 5771-5780 - Haisong Liu, Tao Lu, Yihui Xu, Jia Liu, Wenjie Li, Lijun Chen:
CamLiFlow: Bidirectional Camera-LiDAR Fusion for Joint Optical Flow and Scene Flow Estimation. 5781-5791 - Jinyuan Liu
, Xin Fan, Zhanbo Huang, Guanyao Wu, Risheng Liu, Wei Zhong, Zhongxuan Luo:
Target-aware Dual Adversarial Learning and a Multi-scenario Multi-Modality Benchmark to Fuse Infrared and Visible for Object Detection. 5792-5801 - Chunle Guo, Qixin Yan, Saeed Anwar, Runmin Cong, Wenqi Ren, Chongyi Li:
Image Dehazing Transformer with Transmission-Aware 3D Position Embedding. 5802-5810 - Yuntong Ye, Changfeng Yu, Yi Chang, Lin Zhu
, Xi-Le Zhao, Luxin Yan, Yonghong Tian:
Unsupervised Deraining: Where Contrastive Learning Meets Self-similarity. 5811-5820 - Huan Liu, Zijun Wu, Liangyan Li, Sadaf Salehkalaibar, Jun Chen, Keyan Wang:
Towards Multi-domain Single Image Dehazing via Test-time Training. 5821-5830 - Yi Li, Yi Chang, Yan Gao, Changfeng Yu, Luxin Yan:
Physically Disentangled Intra- and Inter-domain Adaptation for Varicolored Haze Removal. 5831-5840 - Yue Cao, Zhaolin Wan, Dongwei Ren, Zifei Yan, Wangmeng Zuo:
Incorporating Semi-Supervised and Positive-Unlabeled Learning for Boosting Full Reference Image Quality Assessment. 5841-5851 - Lina Guo, Xinjie Shi, Dailan He, Yuanyuan Wang, Rui Ma, Hongwei Qin, Yan Wang:
Practical Learned Lossless JPEG Recompression with Multi-Level Cross-Channel Entropy Model in the DCT Domain. 5852-5861 - Cong Huang
, Jiahao Li, Bin Li, Dong Liu, Yan Lu:
Neural Compression-Based Feature Learning for Video Restoration. 5862-5871 - Xin Tian, Ke Xu
, Xin Yang, Lin Du, Baocai Yin, Rynson W. H. Lau:
Bi-directional Object-Context Prioritization Learning for Saliency Ranking. 5872-5881 - Wenhui Wu, Jian Weng, Pingping Zhang, Xu Wang, Wenhan Yang, Jianmin Jiang:
URetinex-Net: Retinex-based Deep Unfolding Network for Low-light Image Enhancement. 5891-5900 - Jianqi Ma
, Zhetong Liang, Lei Zhang
:
A Text Attention Network for Spatial Deformation Robust Scene Text Image Super-resolution. 5901-5910 - Zhihao Hu, Guo Lu, Jinyang Guo, Shan Liu, Wei Jiang, Dong Xu:
Coarse-To-Fine Deep Video Coding with Hyperprior-Guided Mode Prediction. 5911-5920 - Yixuan Huang, Xiaoyun Zhang, Yu Fu, Siheng Chen, Ya Zhang
, Yanfeng Wang, Dazhi He:
Task Decoupled Framework for Reference-based Super-Resolution. 5921-5930 - Huankang Guan
, Jiaying Lin
, Rynson W. H. Lau:
Learning Semantic Associations for Mirror Detection. 5931-5940 - Yu Zeng, Zhe Lin, Vishal M. Patel:
SketchEdit: Mask-Free Local Image Manipulation with Partial Sketches. 5941-5951 - Kelvin C. K. Chan, Shangchen Zhou, Xiangyu Xu, Chen Change Loy:
Investigating Tradeoffs in Real-World Video Super-Resolution. 5952-5961 - Kelvin C. K. Chan
, Shangchen Zhou, Xiangyu Xu, Chen Change Loy:
BasicVSR++: Improving Video Super-Resolution with Enhanced Propagation and Alignment. 5962-5971 - Kaidong Zhang, Jingjing Fu, Dong Liu:
Inertia-Guided Flow Completion and Style Fusion for Video Inpainting. 5972-5981 - Jun-Hyuk Kim
, Byeongho Heo, Jong-Seok Lee:
Joint Global and Local Hierarchical Priors for Learned Image Compression. 5982-5991 - Xiangtao Kong
, Xina Liu, Jinjin Gu, Yu Qiao
, Chao Dong:
Reflash Dropout in Image Super-Resolution. 5992-6002 - Yi Yu, Wenhan Yang, Yap-Peng Tan, Alex C. Kot:
Towards Robust Rain Removal Against Adversarial Attacks: A Comprehensive Benchmark Analysis and Beyond. 6003-6012 - Weiqi Zou, Yang Wang, Xueyang Fu
, Yang Cao:
Dreaming to Prune Image Deraining Networks. 6013-6022 - Hochang Rhee, Yeong Il Jang, Seyun Kim, Nam Ik Cho:
LC-FDNet: Learned Lossless Image Compression with Frequency Decomposition Network. 6023-6032 - Jie Huang, Yajing Liu, Xueyang Fu
, Man Zhou, Yang Wang, Feng Zhao, Zhiwei Xiong:
Exposure Normalization and Compensation for Multiple-Exposure Correction. 6033-6042 - Kun Zhou
, Wenbo Li
, Liying Lu, Xiaoguang Han, Jiangbo Lu
:
Revisiting Temporal Alignment for Video Restoration. 6043-6052 - Zhenghao Chen, Guo Lu, Zhihao Hu, Shan Liu, Wei Jiang, Dong Xu:
LSVC: A Learning-based Stereo Video Compression Framework. 6063-6072 - Guo Lu, Tianxiong Zhong, Jing Geng, Qiang Hu, Dong Xu:
Learning based Multi-modality Image and Video Compression. 6073-6082 - Xin Tong
, Xianghua Ying, Yongjie Shi, Ruibin Wang, Jinfa Yang:
Transformer Based Line Segment Classifier with Image Context for Real-Time Vanishing Point Detection in Manhattan World. 6083-6092 - Yancong Lin, Ruben Wiersma
, Silvia L. Pintea, Klaus Hildebrandt, Elmar Eisemann, Jan C. van Gemert:
Deep vanishing point detection: Geometric priors make dataset variations vanish. 6093-6103 - Yeongwoo Nam, S. Mohammad Mostafavi I., Kuk-Jin Yoon, Jonghyun Choi
:
Stereo Depth from Events Cameras: Concentrate and Focus on the Future. 6104-6113 - Ronald Clark:
Volumetric Bundle Adjustment for Online Photorealistic Scene Capture. 6114-6122 - Zhongzheng Ren, Aseem Agarwala, Bryan C. Russell, Alexander G. Schwing, Oliver Wang:
Neural Volumetric Object Selection. 6123-6132 - Ziyan Wang, Giljoo Nam, Tuur Stuyck, Stephen Lombardi, Michael Zollhöfer, Jessica K. Hodgins, Christoph Lassner:
HVH: Learning a Hybrid Neural Volumetric Representation for Dynamic Hair Performance Capture. 6133-6144 - Yuheng Jiang, Suyi Jiang, Guoxing Sun, Zhuo Su
, Kaiwen Guo, Minye Wu, Jingyi Yu, Lan Xu:
NeuralHOFusion: Neural Volumetric Rendering under Human-object Interactions. 6145-6155 - Kejie Li, Yansong Tang, Victor Adrian Prisacariu, Philip H. S. Torr:
BNV-Fusion: Dense 3D Reconstruction using Bi-level Neural Volume Fusion. 6156-6165 - Wang Yifan
, Carl Doersch, Relja Arandjelovic, João Carreira, Andrew Zisserman:
Input-level Inductive Biases for 3D Reconstruction. 6166-6176 - Markus Worchel, Rodrigo Diaz, Weiwen Hu, Oliver Schreer
, Ingo Feldmann, Peter Eisert:
Multi-View Mesh Reconstruction with Neural Deferred Shading. 6177-6187 - Lukas Höllein
, Justin Johnson, Matthias Nießner:
StyleMesh: Style Transfer for Indoor 3D Scene Reconstructions. 6188-6198 - Haowen Wang, Mingyuan Wang, Zhengping Che, Zhiyuan Xu, Xiuquan Qiao, Mengshi Qi, Feifei Feng, Jian Tang:
RGB-Depth Fusion GAN for Indoor Depth Completion. 6199-6208 - Yiming Xie, Matheus Gadelha, Fengting Yang, Xiaowei Zhou, Huaizu Jiang:
PlanarRecon: Realtime 3D Plane Detection and Reconstruction from Posed Monocular Videos. 6209-6218 - Mehdi S. M. Sajjadi, Henning Meyer,