


default search action
ICCV 2019: Seoul, South Korea
- 2019 IEEE/CVF International Conference on Computer Vision, ICCV 2019, Seoul, Korea (South), October 27 - November 2, 2019. IEEE 2019, ISBN 978-1-7281-4803-8

Poster 1.1
Deep Learning
- Andreas Rössler, Davide Cozzolino

, Luisa Verdoliva, Christian Riess, Justus Thies, Matthias Nießner:
FaceForensics++: Learning to Detect Manipulated Facial Images. 1-11 - Weixin Lu, Guowei Wan, Yao Zhou, Xiangyu Fu, Pengfei Yuan, Shiyu Song:

DeepVCP: An End-to-End Deep Neural Network for Point Cloud Registration. 12-21 - Matheus Gadelha, Rui Wang, Subhransu Maji:

Shape Reconstruction Using Differentiable Projections and Deep Priors. 22-30 - Måns Larsson, Erik Stenborg, Carl Toft, Lars Hammarstrand, Torsten Sattler, Fredrik Kahl:

Fine-Grained Segmentation Networks: Self-Supervised Segmentation for Improved Long-Term Visual Localization. 31-41 - Luwei Yang, Ziqian Bai, Chengzhou Tang, Honghua Li, Yasutaka Furukawa, Ping Tan:

SANet: Scene Agnostic Network for Camera Localization. 42-51 - Pedro Hermosilla Casajus

, Tobias Ritschel, Timo Ropinski
:
Total Denoising: Unsupervised Learning of 3D Point Cloud Cleaning. 52-60 - Rizard Renanda Adhi Pramono, Yie-Tarng Chen, Wen-Hsien Fang:

Hierarchical Self-Attention Network for Action Localization in Videos. 61-70 - Umar Riaz Muhammad, Yongxin Yang, Timothy M. Hospedales, Tao Xiang, Yi-Zhe Song

:
Goal-Driven Sequential Data Abstraction. 71-80 - Roberto Annunziata, Christos Sagonas, Jacques Calì:

Jointly Aligning Millions of Images With Deep Penalised Reconstruction Congealing. 81-90 - Seungmin Lee, Dongwan Kim, Namil Kim, Seong-Gyun Jeong:

Drop to Adapt: Learning Discriminative Features for Unsupervised Domain Adaptation. 91-100 - Youngdong Kim, Junho Yim, Juseung Yun, Junmo Kim:

NLNL: Negative Learning for Noisy Labels. 101-110 - Shaokai Ye, Xue Lin, Kaidi Xu, Sijia Liu, Hao Cheng, Jan-Henrik Lambrechts, Huan Zhang, Aojun Zhou, Kaisheng Ma

, Yanzhi Wang:
Adversarial Robustness vs. Model Compression, or Both? 111-120 - Pu Zhao

, Sijia Liu, Pin-Yu Chen, Nghia Hoang, Kaidi Xu, Bhavya Kailkhura
, Xue Lin:
On the Design of Black-Box Adversarial Examples by Leveraging Gradient-Free Optimization and Operator Splitting Method. 121-130 - Sagnik Das, Ke Ma, Zhixin Shu, Dimitris Samaras, Roy Shilkrot:

DewarpNet: Single-Image Document Unwarping With Stacked 3D and 2D Regression Networks. 131-140 - Xu Zou, Sheng Zhong, Luxin Yan, Xiangyun Zhao, Jiahuan Zhou, Ying Wu:

Learning Robust Facial Landmark Detection via Hierarchical Structured Ensemble. 141-150 - Zitong Yu, Wei Peng

, Xiaobai Li, Xiaopeng Hong, Guoying Zhao:
Remote Heart Rate Measurement From Highly Compressed Facial Videos: An End-to-End Deep Learning Solution With Video Enhancement. 151-160 - Tianyang Shi, Yi Yuan

, Changjie Fan, Zhengxia Zou, Zhenwei Shi, Yong Liu
:
Face-to-Parameter Translation for Game Character Auto-Creation. 161-170 - Guha Balakrishnan

, Adrian V. Dalca, Amy Zhao, John V. Guttag, Frédo Durand, William T. Freeman:
Visual Deprojection: Probabilistic Recovery of Collapsed Dimensions. 171-180 - Yurui Ren, Xiaoming Yu, Ruonan Zhang, Thomas H. Li, Shan Liu, Ge Li:

StructureFlow: Image Inpainting via Structure-Aware Appearance Flow. 181-190 - Md Mahfuzur Rahman Siddiquee

, Zongwei Zhou
, Nima Tajbakhsh, Ruibin Feng, Michael B. Gotway, Yoshua Bengio, Jianming Liang
:
Learning Fixed Points in Generative Adversarial Networks: From Image-to-Image Translation to Disease Detection and Localization. 191-200 - Zhengxia Zou, Wenyuan Li, Tianyang Shi, Zhenwei Shi, Jieping Ye:

Generative Adversarial Training for Weakly Supervised Cloud Matting. 201-210 - Zheng Tang, Milind Naphade, Stan Birchfield, Jonathan Tremblay, William Hodge, Ratnesh Kumar, Shuo Wang, Xiaodong Yang:

PAMTRI: Pose-Aware Multi-Task Learning for Vehicle Re-Identification Using Highly Randomized Synthetic Data. 211-220 - Eirikur Agustsson, Michael Tschannen, Fabian Mentzer, Radu Timofte

, Luc Van Gool:
Generative Adversarial Networks for Extreme Learned Image Compression. 221-231 - Yanbei Chen, Xiatian Zhu, Shaogang Gong:

Instance-Guided Context Rendering for Cross-Domain Person Re-Identification. 232-242 - Mahmoud Afifi, Michael S. Brown:

What Else Can Fool Deep Learning? Addressing Color Constancy Errors on Deep Neural Network Performance. 243-252 - Patrick Ebel, Eduard Trulls, Kwang Moo Yi, Pascal Fua, Anastasiia Mishchuk:

Beyond Cartesian Representations for Local Descriptors. 253-262 - Muhamad Risqi Utama Saputra

, Pedro Porto Buarque de Gusmão, Yasin Almalioglu
, Andrew Markham, Niki Trigoni
:
Distilling Knowledge From a Deep Pose Regressor Network. 263-272 - Kyung-Rae Kim, Whan Choi, Yeong Jun Koh, Seong-Gyun Jeong, Chang-Su Kim

:
Instance-Level Future Motion Estimation in a Single Image Based on Ordinal Regression. 273-282 - Hang Zhou, Ziwei Liu, Xudong Xu, Ping Luo, Xiaogang Wang:

Vision-Infused Deep Audio Inpainting. 283-292 - Zhen Dong, Zhewei Yao, Amir Gholami, Michael W. Mahoney, Kurt Keutzer:

HAWQ: Hessian AWare Quantization of Neural Networks With Mixed-Precision. 293-302 - Jun-Ho Choi, Huan Zhang, Jun-Hyuk Kim

, Cho-Jui Hsieh, Jong-Seok Lee:
Evaluating Robustness of Deep Image Super-Resolution Against Adversarial Attacks. 303-311 - Kibok Lee, Kimin Lee, Jinwoo Shin, Honglak Lee:

Overcoming Catastrophic Forgetting With Unlabeled Data in the Wild. 312-321 - Yisen Wang, Xingjun Ma

, Zaiyi Chen, Yuan Luo, Jinfeng Yi, James Bailey:
Symmetric Cross Entropy for Robust Learning With Noisy Labels. 322-330 - Avinash Ravichandran, Rahul Bhotika, Stefano Soatto:

Few-Shot Learning With Embedded Class Models and Shot-Free Meta Training. 331-339 - Maneet Singh, Shruti Nagpal, Richa Singh, Mayank Vatsa:

Dual Directed Capsule Network for Very Low Resolution Image Recognition. 340-349 - Xiangyun Zhao, Yi Yang, Feng Zhou, Xiao Tan, Yuchen Yuan, Yingze Bao, Ying Wu:

Recognizing Part Attributes With Insufficient Data. 350-360 - Jiaxin Li, Gim Hee Lee:

USIP: Unsupervised Stable Interest Point Detection From 3D Point Clouds. 361-370 - Binghui Chen, Weihong Deng

, Jiani Hu:
Mixed High-Order Attention Network for Person Re-Identification. 371-381 - Rodrigo Ferreira Berriel, Stéphane Lathuilière, Moin Nabi, Tassilo Klein

, Thiago Oliveira-Santos, Nicu Sebe
, Elisa Ricci
:
Budget-Aware Adapters for Multi-Domain Learning. 382-391 - Tuong Do

, Huy Tran, Thanh-Toan Do, Erman Tjiputra, Quang D. Tran
:
Compact Trilinear Interaction for Visual Question Answering. 392-401 - Ishan Nigam, Pavel Tokmakov, Deva Ramanan

:
Towards Latent Attribute Discovery From Triplet Similarities. 402-410 - Utkarsh Mall, Kevin Matzen, Bharath Hariharan, Noah Snavely, Kavita Bala

:
GeoStyle: Discovering Fashion Trends and Events. 411-420 - Haichao Zhang, Jianyu Wang:

Towards Adversarially Robust Object Detection. 421-430
Recognition
- Junli Zhao

, Xin Qi, Chengfeng Wen, Na Lei, Xianfeng Gu
:
Automatic and Robust Skull Registration Based on Discrete Uniformization. 431-440 - Zhimao Peng, Zechao Li, Junge Zhang, Yan Li, Guo-Jun Qi

, Jinhui Tang
:
Few-Shot Image Recognition With Knowledge Transfer. 441-449 - Michael Wray

, Gabriela Csurka, Diane Larlus, Dima Damen
:
Fine-Grained Action Retrieval Through Multiple Parts-of-Speech Embeddings. 450-459 - Peng Wang, Bingliang Jiao, Lu Yang, Yifei Yang, Shizhou Zhang, Wei Wei, Yanning Zhang:

Vehicle Re-Identification in Aerial Imagery: Dataset and Approach. 460-469 - Krishna Regmi, Mubarak Shah

:
Bridging the Domain Gap for Ground-to-Aerial Image Matching. 470-479 - Mehran Khodabandeh, Arash Vahdat, Mani Ranjbar, William G. Macready:

A Robust Learning Approach to Domain Adaptive Object Detection. 480-490 - Yin Bi, Aaron Chadha, Alhabib Abbas, Eirina Bourtsoulatze, Yiannis Andreopoulos:

Graph-Based Object Classification for Neuromorphic Vision Sensing. 491-501 - Jiwoong Choi

, Dayoung Chun, Hyun Kim
, Hyuk-Jae Lee:
Gaussian YOLOv3: An Accurate and Fast Object Detector Using Localization Uncertainty for Autonomous Driving. 502-511 - Lezi Wang, Ziyan Wu, Srikrishna Karanam, Kuan-Chuan Peng, Rajat Vikram Singh, Bo Liu, Dimitris N. Metaxas:

Sharpen Focus: Learning With Attention Separability and Consistency. 512-521 - Tianshui Chen, Muxin Xu, Xiaolu Hui, Hefeng Wu, Liang Lin:

Learning Semantic-Specific Graph Representation for Multi-Label Image Recognition. 522-531 - Sergey Zakharov, Wadim Kehl, Slobodan Ilic:

DeceptionNet: Network-Driven Domain Randomization. 532-541 - Jiaxu Miao, Yu Wu

, Ping Liu, Yuhang Ding, Yi Yang:
Pose-Guided Feature Alignment for Occluded Person Re-Identification. 542-551 - Tianyuan Yu, Da Li, Yongxin Yang, Timothy M. Hospedales, Tao Xiang:

Robust Person Re-Identification by Modelling Feature Uncertainty. 552-561 - Arulkumar Subramaniam, Athira M. Nambiar, Anurag Mittal:

Co-Segmentation Inspired Attention Networks for Video-Based Person Re-Identification. 562-572 - Huizi Mao, Xiaodong Yang, Bill Dally:

A Delay Metric for Video Object Detection: What Average Precision Fails to Tell. 573-582 - Eden Belouadah, Adrian Popescu:

IL2M: Class Incremental Learning With Dual Memory. 583-592
Segmentation, Grouping, & Shape
- Zhen Zhu

, Mengdu Xu, Song Bai, Tengteng Huang, Xiang Bai:
Asymmetric Non-Local Neural Networks for Semantic Segmentation. 593-602 - Zilong Huang, Xinggang Wang

, Lichao Huang, Chang Huang, Yunchao Wei, Wenyu Liu:
CCNet: Criss-Cross Attention for Semantic Segmentation. 603-612 - Shousheng Luo, Xue-Cheng Tai

, Limei Huo, Yang Wang, Roland Glowinski:
Convex Shape Prior for Multi-Object Segmentation Using a Single Level Set Function. 613-621 - Khoi Nguyen, Sinisa Todorovic:

Feature Weighting and Boosting for Few-Shot Segmentation. 622-631 - Niv Haim, Nimrod Segol, Heli Ben-Hamu, Haggai Maron, Yaron Lipman:

Surface Networks via General Covers. 632-641 - Naiyu Gao

, Yanhu Shan, Yupei Wang, Xin Zhao
, Yinan Yu, Ming Yang
, Kaiqi Huang:
SSAP: Single-Shot Instance Segmentation With Affinity Pyramid. 642-651 - Sifei Liu

, Xueting Li, Varun Jampani, Shalini De Mello, Jan Kautz:
Learning Propagation for Arbitrarily-Structured Data. 652-661 - Jun Hao Liew, Scott Cohen, Brian L. Price, Long Mai, Sim Heng Ong, Jiashi Feng:

MultiSeg: Semantically Meaningful, Scale-Diverse Segmentations From Minimal User Input. 662-670 - Federica Arrigoni

, Tomás Pajdla:
Robust Motion Segmentation From Pairwise Matches. 671-681 - Haoshu Fang, Jianhua Sun, Runzhong Wang

, Minghao Gou, Yong-Lu Li, Cewu Lu:
InstaBoost: Boosting Instance Segmentation via Probability Map Guided Copy-Pasting. 682-691
Face & Body
- Mei Wang, Weihong Deng

, Jiani Hu, Xunqiang Tao, Yaohai Huang:
Racial Faces in the Wild: Reducing Racial Bias by Information Maximization Adaptation Network. 692-702 - Jingxiao Zheng, Ruichi Yu, Jun-Cheng Chen

, Boyu Lu, Carlos Domingo Castillo, Rama Chellappa:
Uncertainty Modeling of Contextual-Connections Between Tracklets for Unconstrained Video-Based Face Recognition. 703-712 - Xingxuan Zhang, Feng Cheng, Shilin Wang:

Spatio-Temporal Fusion Based Convolutional Sequence Learning for Lip Reading. 713-722 - Yu Cheng, Bo Yang, Bo Wang, Wending Yan, Robby T. Tan:

Occlusion-Aware Networks for 3D Human Pose Estimation in Video. 723-732 - Yong Zhang, Haiyong Jiang, Baoyuan Wu, Yanbo Fan, Qiang Ji:

Context-Aware Feature and Label Fusion for Facial Action Unit Intensity Estimation With Partially Labeled Data. 733-742 - Chaoyang Wang, Chen Kong, Simon Lucey

:
Distill Knowledge From NRSfM for Weakly Supervised 3D Pose Learning. 743-752 - Yuan Yao, Yasamin Jafarian, Hyun Soo Park:

MONET: Multiview Semi-Supervised Keypoint Detection via Epipolar Divergence. 753-762 - Gilwoo Lee, Zhiwei Deng, Shugao Ma, Takaaki Shiratori, Siddhartha S. Srinivasa, Yaser Sheikh:

Talking With Hands 16.2M: A Large-Scale Dataset of Synchronized Body-Finger Motion and Audio for Conversational Motion Analysis and Synthesis. 763-772 - Lingxue Song, Dihong Gong, Zhifeng Li, Changsong Liu, Wei Liu

:
Occlusion Robust Face Recognition Based on Mask Learning With Pairwise Differential Siamese Network. 773-782 - Xuanyi Dong, Yi Yang:

Teacher Supervises Students How to Learn From Partially Labeled Images for Facial Landmark Detection. 783-792 - Fu Xiong, Boshen Zhang, Yang Xiao, Zhiguo Cao

, Taidong Yu, Joey Tianyi Zhou, Junsong Yuan:
A2J: Anchor-to-Joint Regression Network for 3D Articulated Pose Estimation From a Single Depth Image. 793-802 - Georgios Pavlakos, Nikos Kolotouros, Kostas Daniilidis:

TexturePose: Supervising Human Mesh Estimation With Texture Consistency. 803-812 - Christian Zimmermann, Duygu Ceylan, Jimei Yang, Bryan C. Russell, Max J. Argus, Thomas Brox:

FreiHAND: A Dataset for Markerless Capture of Hand Pose and Shape From Single RGB Images. 813-822 - Nitin Saini, Eric Price, Rahul Tallamraju, Raffi Enficiaud

, Roman Ludwig, Igor Martinovic, Aamir Ahmad, Michael J. Black:
Markerless Outdoor Human Motion Capture Using Multiple Autonomous Micro Aerial Vehicles. 823-832
Action & Video
- Srijan Das, Rui Dai

, Michal Koperski, Luca Minciullo, Lorenzo Garattoni, François Brémond, Gianpiero Francesca:
Toyota Smarthome: Real-World Activities of Daily Living. 833-842 - Penghao Zhou, Mingmin Chi:

Relation Parsing Neural Network for Human-Object Interaction Detection. 843-851 - Rohit Girdhar, Du Tran, Lorenzo Torresani, Deva Ramanan

:
DistInit: Learning Video Representations Without a Single Labeled Video. 852-861 - Fadime Sener, Angela Yao:

Zero-Shot Anticipation for Instructional Activities. 862-871 - Tianhong Li, Lijie Fan, Mingmin Zhao

, Yingcheng Liu, Dina Katabi:
Making the Invisible Visible: Action Recognition Through Walls and Occlusions. 872-881 - Xudong Xu, Bo Dai, Dahua Lin:

Recursive Visual Sound Separation Using Minus-Plus Net. 882-891
Motion & Tracking
- Fitsum A. Reda, Deqing Sun, Aysegul Dundar, Mohammad Shoeybi, Guilin Liu, Kevin J. Shih, Andrew Tao, Jan Kautz, Bryan Catanzaro:

Unsupervised Video Interpolation Using Cycle Consistency. 892-900 - Tao Wang, Haibin Ling, Congyan Lang, Songhe Feng, Xiaohui Hou:

Deformable Surface Tracking by Graph Matching. 901-910 - Janghoon Choi

, Junseok Kwon, Kyoung Mu Lee:
Deep Meta Learning for Real-Time Target-Aware Visual Tracking. 911-920 - Chiho Choi, Behzad Dariush:

Looking to Relations for Future Trajectory Forecast. 921-930 - Zhao Yang, Qiang Wang, Luca Bertinetto, Song Bai, Weiming Hu, Philip H. S. Torr:

Anchor Diffusion for Unsupervised Video Object Segmentation. 931-940 - Philipp Bergmann, Tim Meinhardt, Laura Leal-Taixé:

Tracking Without Bells and Whistles. 941-951
Scene Understanding
- Zhaoyi Yan, Yuchen Yuan, Wangmeng Zuo, Xiao Tan, Yezhen Wang, Shilei Wen, Errui Ding:

Perspective-Guided Convolution Networks for Crowd Counting. 952-961 - Yichao Zhou, Haozhi Qi, Yi Ma:

End-to-End Wireframe Parsing. 962-971 - Yoshikatsu Nakajima, Byeongkeun Kang, Hideo Saito, Kris Kitani:

Incremental Class Discovery for Semantic Segmentation With RGBD Sensing. 972-981 - Liang Du, Jingang Tan, Hongye Yang, Jianfeng Feng, Xiangyang Xue, Qibao Zheng, Xiaoqing Ye, Xiaolin Zhang:

SSF-DAN: Separated Semantic Feature Based Domain Adaptation Network for Semantic Segmentation. 982-991 - Nicholas Weir, David Lindenbaum, Alexei Bastidas, Adam Van Etten, Varun Kumar Vijay, Sean McPherson, Jacob Shermeyer, Hanlin Tang:

SpaceNet MVOI: A Multi-View Overhead Imagery Dataset. 992-1001 - Vishwanath Sindagi, Vishal M. Patel:

Multi-Level Bottom-Top and Top-Bottom Feature Fusion for Crowd Counting. 1002-1012 - Yuenan Hou, Zheng Ma, Chunxiao Liu, Chen Change Loy:

Learning Lightweight Lane Detection CNNs by Self Attention Distillation. 1013-1021 - Daniel Gordon, Abhishek Kadian, Devi Parikh, Judy Hoffman

, Dhruv Batra:
SplitNet: Sim2Sim and Task2Task Transfer for Embodied Visual Navigation. 1022-1031
3D From Multiview & Sensors
- Wentao Cheng, Weisi Lin, Kan Chen, Xinfeng Zhang:

Cascaded Parallel Filtering for Memory-Efficient Image-Based Localization. 1032-1041 - Chao Wen, Yinda Zhang, Zhuwen Li, Yanwei Fu

:
Pixel2Mesh++: Multi-View 3D Mesh Generation via Deformation. 1042-1051 - Fotios Logothetis, Roberto Mecca, Roberto Cipolla:

A Differential Volumetric Approach to Multi-View Photometric Stereo. 1052-1061 - Viktor Larsson, Torsten Sattler, Zuzana Kukelova

, Marc Pollefeys
:
Revisiting Radial Distortion Absolute Pose. 1062-1071 - Tobias Würfl, André Aichert, Nicole Maass, Frank Dennerlein, Andreas K. Maier:

Estimating the Fundamental Matrix Without Point Correspondences With Application to Transmission Imaging. 1072-1081 - Devesh Adlakha, Adlane Habed, Fabio Morbidi, Cédric Demonceaux

, Michel de Mathelin:
QUARCH: A New Quasi-Affine Reconstruction Stratum From Vague Relative Camera Orientation Knowledge. 1082-1090 - Dániel Baráth, Zuzana Kukelova

:
Homography From Two Orientation- and Scale-Covariant Features. 1091-1099
Applications. Medical, & Robotics
- Hyukryul Yang, Hao Ouyang, Vladlen Koltun, Qifeng Chen:

Hiding Video in Audio via Reversible Generative Models. 1100-1109 - Yong Zhao, Shibiao Xu, Shuhui Bu, Hongkai Jiang, Pengcheng Han:

GSLAM: A General SLAM Framework and Benchmark. 1110-1120 - Sang Jun Lee

, Sung Soo Hwang:
Elaborate Monocular Point and Line SLAM With Robust Initialization. 1121-1129 - Jia Wan

, Antoni B. Chan
:
Adaptive Density Map Generation for Crowd Counting. 1130-1139 - Xingxu Yao, Dongyu She, Sicheng Zhao, Jie Liang, Yu-Kun Lai, Jufeng Yang:

Attention-Aware Polarity Sensitive Embedding for Affective Image Retrieval. 1140-1150 - Chi Zhan, Dongyu She, Sicheng Zhao, Ming-Ming Cheng

, Jufeng Yang:
Zero-Shot Emotion Recognition via Affective Structural Embedding. 1151-1160 - Haoye Dong

, Xiaodan Liang, Xiaohui Shen, Bowen Wu, Bing-Cheng Chen, Jian Yin:
FW-GAN: Flow-Navigated Warping GAN for Video Virtual Try-On. 1161-1170 - Arnab Ghosh, Richard Zhang, Puneet K. Dokania, Oliver Wang, Alexei A. Efros

, Philip H. S. Torr, Eli Shechtman:
Interactive Sketch & Fill: Multiclass Sketch-to-Image Translation. 1171-1180 - Shi Chen

, Qi Zhao:
Attention-Based Autism Spectrum Disorder Screening With Privileged Modality. 1181-1190 - Jun-Tae Lee, Chang-Su Kim

:
Image Aesthetic Assessment Based on Pairwise Comparison A Unified Approach to Score Regression, Binary Classification, and Personalization. 1191-1200 - Zhenyu Wu, Karthik Suresh, Priya Narayanan, Hongyu Xu, Heesung Kwon, Zhangyang Wang:

Delving Into Robust Object Detection From Unmanned Aerial Vehicles: A Deep Nuisance Disentanglement Approach. 1201-1210 - Adnan Siraj Rakin, Zhezhi He, Deliang Fan:

Bit-Flip Attack: Crushing Neural Network With Progressive Bit Search. 1211-1220 - Vishwanath Sindagi, Rajeev Yasarla, Vishal M. Patel:

Pushing the Frontiers of Unconstrained Crowd Counting: New Dataset and Benchmark Method. 1221-1231 - Yi Liu, Qiang Zhang, Dingwen Zhang, Jungong Han:

Employing Deep Part-Object Relationships for Salient Object Detection. 1232-1241 - Vladimiros Sterzentsenko, Leonidas Saroglou, Anargyros Chatzitofis

, Spiros Thermos, Nikolaos Zioulis
, Alexandros Doumanoglou, Dimitrios Zarpalas, Petros Daras
:
Self-Supervised Deep Depth Denoising. 1242-1251 - Hanxiao Wang

, Venkatesh Saligrama
, Stan Sclaroff, Vitaly Ablavsky
:
Cost-Aware Fine-Grained Recognition for IoTs Based on Sequential Fixations. 1252-1261 - Ruichi Yu, Hongcheng Wang, Ang Li, Jingxiao Zheng, Vlad I. Morariu, Larry Davis:

Layout-Induced Video Representation for Recognizing Agent-in-Place Actions. 1262-1272 - Trong-Nguyen Nguyen, Jean Meunier:

Anomaly Detection in Video Sequence With Appearance-Motion Correspondence. 1273-1283
Oral 1.2A
Architectures, Multi-Task Learning, Domain Adaptation
- Saining Xie, Alexander Kirillov, Ross B. Girshick, Kaiming He:

Exploring Randomly Wired Neural Networks for Image Recognition. 1284-1293 - Xin Chen, Lingxi Xie, Jun Wu, Qi Tian:

Progressive Differentiable Architecture Search: Bridging the Depth Gap Between Search and Evaluation. 1294-1303 - Xiawu Zheng, Rongrong Ji

, Lang Tang, Baochang Zhang, Jianzhuang Liu, Qi Tian:
Multinomial Distribution Learning for Effective Neural Architecture Search. 1304-1313 - Andrew Howard, Ruoming Pang, Hartwig Adam, Quoc V. Le, Mark Sandler, Bo Chen, Weijun Wang, Liang-Chieh Chen, Mingxing Tan, Grace Chu, Vijay Vasudevan, Yukun Zhu:

Searching for MobileNetV3. 1314-1324 - Markus Nagel, Mart van Baalen, Tijmen Blankevoort, Max Welling:

Data-Free Quantization Through Weight Equalization and Bias Correction. 1325-1334 - Laurie Bose, Piotr Dudek

, Jianing Chen, Stephen J. Carey, Walterio W. Mayol-Cuevas
:
A Camera That CNNs: Towards Embedded Neural Networks on Pixel Processor Arrays. 1335-1344 - Xiao Jin, Baoyun Peng

, Yichao Wu, Yu Liu, Jiaheng Liu, Ding Liang, Junjie Yan, Xiaolin Hu:
Knowledge Distillation via Route Constrained Optimization. 1345-1354 - Mary Phuong, Christoph Lampert:

Distillation-Based Training for Multi-Exit Architectures. 1355-1364 - Frederick Tung, Greg Mori:

Similarity-Preserving Knowledge Distillation. 1365-1374 - Gjorgji Strezoski, Nanne van Noord

, Marcel Worring
:
Many Task Learning With Task Routing. 1375-1384 - Felix J. S. Bragman, Ryutaro Tanno, Sébastien Ourselin

, Daniel C. Alexander
, Manuel Jorge Cardoso
:
Stochastic Filter Groups for Multi-Task CNNs: Learning Specialist and Generalist Convolution Kernels. 1385-1394 - Anh Tuan Tran, Cuong V. Nguyen

, Tal Hassner:
Transferability and Hardness of Supervised Classification Tasks. 1395-1405 - Xingchao Peng, Qinxun Bai, Xide Xia, Zijun Huang, Kate Saenko

, Bo Wang
:
Moment Matching for Multi-Source Domain Adaptation. 1406-1415 - Safa Cicek, Stefano Soatto:

Unsupervised Domain Adaptation via Regularized Conditional Alignment. 1416-1425 - Ruijia Xu, Guanbin Li, Jihan Yang, Liang Lin:

Larger Norm More Transferable: An Adaptive Feature Norm Approach for Unsupervised Domain Adaptation. 1426-1435 - Jogendra Nath Kundu, Nishank Lakkakula, Venkatesh Babu Radhakrishnan

:
UM-Adapt: Unsupervised Multi-Task Adaptation Using Adversarial Cross-Task Distillation. 1436-1445 - Da Li

, Jianshu Zhang, Yongxin Yang, Cong Liu, Yi-Zhe Song
, Timothy M. Hospedales:
Episodic Training for Domain Generalization. 1446-1455 - Yi-Hsuan Tsai, Kihyuk Sohn, Samuel Schulter, Manmohan Chandraker:

Domain Adaptation for Structured Output via Discriminative Patch Representations. 1456-1465 - Qin Wang, Wen Li, Luc Van Gool:

Semi-Supervised Learning by Augmented Distribution Alignment. 1466-1475 - Lucas Beyer, Xiaohua Zhai, Avital Oliver, Alexander Kolesnikov:

S4L: Self-Supervised Semi-Supervised Learning. 1476-1485
Oral 1.2B
Multi-View Geometry, 3D Scene Understanding
- Pablo Speciale, Johannes L. Schönberger, Sudipta N. Sinha, Marc Pollefeys

:
Privacy Preserving Image Queries for Camera Localization. 1486-1496 - Songyou Peng, Peter F. Sturm:

Calibration Wizard: A Guidance System for Camera Calibration Based on Modelling Geometric and Corner Uncertainty. 1497-1505 - Tobias Gruber

, Frank D. Julca-Aguilar, Mario Bijelic, Felix Heide:
Gated2Depth: Real-Time Dense Lidar From Gated Images. 1506-1516 - Andrea Nicastro, Ronald Clark, Stefan Leutenegger:

X-Section: Cross-Section Prediction for Enhanced RGB-D Fusion. 1517-1526 - Stepan Tulyakov, François Fleuret, Martin Kiefel, Peter V. Gehler, Michael Hirsch:

Learning an Event Sequence Embedding for Dense Event-Based Deep Stereo. 1527-1537 - Rui Chen, Songfang Han

, Jing Xu, Hao Su:
Point-Based Multi-View Stereo Network. 1538-1547 - Xiangyu Xu

, Enrique Dunn
:
Discrete Laplace Operator Estimation for Dynamic 3D Reconstruction. 1548-1557 - Chen Kong, Simon Lucey

:
Deep Non-Rigid Structure From Motion. 1558-1567 - Carlos Esteves

, Yinshuang Xu, Christine Allen-Blanchette, Kostas Daniilidis:
Equivariant Multi-View Networks. 1568-1577 - Jiageng Mao

, Xiaogang Wang, Hongsheng Li
:
Interpolated Convolutional Networks for 3D Point Cloud Understanding. 1578-1587 - Mikaela Angelina Uy, Quang-Hieu Pham

, Binh-Son Hua, Duc Thanh Nguyen, Sai-Kit Yeung:
Revisiting Point Cloud Classification: A New Benchmark Dataset and Classification Model on Real-World Data. 1588-1597 - Tianhang Zheng

, Changyou Chen, Junsong Yuan, Bo Li, Kui Ren:
PointCloud Saliency Maps. 1598-1606 - Zhiyuan Zhang

, Binh-Son Hua, Sai-Kit Yeung:
ShellNet: Efficient Point Cloud Convolutional Neural Networks Using Concentric Shells Statistics. 1607-1616 - Jean-Michel Roufosse, Abhishek Sharma, Maks Ovsjanikov:

Unsupervised Deep Learning for Structured Shape Matching. 1617-1627 - Nadav Dym, Shahar Z. Kovalsky:

Linearly Converging Quasi Branch and Bound Algorithms for Global Rigid Registration. 1628-1636 - Zhipeng Cai, Tat-Jun Chin, Vladlen Koltun:

Consensus Maximization Tree Search Revisited. 1637-1645 - Haoang Li, Ji Zhao

, Jean-Charles Bazin, Wen Chen, Zhe Liu, Yunhui Liu:
Quasi-Globally Optimal and Efficient Vanishing Point Estimation in Manhattan World. 1646-1654 - Yaqing Ding

, Jian Yang, Jean Ponce, Hui Kong
:
An Efficient Solution to the Homography-Based Relative Pose Problem With a Common Reference Direction. 1655-1664 - Heng Yang, Luca Carlone:

A Quaternion-Based Certifiably Optimal Solution to the Wahba Problem With Outliers. 1665-1674 - Timothy Duff

, Kathlén Kohn
, Anton Leykin, Tomás Pajdla:
PLMP - Point-Line Minimal Problems in Complete Multi-View Visibility. 1675-1684
Poster 1.2
Deep Learning
- Jian Zhang, Chenglong Zhao, Bingbing Ni, Minghao Xu, Xiaokang Yang:

Variational Few-Shot Learning. 1685-1694 - Sankha Subhra Mullick, Shounak Datta, Swagatam Das

:
Generative Adversarial Minority Oversampling. 1695-1704 - Dong Gong

, Lingqiao Liu
, Vuong Le, Budhaditya Saha, Moussa Reda Mansour, Svetha Venkatesh, Anton van den Hengel
:
Memorizing Normality to Detect Anomaly: Memory-Augmented Deep Autoencoder for Unsupervised Anomaly Detection. 1705-1714 - Zuoyue Li, Jan Dirk Wegner, Aurélien Lucchi

:
Topological Map Extraction From Overhead Images. 1715-1724 - Haokui Zhang, Ying Li, Yuanzhouhan Cao, Yu Liu, Chunhua Shen, Youliang Yan:

Exploiting Temporal Consistency for Real-Time Video Depth Estimation. 1725-1734 - Hang Zhao, Chuang Gan, Wei-Chiu Ma, Antonio Torralba:

The Sound of Motions. 1735-1744 - Youngjoo Jo, Jongyoul Park:

SC-FEGAN: Face Editing Generative Adversarial Network With User's Sketch and Color. 1745-1753 - Hongwei Ge, Zehang Yan, Kai Zhang, Mingde Zhao, Liang Sun:

Exploring Overall Contextual Information for Image Captioning in Human-Like Cognitive Style. 1754-1763 - Zhuoyuan Chen, Kavya Srinet, Charles R. Qi, Haoqi Fan, Jerry Ma, Larry Zitnick, Demi Guo, Tong Xiao, Saining Xie, Xinlei Chen, Arthur Szlam, Shubham Tulsiani, Haonan Yu, Jonathan Gray:

Order-Aware Generative Modeling Using the 3D-Craft Dataset. 1764-1773 - Lingbo Liu, Zhilin Qiu, Guanbin Li, Shufan Liu, Wanli Ouyang

, Liang Lin:
Crowd Counting With Deep Structured Scale Integration Network. 1774-1783 - Tomer Cohen, Lior Wolf:

Bidirectional One-Shot Unsupervised Domain Mapping. 1784-1792 - A. J. Piergiovanni, Anelia Angelova, Alexander Toshev, Michael S. Ryoo:

Evolving Space-Time Neural Architectures for Videos. 1793-1802 - Jiahui Yu, Thomas S. Huang:

Universally Slimmable Networks and Improved Training Techniques. 1803-1811 - Tonmoy Saikia, Yassine Marrakchi, Arber Zela

, Frank Hutter, Thomas Brox:
AutoDispNet: Improving Disparity Estimation With AutoML. 1812-1823 - Gidi Littwin, Lior Wolf:

Deep Meta Functionals for Shape Representation. 1824-1833 - Yu Liu, Jihao Liu, Xiaogang Wang, Ailing Zeng:

Differentiable Kernel Evolution. 1834-1843 - Mikolaj Binkowski, R. Devon Hjelm, Aaron C. Courville:

Batch Weight for Domain Adaptation With Mass Shift. 1844-1853 - HyunJae Lee, Hyo-Eun Kim, Hyeonseob Nam:

SRM: A Style-Based Recalibration Module for Convolutional Neural Networks. 1854-1862 - Xingang Pan

, Xiaohang Zhan, Jianping Shi, Xiaoou Tang, Ping Luo:
Switchable Whitening for Deep Representation Learning. 1863-1871 - Adria Ruiz, Jakob Verbeek:

Adaptative Inference Cost With Convolutional Neural Mixture Models. 1872-1881 - Ilija Radosavovic, Justin Johnson, Saining Xie, Wan-Yen Lo, Piotr Dollár:

On Network Design Spaces for Visual Recognition. 1882-1890 - Hao Li, Hong Zhang, Xiaojuan Qi, Ruigang Yang

, Gao Huang:
Improved Techniques for Training Adaptive Deep Networks. 1891-1900 - Yunyang Xiong, Ronak Mehta, Vikas Singh:

Resource Constrained Neural Network Architecture Search: Will a Submodularity Assumption Help? 1901-1910 - Xiaohan Ding, Yuchen Guo, Guiguang Ding, Jungong Han:

ACNet: Strengthening the Kernel Skeletons for Powerful CNN via Asymmetric Convolution Blocks. 1911-1920 - Byeongho Heo, Jeesoo Kim, Sangdoo Yun, Hyojin Park, Nojun Kwak, Jin Young Choi:

A Comprehensive Overhaul of Feature Distillation. 1921-1930
Recognition
- Yew Siang Tang, Gim Hee Lee:

Transferable Semi-Supervised 3D Object Detection From RGB-D Data. 1931-1940 - Sergey Zakharov, Ivan Shugurov, Slobodan Ilic:

DPOD: 6D Pose Object Detector and Refiner. 1941-1950 - Zetong Yang, Yanan Sun

, Shu Liu, Xiaoyong Shen, Jiaya Jia
:
STD: Sparse-to-Dense 3D Object Detector for Point Cloud. 1951-1960 - Hang Zhou, Kejiang Chen

, Weiming Zhang, Han Fang, Wenbo Zhou, Nenghai Yu:
DUP-Net: Denoiser and Upsampler Network for 3D Adversarial Point Clouds Defense. 1961-1970 - Tiancai Wang, Rao Muhammad Anwer

, Hisham Cholakkal
, Fahad Shahbaz Khan
, Yanwei Pang, Ling Shao
:
Learning Rich Features at High-Speed for Single-Shot Object Detection. 1971-1980 - Julia Peyre, Josef Sivic, Ivan Laptev, Cordelia Schmid:

Detecting Unseen Visual Relations Using Analogies. 1981-1990 - Andrea Simonelli, Samuel Rota Bulò, Lorenzo Porzi, Manuel Lopez-Antequera, Peter Kontschieder:

Disentangling Monocular 3D Object Detection. 1991-1999 - Boyuan Jiang, Mengmeng Wang, Weihao Gan, Wei Wu, Junjie Yan:

STM: SpatioTemporal and Motion Encoding for Action Recognition. 2000-2009 - Shuaiyi Huang, Qiuyue Wang, Songyang Zhang

, Shipeng Yan, Xuming He:
Dynamic Context Correspondence Network for Semantic Alignment. 2010-2019 - Akshayvarun Subramanya, Vipin Pillai, Hamed Pirsiavash:

Fooling Network Interpretation in Image Classification. 2020-2029 - Yinan Zhao, Brian L. Price, Scott Cohen, Danna Gurari:

Unconstrained Foreground Object Search. 2030-2039 - Jianwei Yang, Zhile Ren, Mingze Xu, Xinlei Chen, David J. Crandall

, Devi Parikh, Dhruv Batra:
Embodied Amodal Recognition: Learning to Move to Perceive Objects. 2040-2050 - Kaiyu Yang, Olga Russakovsky

, Jia Deng:
SpatialSense: An Adversarially Crowdsourced Benchmark for Spatial Relation Recognition. 2051-2060 - Xinlei Chen, Ross B. Girshick, Kaiming He, Piotr Dollár:

TensorMask: A Foundation for Dense Object Segmentation. 2061-2069 - Peng-Tao Jiang, Qibin Hou, Yang Cao, Ming-Ming Cheng

, Yunchao Wei, Hongkai Xiong
:
Integral Object Mining via Online Attention Accumulation. 2070-2079
Segmentation, Grouping, & Shape
- Vladislav Golyanik, Christian Theobalt

, Didier Stricker
:
Accelerated Gravitational Point Set Alignment With Altered Physical Laws. 2080-2089 - Minghao Chen, Hongyang Xue, Deng Cai:

Domain Adaptation for Semantic Segmentation With Maximum Squares Loss. 2090-2099 - Xiangyu Yue, Yang Zhang, Sicheng Zhao, Alberto L. Sangiovanni-Vincentelli, Kurt Keutzer, Boqing Gong:

Domain Randomization and Pyramid Consistency: Simulation-to-Real Generalization Without Accessing Target Domain Data. 2100-2110 - Yi He, Jiayuan Shi, Chuan Wang, Haibin Huang, Jiaming Liu, Guanbin Li, Risheng Liu

, Jue Wang
:
Semi-Supervised Skin Detection by Network With Mutual Guidance. 2111-2120 - Zuxuan Wu, Xin Wang, Joseph Gonzalez

, Tom Goldstein, Larry Davis:
ACE: Adapting to Changing Environments for Semantic Segmentation. 2121-2130 - Dmitrii Marin

, Zijian He, Peter Vajda, Priyam Chatterjee, Sam S. Tsai, Fei Yang, Yuri Boykov:
Efficient Segmentation: Learning Downsampling Near Semantic Boundaries. 2131-2141 - Wei Wang, Kaicheng Yu, Joachim Hugonot, Pascal Fua, Mathieu Salzmann:

Recurrent U-Net for Resource-Constrained Segmentation. 2142-2151 - Krzysztof Lis

, Krishna Kanth Nakka, Pascal Fua, Mathieu Salzmann:
Detecting the Unexpected via Image Resynthesis. 2152-2161
3D From Single View & RGBD
- Jamie Watson, Michael Firman, Gabriel J. Brostow, Daniyar Turmukhambetov:

Self-Supervised Monocular Depth Hints. 2162-2171 - Daeyun Shin, Zhile Ren, Erik B. Sudderth

, Charless C. Fowlkes:
3D Scene Reconstruction With Multi-Layer Depth and Epipolar Transformers. 2172-2182 - Tom van Dijk

, Guido de Croon:
How Do Neural Networks See Depth in Single Images? 2183-2191 - Zhi Li, Xuan Wang, Fei Wang

, Peilin Jiang:
On Boosting Single-Frame 3D Human Pose Estimation via Monocular Videos. 2192-2201 - Nilesh Kulkarni, Shubham Tulsiani, Abhinav Gupta:

Canonical Surface Mapping via Geometric Cycle Consistency. 2202-2211 - Nilesh Kulkarni, Ishan Misra, Shubham Tulsiani, Abhinav Gupta:

3D-RelNet: Joint Object and Relational Network for 3D Prediction. 2212-2221 - Alexander Grabner, Peter M. Roth, Vincent Lepetit:

GP2C: Geometric Projection Parameter Consensus for Joint 3D Pose and Focal Length Estimation in the Wild. 2222-2231
Face & Body
- Valentin Gabeur, Jean-Sébastien Franco, Xavier Martin, Cordelia Schmid, Grégory Rogez:

Moulding Humans: Non-Parametric 3D Human Shape Estimation From Single Images. 2232-2241 - Albert Pumarola, Jordi Sanchez, Gary P. T. Choi

, Alberto Sanfeliu, Francesc Moreno:
3DPeople: Modeling the Geometry of Dressed Humans. 2242-2251 - Nikos Kolotouros, Georgios Pavlakos, Michael J. Black, Kostas Daniilidis:

Learning to Reconstruct 3D Human Pose and Shape via Model-Fitting in the Loop. 2252-2261 - Hai Ci, Chunyu Wang, Xiaoxuan Ma

, Yizhou Wang:
Optimizing Network Structure for 3D Human Pose Estimation. 2262-2271 - Yujun Cai

, Liuhao Ge, Jun Liu
, Jianfei Cai, Tat-Jen Cham
, Junsong Yuan, Nadia Magnenat-Thalmann
:
Exploiting Spatial-Temporal Relationships for 3D Pose Estimation via Graph Convolutional Networks. 2272-2281 - Mohamed Hassan, Vasileios Choutas, Dimitrios Tzionas, Michael J. Black:

Resolving 3D Human Pose Ambiguities With 3D Scene Constraints. 2282-2292 - Thiemo Alldieck

, Gerard Pons-Moll, Christian Theobalt
, Marcus A. Magnor
:
Tex2Shape: Detailed Full Human Body Geometry From a Single Image. 2293-2303 - Shunsuke Saito, Zeng Huang, Ryota Natsume, Shigeo Morishima

, Hao Li
, Angjoo Kanazawa:
PIFu: Pixel-Aligned Implicit Function for High-Resolution Clothed Human Digitization. 2304-2314 - Xiaoxing Zeng, Xiaojiang Peng, Yu Qiao

:
DF2Net: A Dense-Fine-Finer Network for Detailed 3D Face Reconstruction. 2315-2324 - Saurabh Sharma, Pavan Teja Varigonda, Prashast Bindal, Abhishek Sharma, Arjun Jain:

Monocular 3D Human Pose Estimation by Generation and Ordinal Ranking. 2325-2334 - Linlin Yang, Shile Li, Dongheui Lee, Angela Yao:

Aligning Latent Spaces for 3D Hand Pose Estimation. 2335-2343 - Kun Zhou

, Xiaoguang Han, Nianjuan Jiang, Kui Jia, Jiangbo Lu
:
HEMlets Pose: Learning Part-Centric Heatmap Triplets for Accurate 3D Human Pose Estimation. 2344-2353 - Xiong Zhang, Qiang Li, Hong Mo, Wenbo Zhang, Wen Zheng:

End-to-End Hand Mesh Recovery From a Monocular RGB Image. 2354-2364
Motion & Tracking
- Wenwei Zhang, Hui Zhou, Shuyang Sun, Zhe Wang, Jianping Shi, Chen Change Loy:

Robust Multi-Modality Multi-Object Tracking. 2365-2374 - Boris Ivanovic, Marco Pavone

:
The Trajectron: Probabilistic Multi-Agent Trajectory Modeling With Dynamic Spatiotemporal Graphs. 2375-2384 - Bin Yan, Haojie Zhao, Dong Wang, Huchuan Lu, Xiaoyun Yang:

'Skimming-Perusal' Tracking: A Framework for Real-Time and Robust Long-Term Tracking. 2385-2393 - Kyle Min, Jason J. Corso:

TASED-Net: Temporally-Aggregating Spatial Encoder-Decoder Network for Video Saliency Detection. 2394-2403 - Anurag Ranjan, Joel Janai, Andreas Geiger, Michael J. Black:

Attacking Optical Flow. 2404-2413
Computational Photography & Graphics
- Chunyu Li

, Yusuke Monno, Hironori Hidaka, Masatoshi Okutomi:
Pro-Cam SSfM: Projector-Camera System for Structure and Spectral Reflectance From Motion. 2414-2423 - Bin He, Ce Wang, Boxin Shi, Lingyu Duan:

Mop Moiré Patterns Using MopNet. 2424-2432 - Ruofan Zhou, Sabine Süsstrunk:

Kernel Modeling Super-Resolution on Real Low-Resolution Images. 2433-2443 - Daiqian Ma, Renjie Wan

, Boxin Shi, Alex C. Kot, Lingyu Duan:
Learning to Jointly Generate and Separate Reflections. 2444-2452 - Zijun Deng, Lei Zhu, Xiaowei Hu

, Chi-Wing Fu
, Xuemiao Xu, Qing Zhang, Jing Qin
, Pheng-Ann Heng:
Deep Multi-Model Fusion for Single-Image Dehazing. 2453-2462 - Yuhui Quan, Shijie Deng, Yixin Chen, Hui Ji

:
Deep Learning for Seeing Through Window With Raindrops. 2463-2471 - Xiaowei Hu

, Yitong Jiang, Chi-Wing Fu
, Pheng-Ann Heng:
Mask-ShadowGAN: Learning to Remove Shadows From Unpaired Data. 2472-2481
Low-Level Vision & Optimization
- Shangchen Zhou, Jiawei Zhang, Jinshan Pan, Wangmeng Zuo, Haozhe Xie

, Jimmy S. J. Ren:
Spatio-Temporal Filter Adaptive Network for Video Deblurring. 2482-2491 - Yang Liu, Jinshan Pan, Jimmy S. J. Ren, Zhixun Su

:
Learning Deep Priors for Image Dehazing. 2492-2500 - Xueyang Fu

, Zheng-Jun Zha, Feng Wu, Xinghao Ding, John W. Paisley:
JPEG Artifacts Reduction via Deep Convolutional Sparse Coding. 2501-2510 - Shuhang Gu, Yawei Li

, Luc Van Gool, Radu Timofte
:
Self-Guided Network for Fast Image Denoising. 2511-2520 - Ziang Cheng, Yinqiang Zheng

, Shaodi You, Imari Sato:
Non-Local Intrinsic Decomposition With Near-Infrared Priors. 2521-2530
Scene Understanding
- Romain Cohendet, Claire-Hélène Demarty

, Ngoc Q. K. Duong, Martin Engilberge:
VideoMem: Constructing, Analyzing, Predicting Short-Term and Long-Term Video Memorability. 2531-2540 - Maciej Halber, Yifei Shi, Kai Xu, Thomas A. Funkhouser:

Rescan: Inductive Instance Segmentation for Indoor RGBD Scans. 2541-2550 - Armen Avetisyan, Angela Dai, Matthias Nießner:

End-to-End CAD Model Retrieval and 9DoF Alignment in 3D Scans. 2551-2560 - Tianhao Yang, Zheng-Jun Zha, Hanwang Zhang

:
Making History Matter: History-Advantage Sequence Training for Visual Dialog. 2561-2569 - Liu Liu, Hongdong Li

, Yuchao Dai:
Stochastic Attraction-Repulsion Embedding for Large Scale Image Localization. 2570-2579 - Ranjay Krishna, Vincent S. Chen, Paroma Varma, Michael S. Bernstein

, Christopher Ré, Li Fei-Fei:
Scene Graph Prediction With Limited Labels. 2580-2590
Language & Reasoning
- Ramprasaath Ramasamy Selvaraju, Stefan Lee, Yilin Shen, Hongxia Jin, Shalini Ghosh, Larry P. Heck, Dhruv Batra, Devi Parikh:

Taking a HINT: Leveraging Explanations to Make Vision and Language Models More Grounded. 2591-2600 - Samyak Datta, Karan Sikka, Anirban Roy, Karuna Ahuja, Devi Parikh, Ajay Divakaran:

Align2Ground: Weakly Supervised Phrase Grounding Guided by Image-Caption Alignment. 2601-2610 - Xuejing Liu, Liang Li

, Shuhui Wang, Zheng-Jun Zha, Dechao Meng, Qingming Huang:
Adaptive Reconstruction Network for Weakly Supervised Referring Expression Grounding. 2611-2620 - Ting Yao, Yingwei Pan

, Yehao Li, Tao Mei
:
Hierarchy Parsing for Image Captioning. 2621-2629 - Antoine Miech, Dimitri Zhukov, Jean-Baptiste Alayrac, Makarand Tapaswi, Ivan Laptev, Josef Sivic:

HowTo100M: Learning a Text-Video Embedding by Watching Hundred Million Narrated Video Clips. 2630-2640 - Bairui Wang, Lin Ma, Wei Zhang, Wenhao Jiang, Jingwen Wang, Wei Liu

:
Controllable Video Captioning With POS Sequence Guidance Based on Gated Fusion Network. 2641-2650
3D From Multiview & Sensors
- Yuxin Hou, Juho Kannala, Arno Solin

:
Multi-View Stereo by Temporal Nonparametric Fusion. 2651-2660 - Jiacheng Chen, Chen Liu, Jiaye Wu, Yasutaka Furukawa:

Floor-SP: Inverse CAD for Floorplans by Sequential Room-Wise Shortest Path. 2661-2670 - Zhaopeng Cui, Viktor Larsson, Marc Pollefeys

:
Polarimetric Relative Pose Estimation. 2671-2680 - Seong Hun Lee

, Javier Civera:
Closed-Form Optimal Two-View Triangulation Based on Angular Errors. 2681-2689 - Haozhe Xie

, Hongxun Yao, Xiaoshuai Sun, Shangchen Zhou, Shengping Zhang:
Pix2Vox: Context-Aware 3D Reconstruction From Single and Multi-View Images. 2690-2698
Image & Video Synthesis
- Patrick Esser, Johannes Haux, Björn Ommer:

Unsupervised Robust Disentangling of Latent Characteristics for Image Synthesis. 2699-2709 - Mohammad Saeed Rad, Behzad Bozorgtabar

, Urs-Viktor Marti, Max Basler, Hazim Kemal Ekenel
, Jean-Philippe Thiran
:
SROBB: Targeted Perceptual Loss for Single Image Super-Resolution. 2710-2719 - Haotian Zhang, Long Mai, Hailin Jin, Zhaowen Wang, Ning Xu, John P. Collomosse

:
An Internal Learning Approach to Video Inpainting. 2720-2729 - Sai Bi, Kalyan Sunkavalli, Federico Perazzi, Eli Shechtman, Vladimir G. Kim, Ravi Ramamoorthi:

Deep CG2Real: Synthetic-to-Real Translation via Image Disentanglement. 2730-2739 - Yunseok Jang, Tianchen Zhao, Seunghoon Hong, Honglak Lee:

Adversarial Defense via Learning to Generate Diverse Attacks. 2740-2749 - Atsuhiro Noguchi, Tatsuya Harada:

Image Generation From Small Datasets via Batch Statistics Adaptation. 2750-2758 - Mengyao Zhai, Lei Chen, Frederick Tung, Jiawei He, Megha Nawhal, Greg Mori:

Lifelong GAN: Continual Learning for Conditional Image Generation. 2759-2768
Applications. Medical, & Robotics
- Yi Wu, Yuxin Wu, Aviv Tamar, Stuart Russell, Georgia Gkioxari, Yuandong Tian:

Bayesian Relational Memory for Semantic Visual Navigation. 2769-2779 - Fabian Brickwedde, Steffen Abraham, Rudolf Mester:

Mono-SF: Multi-View Geometry Meets Single-View Depth for Monocular Scene Flow Estimation of Dynamic Traffic Scenes. 2780-2790 - Zhaoyang Huang, Yan Xu, Jianping Shi, Xiaowei Zhou, Hujun Bao, Guofeng Zhang:

Prior Guided Dropout for Robust Visual Localization in Dynamic Environments. 2791-2800 - Manuel Martin, Alina Roitberg

, Monica Haurilet, Matthias Horne, Simon Reiß
, Michael Voit, Rainer Stiefelhagen:
Drive&Act: A Multi-Modal Dataset for Fine-Grained Driver Behavior Recognition in Autonomous Vehicles. 2801-2810 - Yan Xu, Xinge Zhu

, Jianping Shi, Guofeng Zhang, Hujun Bao, Hongsheng Li
:
Depth Completion From Sparse LiDAR Data With Depth-Normal Constraints. 2811-2820 - Nicholas Rhinehart

, Rowan McAllister, Kris Kitani, Sergey Levine:
PRECOG: PREdiction Conditioned on Goals in Visual Multi-Agent Settings. 2821-2830 - Zhe Liu, Shunbo Zhou, Chuanzhe Suo, Peng Yin

, Wen Chen, Hesheng Wang
, Haoang Li, Yunhui Liu:
LPD-Net: 3D Point Cloud Learning for Large-Scale Place Recognition and Environment Analysis. 2831-2840 - Fei Xue, Xin Wang, Zike Yan, Qiuyuan Wang, Junqiu Wang, Hongbin Zha:

Local Supports Global: Deep Camera Relocalization With Sequence Enhancement. 2841-2850 - Shunkai Li, Fei Xue, Xin Wang, Zike Yan, Hongbin Zha:

Sequential Adversarial Learning for Self-Supervised Deep Visual Odometry. 2851-2860 - Ziyang Hong, Yvan R. Petillot

, David Lane, Yishu Miao, Sen Wang
:
TextPlace: Visual Place Recognition and Topological Localization Through Reading Scene Texts. 2861-2870 - Mingyu Ding, Zhe Wang, Jiankai Sun, Jianping Shi, Ping Luo:

CamNet: Coarse-to-Fine Retrieval for Camera Re-Localization. 2871-2880 - William B. Shen, Danfei Xu, Yuke Zhu, Li Fei-Fei, Leonidas J. Guibas, Silvio Savarese:

Situational Fusion of Visual Representation for Visual Navigation. 2881-2890 - Ziyuan Huang

, Changhong Fu
, Yiming Li
, Fuling Lin, Peng Lu
:
Learning Aberrance Repressed Correlation Filters for Real-Time UAV Tracking. 2891-2900 - Arsalan Mousavian, Clemens Eppner, Dieter Fox:

6-DOF GraspNet: Variational Grasp Generation for Object Manipulation. 2901-2910 - Namdar Homayounfar, Justin Liang, Wei-Chiu Ma, Jack Fan, Xinyu Wu, Raquel Urtasun:

DAGMapper: Learning to Map by Discovering Lane Topology. 2911-2920 - Noa Garnett, Rafi Cohen, Tomer Pe'er, Roee Lahav, Dan Levi:

3D-LaneNet: End-to-End 3D Multiple Lane Detection. 2921-2930
Oral 2.1A
Feature Representations, Similarity Learning
- Janis Postels, Francesco Ferroni, Huseyin Coskun, Nassir Navab, Federico Tombari:

Sampling-Free Epistemic Uncertainty Estimation Using Approximated Variance Propagation. 2931-2940 - Hong Liu, Rongrong Ji

, Jie Li
, Baochang Zhang, Yue Gao, Yongjian Wu, Feiyue Huang:
Universal Adversarial Perturbation via Prior Driven Uncertainty Approximation. 2941-2949 - Ruth Fong, Mandela Patrick, Andrea Vedaldi:

Understanding Deep Networks via Extremal Perturbations and Smooth Masks. 2950-2958 - Mathilde Caron, Piotr Bojanowski, Julien Mairal, Armand Joulin:

Unsupervised Pre-Training of Image Features on Non-Curated Data. 2959-2968 - Linguang Zhang, Szymon Rusinkiewicz

:
Learning Local Descriptors With a CDF-Based Dynamic Soft Margin. 2969-2978 - Minyoung Kim, Yuting Wang, Pritish Sahu, Vladimir Pavlovic

:
Bayes-Factor-VAE: Hierarchical Bayesian Deep Auto-Encoder Models for Factor Disentanglement. 2979-2987 - Wei Jiang, Weiwei Sun, Andrea Tagliasacchi, Eduard Trulls, Kwang Moo Yi:

Linearized Multi-Sampling for Differentiable Image Transformation. 2988-2997 - Zhiqiang Tang, Xi Peng, Tingfeng Li, Yizhe Zhu, Dimitris N. Metaxas:

AdaTransform: Adaptive Data Transformation. 2998-3006 - Jiaqi Wang

, Kai Chen, Rui Xu, Ziwei Liu, Chen Change Loy, Dahua Lin:
CARAFE: Content-Aware ReAssembly of FEatures. 3007-3016 - Dou Quan, Xuefeng Liang, Shuang Wang, Shaowei Wei, Yanfeng Li, Ning Huyan, Licheng Jiao

:
AFD-Net: Aggregated Feature Difference Learning for Cross-Spectral Image Patch Matching. 3017-3026 - Shupeng Su, Zhisheng Zhong, Chao Zhang:

Deep Joint-Semantics Reconstructing Hashing for Large-Scale Unsupervised Cross-Modal Retrieval. 3027-3035 - Stanislav Morozov, Artem Babenko:

Unsupervised Neural Quantization for Compressed-Domain Similarity Search. 3036-3045 - Soumava Kumar Roy, Mehrtash Harandi

, Richard Nock, Richard I. Hartley:
Siamese Networks: The Tale of Two Manifolds. 3046-3055 - Runzhong Wang

, Junchi Yan, Xiaokang Yang:
Learning Combinatorial Embedding Networks for Deep Graph Matching. 3056-3065 - Zhanghui Kuang, Yiming Gao, Guanbin Li, Ping Luo, Yimin Chen, Liang Lin, Wayne Zhang

:
Fashion Retrieval via Graph Reasoning Networks on a Similarity Pyramid. 3066-3075
Oral 2.1B
Low Level Vision
- Xin Deng, Ren Yang

, Mai Xu, Pier Luigi Dragotti
:
Wavelet Domain Style Transfer for an Effective Perception-Distortion Tradeoff in Single Image Super-Resolution. 3076-3085 - Jianrui Cai, Hui Zeng, Hongwei Yong, Zisheng Cao, Lei Zhang

:
Toward Real-World Single Image Super-Resolution: A New Benchmark and a New Model. 3086-3095 - Wenlong Zhang

, Yihao Liu
, Chao Dong, Yu Qiao:
RankSRGAN: Generative Adversarial Networks With Ranker for Image Super-Resolution. 3096-3105 - Peng Yi, Zhongyuan Wang, Kui Jiang

, Junjun Jiang
, Jiayi Ma:
Progressive Fusion Video Super-Resolution Network via Exploiting Non-Local Spatio-Temporal Correlations. 3106-3115 - Soo Ye Kim, Jihyong Oh

, Munchurl Kim:
Deep SR-ITM: Joint Learning of Super-Resolution and Inverse Tone-Mapping for 4K UHD HDR Applications. 3116-3125 - Tatsuya Yokota, Kazuya Kawai, Muneyuki Sakata, Yuichi Kimura, Hidekata Hontani:

Dynamic PET Image Reconstruction Using Nonnegative Matrix Factorization Incorporated With Deep Image Prior. 3126-3135 - Jerry Liu, Shenlong Wang, Raquel Urtasun:

DSIC: Deep Stereo Image Compression. 3136-3145 - Yoojin Choi, Mostafa El-Khamy

, Jungwon Lee:
Variable Rate Deep Image Compression With a Conditional Autoencoder. 3146-3154 - Saeed Anwar, Nick Barnes

:
Real Image Denoising With Feature Attention. 3155-3164 - Abdelrahman Abdelhamed

, Marcus A. Brubaker
, Michael S. Brown:
Noise Flow: Noise Modeling With Conditional Normalizing Flows. 3165-3173 - Ahmed Abbas

, Paul Swoboda:
Bottleneck Potentials in Markov Random Fields. 3174-3183 - Chen Chen, Qifeng Chen, Minh N. Do

, Vladlen Koltun:
Seeing Motion in the Dark. 3184-3193 - Huaizu Jiang, Deqing Sun, Varun Jampani, Zhaoyang Lv, Erik G. Learned-Miller, Jan Kautz:

SENSE: A Shared Encoder Network for Scene-Flow Estimation. 3194-3203
Poster 2.1
Deep Learning
- Firas Shama, Roey Mechrez, Alon Shoshan, Lihi Zelnik-Manor:

Adversarial Feedback Loop. 3204-3213 - Alon Shoshan, Roey Mechrez, Lihi Zelnik-Manor:

Dynamic-Net: Tuning the Objective Without Re-Training for Synthesis Tasks. 3214-3222 - Xinyu Gong, Shiyu Chang, Yifan Jiang, Zhangyang Wang:

AutoGAN: Neural Architecture Search for Generative Adversarial Networks. 3223-3233 - Han Shu, Yunhe Wang, Xu Jia, Kai Han, Hanting Chen, Chunjing Xu, Qi Tian, Chang Xu

:
Co-Evolutionary Compression for Unpaired Image Translation. 3234-3243 - Zeyu Feng, Chang Xu

, Dacheng Tao
:
Self-Supervised Representation Learning From Multi-Domain Data. 3244-3254 - Michael Möller, Thomas Möllenhoff

, Daniel Cremers
:
Controlling Neural Networks via Energy Dissipation. 3255-3264 - Hao Lu, Yutong Dai, Chunhua Shen, Songcen Xu

:
Indices Matter: Learning to Index for Deep Image Matting. 3265-3274 - Yunan Li

, Qiguang Miao
, Wanli Ouyang
, Zhenxin Ma, Huijuan Fang, Chao Dong, Yi-Ning Quan:
LAP-Net: Level-Aware Progressive Network for Image Dehazing. 3275-3284 - Irwan Bello, Barret Zoph, Quoc Le, Ashish Vaswani, Jonathon Shlens:

Attention Augmented Convolutional Networks. 3285-3294 - Zechun Liu, Haoyuan Mu, Xiangyu Zhang, Zichao Guo, Xin Yang, Kwang-Ting Cheng

, Jian Sun:
MetaPruning: Meta Learning for Automatic Neural Network Channel Pruning. 3295-3304 - Yuefu Zhou, Ya Zhang

, Yanfeng Wang, Qi Tian:
Accelerate CNN via Recursive Bayesian Pruning. 3305-3314 - Duo Li, Aojun Zhou, Anbang Yao:

HBONet: Harmonious Bottleneck on Two Orthogonal Dimensions. 3315-3324 - Jinchi Huang, Lie Qu, Rongfei Jia, Binqiang Zhao:

O2U-Net: A Simple Noisy Label Detection Approach for Deep Neural Networks. 3325-3333 - Dongmin Park, Seokil Hong, Bohyung Han, Kyoung Mu Lee:

Continual Learning by Asymmetric Loss Approximation With Single-Side Overestimation. 3334-3343 - Weifeng Ge, Weilin Huang, Sheng Guo, Matthew R. Scott

:
Label-PEnet: Sequential Label Propagation and Enhancement Networks for Weakly Supervised Instance Segmentation. 3344-3353 - Ziteng Gao, Limin Wang, Gangshan Wu:

LIP: Local Importance-Based Pooling. 3354-3363 - Takumi Kobayashi:

Global Feature Guided Local Pooling. 3364-3373 - Jinghua Wang, Jianmin Jiang:

Conditional Coupled Generative Adversarial Networks for Zero-Shot Domain Adaptation. 3374-3383 - Aamir Mustafa, Salman H. Khan, Munawar Hayat

, Roland Goecke
, Jianbing Shen, Ling Shao
:
Adversarial Defense by Restricting the Hidden Space of Deep Neural Networks. 3384-3393 - Juhong Min, Jongmin Lee, Jean Ponce, Minsu Cho:

Hyperpixel Flow: Semantic Correspondence With Multi-Layer Neural Features. 3394-3403 - Weitao Wan, Jiansheng Chen, Tianpeng Li, Yiqing Huang

, Jingqi Tian, Cheng Yu, Youze Xue:
Information Entropy Based Feature Pooling for Convolutional Neural Networks. 3404-3413 - Yuning Chai:

Patchwork: A Patch-Wise Attention Network for Efficient Object Detection and Segmentation in Video Streams. 3414-3423 - Siddhesh Khandelwal, Leonid Sigal:

AttentionRNN: A Structured Spatial Attention Mechanism. 3424-3433 - Yunpeng Chen

, Haoqi Fan, Bing Xu, Zhicheng Yan, Yannis Kalantidis, Marcus Rohrbach, Shuicheng Yan, Jiashi Feng:
Drop an Octave: Reducing Spatial Redundancy in Convolutional Neural Networks With Octave Convolution. 3434-3443 - Sagie Benaim, Michael Khaitov, Tomer Galanti, Lior Wolf:

Domain Intersection and Domain Difference. 3444-3452 - Oren Rippel, Sanjay Nair, Carissa Lew, Steve Branson, Alexander G. Anderson, Lubomir D. Bourdev:

Learned Video Compression. 3453-3462 - Han Hu, Zheng Zhang, Zhenda Xie, Stephen Lin:

Local Relation Networks for Image Recognition. 3463-3472 - Éloi Mehr, Ariane Jourdan, Nicolas Thome, Matthieu Cord, Vincent Guitteny:

DiscoNet: Shapes Learning on Disconnected Manifolds for 3D Editing. 3473-3482 - Max Ehrlich, Larry Davis:

Deep Residual Learning in the JPEG Transform Domain. 3483-3492 - Xinqi Zhu, Chang Xu

, Langwen Hui, Cewu Lu, Dacheng Tao
:
Approximated Bilinear Modules for Temporal Modeling. 3493-3502 - Chengchao Shen, Mengqi Xue

, Xinchao Wang
, Jie Song, Li Sun, Mingli Song:
Customizing Student Networks From Heterogeneous Teachers via Adaptive Knowledge Amalgamation. 3503-3512 - Hanting Chen, Yunhe Wang, Chang Xu

, Zhaohui Yang, Chuanjian Liu, Boxin Shi, Chunjing Xu, Chao Xu, Qi Tian:
Data-Free Learning of Student Networks. 3513-3521 - Yue Wang, Justin Solomon:

Deep Closest Point: Learning Representations for Point Cloud Registration. 3522-3531 - Chao Zhang, Stephan Liwicki

, William Smith
, Roberto Cipolla:
Orientation-Aware Semantic Segmentation on Icosahedron Spheres. 3532-3540 - Zhaoyang Zhang, Jingyu Li, Wenqi Shao, Zhanglin Peng, Ruimao Zhang

, Xiaogang Wang, Ping Luo:
Differentiable Learning-to-Group Channels via Groupable Convolutional Neural Networks. 3541-3550 - Ping Chao, Chao-Yang Kao, Yu-Shan Ruan, Chien-Hsiang Huang, Youn-Long Lin:

HarDNet: A Low Memory Traffic Network. 3551-3560 - Junjun He, Zhongying Deng, Yu Qiao

:
Dynamic Multi-Scale Filters for Semantic Segmentation. 3561-3571 - Ravi Teja Mullapudi, Steven Chen, Keyi Zhang, Deva Ramanan

, Kayvon Fatahalian:
Online Model Distillation for Efficient Video Inference. 3572-3581
Recognition
- Kai Li, Martin Renqiang Min

, Yun Fu:
Rethinking Zero-Shot Learning: A Conditional Visual Classification Perspective. 3582-3591 - Senthil Purushwalkam, Maximilian Nickel

, Abhinav Gupta, Marc'Aurelio Ranzato:
Task-Driven Modular Networks for Zero-Shot Compositional Learning. 3592-3601 - Limeng Qiao, Yemin Shi, Jia Li, Yonghong Tian, Tiejun Huang, Yaowei Wang:

Transductive Episodic-Wise Adaptive Metric for Few-Shot Learning. 3602-3611 - Wei Zhai, Yang Cao, Jing Zhang

, Zheng-Jun Zha:
Deep Multiple-Attribute-Perceived Network for Real-World Texture Recognition. 3612-3621 - Guan'an Wang, Tianzhu Zhang, Jian Cheng, Si Liu, Yang Yang, Zengguang Hou:

RGB-Infrared Cross-Modality Person Re-Identification via Joint Pixel and Feature Alignment. 3622-3631 - Saurabh Singh, Abhinav Shrivastava:

EvalNorm: Estimating Batch Normalization Statistics for Evaluation. 3632-3640 - Jianyuan Guo

, Yuhui Yuan, Lang Huang, Chao Zhang, Jin-Ge Yao, Kai Han:
Beyond Human Parts: Dual Part-Aligned Representations for Person Re-Identification. 3641-3650 - Qi Dong, Xiatian Zhu, Shaogang Gong:

Person Search by Text Attribute Query As Zero-Shot Learning. 3651-3660 - Qing Liu, Lingxi Xie, Huiyu Wang, Alan L. Yuille

:
Semantic-Aware Knowledge Preservation for Zero-Shot Sketch-Based Image Retrieval. 3661-3670 - Hamed H. Aghdam, Abel Gonzalez-Garcia, Antonio M. López

, Joost van de Weijer
:
Active Learning for Deep Detection Neural Networks. 3671-3679 - Xuanyi Dong, Yi Yang:

One-Shot Neural Architecture Search via Self-Evaluated Template Network. 3680-3689 - Zuozhuo Dai, Mingqiang Chen, Xiaodong Gu, Siyu Zhu, Ping Tan:

Batch DropBlock Network for Person Re-Identification and Beyond. 3690-3700 - Kaiyang Zhou, Yongxin Yang, Andrea Cavallaro, Tao Xiang:

Omni-Scale Feature Learning for Person Re-Identification. 3701-3711 - Linfeng Zhang

, Jiebo Song, Anni Gao, Jingwei Chen, Chenglong Bao, Kaisheng Ma
:
Be Your Own Teacher: Improve the Performance of Convolutional Neural Networks via Self Distillation. 3712-3721 - Nikita Dvornik, Julien Mairal, Cordelia Schmid:

Diversity With Cooperation: Ensemble Methods for Few-Shot Classification. 3722-3730 - Cheng Xu, Zhaoqun Li, Qiang Qiu, Biao Leng, Jingfei Jiang:

Enhancing 2D Representation via Adjacent Views for 3D Shape Retrieval. 3731-3739 - Kun Wei, Muli Yang, Hao Wang, Cheng Deng

, Xianglong Liu
:
Adversarial Fine-Grained Composition Learning for Unseen Attribute-Object Recognition. 3740-3748 - Ruijie Quan, Xuanyi Dong, Yu Wu

, Linchao Zhu
, Yi Yang:
Auto-ReID: Searching for a Part-Aware ConvNet for Person Re-Identification. 3749-3758 - Bryan Bryan, Yuan Gong

, Yizhe Zhang, Christian Poellabauer
:
Second-Order Non-Local Attention Networks for Person Re-Identification. 3759-3768
Segmentation, Grouping, & Shape
- Zipeng Ye, Ran Yi

, Minjing Yu, Yong-Jin Liu, Ying He
:
Fast Computation of Content-Sensitive Superpixels and Supervoxels Using Q-Distances. 3769-3778 - Dániel Baráth, Jiri Matas

:
Progressive-X: Efficient, Anytime, Multi-Model Fitting Algorithm. 3779-3787 - Yingyue Xu, Dan Xu, Xiaopeng Hong, Wanli Ouyang

, Rongrong Ji, Min Xu
, Guoying Zhao:
Structured Modeling of Joint Deep Feature and Prediction Refinement for Salient Object Detection. 3788-3797 - Jinming Su, Jia Li, Yu Zhang, Changqun Xia, Yonghong Tian:

Selectivity or Invariance: Boundary-Aware Salient Object Detection. 3798-3807 - Urbano Miguel Nunes

, Yiannis Demiris
:
Online Unsupervised Learning of the 3D Kinematic Structure of Arbitrary Rigid Bodies. 3808-3816
3D From Single View & RGBD
- Bram Wallace, Bharath Hariharan:

Few-Shot Generalization for Single-Image 3D Reconstruction via Priors. 3817-3826 - Clément Godard

, Oisin Mac Aodha, Michael Firman, Gabriel J. Brostow:
Digging Into Self-Supervised Monocular Depth Estimation. 3827-3837 - Jing Zhu, Yi Fang:

Learning Object-Specific Distance From a Monocular Image. 3838-3847 - Geonho Cha, Minsik Lee

, Songhwai Oh:
Unsupervised 3D Reconstruction Networks. 3848-3857 - Dong Wook Shu, Sung Woo Park, Junseok Kwon:

3D Point Cloud Generative Adversarial Network Based on Tree Structured Graph Convolutions. 3858-3867 - Junjie Hu

, Yan Zhang, Takayuki Okatani:
Visualization of Convolutional Neural Networks for Monocular Depth Estimation. 3868-3877
Action & Video
- Ruohan Gao, Kristen Grauman:

Co-Separating Sounds of Visual Objects. 3878-3887 - Tianwei Lin, Xiao Liu, Xin Li, Errui Ding, Shilei Wen:

BMN: Boundary-Matching Network for Temporal Action Proposal Generation. 3888-3897 - Ziyi Liu

, Le Wang, Qilin Zhang
, Zhanning Gao, Zhenxing Niu, Nanning Zheng, Gang Hua:
Weakly Supervised Temporal Action Localization Through Contrast Based Evaluation Networks. 3898-3907 - Chaoxu Guo, Bin Fan, Jie Gu, Qian Zhang, Shiming Xiang, Véronique Prinet, Chunhong Pan:

Progressive Sparse Local Attention for Video Object Detection. 3908-3917 - Tete Xiao, Quanfu Fan, Danny Gutfreund, Mathew Monfort, Aude Oliva, Bolei Zhou:

Reasoning About Human-Object Interactions Through Dual Attention Networks. 3918-3927 - Xiaohui Zeng, Renjie Liao, Li Gu, Yuwen Xiong, Sanja Fidler

, Raquel Urtasun:
DMM-Net: Differentiable Mask-Matching Network for Video Object Segmentation. 3928-3937 - Hao Wang, Cheng Deng

, Junchi Yan, Dacheng Tao
:
Asymmetric Cross-Guided Attention Network for Actor and Action Video Segmentation From Natural Language Query. 3938-3947 - Huaijia Lin, Xiaojuan Qi, Jiaya Jia

:
AGSS-VOS: Attention Guided Single-Shot Video Object Segmentation. 3948-3956 - Jianing Li, Shiliang Zhang, Jingdong Wang

, Wen Gao, Qi Tian:
Global-Local Temporal Representations for Video Person Re-Identification. 3957-3966 - Chaowei Xiao, Ruizhi Deng, Bo Li, Taesung Lee, Benjamin Edwards, Jinfeng Yi, Dawn Song, Mingyan Liu, Ian M. Molloy:

AdvIT: Adversarial Frames Identifier Based on Temporal Consistency in Videos. 3967-3976
Motion & Tracking
- Ziqin Wang, Jun Xu, Li Liu, Fan Zhu, Ling Shao

:
RANet: Ranking Attention Network for Fast Video Object Segmentation. 3977-3986 - Jiarui Xu, Yue Cao, Zheng Zhang, Han Hu:

Spatial-Temporal Relation Networks for Multi-Object Tracking. 3987-3997 - Lianghua Huang, Xin Zhao

, Kaiqi Huang:
Bridging the Gap Between Detection and Tracking: A Unified Approach. 3998-4008 - Lichao Zhang, Abel Gonzalez-Garcia, Joost van de Weijer

, Martin Danelljan, Fahad Shahbaz Khan
:
Learning the Model Update for Siamese Trackers. 4009-4018 - Linyu Zheng, Ming Tang, Yingying Chen, Jinqiao Wang, Hanqing Lu:

Fast-deepKCF Without Boundary Effect. 4019-4028
Computational Photography & Graphics
- Xiuming Zhang, Jiayuan Mao, Yikai Li, William T. Freeman, Joshua B. Tenenbaum, Jiajun Wu:

Program-Guided Image Manipulators. 4029-4038 - Pierre-André Brousseau, Sébastien Roy:

Calibration of Axial Fisheye Cameras Through Generic Virtual Central Models. 4039-4047 - Vishwanath Saragadam

, Raja Venkata, Jian Wang, Shree K. Nayar, Mohit Gupta:
Micro-Baseline Structured Light. 4048-4057 - Xin Miao, Xin Yuan

, Yunchen Pu, Vassilis Athitsos
:
lambda-Net: Reconstruct Hyperspectral Images From a Snapshot Measurement. 4058-4068 - Masako Kashiwagi, Nao Mishima, Tatsuo Kozakaya, Shinsaku Hiura:

Deep Depth From Aberration Map. 4069-4078 - Lukas Murmann, Michaël Gharbi, Miika Aittala, Frédo Durand:

A Dataset of Multi-Illumination Images in the Wild. 4079-4088 - Jie Song, Xu Chen, Otmar Hilliges:

Monocular Neural Image Based Rendering With Continuous View Control. 4089-4099 - Marc Comino-Trinidad

, Ricardo Martin-Brualla, Florian Kainz, Janne Kontkanen:
Multi-View Image Fusion. 4100-4109
Low-Level & Optimization
- Wei Wang, Xin Chen, Cheng Yang, Xiang Li, Xuemei Hu, Tao Yue

:
Enhancing Low Light Videos by Exploring High Sensitivity Camera Noise. 4110-4118 - Qifan Gao, Xiao Shu, Xiaolin Wu:

Deep Restoration of Vintage Photographs From Scanned Halftone Prints. 4119-4128 - Qiqi Hou, Feng Liu:

Context-Aware Image Matting for Simultaneous Foreground and Alpha Estimation. 4129-4138 - Wei Wang, Ruiming Guo

, Yapeng Tian, Wenming Yang:
CFSNet: Toward a Controllable Feature Space for Image Restoration. 4139-4148 - Wu Wang

, Weihong Zeng, Yue Huang, Xinghao Ding, John W. Paisley:
Deep Blind Hyperspectral Image Fusion. 4149-4158 - Sungmin Cha, Taesup Moon:

Fully Convolutional Pixel Adaptive Image Denoiser. 4159-4168 - Hongyu Liu, Bin Jiang, Yi Xiao, Chao Yang:

Coherent Semantic Attention for Image Inpainting. 4169-4178 - Yajun Qiu, Ruxin Wang, Dapeng Tao, Jun Cheng:

Embedded Block Residual Network: A Recursive Restoration Model for Single-Image Super-Resolution. 4179-4188 - Shuhang Gu, Wen Li, Luc Van Gool, Radu Timofte

:
Fast Image Restoration With Multi-Bin Trainable Linear Units. 4189-4198
Scene Understanding
- Zenglin Shi, Pascal Mettes, Cees Snoek:

Counting With Focus for Free. 4199-4208 - Behzad Bozorgtabar

, Mohammad Saeed Rad, Dwarikanath Mahapatra, Jean-Philippe Thiran
:
SynDeMo: Synergistic Deep Feature Alignment for Joint Learning of Depth and Ego-Motion. 4209-4218 - Ke Li, Tianhao Zhang, Jitendra Malik:

Diverse Image Synthesis From Semantic Layouts via Conditional IMLE. 4219-4228 - Yanwei Pang, Yazhao Li, Jianbing Shen, Ling Shao

:
Towards Bridging Semantic Gap to Improve Semantic Segmentation. 4229-4238
Language & Reasoning
- Lixin Liu, Jiajun Tang

, Xiaojun Wan, Zongming Guo:
Generating Diverse and Descriptive Image Captions Using Visual Paraphrases. 4239-4248 - Xu Yang, Hanwang Zhang

, Jianfei Cai:
Learning to Collocate Neural Modules for Image Captioning. 4249-4259 - Jyoti Aneja, Harsh Agrawal

, Dhruv Batra, Alexander G. Schwing:
Sequential Latent Spaces for Modeling the Intention During Diverse Image Captioning. 4260-4269 - Nilavra Bhattacharya

, Qing Li, Danna Gurari:
Why Does a Visual Question Have Different Answers? 4270-4279 - Mohit Bajaj, Lanjun Wang

, Leonid Sigal:
G3raphGround: Graph-Based Language Grounding. 4280-4289 - Ali Furkan Biten, Rubèn Tito, Andrés Mafla, Lluís Gómez i Bigorda

, Marçal Rusiñol, C. V. Jawahar
, Ernest Valveny, Dimosthenis Karatzas
:
Scene Text Visual Question Answering. 4290-4300
3D From Multiview & Sensors
- Lu Sheng

, Dan Xu, Wanli Ouyang
, Xiaogang Wang:
Unsupervised Collaborative Learning of Keyframe Detection and Visual Odometry Towards Monocular Deep SLAM. 4301-4310 - Youze Xue, Jiansheng Chen, Weitao Wan, Yiqing Huang

, Cheng Yu, Tianpeng Li, Jiayu Bao:
MVSCRF: Learning Multi-View Stereo With Conditional Random Fields. 4311-4320 - Eric Brachmann, Carsten Rother:

Neural-Guided RANSAC: Learning Where to Sample Model Hypotheses. 4321-4330 - Sergey Prokudin, Christoph Lassner, Javier Romero:

Efficient Learning on Point Clouds With Basis Point Sets. 4331-4340 - Haibo Qiu, Chunyu Wang, Jingdong Wang

, Naiyan Wang, Wenjun Zeng
:
Cross View Fusion for 3D Human Pose Estimation. 4341-4350 - Junbang Liang, Ming C. Lin:

Shape-Aware Human Pose and Shape Reconstruction Using Multi-View Images. 4351-4361 - Di Yan, Henrique Morimitsu

, Shan Gao, Xiangyang Ji:
Monocular Piecewise Depth Estimation in Dynamic Scenes by Exploiting Superpixel Relations. 4362-4371 - Hajime Taira, Ignacio Rocco, Jirí Sedlár

, Masatoshi Okutomi, Josef Sivic, Tomás Pajdla, Torsten Sattler, Akihiko Torii:
Is This the Right Place? Geometric-Semantic Pose Verification for Indoor Visual Localization. 4372-4382 - Shivam Duggal, Shenlong Wang, Wei-Chiu Ma, Rui Hu, Raquel Urtasun:

DeepPruner: Learning Efficient Stereo Matching via Differentiable PatchMatch. 4383-4392
Image & Video Synthesis
- Sijie Yan, Zhizhong Li

, Yuanjun Xiong, Huahan Yan, Dahua Lin:
Convolutional Sequence Generation for Skeleton-Based Action Synthesis. 4393-4401 - Seoung Wug Oh, Sungho Lee

, Joon-Young Lee, Seon Joo Kim:
Onion-Peel Networks for Deep Video Completion. 4402-4411 - Sungho Lee

, Seoung Wug Oh, DaeYeun Won, Seon Joo Kim:
Copy-and-Paste Networks for Deep Video Inpainting. 4412-4420 - Dmytro Kotovenko, Artsiom Sanakoyeu, Sabine Lang, Björn Ommer:

Content and Style Disentanglement for Artistic Style Transfer. 4421-4430
Oral 3.1A
Generative Modeling & Synthesis
- Rameen Abdal

, Yipeng Qin
, Peter Wonka:
Image2StyleGAN: How to Embed Images Into the StyleGAN Latent Space? 4431-4440 - Shuai Yang

, Zhangyang Wang, Zhaowen Wang, Ning Xu, Jiaying Liu
, Zongming Guo:
Controllable Artistic Text Style Transfer via Shape-Matching GAN. 4441-4450 - Tai-Yin Chiu:

Understanding Generalized Whitening and Coloring Transform for Universal Style Transfer. 4451-4459 - Cícero Nogueira dos Santos, Youssef Mroueh, Inkit Padhi, Pierre L. Dognin:

Learning Implicit Generative Models by Matching Perceptual Features. 4460-4469 - Jiahui Yu, Zhe Lin, Jimei Yang, Xiaohui Shen, Xin Lu, Thomas S. Huang:

Free-Form Image Inpainting With Gated Convolution. 4470-4479 - Xintong Han, Zuxuan Wu, Weilin Huang, Matthew R. Scott

, Larry Davis:
FiNet: Compatible and Diverse Fashion Image Inpainting. 4480-4490 - Assaf Shocher, Shai Bagon, Phillip Isola, Michal Irani:

InGAN: Capturing and Retargeting the "DNA" of a Natural Image. 4491-4500 - David Bau

, Jun-Yan Zhu, Jonas Wulff, William S. Peebles, Bolei Zhou, Hendrik Strobelt, Antonio Torralba:
Seeing What a GAN Cannot Generate. 4501-4510 - Chieh Hubert Lin, Chia-Che Chang, Yu-Sheng Chen

, Da-Cheng Juan, Wei Wei, Hwann-Tzong Chen:
COCO-GAN: Generation by Parts via Conditional Coordinating. 4511-4520 - Hang Chu, Daiqing Li, David Acuna, Amlan Kar, Maria Shugrina, Xinkai Wei, Ming-Yu Liu, Antonio Torralba, Sanja Fidler

:
Neural Turtle Graphics for Modeling City Road Layouts. 4521-4529 - Michael Oechsle, Lars M. Mescheder, Michael Niemeyer, Thilo Strauss, Andreas Geiger:

Texture Fields: Learning Texture Representations in Function Space. 4530-4539 - Guandao Yang, Xun Huang, Zekun Hao, Ming-Yu Liu, Serge J. Belongie

, Bharath Hariharan:
PointFlow: 3D Point Cloud Generation With Continuous Normalizing Flows. 4540-4549 - Amlan Kar, Aayush Prakash, Ming-Yu Liu, Eric Cameracci, Justin Yuan, Matt Rusiniak, David Acuna, Antonio Torralba, Sanja Fidler

:
Meta-Sim: Learning to Generate Synthetic Datasets. 4550-4559 - Oron Ashual, Lior Wolf:

Specifying Object Attributes and Relations in Interactive Scene Generation. 4560-4568 - Tamar Rott Shaham, Tali Dekel, Tomer Michaeli:

SinGAN: Learning a Generative Model From a Single Natural Image. 4569-4579
Oral 3.1B
Vision, Language, & Text
- Xin Wang, Jiawei Wu

, Jun-Kun Chen, Lei Li
, Yuan-Fang Wang, William Yang Wang:
VaTeX: A Large-Scale, High-Quality Multilingual Dataset for Video-and-Language Research. 4580-4590 - Yu Xiong, Qingqiu Huang, Lingfeng Guo, Hang Zhou, Bolei Zhou, Dahua Lin:

A Graph-Based Framework to Bridge Movies and Synopses. 4591-4600 - Ajeet Kumar Singh, Anand Mishra

, Shashank Shekhar, Anirban Chakraborty:
From Strings to Things: Knowledge-Enabled VQA Model That Can Read and Reason. 4601-4611 - Long Chen

, Hanwang Zhang
, Jun Xiao, Xiangnan He, Shiliang Pu, Shih-Fu Chang:
Counterfactual Critic Multi-Agent Training for Scene Graph Generation. 4612-4622 - Dong Huk Park, Trevor Darrell, Anna Rohrbach:

Robust Change Captioning. 4623-4632 - Lun Huang, Wenmin Wang, Jie Chen, Xiaoyong Wei

:
Attention on Attention for Image Captioning. 4633-4642 - Sibei Yang

, Guanbin Li, Yizhou Yu:
Dynamic Graph Attention for Referring Expression Comprehension. 4643-4652 - Kunpeng Li, Yulun Zhang

, Kai Li, Yuanyuan Li, Yun Fu:
Visual Semantic Reasoning for Image-Text Matching. 4653-4661 - Josiah Wang, Lucia Specia:

Phrase Localization Without Paired Training Examples. 4662-4671 - Daqing Liu, Hanwang Zhang

, Feng Wu, Zheng-Jun Zha:
Learning to Assemble Neural Module Tree Networks for Visual Grounding. 4672-4681 - Zhengyuan Yang, Boqing Gong, Liwei Wang, Wenbing Huang, Dong Yu, Jiebo Luo

:
A Fast and Accurate One-Stage Approach to Visual Grounding. 4682-4692 - Arka Sadhu, Kan Chen, Ram Nevatia:

Zero-Shot Grounding of Objects From Natural Language Queries. 4693-4702 - Siyang Qin, Alessandro Bissacco, Michalis Raptis, Yasuhisa Fujii, Ying Xiao:

Towards Unconstrained End-to-End Text Spotting. 4703-4713 - Jeonghun Baek

, Geewook Kim
, Junyeop Lee, Sungrae Park, Dongyoon Han, Sangdoo Yun, Seong Joon Oh, Hwalsuk Lee:
What Is Wrong With Scene Text Recognition Model Comparisons? Dataset and Model Analysis. 4714-4722
Poster 3.1
Deep Learning
- Francesco Croce, Matthias Hein:

Sparse and Imperceivable Adversarial Attacks. 4723-4731 - Qian Huang, Isay Katsman

, Zeqi Gu, Horace He, Serge J. Belongie
, Ser-Nam Lim:
Enhancing Adversarial Example Transferability With an Intermediate Level Attack. 4732-4741 - Mateusz Michalkiewicz, Jhony Kaesemodel Pontes, Dominic Jack, Mahsa Baktashmotlagh

, Anders P. Eriksson:
Implicit Surface Representations As Layers in Neural Networks. 4742-4751 - Pablo Navarrete Michelini, Hanwen Liu, Yunhua Lu, Xingqun Jiang:

A Tour of Convolutional Networks Guided by Linear Interpreters. 4752-4761 - João F. Henriques, Sébastien Ehrhardt, Samuel Albanie, Andrea Vedaldi:

Small Steps and Giant Leaps: Minimal Newton Solvers for Deep Learning. 4762-4771 - Ameya Joshi

, Amitangshu Mukherjee, Soumik Sarkar, Chinmay Hegde
:
Semantic Adversarial Attacks: Parametric Transformations That Fool Deep Classifiers. 4772-4782 - Yang Bai

, Yan Feng, Yisen Wang, Tao Dai, Shutao Xia, Yong Jiang:
Hilbert-Based Generative Defense for Adversarial Examples. 4783-4792 - Jang Hyun Cho, Bharath Hariharan:

On the Efficacy of Knowledge Distillation. 4793-4801 - Simyung Chang, Seonguk Park, John Yang, Nojun Kwak:

Sym-Parameterized Dynamic Inference for Mixed-Domain Image Translation. 4802-4810 - Shuang Wang, Yanfeng Li, Xuefeng Liang, Dou Quan, Bowu Yang, Shaowei Wei, Licheng Jiao

:
Better and Faster: Exponential Loss for Image Patch Matching. 4811-4820 - Rey Wiyatno, Anqi Xu:

Physical Adversarial Textures That Fool Visual Object Tracking. 4821-4830 - Huidong Liu, Xianfeng Gu

, Dimitris Samaras:
Wasserstein GAN With Quadratic Transport Cost. 4831-4840 - Sven Gowal, Krishnamurthy Dvijotham, Robert Stanforth, Rudy Bunel, Chongli Qin, Jonathan Uesato, Relja Arandjelovic, Timothy Arthur Mann, Pushmeet Kohli:

Scalable Verified Training for Provably Robust Image Classification. 4841-4850 - Ruihao Gong

, Xianglong Liu
, Shenghu Jiang, Tianxiang Li, Peng Hu, Jiazhen Lin, Fengwei Yu, Junjie Yan:
Differentiable Soft Quantization: Bridging Full-Precision and Low-Bit Neural Networks. 4851-4860 - Chris Finlay, Aram-Alexandre Pooladian, Adam M. Oberman:

The LogBarrier Adversarial Attack: Making Effective Use of Decision Boundary Information. 4861-4869 - Thalaiyasingam Ajanthan, Puneet K. Dokania, Richard Hartley, Philip H. S. Torr:

Proximal Mean-Field for Neural Network Quantization. 4870-4879 - Hao-Yun Chen, Jhao-Hong Liang, Shih-Chieh Chang

, Jia-Yu Pan, Yu-Ting Chen, Wei Wei, Da-Cheng Juan:
Improving Adversarial Robustness via Guided Complement Entropy. 4880-4888 - Yujia Liu, Seyed-Mohsen Moosavi-Dezfooli

, Pascal Frossard:
A Geometry-Inspired Decision-Based Attack. 4889-4897 - Jie Li

, Rongrong Ji
, Hong Liu, Xiaopeng Hong, Yue Gao, Qi Tian:
Universal Perturbation Attack Against Image Retrieval. 4898-4907 - Jiaxin Gu, Junhe Zhao, Xiaolong Jiang, Baochang Zhang, Jianzhuang Liu, Guodong Guo, Rongrong Ji:

Bayesian Optimized 1-Bit CNNs. 4908-4916 - Kaiming He, Ross B. Girshick, Piotr Dollár:

Rethinking ImageNet Pre-Training. 4917-4926 - Chaithanya Kumar Mummadi, Thomas Brox, Jan Hendrik Metzen:

Defending Against Universal Perturbations With Shared Adversarial Training. 4927-4936 - Yiyou Sun, Sathya N. Ravi

, Vikas Singh:
Adaptive Activation Thresholding: Dynamic Routing Type Behavior for Interpretability in Convolutional Neural Networks. 4937-4946 - Andrei Kapishnikov, Tolga Bolukbasi, Fernanda B. Viégas, Michael Terry:

XRAI: Better Attributions Through Regions. 4947-4956 - Thomas Brunner

, Frederik Diehl, Michael Truong-Le, Alois C. Knoll
:
Guessing Smart: Biased Sampling for Efficient Black-Box Adversarial Attacks. 4957-4965
Recognition
- Yanwei Pang, Jin Xie, Muhammad Haris Khan

, Rao Muhammad Anwer
, Fahad Shahbaz Khan
, Ling Shao
:
Mask-Guided Attention Network for Occluded Pedestrian Detection. 4966-4974 - Chuanchen Luo, Yuntao Chen, Naiyan Wang, Zhaoxiang Zhang:

Spectral Feature Transformation for Person Re-Identification. 4975-4984 - Xiaofeng Liu, Zhenhua Guo, Site Li

, Ping Jia, Lingsheng Kong, Jane You, B. V. K. Vijaya Kumar
:
Permutation-Invariant Feature Restructuring for Correlation-Aware Image Set-Based Recognition. 4985-4995 - Chufeng Tang, Lu Sheng

, Zhaoxiang Zhang, Xiaolin Hu:
Improving Pedestrian Attribute Recognition With Weakly-Supervised Multi-Scale Attribute-Specific Localization. 4996-5005 - Baoyun Peng

, Xiao Jin, Dongsheng Li, Shunfeng Zhou, Yichao Wu, Jiaheng Liu, Zhaoning Zhang, Yu Liu:
Correlation Congruence for Knowledge Distillation. 5006-5015 - Yiru Wang, Weihao Gan, Jie Yang, Wei Wu, Junjie Yan:

Dynamic Curriculum Learning for Imbalanced Data Classification. 5016-5025 - Makarand Tapaswi, Marc T. Law, Sanja Fidler

:
Video Face Clustering With Unknown Number of Clusters. 5026-5035 - Giorgos Tolias

, Filip Radenovic, Ondrej Chum:
Targeted Mismatch Adversarial Attack: Query With a Flower to Retrieve the Tower. 5036-5045 - Wei-Lin Hsiao, Isay Katsman, Chao-Yuan Wu, Devi Parikh, Kristen Grauman:

Fashion++: Minimal Edits for Outfit Improvement. 5046-5055 - Si Wu, Sihao Lin

, Wenhao Wu, Mohamed Azzam
, Hau-San Wong
:
Semi-Supervised Pedestrian Instance Synthesis and Detection With Mutual Reinforcement. 5056-5065 - Tao Hu, Pascal Mettes, Jia-Hong Huang, Cees Snoek:

SILCO: Show a Few Images, Localize the Common Object. 5066-5075 - Jimmy Addison Lee, Peng Liu

, Jun Cheng, Huazhu Fu
:
A Deep Step Pattern Representation for Multimodal Retinal Image Registration. 5076-5085 - Zhen Zhang

, Wee Sun Lee:
Deep Graphical Feature Learning for the Feature Matching Problem. 5086-5095 - Dong Lao

, Ganesh Sundaramoorthi:
Minimum Delay Object Detection From Video. 5096-5105 - Jérôme Revaud, Jon Almazán, Rafael S. Rezende, César Roberto de Souza:

Learning With Average Precision: Training Image Retrieval With a Listwise Loss. 5106-5115 - Amirreza Shaban, Amir Rahimi, Shray Bansal, Stephen Gould, Byron Boots, Richard Hartley:

Learning to Find Common Objects Across Few Image Collections. 5116-5125 - Lu Zhang, Xiangyu Zhu, Xiangyu Chen, Xu Yang, Zhen Lei, Zhiyong Liu:

Weakly Aligned Cross-Modal Learning for Multispectral Pedestrian Detection. 5126-5136 - Jiangfan Han, Ping Luo, Xiaogang Wang:

Deep Self-Learning From Noisy Labels. 5137-5146 - Marcelo Gennari Do Nascimento, Victor Prisacariu, Roger Fawcett:

DSConv: Efficient Convolution Operator. 5147-5156 - Jiangfan Han, Xiaoyi Dong, Ruimao Zhang

, Dongdong Chen, Weiming Zhang, Nenghai Yu, Ping Luo, Xiaogang Wang:
Once a MAN: Towards Multi-Target Attack via Learning Multi-Target Adversarial Network Once. 5157-5166
Segmentation, Grouping, & Shape
- Wenqiang Xu, Haiyang Wang, Fubo Qi, Cewu Lu:

Explicit Shape Encoding for Real-Time Instance Segmentation. 5167-5176 - Cheng-Yang Fu

, Tamara L. Berg, Alexander C. Berg:
IMP: Instance Mask Projection for High Accuracy Semantic Segmentation of Things. 5177-5186 - Linjie Yang, Yuchen Fan, Ning Xu:

Video Instance Segmentation. 5187-5196 - Kunpeng Li, Yulun Zhang

, Kai Li, Yuanyuan Li, Yun Fu:
Attention Bridging Network for Knowledge Transfer. 5197-5206 - Wataru Shimoda, Keiji Yanai

:
Self-Supervised Difference Detection for Weakly-Supervised Semantic Segmentation. 5207-5216 - Bowen Cheng, Liang-Chieh Chen, Yunchao Wei, Yukun Zhu, Zilong Huang, Jinjun Xiong

, Thomas S. Huang, Wen-Mei Hwu, Honghui Shi:
SPGNet: Semantic Prediction Guidance for Scene Parsing. 5217-5227 - Towaki Takikawa, David Acuna, Varun Jampani, Sanja Fidler

:
Gated-SCNN: Gated Shape CNNs for Semantic Segmentation. 5228-5237 - Yongcheng Liu, Bin Fan, Gaofeng Meng, Jiwen Lu

, Shiming Xiang, Chunhong Pan:
DensePoint: Learning Densely Contextual Representation for Efficient Point Cloud Processing. 5238-5247 - Mennatullah Siam, Boris N. Oreshkin, Martin Jägersand:

AMP: Adaptive Masked Proxies for Few-Shot Segmentation. 5248-5257 - Tarun Kalluri, Girish Varma, Manmohan Chandraker, C. V. Jawahar

:
Universal Semi-Supervised Semantic Segmentation. 5258-5269
Statistics, Physics, Theory & Datasets
- Long-Kai Huang, Jianda Chen, Sinno Jialin Pan

:
Accelerate Learning of Deep Hashing With Gradient Attention. 5270-5279 - Qing-Yuan Jiang, Yi He, Gen Li, Jian Lin, Lei Li, Wu-Jun Li:

SVD: A Large-Scale Short Video Dataset for Near-Duplicate Video Retrieval. 5280-5288 - Hubert Lin, Paul Upchurch, Kavita Bala

:
Block Annotation: Better Image Annotation With Sub-Image Decomposition. 5289-5299 - Yanzhu Liu

, Fan Wang, Adams Wai-Kin Kong:
Probabilistic Deep Ordinal Regression Based on Gaussian Processes. 5300-5308 - Tianlu Wang, Jieyu Zhao, Mark Yatskar, Kai-Wei Chang, Vicente Ordonez

:
Balanced Datasets Are Not Enough: Estimating and Mitigating Gender Bias in Deep Image Representations. 5309-5318 - Pouya Bashivan, Mark Tensen, James J. DiCarlo:

Teacher Guided Architecture Search. 5319-5328
3D From Single View & RGBD
- David Smith, Matthew Loper, Xiaochen Hu, Paris Mavroidis, Javier Romero:

FACSIMILE: Fast and Accurate Scans From an Image in Less Than a Second. 5329-5338 - Yu Rong, Ziwei Liu, Cheng Li, Kaidi Cao, Chen Change Loy:

Delving Deep Into Hybrid Annotations for 3D Human Recovery in the Wild. 5339-5347 - Yu Sun, Yun Ye, Wu Liu, Wenpeng Gao, Yili Fu, Tao Mei

:
Human Mesh Recovery From Monocular Images via a Skeleton-Disentangled Representation. 5348-5357 - Silvia Zuffi

, Angjoo Kanazawa, Tanya Y. Berger-Wolf
, Michael J. Black:
Three-D Safari: Learning to Estimate Zebra Pose, Shape, and Texture From Images "In the Wild". 5358-5367 - Helisa Dhamo, Nassir Navab, Federico Tombari:

Object-Driven Multi-Layer Scene Decomposition From a Single Image. 5368-5377 - Michael Niemeyer, Lars M. Mescheder, Michael Oechsle, Andreas Geiger:

Occupancy Flow: 4D Reconstruction by Learning Particle Dynamics. 5378-5388 - Hou-Ning Hu, Qi-Zhi Cai, Dequan Wang, Ji Lin, Min Sun, Philipp Krähenbühl, Trevor Darrell, Fisher Yu:

Joint Monocular 3D Vehicle Detection and Tracking. 5389-5398
Face & Body
- Bowen Shi, Aurora Martinez Del Rio, Jonathan Keane, Diane Brentari, Greg Shakhnarovich, Karen Livescu

:
Fingerspelling Recognition in the Wild With Iterative Visual Attention. 5399-5408 - Hang Dai, Ling Shao

:
PointAE: Point Auto-Encoder for 3D Statistical Shape and Texture Modelling. 5409-5418 - Bharat Lal Bhatnagar, Garvita Tiwari, Christian Theobalt

, Gerard Pons-Moll:
Multi-Garment Net: Learning to Dress 3D People From Images. 5419-5429 - Haiyong Jiang, Jianfei Cai, Jianmin Zheng

:
Skeleton-Aware 3D Human Shape Reconstruction From Point Clouds. 5430-5440 - Naureen Mahmood, Nima Ghorbani, Nikolaus F. Troje

, Gerard Pons-Moll, Michael J. Black:
AMASS: Archive of Motion Capture As Surface Shapes. 5441-5450 - Fei Wang

, Sanping Zhou, Stanislav Panev
, Jinsong Han, Dong Huang:
Person-in-WiFi: Fine-Grained Person Perception Using WiFi. 5451-5460 - Keqiang Sun, Wayne Wu, Tinghao Liu, Shuo Yang, Quan Wang, Qiang Zhou, Zuochang Ye, Chen Qian:

FAB: A Robust Facial Landmark Detection Framework for Motion-Blurred Videos. 5461-5470 - Bong-Nam Kang, Yonghyun Kim, Bongjin Jun, Daijin Kim:

Attentional Feature-Pair Relation Networks for Accurate Face Recognition. 5471-5480
Action & Video
- Brais Martínez, Davide Modolo, Yuanjun Xiong, Joseph Tighe:

Action Recognition With Spatial-Temporal Discriminative Filter Banks. 5481-5490 - Evangelos Kazakos, Arsha Nagrani, Andrew Zisserman, Dima Damen

:
EPIC-Fusion: Audio-Visual Temporal Binding for Egocentric Action Recognition. 5491-5500 - Phuc Xuan Nguyen, Deva Ramanan

, Charless C. Fowlkes:
Weakly-Supervised Action Localization With Background Modeling. 5501-5510 - Chenxu Luo, Alan L. Yuille

:
Grouped Spatial-Temporal Aggregation for Efficient Action Recognition. 5511-5520 - Tan Yu, Zhou Ren, Yuncheng Li, Enxu Yan, Ning Xu, Junsong Yuan

:
Temporal Structure Mining for Weakly Supervised Action Detection. 5521-5530 - Mingze Xu, Mingfei Gao, Yi-Ting Chen, Larry Davis, David J. Crandall

:
Temporal Recurrent Networks for Online Action Detection. 5531-5540 - Mingfei Gao, Mingze Xu, Larry Davis, Richard Socher, Caiming Xiong:

StartNet: Online Detection of Action Start in Untrimmed Videos. 5541-5550 - Du Tran, Heng Wang, Matt Feiszli, Lorenzo Torresani:

Video Classification With Channel-Separated Convolutional Networks. 5551-5560 - Harshala Gammulle

, Simon Denman, Sridha Sridharan
, Clinton Fookes:
Predicting the Future: A Jointly Learnt Model for Action Anticipation. 5561-5570
Low-Level & Optimization
- Ziyi Shen, Wenguan Wang

, Xiankai Lu, Jianbing Shen, Haibin Ling, Tingfa Xu, Ling Shao
:
Human-Aware Motion Deblurring. 5571-5580 - Lu Zhang, Zhe Lin, Jianming Zhang, Huchuan Lu, You He:

Fast Video Object Segmentation via Dynamic Targeting Network. 5581-5590 - Sean I. Young, Aous Thabit Naman

, Bernd Girod, David Taubman
:
Solving Vision Problems via Filtering. 5591-5600 - Ankit Raj, Yuqi Li, Yoram Bresler

:
GAN-Based Projector for Faster Recovery With Convergence Guarantees in Linear Inverse Problems. 5601-5610 - Deng-Ping Fan

, Shengchuan Zhang, Yu-Huan Wu, Yun Liu, Ming-Ming Cheng
, Bo Ren
, Paul L. Rosin, Rongrong Ji:
Scoot: A Perceptual Metric for Facial Sketches. 5611-5621 - Yawei Li

, Shuhang Gu, Luc Van Gool, Radu Timofte
:
Learning Filter Basis for Convolutional Neural Network Compression. 5622-5631 - Daniel Gehrig

, Antonio Loquercio
, Konstantinos G. Derpanis, Davide Scaramuzza
:
End-to-End Learning of Representations for Asynchronous Event-Based Data. 5632-5642 - Guoqing Wang, Changming Sun

, Arcot Sowmya:
ERL-Net: Entangled Representation Learning for Single Image De-Raining. 5643-5651 - Oleg Voynov, Alexey Artemov

, Vage Egiazarian
, Alexandr Notchenko, Gleb Bobrovskikh, Evgeny Burnaev
, Denis Zorin:
Perceptual Deep Depth Super-Resolution. 5652-5662
Scene Understanding
- Iro Armeni, Zhi-Yang He, Amir Zamir, JunYoung Gwak, Jitendra Malik, Martin Fischer

, Silvio Savarese:
3D Scene Graph: A Structure for Unified Semantics, 3D Space, and Camera. 5663-5672 - Cheng Lin, Changjian Li, Wenping Wang:

Floorplan-Jigsaw: Jointly Estimating Scene Layout and Aligning Partial Scans. 5673-5682 - Wei Yin, Yifan Liu

, Chunhua Shen, Youliang Yan:
Enforcing Geometric Constraints of Virtual Normal for Depth Prediction. 5683-5692 - Tiancai Wang, Rao Muhammad Anwer

, Muhammad Haris Khan
, Fahad Shahbaz Khan
, Yanwei Pang, Ling Shao
, Jorma Laaksonen
:
Deep Contextual Attention for Human-Object Interaction Detection. 5693-5701 - Wenguan Wang

, Zhijie Zhang, Siyuan Qi, Jianbing Shen, Yanwei Pang, Ling Shao
:
Learning Compositional Neural Information Fusion for Human Parsing. 5702-5712 - Anran Zhang, Lei Yue, Jiayi Shen, Fan Zhu, Xiantong Zhen, Xianbin Cao, Ling Shao

:
Attentional Neural Fields for Crowd Counting. 5713-5722 - Lifeng Fan, Wenguan Wang

, Song-Chun Zhu, Xinyu Tang, Siyuan Huang:
Understanding Human Gaze Communication by Spatio-Temporal Graph Reasoning. 5723-5732 - Jean-Baptiste Alayrac, João Carreira, Relja Arandjelovic, Andrew Zisserman:

Controllable Attention for Structured Layered Video Decomposition. 5733-5742 - Lore Goetschalckx, Alex Andonian, Aude Oliva, Phillip Isola:

GANalyze: Toward Visual Definitions of Cognitive Image Properties. 5743-5752
Language & Reasoning
- Zhong Ji, Haoran Wang, Jungong Han, Yanwei Pang:

Saliency-Guided Attention Network for Image-Sentence Matching. 5753-5762 - Zihao Wang, Xihui Liu

, Hongsheng Li
, Lu Sheng
, Junjie Yan, Xiaogang Wang, Jing Shao:
CAMP: Cross-Modal Adaptive Message Passing for Text-Image Retrieval. 5763-5772 - Yan Huang, Liang Wang:

ACMM: Aligned Cross-Modal Memory for Few-Shot Image and Sentence Matching. 5773-5782 - Mohamed Elhoseiny, Mohamed Elfeki:

Creativity Inspired Zero-Shot Learning. 5783-5792 - Mikihiro Tanaka, Takayuki Itamochi, Kenichi Narioka, Ikuro Sato, Yoshitaka Ushiku

, Tatsuya Harada:
Generating Easy-to-Understand Referring Expressions for Target Identifications. 5793-5802 - Jonatas Wehrmann, Maurício Armani Lopes, Douglas M. Souza, Rodrigo C. Barros

:
Language-Agnostic Visual-Semantic Embeddings. 5803-5812 - Nikolaos Sarafianos, Xiang Xu, Ioannis A. Kakadiaris:

Adversarial Representation Learning for Text-to-Image Matching. 5813-5823 - Peng Gao, Haoxuan You

, Zhanpeng Zhang, Xiaogang Wang, Hongsheng Li
:
Multi-Modality Latent Interaction Network for Visual Question Answering. 5824-5834
3D From Multiview & Sensors
- Axel Barroso Laguna, Edgar Riba, Daniel Ponsa

, Krystian Mikolajczyk:
Key.Net: Keypoint Detection by Handcrafted and Learned CNN Filters. 5835-5843 - Jiahui Zhang, Dawei Sun, Zixin Luo, Anbang Yao, Lei Zhou, Tianwei Shen, Yurong Chen

, Hongen Liao, Long Quan:
Learning Two-View Correspondences and Geometry Using Order-Aware Network. 5844-5853 - Michael Bloesch, Tristan Laidlow, Ronald Clark, Stefan Leutenegger, Andrew J. Davison:

Learning Meshes for Dense Visual SLAM. 5854-5863 - Michael Strecke

, Jörg Stückler:
EM-Fusion: Dynamic Object-Level SLAM With Probabilistic Data Association. 5864-5873 - Jiahui Huang, Sheng Yang, Zishuo Zhao, Yu-Kun Lai, Shimin Hu:

ClusterSLAM: A SLAM Backend for Simultaneous Rigid Body Clustering and Motion Estimation. 5874-5883 - Uttaran Bhattacharya

, Venu Madhav Govindu:
Efficient and Robust Registration on the 3D Special Euclidean Group. 5884-5893 - Yoni Kasten, Amnon Geifman, Meirav Galun, Ronen Basri:

Algebraic Characterization of Essential Matrices and Their Averaging in Multiview Settings. 5894-5902
Image & Video Synthesis
- Wen Liu

, Zhixin Piao, Jie Min, Wenhan Luo
, Lin Ma, Shenghua Gao:
Liquid Warping GAN: A Unified Framework for Human Motion Imitation, Appearance Transfer and Novel View Synthesis. 5903-5912 - Yu-Jing Lin, Po-Wei Wu, Che-Han Chang, Edward Y. Chang, Shih-Wei Liao:

RelGAN: Multi-Domain Image-to-Image Translation via Relative Attributes. 5913-5921 - Ruizheng Wu, Xin Tao

, Xiaodong Gu, Xiaoyong Shen, Jiaya Jia
:
Attribute-Driven Spontaneous Motion in Unpaired Image Translation. 5922-5931 - Caroline Chan, Shiry Ginosar, Tinghui Zhou, Alexei A. Efros

:
Everybody Dance Now. 5932-5941 - Yulun Zhang

, Chen Fang, Yilin Wang, Zhaowen Wang, Zhe Lin, Yun Fu, Jimei Yang:
Multimodal Style Transfer via Graph Cuts. 5942-5950 - Ming Lu, Hao Zhao, Anbang Yao, Yurong Chen

, Feng Xu, Li Zhang:
A Closed-Form Solution to Universal Style Transfer. 5951-5960 - Jingyuan Li, Fengxiang He, Lefei Zhang, Bo Du, Dacheng Tao

:
Progressive Reconstruction of Visual Structure for Image Inpainting. 5961-5970
Oral 3.2A
Recognition, Detection, & Re-Identification
- Samarth Sinha, Sayna Ebrahimi, Trevor Darrell:

Variational Adversarial Active Learning. 5971-5980 - Yang Zou, Zhiding Yu, Xiaofeng Liu, B. V. K. Vijaya Kumar

, Jinsong Wang:
Confidence Regularized Self-Training. 5981-5990 - Serim Ryou, Seong-Gyun Jeong, Pietro Perona:

Anchor Loss: Modulating Loss Scale Based on Prediction Difficulty. 5991-6000 - Chengxu Zhuang, Alex Lin Zhai, Daniel Yamins:

Local Aggregation for Unsupervised Learning of Visual Embeddings. 6001-6011 - Zhennan Wang, Wenbin Zou, Chen Xu:

PR Product: A Substitute for Inner Product in Neural Networks. 6012-6021 - Sangdoo Yun, Dongyoon Han, Sanghyuk Chun, Seong Joon Oh, Youngjoon Yoo, Junsuk Choe

:
CutMix: Regularization Strategy to Train Strong Classifiers With Localizable Features. 6022-6031 - Tianfu Wu

, Xi Song:
Towards Interpretable Object Detection by Unfolding Latent Structures. 6032-6042 - Jason Kuen, Federico Perazzi, Zhe Lin, Jianming Zhang, Yap-Peng Tan:

Scaling Object Detection by Transferring Classification Weights. 6043-6052 - Yanghao Li, Yuntao Chen, Naiyan Wang, Zhaoxiang Zhang:

Scale-Aware Trident Networks for Object Detection. 6053-6062 - Satoshi Kosugi, Toshihiko Yamasaki, Kiyoharu Aizawa:

Object-Aware Instance Labeling for Weakly Supervised Object Detection. 6063-6071 - Lanlan Liu, Michael Muelly, Jia Deng, Tomas Pfister, Li-Jia Li

:
Generative Modeling for Small-Data Object Detection. 6072-6080 - Shafin Rahman

, Salman H. Khan, Nick Barnes
:
Transductive Learning for Zero-Shot Object Detection. 6081-6090 - Seunghyeon Kim, Jaehoon Choi

, Taekyung Kim
, Changick Kim:
Self-Training and Adversarial Background Regularization for Unsupervised Domain Adaptive One-Stage Object Detection. 6091-6100 - Suichan Li, Dapeng Chen, Bin Liu, Nenghai Yu, Rui Zhao:

Memory-Based Neighbourhood Embedding for Visual Recognition. 6101-6110 - Yang Fu

, Yunchao Wei, Guanshuo Wang, Yuqian Zhou, Honghui Shi, Thomas S. Huang:
Self-Similarity Grouping: A Simple Unsupervised Cross Domain Adaptation Approach for Person Re-Identification. 6111-6120 - Zimo Liu, Jingya Wang, Shaogang Gong, Dacheng Tao

, Huchuan Lu:
Deep Reinforcement Active Learning for Human-in-the-Loop Person Re-Identification. 6121-6130 - Pirazh Khorramshahi, Amit Kumar, Neehar Peri, Sai Saketh Rambhatla, Jun-Cheng Chen

, Rama Chellappa:
A Dual-Path Model With Adaptive Attention for Vehicle Re-Identification. 6131-6140 - Zhiheng Ma

, Xing Wei, Xiaopeng Hong, Yihong Gong:
Bayesian Loss for Crowd Count Estimation With Point Supervision. 6141-6150 - Zhi-Qi Cheng

, Jun-Xiu Li, Qi Dai, Xiao Wu, Alexander G. Hauptmann:
Learning Spatial Awareness to Improve Crowd Counting. 6151-6160
Oral 3.2B
Video & Action Understanding
- Peixia Li, Boyu Chen, Wanli Ouyang

, Dong Wang, Xiaoyun Yang, Huchuan Lu:
GradNet: Gradient-Guided Network for Visual Object Tracking. 6161-6170 - Peng Chu, Haibin Ling:

FAMNet: Joint Learning of Feature, Affinity and Multi-Dimensional Assignment for Online Multiple Object Tracking. 6171-6180 


Google
Google Scholar
Semantic Scholar
Internet Archive Scholar
CiteSeerX
ORCID