


default search action
ICCV 2019: Seoul, South Korea
- 2019 IEEE/CVF International Conference on Computer Vision, ICCV 2019, Seoul, Korea (South), October 27 - November 2, 2019. IEEE 2019, ISBN 978-1-7281-4803-8
Poster 1.1
Deep Learning
- Andreas Rössler, Davide Cozzolino
, Luisa Verdoliva, Christian Riess, Justus Thies, Matthias Nießner:
FaceForensics++: Learning to Detect Manipulated Facial Images. 1-11 - Weixin Lu, Guowei Wan, Yao Zhou, Xiangyu Fu, Pengfei Yuan, Shiyu Song:
DeepVCP: An End-to-End Deep Neural Network for Point Cloud Registration. 12-21 - Matheus Gadelha, Rui Wang, Subhransu Maji:
Shape Reconstruction Using Differentiable Projections and Deep Priors. 22-30 - Måns Larsson, Erik Stenborg, Carl Toft, Lars Hammarstrand, Torsten Sattler, Fredrik Kahl:
Fine-Grained Segmentation Networks: Self-Supervised Segmentation for Improved Long-Term Visual Localization. 31-41 - Luwei Yang, Ziqian Bai, Chengzhou Tang, Honghua Li, Yasutaka Furukawa, Ping Tan:
SANet: Scene Agnostic Network for Camera Localization. 42-51 - Pedro Hermosilla Casajus, Tobias Ritschel, Timo Ropinski
:
Total Denoising: Unsupervised Learning of 3D Point Cloud Cleaning. 52-60 - Rizard Renanda Adhi Pramono, Yie-Tarng Chen, Wen-Hsien Fang:
Hierarchical Self-Attention Network for Action Localization in Videos. 61-70 - Umar Riaz Muhammad, Yongxin Yang, Timothy M. Hospedales, Tao Xiang, Yi-Zhe Song
:
Goal-Driven Sequential Data Abstraction. 71-80 - Roberto Annunziata, Christos Sagonas, Jacques Calì:
Jointly Aligning Millions of Images With Deep Penalised Reconstruction Congealing. 81-90 - Seungmin Lee, Dongwan Kim, Namil Kim, Seong-Gyun Jeong:
Drop to Adapt: Learning Discriminative Features for Unsupervised Domain Adaptation. 91-100 - Youngdong Kim, Junho Yim, Juseung Yun, Junmo Kim:
NLNL: Negative Learning for Noisy Labels. 101-110 - Shaokai Ye, Xue Lin, Kaidi Xu, Sijia Liu, Hao Cheng, Jan-Henrik Lambrechts, Huan Zhang, Aojun Zhou, Kaisheng Ma
, Yanzhi Wang:
Adversarial Robustness vs. Model Compression, or Both? 111-120 - Pu Zhao
, Sijia Liu, Pin-Yu Chen, Nghia Hoang, Kaidi Xu, Bhavya Kailkhura, Xue Lin:
On the Design of Black-Box Adversarial Examples by Leveraging Gradient-Free Optimization and Operator Splitting Method. 121-130 - Sagnik Das, Ke Ma, Zhixin Shu, Dimitris Samaras, Roy Shilkrot:
DewarpNet: Single-Image Document Unwarping With Stacked 3D and 2D Regression Networks. 131-140 - Xu Zou, Sheng Zhong, Luxin Yan, Xiangyun Zhao, Jiahuan Zhou, Ying Wu:
Learning Robust Facial Landmark Detection via Hierarchical Structured Ensemble. 141-150 - Zitong Yu, Wei Peng
, Xiaobai Li, Xiaopeng Hong, Guoying Zhao:
Remote Heart Rate Measurement From Highly Compressed Facial Videos: An End-to-End Deep Learning Solution With Video Enhancement. 151-160 - Tianyang Shi, Yi Yuan
, Changjie Fan, Zhengxia Zou, Zhenwei Shi, Yong Liu
:
Face-to-Parameter Translation for Game Character Auto-Creation. 161-170 - Guha Balakrishnan
, Adrian V. Dalca, Amy Zhao, John V. Guttag, Frédo Durand, William T. Freeman:
Visual Deprojection: Probabilistic Recovery of Collapsed Dimensions. 171-180 - Yurui Ren, Xiaoming Yu, Ruonan Zhang, Thomas H. Li, Shan Liu, Ge Li:
StructureFlow: Image Inpainting via Structure-Aware Appearance Flow. 181-190 - Md Mahfuzur Rahman Siddiquee
, Zongwei Zhou
, Nima Tajbakhsh, Ruibin Feng, Michael B. Gotway, Yoshua Bengio, Jianming Liang
:
Learning Fixed Points in Generative Adversarial Networks: From Image-to-Image Translation to Disease Detection and Localization. 191-200 - Zhengxia Zou, Wenyuan Li, Tianyang Shi, Zhenwei Shi, Jieping Ye:
Generative Adversarial Training for Weakly Supervised Cloud Matting. 201-210 - Zheng Tang, Milind Naphade, Stan Birchfield, Jonathan Tremblay, William Hodge, Ratnesh Kumar, Shuo Wang, Xiaodong Yang:
PAMTRI: Pose-Aware Multi-Task Learning for Vehicle Re-Identification Using Highly Randomized Synthetic Data. 211-220 - Eirikur Agustsson, Michael Tschannen, Fabian Mentzer, Radu Timofte
, Luc Van Gool:
Generative Adversarial Networks for Extreme Learned Image Compression. 221-231 - Yanbei Chen, Xiatian Zhu, Shaogang Gong:
Instance-Guided Context Rendering for Cross-Domain Person Re-Identification. 232-242 - Mahmoud Afifi, Michael S. Brown:
What Else Can Fool Deep Learning? Addressing Color Constancy Errors on Deep Neural Network Performance. 243-252 - Patrick Ebel, Eduard Trulls, Kwang Moo Yi, Pascal Fua, Anastasiia Mishchuk:
Beyond Cartesian Representations for Local Descriptors. 253-262 - Muhamad Risqi Utama Saputra
, Pedro Porto Buarque de Gusmão, Yasin Almalioglu
, Andrew Markham, Niki Trigoni
:
Distilling Knowledge From a Deep Pose Regressor Network. 263-272 - Kyung-Rae Kim, Whan Choi, Yeong Jun Koh, Seong-Gyun Jeong, Chang-Su Kim
:
Instance-Level Future Motion Estimation in a Single Image Based on Ordinal Regression. 273-282 - Hang Zhou, Ziwei Liu, Xudong Xu, Ping Luo, Xiaogang Wang:
Vision-Infused Deep Audio Inpainting. 283-292 - Zhen Dong, Zhewei Yao, Amir Gholami, Michael W. Mahoney, Kurt Keutzer:
HAWQ: Hessian AWare Quantization of Neural Networks With Mixed-Precision. 293-302 - Jun-Ho Choi, Huan Zhang, Jun-Hyuk Kim
, Cho-Jui Hsieh, Jong-Seok Lee:
Evaluating Robustness of Deep Image Super-Resolution Against Adversarial Attacks. 303-311 - Kibok Lee, Kimin Lee, Jinwoo Shin, Honglak Lee:
Overcoming Catastrophic Forgetting With Unlabeled Data in the Wild. 312-321 - Yisen Wang, Xingjun Ma
, Zaiyi Chen, Yuan Luo, Jinfeng Yi, James Bailey:
Symmetric Cross Entropy for Robust Learning With Noisy Labels. 322-330 - Avinash Ravichandran, Rahul Bhotika, Stefano Soatto:
Few-Shot Learning With Embedded Class Models and Shot-Free Meta Training. 331-339 - Maneet Singh, Shruti Nagpal, Richa Singh, Mayank Vatsa:
Dual Directed Capsule Network for Very Low Resolution Image Recognition. 340-349 - Xiangyun Zhao, Yi Yang, Feng Zhou, Xiao Tan, Yuchen Yuan, Yingze Bao, Ying Wu:
Recognizing Part Attributes With Insufficient Data. 350-360 - Jiaxin Li, Gim Hee Lee:
USIP: Unsupervised Stable Interest Point Detection From 3D Point Clouds. 361-370 - Binghui Chen, Weihong Deng
, Jiani Hu:
Mixed High-Order Attention Network for Person Re-Identification. 371-381 - Rodrigo Ferreira Berriel, Stéphane Lathuilière, Moin Nabi, Tassilo Klein
, Thiago Oliveira-Santos, Nicu Sebe, Elisa Ricci
:
Budget-Aware Adapters for Multi-Domain Learning. 382-391 - Tuong Do
, Huy Tran, Thanh-Toan Do, Erman Tjiputra, Quang D. Tran
:
Compact Trilinear Interaction for Visual Question Answering. 392-401 - Ishan Nigam, Pavel Tokmakov, Deva Ramanan
:
Towards Latent Attribute Discovery From Triplet Similarities. 402-410 - Utkarsh Mall, Kevin Matzen, Bharath Hariharan, Noah Snavely, Kavita Bala
:
GeoStyle: Discovering Fashion Trends and Events. 411-420 - Haichao Zhang, Jianyu Wang:
Towards Adversarially Robust Object Detection. 421-430
Recognition
- Junli Zhao
, Xin Qi, Chengfeng Wen, Na Lei, Xianfeng Gu
:
Automatic and Robust Skull Registration Based on Discrete Uniformization. 431-440 - Zhimao Peng, Zechao Li, Junge Zhang, Yan Li, Guo-Jun Qi
, Jinhui Tang
:
Few-Shot Image Recognition With Knowledge Transfer. 441-449 - Michael Wray
, Gabriela Csurka, Diane Larlus, Dima Damen
:
Fine-Grained Action Retrieval Through Multiple Parts-of-Speech Embeddings. 450-459 - Peng Wang, Bingliang Jiao, Lu Yang, Yifei Yang, Shizhou Zhang, Wei Wei, Yanning Zhang:
Vehicle Re-Identification in Aerial Imagery: Dataset and Approach. 460-469 - Krishna Regmi, Mubarak Shah
:
Bridging the Domain Gap for Ground-to-Aerial Image Matching. 470-479 - Mehran Khodabandeh, Arash Vahdat, Mani Ranjbar, William G. Macready:
A Robust Learning Approach to Domain Adaptive Object Detection. 480-490 - Yin Bi, Aaron Chadha, Alhabib Abbas, Eirina Bourtsoulatze, Yiannis Andreopoulos:
Graph-Based Object Classification for Neuromorphic Vision Sensing. 491-501 - Jiwoong Choi
, Dayoung Chun, Hyun Kim, Hyuk-Jae Lee:
Gaussian YOLOv3: An Accurate and Fast Object Detector Using Localization Uncertainty for Autonomous Driving. 502-511 - Lezi Wang, Ziyan Wu, Srikrishna Karanam, Kuan-Chuan Peng, Rajat Vikram Singh, Bo Liu, Dimitris N. Metaxas:
Sharpen Focus: Learning With Attention Separability and Consistency. 512-521 - Tianshui Chen, Muxin Xu, Xiaolu Hui, Hefeng Wu, Liang Lin:
Learning Semantic-Specific Graph Representation for Multi-Label Image Recognition. 522-531 - Sergey Zakharov, Wadim Kehl, Slobodan Ilic:
DeceptionNet: Network-Driven Domain Randomization. 532-541 - Jiaxu Miao, Yu Wu
, Ping Liu, Yuhang Ding, Yi Yang:
Pose-Guided Feature Alignment for Occluded Person Re-Identification. 542-551 - Tianyuan Yu, Da Li, Yongxin Yang, Timothy M. Hospedales, Tao Xiang:
Robust Person Re-Identification by Modelling Feature Uncertainty. 552-561 - Arulkumar Subramaniam, Athira M. Nambiar, Anurag Mittal:
Co-Segmentation Inspired Attention Networks for Video-Based Person Re-Identification. 562-572 - Huizi Mao, Xiaodong Yang, Bill Dally:
A Delay Metric for Video Object Detection: What Average Precision Fails to Tell. 573-582 - Eden Belouadah, Adrian Popescu:
IL2M: Class Incremental Learning With Dual Memory. 583-592
Segmentation, Grouping, & Shape
- Zhen Zhu
, Mengdu Xu, Song Bai, Tengteng Huang, Xiang Bai:
Asymmetric Non-Local Neural Networks for Semantic Segmentation. 593-602 - Zilong Huang, Xinggang Wang
, Lichao Huang, Chang Huang, Yunchao Wei, Wenyu Liu:
CCNet: Criss-Cross Attention for Semantic Segmentation. 603-612 - Shousheng Luo, Xue-Cheng Tai
, Limei Huo, Yang Wang, Roland Glowinski:
Convex Shape Prior for Multi-Object Segmentation Using a Single Level Set Function. 613-621 - Khoi Nguyen, Sinisa Todorovic:
Feature Weighting and Boosting for Few-Shot Segmentation. 622-631 - Niv Haim, Nimrod Segol, Heli Ben-Hamu, Haggai Maron, Yaron Lipman:
Surface Networks via General Covers. 632-641 - Naiyu Gao
, Yanhu Shan, Yupei Wang, Xin Zhao
, Yinan Yu, Ming Yang
, Kaiqi Huang:
SSAP: Single-Shot Instance Segmentation With Affinity Pyramid. 642-651 - Sifei Liu
, Xueting Li, Varun Jampani, Shalini De Mello, Jan Kautz:
Learning Propagation for Arbitrarily-Structured Data. 652-661 - Jun Hao Liew, Scott Cohen, Brian L. Price, Long Mai, Sim Heng Ong, Jiashi Feng:
MultiSeg: Semantically Meaningful, Scale-Diverse Segmentations From Minimal User Input. 662-670 - Federica Arrigoni
, Tomás Pajdla:
Robust Motion Segmentation From Pairwise Matches. 671-681 - Haoshu Fang, Jianhua Sun, Runzhong Wang
, Minghao Gou, Yong-Lu Li, Cewu Lu:
InstaBoost: Boosting Instance Segmentation via Probability Map Guided Copy-Pasting. 682-691
Face & Body
- Mei Wang, Weihong Deng
, Jiani Hu, Xunqiang Tao, Yaohai Huang:
Racial Faces in the Wild: Reducing Racial Bias by Information Maximization Adaptation Network. 692-702 - Jingxiao Zheng, Ruichi Yu, Jun-Cheng Chen
, Boyu Lu, Carlos Domingo Castillo, Rama Chellappa:
Uncertainty Modeling of Contextual-Connections Between Tracklets for Unconstrained Video-Based Face Recognition. 703-712 - Xingxuan Zhang, Feng Cheng, Shilin Wang:
Spatio-Temporal Fusion Based Convolutional Sequence Learning for Lip Reading. 713-722 - Yu Cheng, Bo Yang, Bo Wang, Wending Yan, Robby T. Tan:
Occlusion-Aware Networks for 3D Human Pose Estimation in Video. 723-732 - Yong Zhang, Haiyong Jiang, Baoyuan Wu, Yanbo Fan, Qiang Ji:
Context-Aware Feature and Label Fusion for Facial Action Unit Intensity Estimation With Partially Labeled Data. 733-742 - Chaoyang Wang, Chen Kong, Simon Lucey
:
Distill Knowledge From NRSfM for Weakly Supervised 3D Pose Learning. 743-752 - Yuan Yao, Yasamin Jafarian, Hyun Soo Park:
MONET: Multiview Semi-Supervised Keypoint Detection via Epipolar Divergence. 753-762 - Gilwoo Lee, Zhiwei Deng, Shugao Ma, Takaaki Shiratori, Siddhartha S. Srinivasa, Yaser Sheikh:
Talking With Hands 16.2M: A Large-Scale Dataset of Synchronized Body-Finger Motion and Audio for Conversational Motion Analysis and Synthesis. 763-772 - Lingxue Song, Dihong Gong, Zhifeng Li, Changsong Liu, Wei Liu
:
Occlusion Robust Face Recognition Based on Mask Learning With Pairwise Differential Siamese Network. 773-782 - Xuanyi Dong, Yi Yang:
Teacher Supervises Students How to Learn From Partially Labeled Images for Facial Landmark Detection. 783-792 - Fu Xiong, Boshen Zhang, Yang Xiao, Zhiguo Cao
, Taidong Yu, Joey Tianyi Zhou, Junsong Yuan:
A2J: Anchor-to-Joint Regression Network for 3D Articulated Pose Estimation From a Single Depth Image. 793-802 - Georgios Pavlakos, Nikos Kolotouros, Kostas Daniilidis:
TexturePose: Supervising Human Mesh Estimation With Texture Consistency. 803-812 - Christian Zimmermann, Duygu Ceylan, Jimei Yang, Bryan C. Russell, Max J. Argus, Thomas Brox:
FreiHAND: A Dataset for Markerless Capture of Hand Pose and Shape From Single RGB Images. 813-822 - Nitin Saini, Eric Price, Rahul Tallamraju, Raffi Enficiaud, Roman Ludwig, Igor Martinovic, Aamir Ahmad, Michael J. Black:
Markerless Outdoor Human Motion Capture Using Multiple Autonomous Micro Aerial Vehicles. 823-832
Action & Video
- Srijan Das, Rui Dai
, Michal Koperski, Luca Minciullo, Lorenzo Garattoni, François Brémond, Gianpiero Francesca:
Toyota Smarthome: Real-World Activities of Daily Living. 833-842 - Penghao Zhou, Mingmin Chi:
Relation Parsing Neural Network for Human-Object Interaction Detection. 843-851 - Rohit Girdhar, Du Tran, Lorenzo Torresani, Deva Ramanan
:
DistInit: Learning Video Representations Without a Single Labeled Video. 852-861 - Fadime Sener, Angela Yao:
Zero-Shot Anticipation for Instructional Activities. 862-871 - Tianhong Li, Lijie Fan, Mingmin Zhao
, Yingcheng Liu, Dina Katabi:
Making the Invisible Visible: Action Recognition Through Walls and Occlusions. 872-881 - Xudong Xu, Bo Dai, Dahua Lin:
Recursive Visual Sound Separation Using Minus-Plus Net. 882-891
Motion & Tracking
- Fitsum A. Reda, Deqing Sun, Aysegul Dundar, Mohammad Shoeybi, Guilin Liu, Kevin J. Shih, Andrew Tao, Jan Kautz, Bryan Catanzaro:
Unsupervised Video Interpolation Using Cycle Consistency. 892-900 - Tao Wang, Haibin Ling, Congyan Lang, Songhe Feng, Xiaohui Hou:
Deformable Surface Tracking by Graph Matching. 901-910 - Janghoon Choi
, Junseok Kwon, Kyoung Mu Lee:
Deep Meta Learning for Real-Time Target-Aware Visual Tracking. 911-920 - Chiho Choi, Behzad Dariush:
Looking to Relations for Future Trajectory Forecast. 921-930 - Zhao Yang, Qiang Wang, Luca Bertinetto, Song Bai, Weiming Hu, Philip H. S. Torr:
Anchor Diffusion for Unsupervised Video Object Segmentation. 931-940 - Philipp Bergmann, Tim Meinhardt, Laura Leal-Taixé:
Tracking Without Bells and Whistles. 941-951
Scene Understanding
- Zhaoyi Yan, Yuchen Yuan, Wangmeng Zuo, Xiao Tan, Yezhen Wang, Shilei Wen, Errui Ding:
Perspective-Guided Convolution Networks for Crowd Counting. 952-961 - Yichao Zhou, Haozhi Qi, Yi Ma:
End-to-End Wireframe Parsing. 962-971 - Yoshikatsu Nakajima, Byeongkeun Kang, Hideo Saito, Kris Kitani:
Incremental Class Discovery for Semantic Segmentation With RGBD Sensing. 972-981 - Liang Du, Jingang Tan, Hongye Yang, Jianfeng Feng, Xiangyang Xue, Qibao Zheng, Xiaoqing Ye, Xiaolin Zhang:
SSF-DAN: Separated Semantic Feature Based Domain Adaptation Network for Semantic Segmentation. 982-991 - Nicholas Weir, David Lindenbaum, Alexei Bastidas, Adam Van Etten, Varun Kumar Vijay, Sean McPherson, Jacob Shermeyer, Hanlin Tang:
SpaceNet MVOI: A Multi-View Overhead Imagery Dataset. 992-1001 - Vishwanath Sindagi, Vishal M. Patel:
Multi-Level Bottom-Top and Top-Bottom Feature Fusion for Crowd Counting. 1002-1012 - Yuenan Hou, Zheng Ma, Chunxiao Liu, Chen Change Loy:
Learning Lightweight Lane Detection CNNs by Self Attention Distillation. 1013-1021 - Daniel Gordon, Abhishek Kadian, Devi Parikh, Judy Hoffman
, Dhruv Batra:
SplitNet: Sim2Sim and Task2Task Transfer for Embodied Visual Navigation. 1022-1031
3D From Multiview & Sensors
- Wentao Cheng, Weisi Lin, Kan Chen, Xinfeng Zhang:
Cascaded Parallel Filtering for Memory-Efficient Image-Based Localization. 1032-1041 - Chao Wen, Yinda Zhang, Zhuwen Li, Yanwei Fu
:
Pixel2Mesh++: Multi-View 3D Mesh Generation via Deformation. 1042-1051 - Fotios Logothetis, Roberto Mecca, Roberto Cipolla:
A Differential Volumetric Approach to Multi-View Photometric Stereo. 1052-1061 - Viktor Larsson, Torsten Sattler, Zuzana Kukelova
, Marc Pollefeys
:
Revisiting Radial Distortion Absolute Pose. 1062-1071 - Tobias Würfl, André Aichert, Nicole Maass, Frank Dennerlein, Andreas K. Maier:
Estimating the Fundamental Matrix Without Point Correspondences With Application to Transmission Imaging. 1072-1081 - Devesh Adlakha, Adlane Habed, Fabio Morbidi, Cédric Demonceaux
, Michel de Mathelin:
QUARCH: A New Quasi-Affine Reconstruction Stratum From Vague Relative Camera Orientation Knowledge. 1082-1090 - Dániel Baráth, Zuzana Kukelova
:
Homography From Two Orientation- and Scale-Covariant Features. 1091-1099
Applications. Medical, & Robotics
- Hyukryul Yang, Hao Ouyang, Vladlen Koltun, Qifeng Chen:
Hiding Video in Audio via Reversible Generative Models. 1100-1109 - Yong Zhao, Shibiao Xu, Shuhui Bu, Hongkai Jiang, Pengcheng Han:
GSLAM: A General SLAM Framework and Benchmark. 1110-1120 - Sang Jun Lee
, Sung Soo Hwang:
Elaborate Monocular Point and Line SLAM With Robust Initialization. 1121-1129 - Jia Wan
, Antoni B. Chan
:
Adaptive Density Map Generation for Crowd Counting. 1130-1139 - Xingxu Yao, Dongyu She, Sicheng Zhao, Jie Liang, Yu-Kun Lai, Jufeng Yang:
Attention-Aware Polarity Sensitive Embedding for Affective Image Retrieval. 1140-1150 - Chi Zhan, Dongyu She, Sicheng Zhao, Ming-Ming Cheng
, Jufeng Yang:
Zero-Shot Emotion Recognition via Affective Structural Embedding. 1151-1160 - Haoye Dong
, Xiaodan Liang, Xiaohui Shen, Bowen Wu, Bing-Cheng Chen, Jian Yin:
FW-GAN: Flow-Navigated Warping GAN for Video Virtual Try-On. 1161-1170 - Arnab Ghosh, Richard Zhang, Puneet K. Dokania, Oliver Wang, Alexei A. Efros
, Philip H. S. Torr, Eli Shechtman:
Interactive Sketch & Fill: Multiclass Sketch-to-Image Translation. 1171-1180 - Shi Chen
, Qi Zhao:
Attention-Based Autism Spectrum Disorder Screening With Privileged Modality. 1181-1190 - Jun-Tae Lee, Chang-Su Kim
:
Image Aesthetic Assessment Based on Pairwise Comparison A Unified Approach to Score Regression, Binary Classification, and Personalization. 1191-1200 - Zhenyu Wu, Karthik Suresh, Priya Narayanan, Hongyu Xu, Heesung Kwon, Zhangyang Wang:
Delving Into Robust Object Detection From Unmanned Aerial Vehicles: A Deep Nuisance Disentanglement Approach. 1201-1210 - Adnan Siraj Rakin, Zhezhi He, Deliang Fan:
Bit-Flip Attack: Crushing Neural Network With Progressive Bit Search. 1211-1220 - Vishwanath Sindagi, Rajeev Yasarla, Vishal M. Patel:
Pushing the Frontiers of Unconstrained Crowd Counting: New Dataset and Benchmark Method. 1221-1231 - Yi Liu, Qiang Zhang, Dingwen Zhang, Jungong Han:
Employing Deep Part-Object Relationships for Salient Object Detection. 1232-1241 - Vladimiros Sterzentsenko, Leonidas Saroglou, Anargyros Chatzitofis
, Spiros Thermos, Nikolaos Zioulis
, Alexandros Doumanoglou, Dimitrios Zarpalas, Petros Daras
:
Self-Supervised Deep Depth Denoising. 1242-1251 - Hanxiao Wang
, Venkatesh Saligrama
, Stan Sclaroff, Vitaly Ablavsky
:
Cost-Aware Fine-Grained Recognition for IoTs Based on Sequential Fixations. 1252-1261 - Ruichi Yu, Hongcheng Wang, Ang Li, Jingxiao Zheng, Vlad I. Morariu, Larry Davis:
Layout-Induced Video Representation for Recognizing Agent-in-Place Actions. 1262-1272 - Trong-Nguyen Nguyen, Jean Meunier:
Anomaly Detection in Video Sequence With Appearance-Motion Correspondence. 1273-1283
Oral 1.2A
Architectures, Multi-Task Learning, Domain Adaptation
- Saining Xie, Alexander Kirillov, Ross B. Girshick, Kaiming He:
Exploring Randomly Wired Neural Networks for Image Recognition. 1284-1293 - Xin Chen, Lingxi Xie, Jun Wu, Qi Tian:
Progressive Differentiable Architecture Search: Bridging the Depth Gap Between Search and Evaluation. 1294-1303 - Xiawu Zheng, Rongrong Ji
, Lang Tang, Baochang Zhang, Jianzhuang Liu, Qi Tian:
Multinomial Distribution Learning for Effective Neural Architecture Search. 1304-1313 - Andrew Howard, Ruoming Pang, Hartwig Adam, Quoc V. Le, Mark Sandler, Bo Chen, Weijun Wang, Liang-Chieh Chen, Mingxing Tan, Grace Chu, Vijay Vasudevan, Yukun Zhu:
Searching for MobileNetV3. 1314-1324 - Markus Nagel, Mart van Baalen, Tijmen Blankevoort, Max Welling:
Data-Free Quantization Through Weight Equalization and Bias Correction. 1325-1334 - Laurie Bose, Piotr Dudek
, Jianing Chen, Stephen J. Carey, Walterio W. Mayol-Cuevas
:
A Camera That CNNs: Towards Embedded Neural Networks on Pixel Processor Arrays. 1335-1344 - Xiao Jin, Baoyun Peng
, Yichao Wu, Yu Liu, Jiaheng Liu, Ding Liang, Junjie Yan, Xiaolin Hu:
Knowledge Distillation via Route Constrained Optimization. 1345-1354 - Mary Phuong, Christoph Lampert:
Distillation-Based Training for Multi-Exit Architectures. 1355-1364 - Frederick Tung, Greg Mori:
Similarity-Preserving Knowledge Distillation. 1365-1374 - Gjorgji Strezoski, Nanne van Noord
, Marcel Worring
:
Many Task Learning With Task Routing. 1375-1384 - Felix J. S. Bragman, Ryutaro Tanno, Sébastien Ourselin
, Daniel C. Alexander
, Manuel Jorge Cardoso
:
Stochastic Filter Groups for Multi-Task CNNs: Learning Specialist and Generalist Convolution Kernels. 1385-1394 - Anh Tuan Tran, Cuong V. Nguyen, Tal Hassner:
Transferability and Hardness of Supervised Classification Tasks. 1395-1405 - Xingchao Peng, Qinxun Bai, Xide Xia, Zijun Huang, Kate Saenko
, Bo Wang
:
Moment Matching for Multi-Source Domain Adaptation. 1406-1415 - Safa Cicek, Stefano Soatto:
Unsupervised Domain Adaptation via Regularized Conditional Alignment. 1416-1425 - Ruijia Xu, Guanbin Li, Jihan Yang, Liang Lin:
Larger Norm More Transferable: An Adaptive Feature Norm Approach for Unsupervised Domain Adaptation. 1426-1435 - Jogendra Nath Kundu, Nishank Lakkakula, Venkatesh Babu Radhakrishnan
:
UM-Adapt: Unsupervised Multi-Task Adaptation Using Adversarial Cross-Task Distillation. 1436-1445 - Da Li
, Jianshu Zhang, Yongxin Yang, Cong Liu, Yi-Zhe Song
, Timothy M. Hospedales:
Episodic Training for Domain Generalization. 1446-1455 - Yi-Hsuan Tsai, Kihyuk Sohn, Samuel Schulter, Manmohan Chandraker:
Domain Adaptation for Structured Output via Discriminative Patch Representations. 1456-1465 - Qin Wang, Wen Li, Luc Van Gool:
Semi-Supervised Learning by Augmented Distribution Alignment. 1466-1475 - Lucas Beyer, Xiaohua Zhai, Avital Oliver, Alexander Kolesnikov:
S4L: Self-Supervised Semi-Supervised Learning. 1476-1485
Oral 1.2B
Multi-View Geometry, 3D Scene Understanding
- Pablo Speciale, Johannes L. Schönberger, Sudipta N. Sinha, Marc Pollefeys
:
Privacy Preserving Image Queries for Camera Localization. 1486-1496 - Songyou Peng, Peter F. Sturm:
Calibration Wizard: A Guidance System for Camera Calibration Based on Modelling Geometric and Corner Uncertainty. 1497-1505 - Tobias Gruber
, Frank D. Julca-Aguilar, Mario Bijelic, Felix Heide:
Gated2Depth: Real-Time Dense Lidar From Gated Images. 1506-1516 - Andrea Nicastro, Ronald Clark, Stefan Leutenegger:
X-Section: Cross-Section Prediction for Enhanced RGB-D Fusion. 1517-1526 - Stepan Tulyakov, François Fleuret, Martin Kiefel, Peter V. Gehler, Michael Hirsch:
Learning an Event Sequence Embedding for Dense Event-Based Deep Stereo. 1527-1537 - Rui Chen, Songfang Han
, Jing Xu, Hao Su:
Point-Based Multi-View Stereo Network. 1538-1547 - Xiangyu Xu
, Enrique Dunn
:
Discrete Laplace Operator Estimation for Dynamic 3D Reconstruction. 1548-1557 - Chen Kong, Simon Lucey
:
Deep Non-Rigid Structure From Motion. 1558-1567 - Carlos Esteves
, Yinshuang Xu, Christine Allen-Blanchette, Kostas Daniilidis:
Equivariant Multi-View Networks. 1568-1577 - Jiageng Mao
, Xiaogang Wang, Hongsheng Li
:
Interpolated Convolutional Networks for 3D Point Cloud Understanding. 1578-1587 - Mikaela Angelina Uy, Quang-Hieu Pham
, Binh-Son Hua, Duc Thanh Nguyen, Sai-Kit Yeung:
Revisiting Point Cloud Classification: A New Benchmark Dataset and Classification Model on Real-World Data. 1588-1597 - Tianhang Zheng
, Changyou Chen, Junsong Yuan, Bo Li, Kui Ren:
PointCloud Saliency Maps. 1598-1606 - Zhiyuan Zhang
, Binh-Son Hua, Sai-Kit Yeung:
ShellNet: Efficient Point Cloud Convolutional Neural Networks Using Concentric Shells Statistics. 1607-1616 - Jean-Michel Roufosse, Abhishek Sharma, Maks Ovsjanikov:
Unsupervised Deep Learning for Structured Shape Matching. 1617-1627 - Nadav Dym, Shahar Z. Kovalsky:
Linearly Converging Quasi Branch and Bound Algorithms for Global Rigid Registration. 1628-1636 - Zhipeng Cai, Tat-Jun Chin, Vladlen Koltun:
Consensus Maximization Tree Search Revisited. 1637-1645 - Haoang Li, Ji Zhao
, Jean-Charles Bazin, Wen Chen, Zhe Liu, Yunhui Liu:
Quasi-Globally Optimal and Efficient Vanishing Point Estimation in Manhattan World. 1646-1654 - Yaqing Ding
, Jian Yang, Jean Ponce, Hui Kong
:
An Efficient Solution to the Homography-Based Relative Pose Problem With a Common Reference Direction. 1655-1664 - Heng Yang, Luca Carlone:
A Quaternion-Based Certifiably Optimal Solution to the Wahba Problem With Outliers. 1665-1674 - Timothy Duff
, Kathlén Kohn
, Anton Leykin, Tomás Pajdla:
PLMP - Point-Line Minimal Problems in Complete Multi-View Visibility. 1675-1684
Poster 1.2
Deep Learning
- Jian Zhang, Chenglong Zhao, Bingbing Ni, Minghao Xu, Xiaokang Yang:
Variational Few-Shot Learning. 1685-1694 - Sankha Subhra Mullick, Shounak Datta, Swagatam Das
:
Generative Adversarial Minority Oversampling. 1695-1704 - Dong Gong
, Lingqiao Liu
, Vuong Le, Budhaditya Saha, Moussa Reda Mansour, Svetha Venkatesh, Anton van den Hengel
:
Memorizing Normality to Detect Anomaly: Memory-Augmented Deep Autoencoder for Unsupervised Anomaly Detection. 1705-1714 - Zuoyue Li, Jan Dirk Wegner, Aurélien Lucchi
:
Topological Map Extraction From Overhead Images. 1715-1724 - Haokui Zhang, Ying Li, Yuanzhouhan Cao, Yu Liu, Chunhua Shen, Youliang Yan:
Exploiting Temporal Consistency for Real-Time Video Depth Estimation. 1725-1734 - Hang Zhao, Chuang Gan, Wei-Chiu Ma, Antonio Torralba:
The Sound of Motions. 1735-1744 - Youngjoo Jo, Jongyoul Park:
SC-FEGAN: Face Editing Generative Adversarial Network With User's Sketch and Color. 1745-1753 - Hongwei Ge, Zehang Yan, Kai Zhang, Mingde Zhao, Liang Sun:
Exploring Overall Contextual Information for Image Captioning in Human-Like Cognitive Style. 1754-1763 - Zhuoyuan Chen, Kavya Srinet, Charles R. Qi, Haoqi Fan, Jerry Ma, Larry Zitnick, Demi Guo, Tong Xiao, Saining Xie, Xinlei Chen, Arthur Szlam, Shubham Tulsiani, Haonan Yu, Jonathan Gray:
Order-Aware Generative Modeling Using the 3D-Craft Dataset. 1764-1773 - Lingbo Liu, Zhilin Qiu, Guanbin Li, Shufan Liu, Wanli Ouyang
, Liang Lin:
Crowd Counting With Deep Structured Scale Integration Network. 1774-1783 - Tomer Cohen, Lior Wolf:
Bidirectional One-Shot Unsupervised Domain Mapping. 1784-1792 - A. J. Piergiovanni, Anelia Angelova, Alexander Toshev, Michael S. Ryoo:
Evolving Space-Time Neural Architectures for Videos. 1793-1802 - Jiahui Yu, Thomas S. Huang:
Universally Slimmable Networks and Improved Training Techniques. 1803-1811 - Tonmoy Saikia, Yassine Marrakchi, Arber Zela
, Frank Hutter, Thomas Brox:
AutoDispNet: Improving Disparity Estimation With AutoML. 1812-1823 - Gidi Littwin, Lior Wolf:
Deep Meta Functionals for Shape Representation. 1824-1833 - Yu Liu, Jihao Liu, Xiaogang Wang, Ailing Zeng:
Differentiable Kernel Evolution. 1834-1843 - Mikolaj Binkowski, R. Devon Hjelm, Aaron C. Courville:
Batch Weight for Domain Adaptation With Mass Shift. 1844-1853 - HyunJae Lee, Hyo-Eun Kim, Hyeonseob Nam:
SRM: A Style-Based Recalibration Module for Convolutional Neural Networks. 1854-1862 - Xingang Pan
, Xiaohang Zhan, Jianping Shi, Xiaoou Tang, Ping Luo:
Switchable Whitening for Deep Representation Learning. 1863-1871 - Adria Ruiz, Jakob Verbeek:
Adaptative Inference Cost With Convolutional Neural Mixture Models. 1872-1881 - Ilija Radosavovic, Justin Johnson, Saining Xie, Wan-Yen Lo, Piotr Dollár:
On Network Design Spaces for Visual Recognition. 1882-1890 - Hao Li, Hong Zhang, Xiaojuan Qi, Ruigang Yang
, Gao Huang:
Improved Techniques for Training Adaptive Deep Networks. 1891-1900 - Yunyang Xiong, Ronak Mehta, Vikas Singh:
Resource Constrained Neural Network Architecture Search: Will a Submodularity Assumption Help? 1901-1910 - Xiaohan Ding, Yuchen Guo, Guiguang Ding, Jungong Han:
ACNet: Strengthening the Kernel Skeletons for Powerful CNN via Asymmetric Convolution Blocks. 1911-1920 - Byeongho Heo, Jeesoo Kim, Sangdoo Yun, Hyojin Park, Nojun Kwak, Jin Young Choi:
A Comprehensive Overhaul of Feature Distillation. 1921-1930
Recognition
- Yew Siang Tang, Gim Hee Lee:
Transferable Semi-Supervised 3D Object Detection From RGB-D Data. 1931-1940 - Sergey Zakharov, Ivan Shugurov, Slobodan Ilic:
DPOD: 6D Pose Object Detector and Refiner. 1941-1950 - Zetong Yang, Yanan Sun
, Shu Liu, Xiaoyong Shen, Jiaya Jia
:
STD: Sparse-to-Dense 3D Object Detector for Point Cloud. 1951-1960 - Hang Zhou, Kejiang Chen
, Weiming Zhang, Han Fang, Wenbo Zhou, Nenghai Yu:
DUP-Net: Denoiser and Upsampler Network for 3D Adversarial Point Clouds Defense. 1961-1970 - Tiancai Wang, Rao Muhammad Anwer
, Hisham Cholakkal
, Fahad Shahbaz Khan
, Yanwei Pang, Ling Shao
:
Learning Rich Features at High-Speed for Single-Shot Object Detection. 1971-1980 - Julia Peyre, Josef Sivic, Ivan Laptev, Cordelia Schmid:
Detecting Unseen Visual Relations Using Analogies. 1981-1990 - Andrea Simonelli, Samuel Rota Bulò, Lorenzo Porzi, Manuel Lopez-Antequera, Peter Kontschieder:
Disentangling Monocular 3D Object Detection. 1991-1999 - Boyuan Jiang, Mengmeng Wang, Weihao Gan, Wei Wu, Junjie Yan:
STM: SpatioTemporal and Motion Encoding for Action Recognition. 2000-2009 - Shuaiyi Huang, Qiuyue Wang, Songyang Zhang
, Shipeng Yan, Xuming He:
Dynamic Context Correspondence Network for Semantic Alignment. 2010-2019 - Akshayvarun Subramanya, Vipin Pillai, Hamed Pirsiavash:
Fooling Network Interpretation in Image Classification. 2020-2029 - Yinan Zhao, Brian L. Price, Scott Cohen, Danna Gurari:
Unconstrained Foreground Object Search. 2030-2039 - Jianwei Yang, Zhile Ren, Mingze Xu, Xinlei Chen, David J. Crandall, Devi Parikh, Dhruv Batra:
Embodied Amodal Recognition: Learning to Move to Perceive Objects. 2040-2050 - Kaiyu Yang, Olga Russakovsky
, Jia Deng:
SpatialSense: An Adversarially Crowdsourced Benchmark for Spatial Relation Recognition. 2051-2060 - Xinlei Chen, Ross B. Girshick, Kaiming He, Piotr Dollár:
TensorMask: A Foundation for Dense Object Segmentation. 2061-2069 - Peng-Tao Jiang, Qibin Hou, Yang Cao, Ming-Ming Cheng
, Yunchao Wei, Hongkai Xiong
:
Integral Object Mining via Online Attention Accumulation. 2070-2079
Segmentation, Grouping, & Shape
- Vladislav Golyanik, Christian Theobalt
, Didier Stricker
:
Accelerated Gravitational Point Set Alignment With Altered Physical Laws. 2080-2089 - Minghao Chen, Hongyang Xue, Deng Cai:
Domain Adaptation for Semantic Segmentation With Maximum Squares Loss. 2090-2099 - Xiangyu Yue, Yang Zhang, Sicheng Zhao, Alberto L. Sangiovanni-Vincentelli, Kurt Keutzer, Boqing Gong:
Domain Randomization and Pyramid Consistency: Simulation-to-Real Generalization Without Accessing Target Domain Data. 2100-2110 - Yi He, Jiayuan Shi, Chuan Wang, Haibin Huang, Jiaming Liu, Guanbin Li, Risheng Liu, Jue Wang
:
Semi-Supervised Skin Detection by Network With Mutual Guidance. 2111-2120 - Zuxuan Wu, Xin Wang, Joseph Gonzalez
, Tom Goldstein, Larry Davis:
ACE: Adapting to Changing Environments for Semantic Segmentation. 2121-2130 - Dmitrii Marin
, Zijian He, Peter Vajda, Priyam Chatterjee, Sam S. Tsai, Fei Yang, Yuri Boykov:
Efficient Segmentation: Learning Downsampling Near Semantic Boundaries. 2131-2141 - Wei Wang, Kaicheng Yu, Joachim Hugonot, Pascal Fua, Mathieu Salzmann:
Recurrent U-Net for Resource-Constrained Segmentation. 2142-2151 - Krzysztof Lis
, Krishna Kanth Nakka, Pascal Fua, Mathieu Salzmann:
Detecting the Unexpected via Image Resynthesis. 2152-2161
3D From Single View & RGBD
- Jamie Watson, Michael Firman, Gabriel J. Brostow, Daniyar Turmukhambetov:
Self-Supervised Monocular Depth Hints. 2162-2171 - Daeyun Shin, Zhile Ren, Erik B. Sudderth
, Charless C. Fowlkes:
3D Scene Reconstruction With Multi-Layer Depth and Epipolar Transformers. 2172-2182 - Tom van Dijk
, Guido de Croon:
How Do Neural Networks See Depth in Single Images? 2183-2191 - Zhi Li, Xuan Wang, Fei Wang
, Peilin Jiang:
On Boosting Single-Frame 3D Human Pose Estimation via Monocular Videos. 2192-2201 - Nilesh Kulkarni, Shubham Tulsiani, Abhinav Gupta:
Canonical Surface Mapping via Geometric Cycle Consistency. 2202-2211 - Nilesh Kulkarni, Ishan Misra, Shubham Tulsiani, Abhinav Gupta:
3D-RelNet: Joint Object and Relational Network for 3D Prediction. 2212-2221 - Alexander Grabner, Peter M. Roth, Vincent Lepetit:
GP2C: Geometric Projection Parameter Consensus for Joint 3D Pose and Focal Length Estimation in the Wild. 2222-2231
Face & Body
- Valentin Gabeur, Jean-Sébastien Franco, Xavier Martin, Cordelia Schmid, Grégory Rogez:
Moulding Humans: Non-Parametric 3D Human Shape Estimation From Single Images. 2232-2241 - Albert Pumarola, Jordi Sanchez, Gary P. T. Choi
, Alberto Sanfeliu, Francesc Moreno:
3DPeople: Modeling the Geometry of Dressed Humans. 2242-2251 - Nikos Kolotouros, Georgios Pavlakos, Michael J. Black, Kostas Daniilidis:
Learning to Reconstruct 3D Human Pose and Shape via Model-Fitting in the Loop. 2252-2261 - Hai Ci, Chunyu Wang, Xiaoxuan Ma
, Yizhou Wang:
Optimizing Network Structure for 3D Human Pose Estimation. 2262-2271 - Yujun Cai
, Liuhao Ge, Jun Liu
, Jianfei Cai, Tat-Jen Cham
, Junsong Yuan, Nadia Magnenat-Thalmann
:
Exploiting Spatial-Temporal Relationships for 3D Pose Estimation via Graph Convolutional Networks. 2272-2281 - Mohamed Hassan, Vasileios Choutas, Dimitrios Tzionas, Michael J. Black:
Resolving 3D Human Pose Ambiguities With 3D Scene Constraints. 2282-2292 - Thiemo Alldieck
, Gerard Pons-Moll, Christian Theobalt
, Marcus A. Magnor
:
Tex2Shape: Detailed Full Human Body Geometry From a Single Image. 2293-2303 - Shunsuke Saito, Zeng Huang, Ryota Natsume, Shigeo Morishima
, Hao Li
, Angjoo Kanazawa:
PIFu: Pixel-Aligned Implicit Function for High-Resolution Clothed Human Digitization. 2304-2314 - Xiaoxing Zeng, Xiaojiang Peng, Yu Qiao
:
DF2Net: A Dense-Fine-Finer Network for Detailed 3D Face Reconstruction. 2315-2324 - Saurabh Sharma, Pavan Teja Varigonda, Prashast Bindal, Abhishek Sharma, Arjun Jain:
Monocular 3D Human Pose Estimation by Generation and Ordinal Ranking. 2325-2334 - Linlin Yang, Shile Li, Dongheui Lee, Angela Yao:
Aligning Latent Spaces for 3D Hand Pose Estimation. 2335-2343 - Kun Zhou
, Xiaoguang Han, Nianjuan Jiang, Kui Jia, Jiangbo Lu
:
HEMlets Pose: Learning Part-Centric Heatmap Triplets for Accurate 3D Human Pose Estimation. 2344-2353 - Xiong Zhang, Qiang Li, Hong Mo, Wenbo Zhang, Wen Zheng:
End-to-End Hand Mesh Recovery From a Monocular RGB Image. 2354-2364
Motion & Tracking
- Wenwei Zhang, Hui Zhou, Shuyang Sun, Zhe Wang, Jianping Shi, Chen Change Loy:
Robust Multi-Modality Multi-Object Tracking. 2365-2374 - Boris Ivanovic, Marco Pavone
:
The Trajectron: Probabilistic Multi-Agent Trajectory Modeling With Dynamic Spatiotemporal Graphs. 2375-2384 - Bin Yan, Haojie Zhao, Dong Wang, Huchuan Lu, Xiaoyun Yang:
'Skimming-Perusal' Tracking: A Framework for Real-Time and Robust Long-Term Tracking. 2385-2393 - Kyle Min, Jason J. Corso:
TASED-Net: Temporally-Aggregating Spatial Encoder-Decoder Network for Video Saliency Detection. 2394-2403 - Anurag Ranjan, Joel Janai, Andreas Geiger, Michael J. Black:
Attacking Optical Flow. 2404-2413
Computational Photography & Graphics
- Chunyu Li
, Yusuke Monno, Hironori Hidaka, Masatoshi Okutomi:
Pro-Cam SSfM: Projector-Camera System for Structure and Spectral Reflectance From Motion. 2414-2423 - Bin He, Ce Wang, Boxin Shi, Lingyu Duan:
Mop Moiré Patterns Using MopNet. 2424-2432 - Ruofan Zhou, Sabine Süsstrunk:
Kernel Modeling Super-Resolution on Real Low-Resolution Images. 2433-2443 - Daiqian Ma, Renjie Wan, Boxin Shi, Alex C. Kot, Lingyu Duan:
Learning to Jointly Generate and Separate Reflections. 2444-2452 - Zijun Deng, Lei Zhu, Xiaowei Hu
, Chi-Wing Fu
, Xuemiao Xu, Qing Zhang, Jing Qin
, Pheng-Ann Heng:
Deep Multi-Model Fusion for Single-Image Dehazing. 2453-2462 - Yuhui Quan, Shijie Deng, Yixin Chen, Hui Ji
:
Deep Learning for Seeing Through Window With Raindrops. 2463-2471 - Xiaowei Hu
, Yitong Jiang, Chi-Wing Fu
, Pheng-Ann Heng:
Mask-ShadowGAN: Learning to Remove Shadows From Unpaired Data. 2472-2481
Low-Level Vision & Optimization
- Shangchen Zhou, Jiawei Zhang, Jinshan Pan, Wangmeng Zuo, Haozhe Xie
, Jimmy S. J. Ren:
Spatio-Temporal Filter Adaptive Network for Video Deblurring. 2482-2491 - Yang Liu, Jinshan Pan, Jimmy S. J. Ren, Zhixun Su
:
Learning Deep Priors for Image Dehazing. 2492-2500 - Xueyang Fu
, Zheng-Jun Zha, Feng Wu, Xinghao Ding, John W. Paisley:
JPEG Artifacts Reduction via Deep Convolutional Sparse Coding. 2501-2510 - Shuhang Gu, Yawei Li
, Luc Van Gool, Radu Timofte
:
Self-Guided Network for Fast Image Denoising. 2511-2520 - Ziang Cheng, Yinqiang Zheng
, Shaodi You, Imari Sato:
Non-Local Intrinsic Decomposition With Near-Infrared Priors. 2521-2530
Scene Understanding
- Romain Cohendet, Claire-Hélène Demarty
, Ngoc Q. K. Duong, Martin Engilberge:
VideoMem: Constructing, Analyzing, Predicting Short-Term and Long-Term Video Memorability. 2531-2540 - Maciej Halber, Yifei Shi, Kai Xu, Thomas A. Funkhouser:
Rescan: Inductive Instance Segmentation for Indoor RGBD Scans. 2541-2550 - Armen Avetisyan, Angela Dai, Matthias Nießner:
End-to-End CAD Model Retrieval and 9DoF Alignment in 3D Scans. 2551-2560 - Tianhao Yang, Zheng-Jun Zha, Hanwang Zhang
:
Making History Matter: History-Advantage Sequence Training for Visual Dialog. 2561-2569 - Liu Liu, Hongdong Li
, Yuchao Dai:
Stochastic Attraction-Repulsion Embedding for Large Scale Image Localization. 2570-2579 - Ranjay Krishna, Vincent S. Chen, Paroma Varma, Michael S. Bernstein
, Christopher Ré, Li Fei-Fei:
Scene Graph Prediction With Limited Labels. 2580-2590
Language & Reasoning
- Ramprasaath Ramasamy Selvaraju, Stefan Lee, Yilin Shen, Hongxia Jin, Shalini Ghosh, Larry P. Heck, Dhruv Batra, Devi Parikh:
Taking a HINT: Leveraging Explanations to Make Vision and Language Models More Grounded. 2591-2600 - Samyak Datta, Karan Sikka, Anirban Roy, Karuna Ahuja, Devi Parikh, Ajay Divakaran:
Align2Ground: Weakly Supervised Phrase Grounding Guided by Image-Caption Alignment. 2601-2610 - Xuejing Liu, Liang Li
, Shuhui Wang, Zheng-Jun Zha, Dechao Meng, Qingming Huang:
Adaptive Reconstruction Network for Weakly Supervised Referring Expression Grounding. 2611-2620 - Ting Yao, Yingwei Pan
, Yehao Li, Tao Mei
:
Hierarchy Parsing for Image Captioning. 2621-2629 - Antoine Miech, Dimitri Zhukov, Jean-Baptiste Alayrac, Makarand Tapaswi, Ivan Laptev, Josef Sivic:
HowTo100M: Learning a Text-Video Embedding by Watching Hundred Million Narrated Video Clips. 2630-2640 - Bairui Wang, Lin Ma, Wei Zhang, Wenhao Jiang, Jingwen Wang, Wei Liu
:
Controllable Video Captioning With POS Sequence Guidance Based on Gated Fusion Network. 2641-2650
3D From Multiview & Sensors
- Yuxin Hou, Juho Kannala, Arno Solin
:
Multi-View Stereo by Temporal Nonparametric Fusion. 2651-2660 - Jiacheng Chen, Chen Liu, Jiaye Wu, Yasutaka Furukawa:
Floor-SP: Inverse CAD for Floorplans by Sequential Room-Wise Shortest Path. 2661-2670 - Zhaopeng Cui, Viktor Larsson, Marc Pollefeys
:
Polarimetric Relative Pose Estimation. 2671-2680 - Seong Hun Lee
, Javier Civera:
Closed-Form Optimal Two-View Triangulation Based on Angular Errors. 2681-2689 - Haozhe Xie
, Hongxun Yao, Xiaoshuai Sun, Shangchen Zhou, Shengping Zhang:
Pix2Vox: Context-Aware 3D Reconstruction From Single and Multi-View Images. 2690-2698
Image & Video Synthesis
- Patrick Esser, Johannes Haux, Björn Ommer:
Unsupervised Robust Disentangling of Latent Characteristics for Image Synthesis. 2699-2709 - Mohammad Saeed Rad, Behzad Bozorgtabar
, Urs-Viktor Marti, Max Basler, Hazim Kemal Ekenel
, Jean-Philippe Thiran
:
SROBB: Targeted Perceptual Loss for Single Image Super-Resolution. 2710-2719 - Haotian Zhang, Long Mai, Hailin Jin, Zhaowen Wang, Ning Xu, John P. Collomosse:
An Internal Learning Approach to Video Inpainting. 2720-2729 - Sai Bi, Kalyan Sunkavalli, Federico Perazzi, Eli Shechtman, Vladimir G. Kim, Ravi Ramamoorthi:
Deep CG2Real: Synthetic-to-Real Translation via Image Disentanglement. 2730-2739 - Yunseok Jang, Tianchen Zhao, Seunghoon Hong, Honglak Lee:
Adversarial Defense via Learning to Generate Diverse Attacks. 2740-2749 - Atsuhiro Noguchi, Tatsuya Harada:
Image Generation From Small Datasets via Batch Statistics Adaptation. 2750-2758 - Mengyao Zhai, Lei Chen, Frederick Tung, Jiawei He, Megha Nawhal, Greg Mori:
Lifelong GAN: Continual Learning for Conditional Image Generation. 2759-2768
Applications. Medical, & Robotics
- Yi Wu, Yuxin Wu, Aviv Tamar, Stuart Russell, Georgia Gkioxari, Yuandong Tian:
Bayesian Relational Memory for Semantic Visual Navigation. 2769-2779 - Fabian Brickwedde, Steffen Abraham, Rudolf Mester:
Mono-SF: Multi-View Geometry Meets Single-View Depth for Monocular Scene Flow Estimation of Dynamic Traffic Scenes. 2780-2790 - Zhaoyang Huang, Yan Xu, Jianping Shi, Xiaowei Zhou, Hujun Bao, Guofeng Zhang:
Prior Guided Dropout for Robust Visual Localization in Dynamic Environments. 2791-2800 - Manuel Martin, Alina Roitberg
, Monica Haurilet, Matthias Horne, Simon Reiß
, Michael Voit, Rainer Stiefelhagen:
Drive&Act: A Multi-Modal Dataset for Fine-Grained Driver Behavior Recognition in Autonomous Vehicles. 2801-2810 - Yan Xu, Xinge Zhu
, Jianping Shi, Guofeng Zhang, Hujun Bao, Hongsheng Li
:
Depth Completion From Sparse LiDAR Data With Depth-Normal Constraints. 2811-2820 - Nicholas Rhinehart
, Rowan McAllister, Kris Kitani, Sergey Levine:
PRECOG: PREdiction Conditioned on Goals in Visual Multi-Agent Settings. 2821-2830 - Zhe Liu, Shunbo Zhou, Chuanzhe Suo, Peng Yin
, Wen Chen, Hesheng Wang
, Haoang Li, Yunhui Liu:
LPD-Net: 3D Point Cloud Learning for Large-Scale Place Recognition and Environment Analysis. 2831-2840 - Fei Xue, Xin Wang, Zike Yan, Qiuyuan Wang, Junqiu Wang, Hongbin Zha:
Local Supports Global: Deep Camera Relocalization With Sequence Enhancement. 2841-2850 - Shunkai Li, Fei Xue, Xin Wang, Zike Yan, Hongbin Zha:
Sequential Adversarial Learning for Self-Supervised Deep Visual Odometry. 2851-2860 - Ziyang Hong, Yvan R. Petillot
, David Lane, Yishu Miao, Sen Wang
:
TextPlace: Visual Place Recognition and Topological Localization Through Reading Scene Texts. 2861-2870 - Mingyu Ding, Zhe Wang, Jiankai Sun, Jianping Shi, Ping Luo:
CamNet: Coarse-to-Fine Retrieval for Camera Re-Localization. 2871-2880 - William B. Shen, Danfei Xu, Yuke Zhu, Li Fei-Fei, Leonidas J. Guibas, Silvio Savarese:
Situational Fusion of Visual Representation for Visual Navigation. 2881-2890 - Ziyuan Huang
, Changhong Fu
, Yiming Li
, Fuling Lin, Peng Lu
:
Learning Aberrance Repressed Correlation Filters for Real-Time UAV Tracking. 2891-2900 - Arsalan Mousavian, Clemens Eppner, Dieter Fox:
6-DOF GraspNet: Variational Grasp Generation for Object Manipulation. 2901-2910 - Namdar Homayounfar, Justin Liang, Wei-Chiu Ma, Jack Fan, Xinyu Wu, Raquel Urtasun:
DAGMapper: Learning to Map by Discovering Lane Topology. 2911-2920 - Noa Garnett, Rafi Cohen, Tomer Pe'er, Roee Lahav, Dan Levi:
3D-LaneNet: End-to-End 3D Multiple Lane Detection. 2921-2930
Oral 2.1A
Feature Representations, Similarity Learning
- Janis Postels, Francesco Ferroni, Huseyin Coskun, Nassir Navab, Federico Tombari:
Sampling-Free Epistemic Uncertainty Estimation Using Approximated Variance Propagation. 2931-2940 - Hong Liu, Rongrong Ji
, Jie Li
, Baochang Zhang, Yue Gao, Yongjian Wu, Feiyue Huang:
Universal Adversarial Perturbation via Prior Driven Uncertainty Approximation. 2941-2949 - Ruth Fong, Mandela Patrick, Andrea Vedaldi:
Understanding Deep Networks via Extremal Perturbations and Smooth Masks. 2950-2958 - Mathilde Caron, Piotr Bojanowski, Julien Mairal, Armand Joulin:
Unsupervised Pre-Training of Image Features on Non-Curated Data. 2959-2968 - Linguang Zhang, Szymon Rusinkiewicz
:
Learning Local Descriptors With a CDF-Based Dynamic Soft Margin. 2969-2978 - Minyoung Kim, Yuting Wang, Pritish Sahu, Vladimir Pavlovic
:
Bayes-Factor-VAE: Hierarchical Bayesian Deep Auto-Encoder Models for Factor Disentanglement. 2979-2987 - Wei Jiang, Weiwei Sun, Andrea Tagliasacchi, Eduard Trulls, Kwang Moo Yi:
Linearized Multi-Sampling for Differentiable Image Transformation. 2988-2997 - Zhiqiang Tang, Xi Peng, Tingfeng Li, Yizhe Zhu, Dimitris N. Metaxas:
AdaTransform: Adaptive Data Transformation. 2998-3006 - Jiaqi Wang, Kai Chen, Rui Xu, Ziwei Liu, Chen Change Loy, Dahua Lin:
CARAFE: Content-Aware ReAssembly of FEatures. 3007-3016 - Dou Quan, Xuefeng Liang, Shuang Wang, Shaowei Wei, Yanfeng Li, Ning Huyan, Licheng Jiao
:
AFD-Net: Aggregated Feature Difference Learning for Cross-Spectral Image Patch Matching. 3017-3026 - Shupeng Su, Zhisheng Zhong, Chao Zhang:
Deep Joint-Semantics Reconstructing Hashing for Large-Scale Unsupervised Cross-Modal Retrieval. 3027-3035 - Stanislav Morozov, Artem Babenko:
Unsupervised Neural Quantization for Compressed-Domain Similarity Search. 3036-3045 - Soumava Kumar Roy, Mehrtash Harandi
, Richard Nock, Richard I. Hartley:
Siamese Networks: The Tale of Two Manifolds. 3046-3055 - Runzhong Wang
, Junchi Yan, Xiaokang Yang:
Learning Combinatorial Embedding Networks for Deep Graph Matching. 3056-3065 - Zhanghui Kuang, Yiming Gao, Guanbin Li, Ping Luo, Yimin Chen, Liang Lin, Wayne Zhang
:
Fashion Retrieval via Graph Reasoning Networks on a Similarity Pyramid. 3066-3075
Oral 2.1B
Low Level Vision
- Xin Deng, Ren Yang
, Mai Xu, Pier Luigi Dragotti
:
Wavelet Domain Style Transfer for an Effective Perception-Distortion Tradeoff in Single Image Super-Resolution. 3076-3085 - Jianrui Cai, Hui Zeng, Hongwei Yong, Zisheng Cao, Lei Zhang
:
Toward Real-World Single Image Super-Resolution: A New Benchmark and a New Model. 3086-3095 - Wenlong Zhang
, Yihao Liu
, Chao Dong, Yu Qiao:
RankSRGAN: Generative Adversarial Networks With Ranker for Image Super-Resolution. 3096-3105 - Peng Yi, Zhongyuan Wang, Kui Jiang
, Junjun Jiang
, Jiayi Ma:
Progressive Fusion Video Super-Resolution Network via Exploiting Non-Local Spatio-Temporal Correlations. 3106-3115 - Soo Ye Kim, Jihyong Oh
, Munchurl Kim:
Deep SR-ITM: Joint Learning of Super-Resolution and Inverse Tone-Mapping for 4K UHD HDR Applications. 3116-3125 - Tatsuya Yokota, Kazuya Kawai, Muneyuki Sakata, Yuichi Kimura, Hidekata Hontani:
Dynamic PET Image Reconstruction Using Nonnegative Matrix Factorization Incorporated With Deep Image Prior. 3126-3135 - Jerry Liu, Shenlong Wang, Raquel Urtasun:
DSIC: Deep Stereo Image Compression. 3136-3145 - Yoojin Choi, Mostafa El-Khamy
, Jungwon Lee:
Variable Rate Deep Image Compression With a Conditional Autoencoder. 3146-3154 - Saeed Anwar, Nick Barnes
:
Real Image Denoising With Feature Attention. 3155-3164 - Abdelrahman Abdelhamed
, Marcus A. Brubaker
, Michael S. Brown:
Noise Flow: Noise Modeling With Conditional Normalizing Flows. 3165-3173 - Ahmed Abbas
, Paul Swoboda:
Bottleneck Potentials in Markov Random Fields. 3174-3183 - Chen Chen, Qifeng Chen, Minh N. Do
, Vladlen Koltun:
Seeing Motion in the Dark. 3184-3193 - Huaizu Jiang, Deqing Sun, Varun Jampani, Zhaoyang Lv, Erik G. Learned-Miller, Jan Kautz:
SENSE: A Shared Encoder Network for Scene-Flow Estimation. 3194-3203
Poster 2.1
Deep Learning
- Firas Shama, Roey Mechrez, Alon Shoshan, Lihi Zelnik-Manor:
Adversarial Feedback Loop. 3204-3213 - Alon Shoshan, Roey Mechrez, Lihi Zelnik-Manor:
Dynamic-Net: Tuning the Objective Without Re-Training for Synthesis Tasks. 3214-3222 - Xinyu Gong, Shiyu Chang, Yifan Jiang, Zhangyang Wang:
AutoGAN: Neural Architecture Search for Generative Adversarial Networks. 3223-3233 - Han Shu, Yunhe Wang, Xu Jia, Kai Han, Hanting Chen, Chunjing Xu, Qi Tian, Chang Xu
:
Co-Evolutionary Compression for Unpaired Image Translation. 3234-3243 - Zeyu Feng, Chang Xu
, Dacheng Tao:
Self-Supervised Representation Learning From Multi-Domain Data. 3244-3254 - Michael Möller, Thomas Möllenhoff
, Daniel Cremers
:
Controlling Neural Networks via Energy Dissipation. 3255-3264 - Hao Lu, Yutong Dai, Chunhua Shen, Songcen Xu
:
Indices Matter: Learning to Index for Deep Image Matting. 3265-3274 - Yunan Li
, Qiguang Miao
, Wanli Ouyang
, Zhenxin Ma, Huijuan Fang, Chao Dong, Yi-Ning Quan:
LAP-Net: Level-Aware Progressive Network for Image Dehazing. 3275-3284 - Irwan Bello, Barret Zoph, Quoc Le, Ashish Vaswani, Jonathon Shlens:
Attention Augmented Convolutional Networks. 3285-3294 - Zechun Liu, Haoyuan Mu, Xiangyu Zhang, Zichao Guo, Xin Yang, Kwang-Ting Cheng
, Jian Sun:
MetaPruning: Meta Learning for Automatic Neural Network Channel Pruning. 3295-3304 - Yuefu Zhou, Ya Zhang
, Yanfeng Wang, Qi Tian:
Accelerate CNN via Recursive Bayesian Pruning. 3305-3314 - Duo Li, Aojun Zhou, Anbang Yao:
HBONet: Harmonious Bottleneck on Two Orthogonal Dimensions. 3315-3324 - Jinchi Huang, Lie Qu, Rongfei Jia, Binqiang Zhao:
O2U-Net: A Simple Noisy Label Detection Approach for Deep Neural Networks. 3325-3333 - Dongmin Park, Seokil Hong, Bohyung Han, Kyoung Mu Lee:
Continual Learning by Asymmetric Loss Approximation With Single-Side Overestimation. 3334-3343 - Weifeng Ge, Weilin Huang, Sheng Guo, Matthew R. Scott
:
Label-PEnet: Sequential Label Propagation and Enhancement Networks for Weakly Supervised Instance Segmentation. 3344-3353 - Ziteng Gao, Limin Wang, Gangshan Wu:
LIP: Local Importance-Based Pooling. 3354-3363 - Takumi Kobayashi:
Global Feature Guided Local Pooling. 3364-3373 - Jinghua Wang, Jianmin Jiang:
Conditional Coupled Generative Adversarial Networks for Zero-Shot Domain Adaptation. 3374-3383 - Aamir Mustafa, Salman H. Khan, Munawar Hayat
, Roland Goecke
, Jianbing Shen, Ling Shao
:
Adversarial Defense by Restricting the Hidden Space of Deep Neural Networks. 3384-3393 - Juhong Min, Jongmin Lee, Jean Ponce, Minsu Cho:
Hyperpixel Flow: Semantic Correspondence With Multi-Layer Neural Features. 3394-3403 - Weitao Wan, Jiansheng Chen, Tianpeng Li, Yiqing Huang
, Jingqi Tian, Cheng Yu, Youze Xue:
Information Entropy Based Feature Pooling for Convolutional Neural Networks. 3404-3413 - Yuning Chai:
Patchwork: A Patch-Wise Attention Network for Efficient Object Detection and Segmentation in Video Streams. 3414-3423 - Siddhesh Khandelwal, Leonid Sigal:
AttentionRNN: A Structured Spatial Attention Mechanism. 3424-3433 - Yunpeng Chen
, Haoqi Fan, Bing Xu, Zhicheng Yan, Yannis Kalantidis, Marcus Rohrbach, Shuicheng Yan, Jiashi Feng:
Drop an Octave: Reducing Spatial Redundancy in Convolutional Neural Networks With Octave Convolution. 3434-3443 - Sagie Benaim, Michael Khaitov, Tomer Galanti, Lior Wolf:
Domain Intersection and Domain Difference. 3444-3452 - Oren Rippel, Sanjay Nair, Carissa Lew, Steve Branson, Alexander G. Anderson, Lubomir D. Bourdev:
Learned Video Compression. 3453-3462 - Han Hu, Zheng Zhang, Zhenda Xie, Stephen Lin:
Local Relation Networks for Image Recognition. 3463-3472 - Éloi Mehr, Ariane Jourdan, Nicolas Thome, Matthieu Cord, Vincent Guitteny:
DiscoNet: Shapes Learning on Disconnected Manifolds for 3D Editing. 3473-3482 - Max Ehrlich, Larry Davis:
Deep Residual Learning in the JPEG Transform Domain. 3483-3492 - Xinqi Zhu, Chang Xu
, Langwen Hui, Cewu Lu, Dacheng Tao:
Approximated Bilinear Modules for Temporal Modeling. 3493-3502 - Chengchao Shen, Mengqi Xue
, Xinchao Wang
, Jie Song, Li Sun, Mingli Song:
Customizing Student Networks From Heterogeneous Teachers via Adaptive Knowledge Amalgamation. 3503-3512 - Hanting Chen, Yunhe Wang, Chang Xu
, Zhaohui Yang, Chuanjian Liu, Boxin Shi, Chunjing Xu, Chao Xu, Qi Tian:
Data-Free Learning of Student Networks. 3513-3521 - Yue Wang, Justin Solomon:
Deep Closest Point: Learning Representations for Point Cloud Registration. 3522-3531 - Chao Zhang, Stephan Liwicki
, William Smith
, Roberto Cipolla:
Orientation-Aware Semantic Segmentation on Icosahedron Spheres. 3532-3540 - Zhaoyang Zhang, Jingyu Li, Wenqi Shao, Zhanglin Peng, Ruimao Zhang
, Xiaogang Wang, Ping Luo:
Differentiable Learning-to-Group Channels via Groupable Convolutional Neural Networks. 3541-3550 - Ping Chao, Chao-Yang Kao, Yu-Shan Ruan, Chien-Hsiang Huang, Youn-Long Lin:
HarDNet: A Low Memory Traffic Network. 3551-3560 - Junjun He, Zhongying Deng, Yu Qiao
:
Dynamic Multi-Scale Filters for Semantic Segmentation. 3561-3571 - Ravi Teja Mullapudi, Steven Chen, Keyi Zhang, Deva Ramanan
, Kayvon Fatahalian:
Online Model Distillation for Efficient Video Inference. 3572-3581
Recognition
- Kai Li, Martin Renqiang Min
, Yun Fu:
Rethinking Zero-Shot Learning: A Conditional Visual Classification Perspective. 3582-3591 - Senthil Purushwalkam, Maximilian Nickel
, Abhinav Gupta, Marc'Aurelio Ranzato:
Task-Driven Modular Networks for Zero-Shot Compositional Learning. 3592-3601 - Limeng Qiao, Yemin Shi, Jia Li, Yonghong Tian, Tiejun Huang, Yaowei Wang:
Transductive Episodic-Wise Adaptive Metric for Few-Shot Learning. 3602-3611 - Wei Zhai, Yang Cao, Jing Zhang
, Zheng-Jun Zha:
Deep Multiple-Attribute-Perceived Network for Real-World Texture Recognition. 3612-3621 - Guan'an Wang, Tianzhu Zhang, Jian Cheng, Si Liu, Yang Yang, Zengguang Hou:
RGB-Infrared Cross-Modality Person Re-Identification via Joint Pixel and Feature Alignment. 3622-3631 - Saurabh Singh, Abhinav Shrivastava:
EvalNorm: Estimating Batch Normalization Statistics for Evaluation. 3632-3640 - Jianyuan Guo
, Yuhui Yuan, Lang Huang, Chao Zhang, Jin-Ge Yao, Kai Han:
Beyond Human Parts: Dual Part-Aligned Representations for Person Re-Identification. 3641-3650 - Qi Dong, Xiatian Zhu, Shaogang Gong:
Person Search by Text Attribute Query As Zero-Shot Learning. 3651-3660 - Qing Liu, Lingxi Xie, Huiyu Wang, Alan L. Yuille
:
Semantic-Aware Knowledge Preservation for Zero-Shot Sketch-Based Image Retrieval. 3661-3670 - Hamed H. Aghdam, Abel Gonzalez-Garcia, Antonio M. López
, Joost van de Weijer
:
Active Learning for Deep Detection Neural Networks. 3671-3679 - Xuanyi Dong, Yi Yang:
One-Shot Neural Architecture Search via Self-Evaluated Template Network. 3680-3689 - Zuozhuo Dai, Mingqiang Chen, Xiaodong Gu, Siyu Zhu, Ping Tan:
Batch DropBlock Network for Person Re-Identification and Beyond. 3690-3700 - Kaiyang Zhou, Yongxin Yang, Andrea Cavallaro, Tao Xiang:
Omni-Scale Feature Learning for Person Re-Identification. 3701-3711 - Linfeng Zhang
, Jiebo Song, Anni Gao, Jingwei Chen, Chenglong Bao, Kaisheng Ma
:
Be Your Own Teacher: Improve the Performance of Convolutional Neural Networks via Self Distillation. 3712-3721 - Nikita Dvornik, Julien Mairal, Cordelia Schmid:
Diversity With Cooperation: Ensemble Methods for Few-Shot Classification. 3722-3730 - Cheng Xu, Zhaoqun Li, Qiang Qiu, Biao Leng, Jingfei Jiang:
Enhancing 2D Representation via Adjacent Views for 3D Shape Retrieval. 3731-3739 - Kun Wei, Muli Yang, Hao Wang, Cheng Deng
, Xianglong Liu
:
Adversarial Fine-Grained Composition Learning for Unseen Attribute-Object Recognition. 3740-3748 - Ruijie Quan, Xuanyi Dong, Yu Wu
, Linchao Zhu
, Yi Yang:
Auto-ReID: Searching for a Part-Aware ConvNet for Person Re-Identification. 3749-3758 - Bryan Bryan, Yuan Gong
, Yizhe Zhang, Christian Poellabauer
:
Second-Order Non-Local Attention Networks for Person Re-Identification. 3759-3768
Segmentation, Grouping, & Shape
- Zipeng Ye, Ran Yi
, Minjing Yu, Yong-Jin Liu, Ying He
:
Fast Computation of Content-Sensitive Superpixels and Supervoxels Using Q-Distances. 3769-3778 - Dániel Baráth, Jiri Matas
:
Progressive-X: Efficient, Anytime, Multi-Model Fitting Algorithm. 3779-3787 - Yingyue Xu, Dan Xu, Xiaopeng Hong, Wanli Ouyang
, Rongrong Ji, Min Xu
, Guoying Zhao:
Structured Modeling of Joint Deep Feature and Prediction Refinement for Salient Object Detection. 3788-3797 - Jinming Su, Jia Li, Yu Zhang, Changqun Xia, Yonghong Tian:
Selectivity or Invariance: Boundary-Aware Salient Object Detection. 3798-3807 - Urbano Miguel Nunes
, Yiannis Demiris
:
Online Unsupervised Learning of the 3D Kinematic Structure of Arbitrary Rigid Bodies. 3808-3816
3D From Single View & RGBD
- Bram Wallace, Bharath Hariharan:
Few-Shot Generalization for Single-Image 3D Reconstruction via Priors. 3817-3826 - Clément Godard
, Oisin Mac Aodha, Michael Firman, Gabriel J. Brostow:
Digging Into Self-Supervised Monocular Depth Estimation. 3827-3837 - Jing Zhu, Yi Fang:
Learning Object-Specific Distance From a Monocular Image. 3838-3847 - Geonho Cha, Minsik Lee
, Songhwai Oh:
Unsupervised 3D Reconstruction Networks. 3848-3857 - Dong Wook Shu, Sung Woo Park, Junseok Kwon:
3D Point Cloud Generative Adversarial Network Based on Tree Structured Graph Convolutions. 3858-3867 - Junjie Hu
, Yan Zhang, Takayuki Okatani:
Visualization of Convolutional Neural Networks for Monocular Depth Estimation. 3868-3877
Action & Video
- Ruohan Gao, Kristen Grauman:
Co-Separating Sounds of Visual Objects. 3878-3887 - Tianwei Lin, Xiao Liu, Xin Li, Errui Ding, Shilei Wen:
BMN: Boundary-Matching Network for Temporal Action Proposal Generation. 3888-3897 - Ziyi Liu
, Le Wang, Qilin Zhang
, Zhanning Gao, Zhenxing Niu, Nanning Zheng, Gang Hua:
Weakly Supervised Temporal Action Localization Through Contrast Based Evaluation Networks. 3898-3907 - Chaoxu Guo, Bin Fan, Jie Gu, Qian Zhang, Shiming Xiang, Véronique Prinet, Chunhong Pan:
Progressive Sparse Local Attention for Video Object Detection. 3908-3917 - Tete Xiao, Quanfu Fan, Danny Gutfreund, Mathew Monfort, Aude Oliva, Bolei Zhou:
Reasoning About Human-Object Interactions Through Dual Attention Networks. 3918-3927 - Xiaohui Zeng, Renjie Liao, Li Gu, Yuwen Xiong, Sanja Fidler, Raquel Urtasun:
DMM-Net: Differentiable Mask-Matching Network for Video Object Segmentation. 3928-3937 - Hao Wang, Cheng Deng
, Junchi Yan, Dacheng Tao:
Asymmetric Cross-Guided Attention Network for Actor and Action Video Segmentation From Natural Language Query. 3938-3947 - Huaijia Lin, Xiaojuan Qi, Jiaya Jia
:
AGSS-VOS: Attention Guided Single-Shot Video Object Segmentation. 3948-3956 - Jianing Li, Shiliang Zhang, Jingdong Wang
, Wen Gao, Qi Tian:
Global-Local Temporal Representations for Video Person Re-Identification. 3957-3966 - Chaowei Xiao, Ruizhi Deng, Bo Li, Taesung Lee, Benjamin Edwards, Jinfeng Yi, Dawn Song, Mingyan Liu, Ian M. Molloy:
AdvIT: Adversarial Frames Identifier Based on Temporal Consistency in Videos. 3967-3976
Motion & Tracking
- Ziqin Wang, Jun Xu, Li Liu, Fan Zhu, Ling Shao
:
RANet: Ranking Attention Network for Fast Video Object Segmentation. 3977-3986 - Jiarui Xu, Yue Cao, Zheng Zhang, Han Hu:
Spatial-Temporal Relation Networks for Multi-Object Tracking. 3987-3997 - Lianghua Huang, Xin Zhao
, Kaiqi Huang:
Bridging the Gap Between Detection and Tracking: A Unified Approach. 3998-4008 - Lichao Zhang, Abel Gonzalez-Garcia, Joost van de Weijer
, Martin Danelljan, Fahad Shahbaz Khan
:
Learning the Model Update for Siamese Trackers. 4009-4018 - Linyu Zheng, Ming Tang, Yingying Chen, Jinqiao Wang, Hanqing Lu:
Fast-deepKCF Without Boundary Effect. 4019-4028
Computational Photography & Graphics
- Xiuming Zhang, Jiayuan Mao, Yikai Li, William T. Freeman, Joshua B. Tenenbaum, Jiajun Wu:
Program-Guided Image Manipulators. 4029-4038 - Pierre-André Brousseau, Sébastien Roy:
Calibration of Axial Fisheye Cameras Through Generic Virtual Central Models. 4039-4047 - Vishwanath Saragadam
, Raja Venkata, Jian Wang, Shree K. Nayar, Mohit Gupta:
Micro-Baseline Structured Light. 4048-4057 - Xin Miao, Xin Yuan
, Yunchen Pu, Vassilis Athitsos
:
lambda-Net: Reconstruct Hyperspectral Images From a Snapshot Measurement. 4058-4068 - Masako Kashiwagi, Nao Mishima, Tatsuo Kozakaya, Shinsaku Hiura:
Deep Depth From Aberration Map. 4069-4078 - Lukas Murmann, Michaël Gharbi, Miika Aittala, Frédo Durand:
A Dataset of Multi-Illumination Images in the Wild. 4079-4088 - Jie Song, Xu Chen, Otmar Hilliges:
Monocular Neural Image Based Rendering With Continuous View Control. 4089-4099 - Marc Comino Trinidad
, Ricardo Martin-Brualla, Florian Kainz, Janne Kontkanen:
Multi-View Image Fusion. 4100-4109
Low-Level & Optimization
- Wei Wang, Xin Chen, Cheng Yang, Xiang Li, Xuemei Hu, Tao Yue
:
Enhancing Low Light Videos by Exploring High Sensitivity Camera Noise. 4110-4118 - Qifan Gao, Xiao Shu, Xiaolin Wu:
Deep Restoration of Vintage Photographs From Scanned Halftone Prints. 4119-4128 - Qiqi Hou, Feng Liu:
Context-Aware Image Matting for Simultaneous Foreground and Alpha Estimation. 4129-4138 - Wei Wang, Ruiming Guo
, Yapeng Tian, Wenming Yang:
CFSNet: Toward a Controllable Feature Space for Image Restoration. 4139-4148 - Wu Wang
, Weihong Zeng, Yue Huang, Xinghao Ding, John W. Paisley:
Deep Blind Hyperspectral Image Fusion. 4149-4158 - Sungmin Cha, Taesup Moon:
Fully Convolutional Pixel Adaptive Image Denoiser. 4159-4168 - Hongyu Liu, Bin Jiang, Yi Xiao, Chao Yang:
Coherent Semantic Attention for Image Inpainting. 4169-4178 - Yajun Qiu, Ruxin Wang, Dapeng Tao, Jun Cheng:
Embedded Block Residual Network: A Recursive Restoration Model for Single-Image Super-Resolution. 4179-4188 - Shuhang Gu, Wen Li, Luc Van Gool, Radu Timofte
:
Fast Image Restoration With Multi-Bin Trainable Linear Units. 4189-4198
Scene Understanding
- Zenglin Shi, Pascal Mettes, Cees Snoek:
Counting With Focus for Free. 4199-4208 - Behzad Bozorgtabar
, Mohammad Saeed Rad, Dwarikanath Mahapatra, Jean-Philippe Thiran
:
SynDeMo: Synergistic Deep Feature Alignment for Joint Learning of Depth and Ego-Motion. 4209-4218 - Ke Li, Tianhao Zhang, Jitendra Malik:
Diverse Image Synthesis From Semantic Layouts via Conditional IMLE. 4219-4228 - Yanwei Pang, Yazhao Li, Jianbing Shen, Ling Shao
:
Towards Bridging Semantic Gap to Improve Semantic Segmentation. 4229-4238
Language & Reasoning
- Lixin Liu, Jiajun Tang
, Xiaojun Wan, Zongming Guo:
Generating Diverse and Descriptive Image Captions Using Visual Paraphrases. 4239-4248 - Xu Yang, Hanwang Zhang
, Jianfei Cai:
Learning to Collocate Neural Modules for Image Captioning. 4249-4259 - Jyoti Aneja, Harsh Agrawal
, Dhruv Batra, Alexander G. Schwing:
Sequential Latent Spaces for Modeling the Intention During Diverse Image Captioning. 4260-4269 - Nilavra Bhattacharya
, Qing Li, Danna Gurari:
Why Does a Visual Question Have Different Answers? 4270-4279 - Mohit Bajaj, Lanjun Wang
, Leonid Sigal:
G3raphGround: Graph-Based Language Grounding. 4280-4289 - Ali Furkan Biten, Rubèn Tito, Andrés Mafla, Lluís Gómez i Bigorda
, Marçal Rusiñol, C. V. Jawahar
, Ernest Valveny, Dimosthenis Karatzas
:
Scene Text Visual Question Answering. 4290-4300
3D From Multiview & Sensors
- Lu Sheng
, Dan Xu, Wanli Ouyang
, Xiaogang Wang:
Unsupervised Collaborative Learning of Keyframe Detection and Visual Odometry Towards Monocular Deep SLAM. 4301-4310 - Youze Xue, Jiansheng Chen, Weitao Wan, Yiqing Huang
, Cheng Yu, Tianpeng Li, Jiayu Bao:
MVSCRF: Learning Multi-View Stereo With Conditional Random Fields. 4311-4320 - Eric Brachmann, Carsten Rother:
Neural-Guided RANSAC: Learning Where to Sample Model Hypotheses. 4321-4330 - Sergey Prokudin, Christoph Lassner, Javier Romero:
Efficient Learning on Point Clouds With Basis Point Sets. 4331-4340 - Haibo Qiu, Chunyu Wang, Jingdong Wang
, Naiyan Wang, Wenjun Zeng
:
Cross View Fusion for 3D Human Pose Estimation. 4341-4350 - Junbang Liang, Ming C. Lin:
Shape-Aware Human Pose and Shape Reconstruction Using Multi-View Images. 4351-4361 - Di Yan, Henrique Morimitsu
, Shan Gao, Xiangyang Ji:
Monocular Piecewise Depth Estimation in Dynamic Scenes by Exploiting Superpixel Relations. 4362-4371 - Hajime Taira, Ignacio Rocco, Jirí Sedlár
, Masatoshi Okutomi, Josef Sivic, Tomás Pajdla, Torsten Sattler, Akihiko Torii:
Is This the Right Place? Geometric-Semantic Pose Verification for Indoor Visual Localization. 4372-4382 - Shivam Duggal, Shenlong Wang, Wei-Chiu Ma, Rui Hu, Raquel Urtasun:
DeepPruner: Learning Efficient Stereo Matching via Differentiable PatchMatch. 4383-4392
Image & Video Synthesis
- Sijie Yan, Zhizhong Li
, Yuanjun Xiong, Huahan Yan, Dahua Lin:
Convolutional Sequence Generation for Skeleton-Based Action Synthesis. 4393-4401 - Seoung Wug Oh, Sungho Lee
, Joon-Young Lee, Seon Joo Kim:
Onion-Peel Networks for Deep Video Completion. 4402-4411 - Sungho Lee
, Seoung Wug Oh, DaeYeun Won, Seon Joo Kim:
Copy-and-Paste Networks for Deep Video Inpainting. 4412-4420 - Dmytro Kotovenko, Artsiom Sanakoyeu, Sabine Lang, Björn Ommer:
Content and Style Disentanglement for Artistic Style Transfer. 4421-4430
Oral 3.1A
Generative Modeling & Synthesis
- Rameen Abdal
, Yipeng Qin
, Peter Wonka:
Image2StyleGAN: How to Embed Images Into the StyleGAN Latent Space? 4431-4440 - Shuai Yang
, Zhangyang Wang, Zhaowen Wang, Ning Xu, Jiaying Liu
, Zongming Guo:
Controllable Artistic Text Style Transfer via Shape-Matching GAN. 4441-4450 - Tai-Yin Chiu:
Understanding Generalized Whitening and Coloring Transform for Universal Style Transfer. 4451-4459 - Cícero Nogueira dos Santos, Youssef Mroueh, Inkit Padhi, Pierre L. Dognin:
Learning Implicit Generative Models by Matching Perceptual Features. 4460-4469 - Jiahui Yu, Zhe Lin, Jimei Yang, Xiaohui Shen, Xin Lu, Thomas S. Huang:
Free-Form Image Inpainting With Gated Convolution. 4470-4479 - Xintong Han, Zuxuan Wu, Weilin Huang, Matthew R. Scott
, Larry Davis:
FiNet: Compatible and Diverse Fashion Image Inpainting. 4480-4490 - Assaf Shocher, Shai Bagon, Phillip Isola, Michal Irani:
InGAN: Capturing and Retargeting the "DNA" of a Natural Image. 4491-4500 - David Bau
, Jun-Yan Zhu, Jonas Wulff, William S. Peebles, Bolei Zhou, Hendrik Strobelt, Antonio Torralba:
Seeing What a GAN Cannot Generate. 4501-4510 - Chieh Hubert Lin, Chia-Che Chang, Yu-Sheng Chen
, Da-Cheng Juan, Wei Wei, Hwann-Tzong Chen:
COCO-GAN: Generation by Parts via Conditional Coordinating. 4511-4520 - Hang Chu, Daiqing Li, David Acuna, Amlan Kar, Maria Shugrina, Xinkai Wei, Ming-Yu Liu, Antonio Torralba, Sanja Fidler:
Neural Turtle Graphics for Modeling City Road Layouts. 4521-4529 - Michael Oechsle, Lars M. Mescheder, Michael Niemeyer, Thilo Strauss, Andreas Geiger:
Texture Fields: Learning Texture Representations in Function Space. 4530-4539 - Guandao Yang, Xun Huang, Zekun Hao, Ming-Yu Liu, Serge J. Belongie
, Bharath Hariharan:
PointFlow: 3D Point Cloud Generation With Continuous Normalizing Flows. 4540-4549 - Amlan Kar, Aayush Prakash, Ming-Yu Liu, Eric Cameracci, Justin Yuan, Matt Rusiniak, David Acuna, Antonio Torralba, Sanja Fidler:
Meta-Sim: Learning to Generate Synthetic Datasets. 4550-4559 - Oron Ashual, Lior Wolf:
Specifying Object Attributes and Relations in Interactive Scene Generation. 4560-4568 - Tamar Rott Shaham, Tali Dekel, Tomer Michaeli:
SinGAN: Learning a Generative Model From a Single Natural Image. 4569-4579
Oral 3.1B
Vision, Language, & Text
- Xin Wang, Jiawei Wu
, Jun-Kun Chen, Lei Li
, Yuan-Fang Wang, William Yang Wang:
VaTeX: A Large-Scale, High-Quality Multilingual Dataset for Video-and-Language Research. 4580-4590 - Yu Xiong, Qingqiu Huang, Lingfeng Guo, Hang Zhou, Bolei Zhou, Dahua Lin:
A Graph-Based Framework to Bridge Movies and Synopses. 4591-4600 - Ajeet Kumar Singh, Anand Mishra
, Shashank Shekhar, Anirban Chakraborty:
From Strings to Things: Knowledge-Enabled VQA Model That Can Read and Reason. 4601-4611 - Long Chen
, Hanwang Zhang
, Jun Xiao, Xiangnan He, Shiliang Pu, Shih-Fu Chang:
Counterfactual Critic Multi-Agent Training for Scene Graph Generation. 4612-4622 - Dong Huk Park, Trevor Darrell, Anna Rohrbach:
Robust Change Captioning. 4623-4632 - Lun Huang, Wenmin Wang, Jie Chen, Xiaoyong Wei
:
Attention on Attention for Image Captioning. 4633-4642 - Sibei Yang
, Guanbin Li, Yizhou Yu:
Dynamic Graph Attention for Referring Expression Comprehension. 4643-4652 - Kunpeng Li, Yulun Zhang
, Kai Li, Yuanyuan Li, Yun Fu:
Visual Semantic Reasoning for Image-Text Matching. 4653-4661 - Josiah Wang, Lucia Specia:
Phrase Localization Without Paired Training Examples. 4662-4671 - Daqing Liu, Hanwang Zhang
, Feng Wu, Zheng-Jun Zha:
Learning to Assemble Neural Module Tree Networks for Visual Grounding. 4672-4681 - Zhengyuan Yang, Boqing Gong, Liwei Wang, Wenbing Huang, Dong Yu, Jiebo Luo
:
A Fast and Accurate One-Stage Approach to Visual Grounding. 4682-4692 - Arka Sadhu, Kan Chen, Ram Nevatia:
Zero-Shot Grounding of Objects From Natural Language Queries. 4693-4702 - Siyang Qin, Alessandro Bissacco, Michalis Raptis, Yasuhisa Fujii, Ying Xiao:
Towards Unconstrained End-to-End Text Spotting. 4703-4713 - Jeonghun Baek
, Geewook Kim, Junyeop Lee, Sungrae Park, Dongyoon Han, Sangdoo Yun, Seong Joon Oh, Hwalsuk Lee:
What Is Wrong With Scene Text Recognition Model Comparisons? Dataset and Model Analysis. 4714-4722
Poster 3.1
Deep Learning
- Francesco Croce, Matthias Hein:
Sparse and Imperceivable Adversarial Attacks. 4723-4731 - Qian Huang, Isay Katsman
, Zeqi Gu, Horace He, Serge J. Belongie
, Ser-Nam Lim:
Enhancing Adversarial Example Transferability With an Intermediate Level Attack. 4732-4741 - Mateusz Michalkiewicz, Jhony Kaesemodel Pontes, Dominic Jack, Mahsa Baktashmotlagh
, Anders P. Eriksson:
Implicit Surface Representations As Layers in Neural Networks. 4742-4751 - Pablo Navarrete Michelini, Hanwen Liu, Yunhua Lu, Xingqun Jiang:
A Tour of Convolutional Networks Guided by Linear Interpreters. 4752-4761 - João F. Henriques, Sébastien Ehrhardt, Samuel Albanie, Andrea Vedaldi:
Small Steps and Giant Leaps: Minimal Newton Solvers for Deep Learning. 4762-4771 - Ameya Joshi
, Amitangshu Mukherjee, Soumik Sarkar, Chinmay Hegde
:
Semantic Adversarial Attacks: Parametric Transformations That Fool Deep Classifiers. 4772-4782 - Yang Bai
, Yan Feng, Yisen Wang, Tao Dai, Shutao Xia, Yong Jiang:
Hilbert-Based Generative Defense for Adversarial Examples. 4783-4792 - Jang Hyun Cho, Bharath Hariharan:
On the Efficacy of Knowledge Distillation. 4793-4801 - Simyung Chang, Seonguk Park, John Yang, Nojun Kwak:
Sym-Parameterized Dynamic Inference for Mixed-Domain Image Translation. 4802-4810 - Shuang Wang, Yanfeng Li, Xuefeng Liang, Dou Quan, Bowu Yang, Shaowei Wei, Licheng Jiao
:
Better and Faster: Exponential Loss for Image Patch Matching. 4811-4820 - Rey Wiyatno, Anqi Xu:
Physical Adversarial Textures That Fool Visual Object Tracking. 4821-4830 - Huidong Liu, Xianfeng Gu
, Dimitris Samaras:
Wasserstein GAN With Quadratic Transport Cost. 4831-4840 - Sven Gowal, Krishnamurthy Dvijotham, Robert Stanforth, Rudy Bunel, Chongli Qin, Jonathan Uesato, Relja Arandjelovic, Timothy Arthur Mann, Pushmeet Kohli:
Scalable Verified Training for Provably Robust Image Classification. 4841-4850 - Ruihao Gong
, Xianglong Liu
, Shenghu Jiang, Tianxiang Li, Peng Hu, Jiazhen Lin, Fengwei Yu, Junjie Yan:
Differentiable Soft Quantization: Bridging Full-Precision and Low-Bit Neural Networks. 4851-4860 - Chris Finlay, Aram-Alexandre Pooladian, Adam M. Oberman:
The LogBarrier Adversarial Attack: Making Effective Use of Decision Boundary Information. 4861-4869 - Thalaiyasingam Ajanthan, Puneet K. Dokania, Richard Hartley, Philip H. S. Torr:
Proximal Mean-Field for Neural Network Quantization. 4870-4879 - Hao-Yun Chen, Jhao-Hong Liang, Shih-Chieh Chang
, Jia-Yu Pan, Yu-Ting Chen, Wei Wei, Da-Cheng Juan:
Improving Adversarial Robustness via Guided Complement Entropy. 4880-4888 - Yujia Liu, Seyed-Mohsen Moosavi-Dezfooli
, Pascal Frossard:
A Geometry-Inspired Decision-Based Attack. 4889-4897 - Jie Li
, Rongrong Ji
, Hong Liu, Xiaopeng Hong, Yue Gao, Qi Tian:
Universal Perturbation Attack Against Image Retrieval. 4898-4907 - Jiaxin Gu, Junhe Zhao, Xiaolong Jiang, Baochang Zhang, Jianzhuang Liu, Guodong Guo, Rongrong Ji:
Bayesian Optimized 1-Bit CNNs. 4908-4916 - Kaiming He, Ross B. Girshick, Piotr Dollár:
Rethinking ImageNet Pre-Training. 4917-4926 - Chaithanya Kumar Mummadi, Thomas Brox, Jan Hendrik Metzen:
Defending Against Universal Perturbations With Shared Adversarial Training. 4927-4936 - Yiyou Sun, Sathya N. Ravi
, Vikas Singh:
Adaptive Activation Thresholding: Dynamic Routing Type Behavior for Interpretability in Convolutional Neural Networks. 4937-4946 - Andrei Kapishnikov, Tolga Bolukbasi, Fernanda B. Viégas, Michael Terry:
XRAI: Better Attributions Through Regions. 4947-4956 - Thomas Brunner
, Frederik Diehl, Michael Truong-Le, Alois C. Knoll
:
Guessing Smart: Biased Sampling for Efficient Black-Box Adversarial Attacks. 4957-4965
Recognition
- Yanwei Pang, Jin Xie, Muhammad Haris Khan
, Rao Muhammad Anwer
, Fahad Shahbaz Khan
, Ling Shao
:
Mask-Guided Attention Network for Occluded Pedestrian Detection. 4966-4974 - Chuanchen Luo, Yuntao Chen, Naiyan Wang, Zhaoxiang Zhang:
Spectral Feature Transformation for Person Re-Identification. 4975-4984 - Xiaofeng Liu, Zhenhua Guo, Site Li
, Ping Jia, Lingsheng Kong, Jane You, B. V. K. Vijaya Kumar
:
Permutation-Invariant Feature Restructuring for Correlation-Aware Image Set-Based Recognition. 4985-4995 - Chufeng Tang, Lu Sheng
, Zhaoxiang Zhang, Xiaolin Hu:
Improving Pedestrian Attribute Recognition With Weakly-Supervised Multi-Scale Attribute-Specific Localization. 4996-5005 - Baoyun Peng
, Xiao Jin, Dongsheng Li, Shunfeng Zhou, Yichao Wu, Jiaheng Liu, Zhaoning Zhang, Yu Liu:
Correlation Congruence for Knowledge Distillation. 5006-5015 - Yiru Wang, Weihao Gan, Jie Yang, Wei Wu, Junjie Yan:
Dynamic Curriculum Learning for Imbalanced Data Classification. 5016-5025 - Makarand Tapaswi, Marc T. Law, Sanja Fidler:
Video Face Clustering With Unknown Number of Clusters. 5026-5035 - Giorgos Tolias
, Filip Radenovic, Ondrej Chum:
Targeted Mismatch Adversarial Attack: Query With a Flower to Retrieve the Tower. 5036-5045 - Wei-Lin Hsiao, Isay Katsman, Chao-Yuan Wu, Devi Parikh, Kristen Grauman:
Fashion++: Minimal Edits for Outfit Improvement. 5046-5055 - Si Wu, Sihao Lin
, Wenhao Wu, Mohamed Azzam
, Hau-San Wong
:
Semi-Supervised Pedestrian Instance Synthesis and Detection With Mutual Reinforcement. 5056-5065 - Tao Hu, Pascal Mettes, Jia-Hong Huang, Cees Snoek:
SILCO: Show a Few Images, Localize the Common Object. 5066-5075 - Jimmy Addison Lee, Peng Liu
, Jun Cheng, Huazhu Fu
:
A Deep Step Pattern Representation for Multimodal Retinal Image Registration. 5076-5085 - Zhen Zhang
, Wee Sun Lee:
Deep Graphical Feature Learning for the Feature Matching Problem. 5086-5095 - Dong Lao
, Ganesh Sundaramoorthi:
Minimum Delay Object Detection From Video. 5096-5105 - Jérôme Revaud, Jon Almazán, Rafael S. Rezende, César Roberto de Souza:
Learning With Average Precision: Training Image Retrieval With a Listwise Loss. 5106-5115 - Amirreza Shaban, Amir Rahimi, Shray Bansal, Stephen Gould, Byron Boots, Richard Hartley:
Learning to Find Common Objects Across Few Image Collections. 5116-5125 - Lu Zhang, Xiangyu Zhu, Xiangyu Chen, Xu Yang, Zhen Lei, Zhiyong Liu:
Weakly Aligned Cross-Modal Learning for Multispectral Pedestrian Detection. 5126-5136 - Jiangfan Han, Ping Luo, Xiaogang Wang:
Deep Self-Learning From Noisy Labels. 5137-5146 - Marcelo Gennari Do Nascimento, Victor Prisacariu, Roger Fawcett:
DSConv: Efficient Convolution Operator. 5147-5156 - Jiangfan Han, Xiaoyi Dong, Ruimao Zhang
, Dongdong Chen, Weiming Zhang, Nenghai Yu, Ping Luo, Xiaogang Wang:
Once a MAN: Towards Multi-Target Attack via Learning Multi-Target Adversarial Network Once. 5157-5166
Segmentation, Grouping, & Shape
- Wenqiang Xu, Haiyang Wang, Fubo Qi, Cewu Lu:
Explicit Shape Encoding for Real-Time Instance Segmentation. 5167-5176 - Cheng-Yang Fu
, Tamara L. Berg, Alexander C. Berg:
IMP: Instance Mask Projection for High Accuracy Semantic Segmentation of Things. 5177-5186 - Linjie Yang, Yuchen Fan, Ning Xu:
Video Instance Segmentation. 5187-5196 - Kunpeng Li, Yulun Zhang
, Kai Li, Yuanyuan Li, Yun Fu:
Attention Bridging Network for Knowledge Transfer. 5197-5206 - Wataru Shimoda, Keiji Yanai
:
Self-Supervised Difference Detection for Weakly-Supervised Semantic Segmentation. 5207-5216 - Bowen Cheng, Liang-Chieh Chen, Yunchao Wei, Yukun Zhu, Zilong Huang, Jinjun Xiong
, Thomas S. Huang, Wen-Mei Hwu, Honghui Shi:
SPGNet: Semantic Prediction Guidance for Scene Parsing. 5217-5227 - Towaki Takikawa, David Acuna, Varun Jampani, Sanja Fidler:
Gated-SCNN: Gated Shape CNNs for Semantic Segmentation. 5228-5237 - Yongcheng Liu, Bin Fan, Gaofeng Meng, Jiwen Lu
, Shiming Xiang, Chunhong Pan:
DensePoint: Learning Densely Contextual Representation for Efficient Point Cloud Processing. 5238-5247 - Mennatullah Siam, Boris N. Oreshkin, Martin Jägersand:
AMP: Adaptive Masked Proxies for Few-Shot Segmentation. 5248-5257 - Tarun Kalluri, Girish Varma, Manmohan Chandraker, C. V. Jawahar
:
Universal Semi-Supervised Semantic Segmentation. 5258-5269
Statistics, Physics, Theory & Datasets
- Long-Kai Huang, Jianda Chen, Sinno Jialin Pan
:
Accelerate Learning of Deep Hashing With Gradient Attention. 5270-5279 - Qing-Yuan Jiang, Yi He, Gen Li, Jian Lin, Lei Li, Wu-Jun Li:
SVD: A Large-Scale Short Video Dataset for Near-Duplicate Video Retrieval. 5280-5288 - Hubert Lin, Paul Upchurch, Kavita Bala
:
Block Annotation: Better Image Annotation With Sub-Image Decomposition. 5289-5299 - Yanzhu Liu
, Fan Wang, Adams Wai-Kin Kong:
Probabilistic Deep Ordinal Regression Based on Gaussian Processes. 5300-5308 - Tianlu Wang, Jieyu Zhao, Mark Yatskar, Kai-Wei Chang, Vicente Ordonez
:
Balanced Datasets Are Not Enough: Estimating and Mitigating Gender Bias in Deep Image Representations. 5309-5318 - Pouya Bashivan, Mark Tensen, James J. DiCarlo:
Teacher Guided Architecture Search. 5319-5328
3D From Single View & RGBD
- David Smith, Matthew Loper, Xiaochen Hu, Paris Mavroidis, Javier Romero:
FACSIMILE: Fast and Accurate Scans From an Image in Less Than a Second. 5329-5338 - Yu Rong, Ziwei Liu, Cheng Li, Kaidi Cao, Chen Change Loy:
Delving Deep Into Hybrid Annotations for 3D Human Recovery in the Wild. 5339-5347 - Yu Sun, Yun Ye, Wu Liu, Wenpeng Gao, Yili Fu, Tao Mei
:
Human Mesh Recovery From Monocular Images via a Skeleton-Disentangled Representation. 5348-5357 - Silvia Zuffi
, Angjoo Kanazawa, Tanya Y. Berger-Wolf
, Michael J. Black:
Three-D Safari: Learning to Estimate Zebra Pose, Shape, and Texture From Images "In the Wild". 5358-5367 - Helisa Dhamo, Nassir Navab, Federico Tombari:
Object-Driven Multi-Layer Scene Decomposition From a Single Image. 5368-5377 - Michael Niemeyer, Lars M. Mescheder, Michael Oechsle, Andreas Geiger:
Occupancy Flow: 4D Reconstruction by Learning Particle Dynamics. 5378-5388 - Hou-Ning Hu, Qi-Zhi Cai, Dequan Wang, Ji Lin, Min Sun, Philipp Krähenbühl, Trevor Darrell, Fisher Yu:
Joint Monocular 3D Vehicle Detection and Tracking. 5389-5398
Face & Body
- Bowen Shi, Aurora Martinez Del Rio, Jonathan Keane, Diane Brentari, Greg Shakhnarovich, Karen Livescu
:
Fingerspelling Recognition in the Wild With Iterative Visual Attention. 5399-5408 - Hang Dai, Ling Shao
:
PointAE: Point Auto-Encoder for 3D Statistical Shape and Texture Modelling. 5409-5418 - Bharat Lal Bhatnagar, Garvita Tiwari, Christian Theobalt
, Gerard Pons-Moll:
Multi-Garment Net: Learning to Dress 3D People From Images. 5419-5429 - Haiyong Jiang, Jianfei Cai, Jianmin Zheng
:
Skeleton-Aware 3D Human Shape Reconstruction From Point Clouds. 5430-5440 - Naureen Mahmood, Nima Ghorbani, Nikolaus F. Troje
, Gerard Pons-Moll, Michael J. Black:
AMASS: Archive of Motion Capture As Surface Shapes. 5441-5450 - Fei Wang, Sanping Zhou, Stanislav Panev
, Jinsong Han, Dong Huang:
Person-in-WiFi: Fine-Grained Person Perception Using WiFi. 5451-5460 - Keqiang Sun, Wayne Wu, Tinghao Liu, Shuo Yang, Quan Wang, Qiang Zhou, Zuochang Ye, Chen Qian:
FAB: A Robust Facial Landmark Detection Framework for Motion-Blurred Videos. 5461-5470 - Bong-Nam Kang, Yonghyun Kim, Bongjin Jun, Daijin Kim:
Attentional Feature-Pair Relation Networks for Accurate Face Recognition. 5471-5480
Action & Video
- Brais Martínez, Davide Modolo, Yuanjun Xiong, Joseph Tighe:
Action Recognition With Spatial-Temporal Discriminative Filter Banks. 5481-5490 - Evangelos Kazakos, Arsha Nagrani, Andrew Zisserman, Dima Damen
:
EPIC-Fusion: Audio-Visual Temporal Binding for Egocentric Action Recognition. 5491-5500 - Phuc Xuan Nguyen, Deva Ramanan
, Charless C. Fowlkes:
Weakly-Supervised Action Localization With Background Modeling. 5501-5510 - Chenxu Luo, Alan L. Yuille
:
Grouped Spatial-Temporal Aggregation for Efficient Action Recognition. 5511-5520 - Tan Yu, Zhou Ren, Yuncheng Li, Enxu Yan, Ning Xu, Junsong Yuan
:
Temporal Structure Mining for Weakly Supervised Action Detection. 5521-5530 - Mingze Xu, Mingfei Gao, Yi-Ting Chen, Larry Davis, David J. Crandall:
Temporal Recurrent Networks for Online Action Detection. 5531-5540 - Mingfei Gao, Mingze Xu, Larry Davis, Richard Socher, Caiming Xiong:
StartNet: Online Detection of Action Start in Untrimmed Videos. 5541-5550 - Du Tran, Heng Wang, Matt Feiszli, Lorenzo Torresani:
Video Classification With Channel-Separated Convolutional Networks. 5551-5560 - Harshala Gammulle
, Simon Denman, Sridha Sridharan
, Clinton Fookes:
Predicting the Future: A Jointly Learnt Model for Action Anticipation. 5561-5570
Low-Level & Optimization
- Ziyi Shen, Wenguan Wang
, Xiankai Lu, Jianbing Shen, Haibin Ling, Tingfa Xu, Ling Shao
:
Human-Aware Motion Deblurring. 5571-5580 - Lu Zhang, Zhe Lin, Jianming Zhang, Huchuan Lu, You He:
Fast Video Object Segmentation via Dynamic Targeting Network. 5581-5590 - Sean I. Young, Aous Thabit Naman
, Bernd Girod, David Taubman
:
Solving Vision Problems via Filtering. 5591-5600 - Ankit Raj, Yuqi Li, Yoram Bresler
:
GAN-Based Projector for Faster Recovery With Convergence Guarantees in Linear Inverse Problems. 5601-5610 - Deng-Ping Fan
, Shengchuan Zhang, Yu-Huan Wu, Yun Liu, Ming-Ming Cheng
, Bo Ren, Paul L. Rosin, Rongrong Ji:
Scoot: A Perceptual Metric for Facial Sketches. 5611-5621 - Yawei Li
, Shuhang Gu, Luc Van Gool, Radu Timofte
:
Learning Filter Basis for Convolutional Neural Network Compression. 5622-5631 - Daniel Gehrig
, Antonio Loquercio
, Konstantinos G. Derpanis, Davide Scaramuzza
:
End-to-End Learning of Representations for Asynchronous Event-Based Data. 5632-5642 - Guoqing Wang, Changming Sun
, Arcot Sowmya:
ERL-Net: Entangled Representation Learning for Single Image De-Raining. 5643-5651 - Oleg Voynov, Alexey Artemov
, Vage Egiazarian
, Alexandr Notchenko, Gleb Bobrovskikh, Evgeny Burnaev
, Denis Zorin:
Perceptual Deep Depth Super-Resolution. 5652-5662
Scene Understanding
- Iro Armeni, Zhi-Yang He, Amir Zamir, JunYoung Gwak, Jitendra Malik, Martin Fischer
, Silvio Savarese:
3D Scene Graph: A Structure for Unified Semantics, 3D Space, and Camera. 5663-5672 - Cheng Lin, Changjian Li, Wenping Wang:
Floorplan-Jigsaw: Jointly Estimating Scene Layout and Aligning Partial Scans. 5673-5682 - Wei Yin, Yifan Liu
, Chunhua Shen, Youliang Yan:
Enforcing Geometric Constraints of Virtual Normal for Depth Prediction. 5683-5692 - Tiancai Wang, Rao Muhammad Anwer
, Muhammad Haris Khan
, Fahad Shahbaz Khan
, Yanwei Pang, Ling Shao
, Jorma Laaksonen
:
Deep Contextual Attention for Human-Object Interaction Detection. 5693-5701 - Wenguan Wang
, Zhijie Zhang, Siyuan Qi, Jianbing Shen, Yanwei Pang, Ling Shao
:
Learning Compositional Neural Information Fusion for Human Parsing. 5702-5712 - Anran Zhang, Lei Yue, Jiayi Shen, Fan Zhu, Xiantong Zhen, Xianbin Cao, Ling Shao
:
Attentional Neural Fields for Crowd Counting. 5713-5722 - Lifeng Fan, Wenguan Wang
, Song-Chun Zhu, Xinyu Tang, Siyuan Huang:
Understanding Human Gaze Communication by Spatio-Temporal Graph Reasoning. 5723-5732 - Jean-Baptiste Alayrac, João Carreira, Relja Arandjelovic, Andrew Zisserman:
Controllable Attention for Structured Layered Video Decomposition. 5733-5742 - Lore Goetschalckx, Alex Andonian, Aude Oliva, Phillip Isola:
GANalyze: Toward Visual Definitions of Cognitive Image Properties. 5743-5752
Language & Reasoning
- Zhong Ji, Haoran Wang, Jungong Han, Yanwei Pang:
Saliency-Guided Attention Network for Image-Sentence Matching. 5753-5762 - Zihao Wang, Xihui Liu, Hongsheng Li
, Lu Sheng
, Junjie Yan, Xiaogang Wang, Jing Shao:
CAMP: Cross-Modal Adaptive Message Passing for Text-Image Retrieval. 5763-5772 - Yan Huang, Liang Wang:
ACMM: Aligned Cross-Modal Memory for Few-Shot Image and Sentence Matching. 5773-5782 - Mohamed Elhoseiny, Mohamed Elfeki:
Creativity Inspired Zero-Shot Learning. 5783-5792 - Mikihiro Tanaka, Takayuki Itamochi, Kenichi Narioka, Ikuro Sato, Yoshitaka Ushiku
, Tatsuya Harada:
Generating Easy-to-Understand Referring Expressions for Target Identifications. 5793-5802 - Jonatas Wehrmann, Maurício Armani Lopes, Douglas M. Souza, Rodrigo C. Barros
:
Language-Agnostic Visual-Semantic Embeddings. 5803-5812 - Nikolaos Sarafianos, Xiang Xu, Ioannis A. Kakadiaris:
Adversarial Representation Learning for Text-to-Image Matching. 5813-5823 - Peng Gao, Haoxuan You
, Zhanpeng Zhang, Xiaogang Wang, Hongsheng Li
:
Multi-Modality Latent Interaction Network for Visual Question Answering. 5824-5834
3D From Multiview & Sensors
- Axel Barroso Laguna, Edgar Riba, Daniel Ponsa
, Krystian Mikolajczyk:
Key.Net: Keypoint Detection by Handcrafted and Learned CNN Filters. 5835-5843 - Jiahui Zhang, Dawei Sun, Zixin Luo, Anbang Yao, Lei Zhou, Tianwei Shen, Yurong Chen
, Hongen Liao, Long Quan:
Learning Two-View Correspondences and Geometry Using Order-Aware Network. 5844-5853 - Michael Bloesch, Tristan Laidlow, Ronald Clark, Stefan Leutenegger, Andrew J. Davison:
Learning Meshes for Dense Visual SLAM. 5854-5863 - Michael Strecke
, Jörg Stückler:
EM-Fusion: Dynamic Object-Level SLAM With Probabilistic Data Association. 5864-5873 - Jiahui Huang, Sheng Yang, Zishuo Zhao, Yu-Kun Lai, Shimin Hu:
ClusterSLAM: A SLAM Backend for Simultaneous Rigid Body Clustering and Motion Estimation. 5874-5883 - Uttaran Bhattacharya
, Venu Madhav Govindu:
Efficient and Robust Registration on the 3D Special Euclidean Group. 5884-5893 - Yoni Kasten, Amnon Geifman, Meirav Galun, Ronen Basri:
Algebraic Characterization of Essential Matrices and Their Averaging in Multiview Settings. 5894-5902
Image & Video Synthesis
- Wen Liu
, Zhixin Piao, Jie Min, Wenhan Luo
, Lin Ma, Shenghua Gao:
Liquid Warping GAN: A Unified Framework for Human Motion Imitation, Appearance Transfer and Novel View Synthesis. 5903-5912 - Yu-Jing Lin, Po-Wei Wu, Che-Han Chang, Edward Y. Chang, Shih-Wei Liao:
RelGAN: Multi-Domain Image-to-Image Translation via Relative Attributes. 5913-5921 - Ruizheng Wu, Xin Tao
, Xiaodong Gu, Xiaoyong Shen, Jiaya Jia
:
Attribute-Driven Spontaneous Motion in Unpaired Image Translation. 5922-5931 - Caroline Chan, Shiry Ginosar, Tinghui Zhou, Alexei A. Efros
:
Everybody Dance Now. 5932-5941 - Yulun Zhang
, Chen Fang, Yilin Wang, Zhaowen Wang, Zhe Lin, Yun Fu, Jimei Yang:
Multimodal Style Transfer via Graph Cuts. 5942-5950 - Ming Lu, Hao Zhao, Anbang Yao, Yurong Chen
, Feng Xu, Li Zhang:
A Closed-Form Solution to Universal Style Transfer. 5951-5960 - Jingyuan Li, Fengxiang He, Lefei Zhang, Bo Du, Dacheng Tao:
Progressive Reconstruction of Visual Structure for Image Inpainting. 5961-5970
Oral 3.2A
Recognition, Detection, & Re-Identification
- Samarth Sinha, Sayna Ebrahimi, Trevor Darrell:
Variational Adversarial Active Learning. 5971-5980 - Yang Zou, Zhiding Yu, Xiaofeng Liu, B. V. K. Vijaya Kumar
, Jinsong Wang:
Confidence Regularized Self-Training. 5981-5990 - Serim Ryou, Seong-Gyun Jeong, Pietro Perona:
Anchor Loss: Modulating Loss Scale Based on Prediction Difficulty. 5991-6000 - Chengxu Zhuang, Alex Lin Zhai, Daniel Yamins:
Local Aggregation for Unsupervised Learning of Visual Embeddings. 6001-6011 - Zhennan Wang, Wenbin Zou, Chen Xu:
PR Product: A Substitute for Inner Product in Neural Networks. 6012-6021 - Sangdoo Yun, Dongyoon Han, Sanghyuk Chun, Seong Joon Oh, Youngjoon Yoo, Junsuk Choe
:
CutMix: Regularization Strategy to Train Strong Classifiers With Localizable Features. 6022-6031 - Tianfu Wu
, Xi Song:
Towards Interpretable Object Detection by Unfolding Latent Structures. 6032-6042 - Jason Kuen, Federico Perazzi, Zhe Lin, Jianming Zhang, Yap-Peng Tan:
Scaling Object Detection by Transferring Classification Weights. 6043-6052 - Yanghao Li, Yuntao Chen, Naiyan Wang, Zhaoxiang Zhang:
Scale-Aware Trident Networks for Object Detection. 6053-6062 - Satoshi Kosugi, Toshihiko Yamasaki, Kiyoharu Aizawa:
Object-Aware Instance Labeling for Weakly Supervised Object Detection. 6063-6071 - Lanlan Liu, Michael Muelly, Jia Deng, Tomas Pfister, Li-Jia Li
:
Generative Modeling for Small-Data Object Detection. 6072-6080 - Shafin Rahman
, Salman H. Khan, Nick Barnes
:
Transductive Learning for Zero-Shot Object Detection. 6081-6090 - Seunghyeon Kim, Jaehoon Choi
, Taekyung Kim
, Changick Kim:
Self-Training and Adversarial Background Regularization for Unsupervised Domain Adaptive One-Stage Object Detection. 6091-6100 - Suichan Li, Dapeng Chen, Bin Liu, Nenghai Yu, Rui Zhao:
Memory-Based Neighbourhood Embedding for Visual Recognition. 6101-6110 - Yang Fu
, Yunchao Wei, Guanshuo Wang, Yuqian Zhou, Honghui Shi, Thomas S. Huang:
Self-Similarity Grouping: A Simple Unsupervised Cross Domain Adaptation Approach for Person Re-Identification. 6111-6120 - Zimo Liu, Jingya Wang, Shaogang Gong, Dacheng Tao, Huchuan Lu:
Deep Reinforcement Active Learning for Human-in-the-Loop Person Re-Identification. 6121-6130 - Pirazh Khorramshahi, Amit Kumar, Neehar Peri, Sai Saketh Rambhatla, Jun-Cheng Chen
, Rama Chellappa:
A Dual-Path Model With Adaptive Attention for Vehicle Re-Identification. 6131-6140 - Zhiheng Ma
, Xing Wei, Xiaopeng Hong, Yihong Gong:
Bayesian Loss for Crowd Count Estimation With Point Supervision. 6141-6150 - Zhi-Qi Cheng
, Jun-Xiu Li, Qi Dai, Xiao Wu, Alexander G. Hauptmann:
Learning Spatial Awareness to Improve Crowd Counting. 6151-6160
Oral 3.2B
Video & Action Understanding
- Peixia Li, Boyu Chen, Wanli Ouyang
, Dong Wang, Xiaoyun Yang, Huchuan Lu:
GradNet: Gradient-Guided Network for Visual Object Tracking. 6161-6170 - Peng Chu, Haibin Ling:
FAMNet: Joint Learning of Feature, Affinity and Multi-Dimensional Assignment for Online Multiple Object Tracking. 6171-6180 - Goutam Bhat, Martin Danelljan, Luc Van Gool, Radu Timofte
:
Learning Discriminative Model Prediction for Tracking. 6181-6190