default search action
WACV 2021: Waikoloa, HI, USA
- IEEE Winter Conference on Applications of Computer Vision, WACV 2021, Waikoloa, HI, USA, January 3-8, 2021. IEEE 2021, ISBN 978-1-6654-0477-8
Human Applications: Faces, Driving, Etc.
- Hao Chen, Benoit Lagadec, François Brémond:
Enhancing Diversity in Teacher-Student Networks via Asymmetric branches for Unsupervised Person Re-identification. 1-10 - Harsimran Kaur, Roberto Manduchi:
Subject Guided Eye Image Synthesis with Application to Gaze Redirection. 11-20 - Siwei Zhang, Zhiwu Huang, Danda Pani Paudel, Luc Van Gool:
Facial Emotion Recognition with Noisy Multi-task Annotations. 21-31 - Yang Liu, Alexandros Neophytou, Sunando Sengupta, Eric Sommerlade:
Relighting Images in the Wild with a Self-Supervised Siamese Auto-Encoder. 32-40 - Alexander Richard, Colin Lea, Shugao Ma, Juergen Gall, Fernando De la Torre, Yaser Sheikh:
Audio- and Gaze-driven Facial Animation of Codec Avatars. 41-50 - Abdelhak Loukkal, Yves Grandvalet, Tom Drummond, You Li:
Driving among Flatmobiles: Bird-Eye-View occupancy grids from a monocular camera for holistic trajectory planning. 51-60 - Varun Ravi Kumar, Marvin Klingner, Senthil Kumar Yogamani, Stefan Milz, Tim Fingscheidt, Patrick Mäder:
SynDistNet: Self-Supervised Monocular Fisheye Camera Distance Estimation Synergized with Semantic Segmentation for Autonomous Driving. 61-71 - Heng Zhang, Élisa Fromont, Sébastien Lefèvre, Bruno Avignon:
Guided Attentive Feature Fusion for Multispectral Pedestrian Detection. 72-80 - Matthew Shere, Hansung Kim, Adrian Hilton:
Temporally Consistent 3D Human Pose Estimation Using Dual 360° Cameras. 81-90 - Okan Köpüklü, Jiapeng Zheng, Hang Xu, Gerhard Rigoll:
Driver Anomaly Detection: A Dataset and Contrastive Learning Approach. 91-100
3D, Domain Adaptation, Video, etc.
- Tobias Ringwald, Rainer Stiefelhagen:
Adaptiope: A Modern Benchmark for Unsupervised Domain Adaptation. 101-110 - Peri Akiva, Matthew Purri, Kristin J. Dana, Beth Tellman, Tyler Anderson:
H2O-Net: Self-Supervised Flood Segmentation via Adversarial Domain Adaptation and Label Refinement. 111-122 - Idan Achituve, Haggai Maron, Gal Chechik:
Self-Supervised Learning for Domain Adaptation on Point Clouds. 123-133 - Zhangsihao Yang, Or Litany, Tolga Birdal, Srinath Sridhar, Leonidas J. Guibas:
Continuous Geodesic Convolutions for Learning on 3D Shapes. 134-144 - Lê Minh Ngô, Wei Wang, Burak Mandira, Sezer Karaoglu, Henri Bouma, Hamdi Dibeklioglu, Theo Gevers:
Identity Unbiased Deception Detection by 2D-to-3D Face Reconstruction. 145-154 - Yang Wang, Gedas Bertasius, Tae-Hyun Oh, Abhinav Gupta, Minh Hoai, Lorenzo Torresani:
Supervoxel Attention Graphs for Long-Range Video Modeling. 155-166 - Xiang Hao, Kripa Chettiar, Ben Cheung, Vernon Germano, Raffay Hamid:
Intro and Recap Detection for Movies and TV Series. 167-176 - Rob Romijnders, Aravindh Mahendran, Michael Tschannen, Josip Djolonga, Marvin Ritter, Neil Houlsby, Mario Lucic:
Representation learning from videos in-the-wild: An object-centric approach. 177-187 - Gil Ben-Artzi:
Separable Four Points Fundamental Matrix. 188-196 - René Schuster, Oliver Wasenmüller, Christian Unger, Didier Stricker:
SSGP: Sparse Spatial Guided Propagation for Robust and Generic Interpolation. 197-206
Synthesis, Reconstruction, Recognition, Learning
- Jacob Shermeyer, Thomas Hossler, Adam Van Etten, Daniel Hogan, Ryan Lewis, Daeil Kim:
RarePlanes: Synthetic Data Takes Flight. 207-217 - Abhijith Punnappurath, Michael S. Brown:
Spatially Aware Metadata for Raw Reconstruction. 218-226 - Yash Patel, Srikar Appalaraju, R. Manmatha:
Saliency Driven Perceptual Image Compression. 227-236 - Jing Yu Koh, Jason Baldridge, Honglak Lee, Yinfei Yang:
Text-to-Image Generation Grounded by Fine-Grained User Attention. 237-246 - René Schuster, Christian Unger, Didier Stricker:
A Deep Temporal Fusion Framework for Scene Flow Using a Learnable Motion Model and Occlusions. 247-255 - David Peer, Sebastian Stabinger, Antonio Jose Rodríguez-Sánchez:
Conflicting Bundles: Adapting Architectures Towards the Improved Training of Deep Neural Networks. 256-265 - Jinglun Feng, Liang Yang, Haiyan Wang, Yingli Tian, Jizhong Xiao:
Subsurface Pipes Detection Using DNN-based Back Projection on GPR Data. 266-275 - Daniel Stanley Tan, Yi-Chun Chen, Trista Pei-Chun Chen, Wei-Chao Chen:
TrustMAE: A Noise-Resilient Defect Classification Framework using Memory-Augmented Auto-Encoders with Trust Regions. 276-285 - Dvir Samuel, Yuval Atzmon, Gal Chechik:
From generalized zero-shot learning to long-tail with class descriptors. 286-295 - Zeqian Li, Michael Mozer, Jacob Whitehill:
Compositional Embeddings for Multi-Label One-Shot Learning. 296-304
Segmentation, Image Manipulation, Image Processing
- Jun Hao Liew, Scott Cohen, Brian L. Price, Long Mai, Jiashi Feng:
Deep Interactive Thin Object Selection. 305-314 - Kratarth Goel, Praveen Srinivasan, Sarah Tariq, James Philbin:
QuadroNet: Multi-Task Learning for Real-Time Semantic Depth Aware Instance Segmentation. 315-324 - Tianyu Ma, Hang Zhang, Hanley Ong, Amar Vora, Thanh D. Nguyen, Ajay Gupta, Yi Wang, Mert R. Sabuncu:
Ensembling Low Precision Models for Binary Biomedical Image Segmentation. 325-334 - Anqi Yang, Feng Pan, Vishwanath Saragadam, Duy Dao, Zhuo Hui, Jen-Hao Rick Chang, Aswin C. Sankaranarayanan:
SliceNets - A Scalable Approach for Object Detection in 3D CT Scans. 335-344 - Zichen Liu, Jun Hao Liew, Xiangyu Chen, Jiashi Feng:
DANCE : A Deep Attentive Contour Model for Efficient Instance Segmentation. 345-354 - Weimin Chen, Yuqing Ma, Xianglong Liu, Yi Yuan:
Hierarchical Generative Adversarial Networks for Single Image Super-Resolution. 355-364 - He Zhang, Jianming Zhang, Federico Perazzi, Zhe Lin, Vishal M. Patel:
Deep Image Compositing. 365-374 - Myung-Joon Kwon, In-Jae Yu, Seung-Hun Nam, Heung-Kyu Lee:
CAT-Net: Compression Artifact Tracing Network for Detection and Localization of Image Splicing. 375-384 - Chang Liu, Henghui Ding, Xudong Jiang:
Towards Enhancing Fine-grained Details for Image Matting. 385-393 - Mahdiar Nekoui, Fidel Omar Tito Cruz, Li Cheng:
EAGLE-Eye: Extreme-pose Action Grader using detaiL bird's-Eye view. 394-402 - Joshua D. Rego, Karthik Kulkarni, Suren Jayasuriya:
Robust Lensless Image Reconstruction via PSF Estimation. 403-412 - Aditya Mehta, Harsh Sinha, Murari Mandal, Pratik Narang:
Domain-Aware Unsupervised Hyperspectral Reconstruction for Aerial Image Dehazing. 413-422 - Thomas Hartley, Kirill A. Sidorov, Christopher Willis, A. David Marshall:
SWAG: Superpixels Weighted by Average Gradients for Explanations of CNNs. 423-432 - Chenhao Li, Yuta Taniguchi, Min Lu, Shin'ichi Konomi:
Few-shot Font Style Transfer between Different Languages. 433-442 - Tunai Porto Marques, Alexandra Branzan Albu, Patrick O'Hara, Norma Serra, Ben Morrow, Lauren McWhinnie, Rosaline Canessa:
Size-invariant Detection of Marine Vessels From Visual Time Series. 443-453
Domain Adaptation, Saliency, Segmentation, Captioning, Tracking, Image Processing
- Tongxin Wang, Zhengming Ding, Wei Shao, Haixu Tang, Kun Huang:
Towards Fair Cross-Domain Adaptation via Generative Learning. 454-463 - Pengfei Fang, Pan Ji, Lars Petersson, Mehrtash Harandi:
Set Augmented Triplet Loss for Video Person Re-Identification. 464-473 - Hao-Wei Yeh, Baoyao Yang, Pong C. Yuen, Tatsuya Harada:
SoFA: Source-data-free Feature Alignment for Unsupervised Domain Adaptation. 474-483 - Yifeng Zhang, Ming Jiang, Qi Zhao:
Saliency Prediction with External Knowledge. 484-493 - Philipp Benz, Chaoning Zhang, Adil Karjauv, In So Kweon:
Revisiting Batch Normalization for Improving Corruption Robustness. 494-503 - Yizhou Wang, Zhongyu Jiang, Xiangyu Gao, Jenq-Neng Hwang, Guanbin Xing, Hui Liu:
RODNet: Radar Object Detection using Cross-Modal Supervision. 504-513 - Jinyu Yang, Weizhi An, Chaochao Yan, Peilin Zhao, Junzhou Huang:
Context-Aware Domain Adaptation in Semantic Segmentation. 514-524 - Haochen Wang, Yandan Yang, Xianbin Cao, Xiantong Zhen, Cees Snoek, Ling Shao:
Variational Prototype Inference for Few-Shot Semantic Segmentation. 525-534 - Laura Sevilla-Lara, Shengxin Zha, Zhicheng Yan, Vedanuj Goswami, Matt Feiszli, Lorenzo Torresani:
Only Time Can Tell: Discovering Temporal Data for Temporal Modeling. 535-544 - Xianyu Chen, Ming Jiang, Qi Zhao:
Self-Distillation for Few-Shot Image Captioning. 545-555 - Camilo Pestana, Wei Liu, David G. Glance, Ajmal Mian:
Defense-friendly Images in Adversarial Attacks: Dataset and Metrics for Perturbation Difficulty. 556-565 - Heng Fan, Haibin Ling:
MART: Motion-Aware Recurrent Neural Network for Robust Visual Tracking. 566-575 - Badri N. Patro, Mayank Lunayach, Deepankar Srivastava, Sarvesh, Hunar Singh, Vinay P. Namboodiri:
Multimodal Humor Dataset: Predicting Laughter tracks for Sitcoms. 576-585 - Ge Liu, Linglan Zhao, Wei Li, Dashan Guo, Xiangzhong Fang:
Class-wise Metric Scaling for Improved Few-Shot Classification. 586-595 - Jinsoo Choi, Jaesik Park, In So Kweon:
High-quality Frame Interpolation via Tridirectional Inference. 596-604
Domain Adaptation, Representation, Visual Analytics, Uncertainty and Attention
- Taotao Jing, Zhengming Ding:
Adversarial Dual Distinct Classifiers for Unsupervised Domain Adaptation. 605-614 - Vinod K. Kurmi, Venkatesh K. Subramanian, Vinay P. Namboodiri:
Domain Impression: A Source Data Free Domain Adaptation Method. 615-625 - Yandan Yang, Lu Sheng, Xiaolong Jiang, Haochen Wang, Dong Xu, Xianbin Cao:
IncreACO: Incrementally Learned Automatic Check-out with Photorealistic Exemplar Augmentation. 626-634 - Youshan Zhang, Hui Ye, Brian D. Davison:
Adversarial Reinforcement Learning for Unsupervised Domain Adaptation. 635-644 - Or Litany, Ari Morcos, Srinath Sridhar, Leonidas J. Guibas, Judy Hoffman:
Representation Learning Through Latent Canonicalizations. 645-654 - Wenhu Chen, Zhe Gan, Linjie Li, Yu Cheng, William Yang Wang, Jingjing Liu:
Meta Module Network for Compositional Visual Reasoning. 655-664 - Jianan Wang, Boyang Li, Xiangyu Fan, Jing Lin, Yanwei Fu:
Data-efficient Alignment of Multimodal Sequences by Aligning Gradient Updates and Internal Feature Distributions. 665-675 - Olga Moskvyak, Frédéric Maire, Feras Dayoub, Mahsa Baktashmotlagh:
Keypoint-Aligned Embeddings for Image Retrieval and Re-identification. 676-685 - Hao Guo, Brian Dolhansky, Eric Hsin, Phong Dinh, Cristian Canton-Ferrer, Song Wang:
Deep Poisoning: Towards Robust Image Data Sharing against Visual Disclosure. 686-696 - Xinyi Zheng, Douglas Burdick, Lucian Popa, Xu Zhong, Nancy Xin Ru Wang:
Global Table Extractor (GTE): A Framework for Joint Table Identification and Cell Structure Recognition Using Visual Context. 697-706 - Yichen Shen, Zhilu Zhang, Mert R. Sabuncu, Lin Sun:
Real-Time Uncertainty Estimation in Computer Vision via Uncertainty-Aware Distribution Distillation. 707-716 - Saurabh Satish Desai, Stefan Lee:
Auxiliary Tasks for Efficient Learning of Point-Goal Navigation. 717-725 - Badri N. Patro, G. S. Kasturi, Ansh Jain, Vinay P. Namboodiri:
Self Supervision for Attention Networks. 726-735 - Vinod K. Kurmi, Badri N. Patro, Venkatesh K. Subramanian, Vinay P. Namboodiri:
Do not Forget to Attend to Uncertainty while Mitigating Catastrophic Forgetting. 736-745 - Jeya Maria Jose Valanarasu, Vishal M. Patel:
Overcomplete Deep Subspace Clustering Networks. 746-755
Rectification and Tracking, 3D and Action, Motion and Tracking
- Sijie Zhu, Taojiannan Yang, Chen Chen:
Revisiting Street-to-Aerial View Image Geo-localization and Orientation Estimation. 756-765 - Michal Uricár, Ganesh Sistu, Hazem Rashed, Antonín Vobecký, Varun Ravi Kumar, Pavel Krízek, Fabian Bürger, Senthil Kumar Yogamani:
Let's Get Dirty: GAN Based Data Augmentation for Camera Lens Soiling Detection in Autonomous Driving. 766-775 - Luis Bermudez, Nadine L. Dabby, Yingxi Adelle Lin, Sara Hilmarsdottir, Narayan Sundararajan, Swarnendu Kar:
A Learning-Based Approach to Parametric Rotoscoping of Multi-Shape Systems. 776-785 - Pranav Verma, Dominique E. Meyer, Hanyang Xu, Falko Kuester:
Splatty- A Unified Image Demosaicing and Rectification Method. 786-795 - Hung Tran, Vuong Le, Truyen Tran:
Goal-driven Long-Term Trajectory Prediction. 796-805 - Rodrigo Santa Cruz, Léo Lebrat, Pierrick Bourgeat, Clinton Fookes, Jurgen Fripp, Olivier Salvado:
DeepCSR: A 3D Deep Learning Approach for Cortical Surface Reconstruction. 806-815 - Yu Lin, Yigong Wang, Yifan Li, Yang Gao, Zhuoyi Wang, Latifur Khan:
Attention-Based Spatial Guidance for Image-to-Image Translation. 816-825 - Chenxi Xiao, Juan P. Wachs:
Triangle-Net: Towards Robustness in Point Cloud Learning. 826-835 - Liangjian Chen, Shih-Yao Lin, Yusheng Xie, Yen-Yu Lin, Xiaohui Xie:
MVHM: A Large-Scale Multi-View Hand Mesh Benchmark for Accurate 3D Hand Pose Estimation. 836-845 - Yizhak Ben-Shabat, Xin Yu, Fatemeh Sadat Saleh, Dylan Campbell, Cristian Rodriguez Opazo, Hongdong Li, Stephen Gould:
The IKEA ASM Dataset: Understanding People Assembling Furniture through Actions, Objects and Pose. 846-858 - Kakani Katija, Paul L. D. Roberts, Joost Daniels, Alexandra Lapides, Kevin Barnard, Mike Risi, Ben Y. Ranaan, Benjamin G. Woodward, Jonathan Takahashi:
Visual tracking of deepwater animals using machine learning-controlled robotic underwater vehicles. 859-868 - Shuo-Diao Yang, Hung-Ting Su, Winston H. Hsu, Wen-Chin Chen:
Class-agnostic Few-shot Object Counting. 869-877 - Neeraj Battan, Yudhik Agrawal, Sai Soorya Rao, Aman Goel, Avinash Sharma:
GlocalNet: Class-aware Long-term Human Motion Synthesis. 878-887
Detection and Recognition, Segmentation and Tracking, Low-Level Vision
- Kai Yang, Zihao Xu, Jingjing Fei:
DualSANet: Dual Spatial Attention Network for Iris Recognition. 888-896 - Jongmin Lee, Yoonwoo Jeong, Seungwook Kim, Juhong Min, Minsu Cho:
Learning to Distill Convolutional Features into Compact Local Descriptors. 897-907 - Yanguang Bi, Zhiqiang Hu:
Disentangled Contour Learning for Quadrilateral Text Detection. 908-917 - Ayush Jaiswal, Yue Wu, Pradeep Natarajan, Premkumar Natarajan:
Class-agnostic Object Detection. 918-927 - Myungchul Kim, Sanghyun Woo, Dahun Kim, In So Kweon:
The Devil is in the Boundary: Exploiting Boundary Representation for Basis-based Instance Segmentation. 928-937 - Hao Tang, Xingwei Liu, Kun Han, Xiaohui Xie, Xuming Chen, Qian Huang, Yong Liu, Shanlin Sun, Narisu Bai:
Spatial Context-Aware Self-Attention Model For Multi-Organ Segmentation. 938-948 - Yimian Dai, Yiquan Wu, Fei Zhou, Kobus Barnard:
Asymmetric Contextual Modulation for Infrared Small Target Detection. 949-958 - Wei He, Meiqing Wu, Mingfu Liang, Siew-Kei Lam:
CAP: Context-Aware Pruning for Semantic Segmentation. 959-968 - Heng Fan, Fan Yang, Peng Chu, Yuewei Lin, Lin Yuan, Haibin Ling:
TracKlinic: Diagnosis of Challenge Factors in Visual Tracking. 969-978 - Mehrdad Hosseinzadeh, Yang Wang:
Video Captioning of Future Frames. 979-988 - Alireza Shafaei, James J. Little, Mark Schmidt:
AutoRetouch: Automatic Professional Face Retouching. 989-997 - Satish Kumar, A. S. M. Iftekhar, Michael Goebel, Tom Bullock, Mary H. MacLean, Michael B. Miller, Tyler Santander, Barry Giesbrecht, Scott T. Grafton, B. S. Manjunath:
StressNet: Detecting Stress in Thermal Videos. 998-1008 - Sadbhavana Babar, Sukhendu Das:
Where to Look?: Mining Complementary Image Regions for Weakly Supervised Object Localization. 1009-1018 - Jaedong Hwang, Seohyun Kim, Jeany Son, Bohyung Han:
Weakly Supervised Instance Segmentation by Deep Community Learning. 1019-1028 - Kangning Liu, Shuhang Gu, Andrés Romero, Radu Timofte:
Unsupervised Multimodal Video-to-Video Translation via Self-Supervised Learning. 1029-1039
3D, Video Processsing, Detection and Recognition
- Arwen Bradley, Jason Klivington, Joseph Triscari, Rudolph van der Merwe:
Cinematic-L1 Video Stabilization with a Log-Homography Model. 1040-1048 - Liangjian Chen, Shih-Yao Lin, Yusheng Xie, Yen-Yu Lin, Xiaohui Xie:
Temporal-Aware Self-Supervised Learning for 3D Hand Pose and Mesh Estimation in Videos. 1049-1058 - Kellie Corona, Katie Osterdahl, Roderic Collins, Anthony Hoogs:
MEVA: A Large-Scale Multiview, Multimodal Video Dataset for Activity Detection. 1059-1067 - Kyle Min, Jason J. Corso:
Integrating Human Gaze into Attention for Egocentric Activity Recognition. 1068-1077 - Cristian Rodriguez Opazo, Edison Marrese-Taylor, Basura Fernando, Hongdong Li, Stephen Gould:
DORi: Discovering Object Relationships for Moment Localization of a Natural Language Query in a Video. 1078-1087 - Xide Xia, Tianfan Xue, Wei-Sheng Lai, Zheng Sun, Abby Chang, Brian Kulis, Jiawen Chen:
Real-time Localized Photorealistic Video Style Transfer. 1088-1097 - Simon Niklaus, Long Mai, Oliver Wang:
Revisiting Adaptive Convolutions for Video Frame Interpolation. 1098-1108 - Longlong Jing, Toufiq Parag, Zhe Wu, Yingli Tian, Hongcheng Wang:
VideoSSL: Semi-Supervised Learning for Video Classification. 1109-1118 - Zhenqiang Li, Weimin Wang, Zuoyue Li, Yifei Huang, Yoichi Sato:
Towards Visually Explaining Video Understanding Networks with Perturbation. 1119-1128 - Shaojie Wang, Wentian Zhao, Ziyi Kou, Jing Shi, Chenliang Xu:
How to Make a BLT Sandwich? Learning VQA towards Understanding Web Instructional Videos. 1129-1138 - Muhammad Umer Anwaar, Egor Labintcev, Martin Kleinsteuber:
Compositional Learning of Image-Text Query for Image Retrieval. 1139-1148 - Ahmed-Shehab Khan, Zhiyuan Li, Jie Cai, Yan Tong:
Regional Attention Networks with Context-aware Fusion for Group Emotion Recognition. 1149-1158 - Yuqi Gong, Xuehui Yu, Yao Ding, Xiaoke Peng, Jian Zhao, Zhenjun Han:
Effective Fusion Factor in FPN for Tiny Object Detection. 1159-1167 - Xinyue Zhang, Jiahao Ding, Maoqiang Wu, Stephen T. C. Wong, Hien Van Nguyen, Miao Pan:
Adaptive Privacy Preserving Deep Learning Algorithms for Medical Data. 1168-1177 - Ajian Liu, Zichang Tan, Jun Wan, Sergio Escalera, Guodong Guo, Stan Z. Li:
CASIA-SURF CeFA: A Benchmark for Multi-modal Cross-ethnicity Face Anti-spoofing. 1178-1186
Face, Head, Action, GANs
- Zhiwen Cao, Zongcheng Chu, Dongfang Liu, Yingjie Victor Chen:
A Vector-based Representation to Enhance Head Pose Estimation. 1187-1196 - Bo Zhao, Shixiang Tang, Dapeng Chen, Hakan Bilen, Rui Zhao:
Continual Representation Learning for Biometric Identification. 1197-1207 - Jia-Ren Chang, Yong-Sheng Chen:
Exploiting Spatial Relation for Reducing Distortion in Style Transfer. 1208-1216 - François Robert Hogan, Michael Jenkin, Sahand Rezaei-Shoshtari, Yogesh A. Girdhar, David Meger, Gregory Dudek:
Seeing Through your Skin: Recognizing Objects with a Novel Visuotactile Sensor. 1217-1226 - Suresh Kirthi Kumaraswamy, Miaojing Shi, Ewa Kijak:
Detecting Human-Object Interaction with Mixed Supervision. 1227-1236 - Rosaura G. VidalMata, Walter J. Scheirer, Anna Kukleva, David D. Cox, Hilde Kuehne:
Joint Visual-Temporal Embedding for Unsupervised Learning of Actions in Untrimmed Sequences. 1237-1246 - Koichiro Niinuma, Itir Önal Ertugrul, Jeffrey F. Cohn, László A. Jeni:
Synthetic Expressions are Better Than Real for Learning to Detect Facial Actions. 1247-1256 - Iuliia Kotseruba, Amir Rasouli, John K. Tsotsos:
Benchmark for Evaluating Pedestrian Action Prediction. 1257-1267 - Guillaume Vaudaux-Ruth, Adrien Chan-Hon-Tong, Catherine Achard:
SALAD: Self-Assessment Learning for Action Detection. 1268-1277 - Zachary Wharton, Ardhendu Behera, Yonghuai Liu, Nik Bessis:
Coarse Temporal Attention Network (CTA-Net) for Driver's Activity Recognition. 1278-1288 - Ilya Kavalerov, Wojciech Czaja, Rama Chellappa:
A Multi-Class Hinge Loss for Conditional GANs. 1289-1298 - Tobias Hinz, Matthew Fisher, Oliver Wang, Stefan Wermter:
Improved Techniques for Training Single-Image GANs. 1299-1308 - Rajat Arora, Yong Jae Lee:
SinGAN-GIF: Learning a Generative Video Model from a Single GIF. 1309-1318 - Patrick Tinsley, Adam Czajka, Patrick J. Flynn:
This Face Does Not Exist... But It Might Be Yours! Identity Leakage in Generative Models. 1319-1327 - Soumya Tripathy, Juho Kannala, Esa Rahtu:
FACEGAN: Facial Attribute Controllable rEenactment GAN. 1328-1337
Learning
- Le Thanh Nguyen-Meidine, Atif Belal, Madhu Kiran, Jose Dolz, Louis-Antoine Blais-Morin, Eric Granger:
Unsupervised Multi-Target Domain Adaptation Through Knowledge Distillation. 1338-1346 - Vivek Sharma, Naila Murray, Diane Larlus, M. Saquib Sarfraz, Rainer Stiefelhagen, Gabriela Csurka:
Unsupervised Meta-Domain Adaptation for Fashion Retrieval. 1347-1356 - Marco Toldo, Umberto Michieli, Pietro Zanuttigh:
Unsupervised Domain Adaptation in Semantic Segmentation via Orthogonal and Clustered Embeddings. 1357-1367 - Viktor Olsson, Wilhelm Tranheden, Juliano Pinto, Lennart Svensson:
ClassMix: Segmentation-Based Data Augmentation for Semi-Supervised Learning. 1368-1377 - Wilhelm Tranheden, Viktor Olsson, Juliano Pinto, Lennart Svensson:
DACS: Domain Adaptation via Cross-domain Mixed Sampling. 1378-1388 - An Zhao, Mingyu Ding, Zhiwu Lu, Tao Xiang, Yulei Niu, Jiechao Guan, Ji-Rong Wen:
Domain-Adaptive Few-Shot Learning. 1389-1398 - Tal Ridnik, Hussam Lawen, Asaf Noy, Emanuel Ben Baruch, Gilad Sharir, Itamar Friedman:
TResNet: High Performance GPU-Dedicated Architecture. 1399-1408 - Kumara Kahatapitiya, Ranga Rodrigo:
Exploiting the Redundancy in Convolutional Filters for Parameter Reduction. 1409-1419 - Artur Jordão, Maiko M. I. Lie, Victor Hugo Cunha de Melo, William Robson Schwartz:
Covariance-free Partial Least Squares: An Incremental Dimensionality Reduction Method. 1420-1428 - Gaurav Kumar Nayak, Konda Reddy Mopuri, Anirban Chakraborty:
Effectiveness of Arbitrary Transfer Sets for Data-free Knowledge Distillation. 1429-1437 - Joseph Bethge, Christian Bartz, Haojin Yang, Ying Chen, Christoph Meinel:
MeliusNet: An Improved Network Architecture for Binary Neural Networks. 1438-1447 - Dóra Babicz, Soma Kontár, Márk Peto, András Fülöp, Gergely Szabó, András Horváth:
Receptive Field Size Optimization with Continuous Time Pooling. 1448-1457 - Steve Dias Da Cruz, Bertram Taetz, Thomas Stifter, Didier Stricker:
Illumination Normalization by Partially Impossible Encoder-Decoder Cost Function. 1458-1467 - Rick Groenendijk, Sezer Karaoglu, Theo Gevers, Thomas Mensink:
Multi-Loss Weighting with Coefficient of Variations. 1468-1477 - Byungju Kim, Hyeong Gwon Hong, Junmo Kim:
De-biasing Neural Networks with Estimated Offset for Class Imbalanced Learning. 1478-1486
Objects, Detection, Segmentation
- Siavash Khodadadeh, Saeid Motiian, Zhe Lin, Ladislau Bölöni, Shabnam Ghadar:
Automatic Object Recoloring Using Adversarial Learning. 1487-1495 - Xiaowen Ying, Xin Li, Mooi Choo Chuah:
Weakly-supervised Object Representation Learning for Few-shot Semantic Segmentation. 1496-1505 - Jean-Philippe Mercier, Mathieu Garon, Philippe Giguère, Jean-François Lalonde:
Deep Template-based Object Instance Detection. 1506-1515 - Ikki Kishida, Hong Chen, Masaki Baba, Jiren Jin, Ayako Amma, Hideki Nakayama:
Object Recognition with Continual Open Set Domain Adaptation for Home Robot. 1516-1525 - Ramin Nabati, Hairong Qi:
CenterFusion: Center-based Radar and Camera Fusion for 3D Object Detection. 1526-1535 - Abeba Birhane, Vinay Uday Prabhu:
Large image datasets: A pyrrhic win for computer vision? 1536-1546 - Kimmo Kärkkäinen, Jungseock Joo:
FairFace: Face Attribute Dataset for Balanced Race, Gender, and Age for Bias Measurement and Mitigation. 1547-1557 - Domenick Poster, Matthew Thielke, Robert Nguyen, Srinivasan Rajaraman, Xing Di, Cedric Nimpa Fondje, Vishal M. Patel, Nathaniel J. Short, Benjamin S. Riggan, Nasser M. Nasrabadi, Shuowen Hu:
A Large-Scale, Time-Synchronized Visible and Thermal Face Dataset. 1558-1567 - Francesco Ragusa, Antonino Furnari, Salvatore Livatino, Giovanni Maria Farinella:
The MECCANO Dataset: Understanding Human-Object Interactions from Egocentric Videos in an Industrial-like Domain. 1568-1577 - Hoang-An Le, Thomas Mensink, Partha Das, Sezer Karaoglu, Theo Gevers:
EDEN: Multimodal Synthetic Dataset of Enclosed GarDEN Scenes. 1578-1588 - Mohammad Saeed Rad, Thomas Yu, Claudiu Musat, Hazim Kemal Ekenel, Behzad Bozorgtabar, Jean-Philippe Thiran:
Benefiting from Bicubically Down-Sampled Images for Learning Real-World Image Super-Resolution. 1589-1598 - Prasan A. Shedligeri, Anupama S, Kaushik Mitra:
A Unified Framework for Compressive Video Recovery from Coded Exposure Techniques. 1599-1608 - Sebastian Lutz, Aljosa Smolic:
Foreground color prediction through inverse compositing. 1609-1618 - Konstantin Sofiiuk, Polina Popenova, Anton Konushin:
Foreground-aware Semantic Representations for Image Harmonization. 1619-1628 - Mohammad Emad, Maurice Peemen, Henk Corporaal:
DualSR: Zero-Shot Dual Learning for Real-World Super-Resolution. 1629-1638
Motion, Classification, Recognition
- Sandro Hauri, Nemanja Djuric, Vladan Radosavljevic, Slobodan Vucetic:
Multi-Modal Trajectory Prediction of NBA Players. 1639-1648 - Davide Modolo, Bing Shuai, Rahul Rama Varior, Joseph Tighe:
Understanding the impact of mistakes on background regions in crowd counting. 1649-1658 - Matthew Moynihan, Susana Ruano, Rafael Pagés, Aljosa Smolic:
Autonomous Tracking For Volumetric Video Sequences. 1659-1668 - Nadine Behrmann, Juergen Gall, Mehdi Noroozi:
Unsupervised Video Representation Learning by Bidirectional Feature Prediction. 1669-1678 - Shubhika Garg, Vidit Goel:
Mask Selection and Propagation for Unsupervised Video Object Segmentation. 1679-1689 - Saypraseuth Mounsaveng, Issam H. Laradji, Ismail Ben Ayed, David Vázquez, Marco Pedersoli:
Learning Data Augmentation with Online Bilevel Optimization for Image Classification. 1690-1699 - Mert Kilickaya, Arnold W. M. Smeulders:
Structured Visual Search via Composition-aware Learning. 1700-1709 - Heng Zhao, Kim-Hui Yap, Alex ChiChung Kot:
Fusion Learning using Semantics and Graph Convolutional Network for Visual Food Recognition. 1710-1719 - Dawid Rymarczyk, Adriana Borowa, Jacek Tabor, Bartosz Zielinski:
Kernel Self-Attention for Weakly-supervised Image Classification using Deep Multiple Instance Learning. 1720-1729 - Sobhan Soleymani, Ali Dabouei, Fariborz Taherkhani, Jeremy M. Dawson, Nasser M. Nasrabadi:
Mutual Information Maximization on Disentangled Representations for Differential Morph Detection. 1730-1740 - Shujon Naha, Qingyang Xiao, Prianka Banik, Md. Alimoor Reza, David J. Crandall:
Part Segmentation of Unseen Objects using Keypoint Guidance. 1741-1749 - Marcus Valtonen Örnhag, Patrik Persson, Mårten Wadenbäck, Kalle Åström, Anders Heyden:
Efficient Real-Time Radial Distortion Correction for UAVs. 1750-1759
3D and Pose
- Fangwen Shu, Paul Lesur, Yaxu Xie, Alain Pagani, Didier Stricker:
SLAM in the Field: An Evaluation of Monocular Mapping and Localization on Challenging Dynamic Agricultural Environment. 1760-1770 - Yahui Zhang, Shaodi You, Theo Gevers:
Automatic Calibration of the Fisheye Camera for Egocentric 3D Human Pose Estimation from a Single Image. 1771-1780 - Shrutimoy Das, Siddhant Katyan, Pawan Kumar:
A Deflation based Fast and Robust Preconditioner for Bundle Adjustment. 1781-1788 - Jacek Komorowski:
MinkLoc3D: Point Cloud Based Large-Scale Place Recognition. 1789-1798 - Yara Ali Alnaggar, Mohamed Afifi, Karim Amer, Mohamed ElHelw:
Multi Projection Fusion for Real-time Semantic Segmentation of 3D LiDAR Point Clouds. 1799-1808 - Sergey Prokudin, Michael J. Black, Javier Romero:
SMPLpix: Neural Avatars from 3D Human Models. 1809-1818 - Matthias Domnik, Pedro F. Proença, Jeff Delaune, Jörg Thiem, Roland Brockers:
Dense 3D-Reconstruction from Monocular Image Sequences for Computationally Constrained UAS∗. 1819-1827 - Stefan Lionar, Daniil Emtsev, Dusan Svilarkovic, Songyou Peng:
Dynamic Plane Convolutional Occupancy Networks. 1828-1837 - Sohee Kim Park, Minh Hoai, Arani Bhattacharya, Samir R. Das:
Adaptive Streaming of 360-Degree Videos with Reinforcement Learning. 1838-1847 - Lars Haalck, Benjamin Risse:
Embedded Dense Camera Trajectories in Multi-Video Image Mosaics by Geodesic Interpolation-based Reintegration. 1848-1857 - Alexander Mathis, Thomas Biasi, Steffen Schneider, Mert Yüksekgönül, Byron Rogers, Matthias Bethge, Mackenzie W. Mathis:
Pretraining boosts out-of-domain robustness for pose estimation. 1858-1867 - Ruslan Rakhimov, Emil Bogomolov, Alexandr Notchenko, Fung Mao, Alexey Artemov, Denis Zorin, Evgeny Burnaev:
Making DensePose fast and light. 1868-1876 - Meghal Dani, Karan Narain, Ramya Hebbalaguppe:
3DPoseLite: A Compact 3D Pose Estimation Using Node Embeddings. 1877-1886 - Zhongguo Li, Magnus Oskarsson, Anders Heyden:
3D Human Pose and Shape Estimation Through Collaborative Learning and Multi-view Model-fitting. 1887-1896 - Gabriel Moreira, Manuel Marques, João Paulo Costeira:
Fast Pose Graph Optimization via Krylov-Schur and Cholesky Factorization. 1897-1905
Applications
- Marco Rudolph, Bastian Wandt, Bodo Rosenhahn:
Same Same But DifferNet: Semi-Supervised Defect Detection with Normalizing Flows. 1906-1915 - Junyu Luo, Zekun Li, Jinpeng Wang, Chin-Yew Lin:
ChartOCR: Data Extraction from Charts Images via a Deep Hybrid Framework. 1916-1924 - Sindhu B. Hegde, K. R. Prajwal, Rudrabha Mukhopadhyay, Vinay P. Namboodiri, C. V. Jawahar:
Visual Speech Enhancement Without A Real Visual Stream. 1925-1934 - Xiaohan Nie, Shixing Chen, Raffay Hamid:
A Robust and Efficient Framework for Sports-Field Registration. 1935-1943 - Trevor Seets, Atul Ingle, Martin Laurenzis, Andreas Velten:
Motion Adaptive Deblurring with Single-Photon Cameras. 1944-1953 - Gal Sadeh Kenigsfield, Ran El-Yaniv:
TranstextNet: Transducing Text for Recognizing Unseen Visual Relationships. 1954-1963 - Kanish Garg, Swati Bhugra, Brejesh Lall:
Automatic Quantification of Plant Disease from Field Image Data Using Deep Learning. 1964-1971 - Loc Trinh, Michael Tsang, Sirisha Rambhatla, Yan Liu:
Interpretable and Trustworthy Deepfake Detection via Dynamic Prototypes. 1972-1982 - Mohsen Jafarzadeh, Touqeer Ahmad, Akshay Raj Dhamija, Chunchun Li, Steve Cruz, Terrance E. Boult:
Automatic Open-World Reliability Assessment. 1983-1992 - Tobias Nickchen, Stefan Heindorf, Gregor Engels:
Generating Physically Sound Training Data for Image Recognition of Additively Manufactured Parts. 1993-2001 - Masoud PourReza, Bahram Mohammadi, Mostafa Khaki, Samir Bouindour, Hichem Snoussi, Mohammad Sabokrou:
G2D: Generate to Detect Anomaly. 2002-2011 - Gonçalo Mordido, Julian Niedermeier, Christoph Meinel:
Assessing Image and Text Generation with Topological Analysis and Fuzzy Logic. 2012-2021 - Xiaoyu Zhu, Junwei Liang, Alexander G. Hauptmann:
MSNet: A Multilevel Instance Segmentation Network for Natural Disaster Damage Assessment in Aerial Videos. 2022-2031 - Ya-Chu Chang, Chia-Ni Lu, Chia-Chi Cheng, Wei-Chen Chiu:
Single Image Reflection Removal with Edge Guidance, Reflection Classifier, and Recurrent Decomposition. 2032-2041 - Adwaye Rambojun, William Tillett, Tony Shardlow, Neill D. F. Campbell:
Active Latent Space Shape Model: A Bayesian Treatment of Shape Model Adaptation with an Application to Psoriatic Arthritis Radiographs. 2042-2051
Video and Computational Photography
- Reza Ghoddoosian, Saif Iftekar Sayed, Vassilis Athitsos:
Action Duration Prediction for Segment-Level Alignment of Weakly-Labeled Videos. 2052-2061 - Mohamed Chaabane, Lionel Gueguen, Ameni Trabelsi, J. Ross Beveridge, Stephen O'Hara:
End-to-end Learning Improves Static Object Geo-localization from Video. 2062-2071 - Yuta Kayatani, Zekun Yang, Mayu Otani, Noa Garcia, Chenhui Chu, Yuta Nakashima, Haruo Takemura:
The Laughing Machine: Predicting Humor in Video. 2072-2081 - Reuben Tan, Huijuan Xu, Kate Saenko, Bryan A. Plummer:
LoGAN: Latent Graph Co-Attention Network for Weakly-Supervised Video Moment Retrieval. 2082-2091 - Suyoung Lee, Myungsub Choi, Kyoung Mu Lee:
DynaVSR: Dynamic Adaptive Blind Video Super-Resolution. 2092-2101 - Yuzhi Zhao, Lai-Man Po, Tingyu Lin, Xuehui Wang, Kangcheng Liu, Yujia Zhang, Wing Yin Yu, Pengfei Xian, Jingjing Xiong:
Legacy Photo Editing with Learned Noise Prior. 2102-2111 - Man M. Ho, Jinjia Zhou:
Deep Preset: Blending and Retouching Photos with Color Style Transfer. 2112-2120 - Kyunghun Kim, Yeohun Yun, Keon-Woo Kang, Kyeongbo Kong, Siyeong Lee, Suk-Ju Kang:
Painting Outside as Inside: Edge Guided Image Outpainting via Bidirectional Rearrangement with Progressive Step Learning. 2121-2129 - Wesley Khademi, Sonia Rao, Clare Minnerath, Guy Hagen, Jonathan Ventura:
Self-Supervised Poisson-Gaussian Denoising. 2130-2138 - Yijun Li, Lu Jiang, Ming-Hsuan Yang:
Controllable and Progressive Image Extrapolation. 2139-2148
Aerial Imagery and 3D, Vision and Language
- Jingru Yi, Pengxiang Wu, Bo Liu, Qiaoying Huang, Hui Qu, Dimitris N. Metaxas:
Oriented Object Detection in Aerial Images with Box Boundary-Aware Vectors. 2149-2158 - Xueqing Deng, Yi Zhu, Yuxin Tian, Shawn D. Newsam:
Scale Aware Adaptation for Land-Cover Classification in Remote Sensing Imagery. 2159-2168 - Tao Hu, Geng Lin, Zhizhong Han, Matthias Zwicker:
Learning to Generate Dense Point Clouds with Textures on Multiple Categories. 2169-2178 - Miguel Ángel Bautista, Walter Talbott, Shuangfei Zhai, Nitish Srivastava, Joshua M. Susskind:
On the generalization of learning-based 3D reconstruction. 2179-2188 - Sara Mousavi, Dylan Lee, Tatianna Griffin, Kelley Cross, Dawnie W. Steadman, Audris Mockus:
SChISM: Semantic Clustering via Image Sequence Merging for Images of Human-Decomposition. 2189-2198 - Minesh Mathew, Dimosthenis Karatzas, C. V. Jawahar:
DocVQA: A Dataset for VQA on Document Images. 2199-2208 - Haidong Zhu, Arka Sadhu, Zhaoheng Zheng, Ram Nevatia:
Utilizing Every Image Object for Semi-supervised Phrase Grounding. 2209-2218 - Andrés Mafla, Rafael Sampaio de Rezende, Lluís Gómez, Diane Larlus, Dimosthenis Karatzas:
StacMR: Scene-Text Aware Cross-Modal Retrieval. 2219-2229 - Sang Jun Lee, Deokhwa Kim, Sung Soo Hwang, Donghwan Lee:
Local to Global: Efficient Visual Localization for a Monocular Camera. 2230-2239
Object Detection, Segmentation and 0/1-Shot Learning
- Vinay Kumar Verma, Ashish Mishra, Anubha Pandey, Hema A. Murthy, Piyush Rai:
Towards Zero-Shot Learning with Fewer Seen Class Examples. 2240-2250 - Chenxi Xiao, Naveen Madapana, Juan P. Wachs:
One-Shot Image Recognition Using Prototypical Encoders with Reduced Hubness. 2251-2260 - Suwichaya Suwanwimolkul, Satoshi Komorita, Kazuyuki Tasaka:
Learning of low-level feature keypoints for accurate and robust detection. 2261-2270 - Hazem Rashed, Eslam Mohamed, Ganesh Sistu, Varun Ravi Kumar, Ciarán Eising, Ahmad El Sallab, Senthil Kumar Yogamani:
Generalized Object Detection on Fisheye Cameras for Autonomous Driving: Dataset, Representations and Baseline. 2271-2279 - Rishabh Dabral, Srijon Sarkar, Sai Praneeth Reddy, Ganesh Ramakrishnan:
Exploration of Spatial and Temporal Modeling Alternatives for HOI. 2280-2289 - Peng Tang, Chetan Ramaiah, Yan Wang, Ran Xu, Caiming Xiong:
Proposal Learning for Semi-Supervised Object Detection. 2290-2300 - Prashant W. Patil, Akshay Dudhane, Subrahmanyam Murala:
Multi-frame Recurrent Adversarial Network for Moving Object Segmentation. 2301-2310 - Tatsuro Koizumi, William A. P. Smith:
Shape from semantic segmentation via the geometric Rényi divergence. 2311-2320 - Yuchi Ishikawa, Seito Kasai, Yoshimitsu Aoki, Hirokatsu Kataoka:
Alleviating Over-segmentation Errors by Detecting Action Boundaries. 2321-2330 - Muhammad Shahid, Cigdem Beyan, Vittorio Murino:
S-VVAD: Visual Voice Activity Detection by Motion Segmentation. 2331-2340
Pose Estimation, Humans and Actions
- Suhas Lohit, Rushil Anirudh, Pavan K. Turaga:
Recovering Trajectories of Unmarked Joints in 3D Human Actions Using Latent Space Optimization. 2341-2350 - Behnoosh Parsa, Ashis G. Banerjee:
A Multi-Task Learning Approach for Human Activity Segmentation and Ergonomics Risk Assessment. 2351-2361 - Di Yang, Rui Dai, Yaohui Wang, Rupayan Mallick, Luca Minciullo, Gianpiero Francesca, François Brémond:
Selective Spatio-Temporal Aggregation Based Pose Refinement System: Towards Understanding Human Activities in Real-World Videos. 2362-2371 - Fanqing Lin, Connor Wilhelm, Tony R. Martinez:
Two-hand Global 3D Pose Estimation using Monocular RGB. 2372-2380 - Ameni Trabelsi, Mohamed Chaabane, Nathaniel Blanchard, J. Ross Beveridge:
A Pose Proposal and Refinement Network for Better 6D Object Pose Estimation. 2381-2390 - Rumeysa Bodur, Binod Bhattarai, Tae-Kyun Kim:
3D Dense Geometry-Guided Facial Expression Synthesis by Adversarial Learning. 2391-2400 - Amir Hossein Farzaneh, Xiaojun Qi:
Facial Expression Recognition in the Wild via Deep Attentive Center Loss. 2401-2410 - Shivangi Yadav, Arun Ross:
CIT-GAN: Cyclic Image Translation Generative Adversarial Network With Application in Iris Presentation Attack Detection. 2411-2420 - Kshitij Nikhal, Benjamin S. Riggan:
Unsupervised Attention Based Instance Discriminative Learning for Person Re-Identification. 2421-2430 - Yu-Jhe Li, Xinshuo Weng, Kris M. Kitani:
Learning Shape Representations for Person Re-Identification under Clothing Change. 2431-2440
Medical, Risk, Bias, Uncertainty and Defects
- Jia-Hong Huang, Chao-Han Huck Yang, Fangyu Liu, Meng Tian, Yi-Chieh Liu, Ting-Wei Wu, I-Hung Lin, Kang Wang, Hiromasa Morikawa, Hernghua Chang, Jesper Tegnér, Marcel Worring:
DeepOpht: Medical Report Generation for Retinal Images via Deep Models and Visual Explanation. 2441-2451 - Issam H. Laradji, Pau Rodríguez, Oscar Mañas, Keegan Lensink, Marco Law, Lironne Kurzman, William Parker, David Vázquez, Derek Nowrouzezahrai:
A Weakly Supervised Consistency-based Learning Method for COVID-19 Segmentation in CT Images. 2452-2461 - Subba Reddy Oota, Vijay Rowtula, Shahid Saleem Mohammed, Jeffrey Galitz, Minghsun Liu, Manish Gupta:
HealTech - A System for Predicting Patient Hospitalization Risk and Wound Progression in Old Patients. 2462-2471 - Jerry W. Wei, Arief A. Suriawinata, Bing Ren, Xiaoying Liu, Mikhail Lisovsky, Louis J. Vaickus, Charles Brown, Michael Baker, Mustafa Nasir-Moin, Naofumi Tomita, Lorenzo Torresani, Jason Wei, Saeed Hassanpour:
Learn like a Pathologist: Curriculum Learning by Annotator Agreement for Histopathology Image Classification. 2472-2482 - Murat Sensoy, Maryam Saleki, Simon Julier, Reyhan Aydogan, John Reid:
Misclassification Risk and Uncertainty Quantification in Deep Classifiers. 2483-2491 - Peri Akiva, Benjamin Planche, Aditi Roy, Kristin J. Dana, Peter Oudemans, Michael Mars:
AI on the Bog: Monitoring and Evaluating Cranberry Crop Risk. 2492-2501 - Logan Frank, Christopher Wiegman, Jim Davis, Scott A. Shearer:
Confidence-Driven Hierarchical Classification of Cultivated Plant Stresses. 2502-2511 - Ehsan Adeli-Mosabbeb, Qingyu Zhao, Adolf Pfefferbaum, Edith V. Sullivan, Li Fei-Fei, Juan Carlos Niebles, Kilian M. Pohl:
Representation Learning with Statistical Independence to Mitigate Bias. 2512-2522 - Gongjie Zhang, Kaiwen Cui, Tzu-Yi Hung, Shijian Lu:
Defect-GAN: High-Fidelity Defect Synthesis for Automated Defect Inspection. 2523-2533
Deep Learning and Generative Networks
- Rushil Anirudh, Suhas Lohit, Pavan K. Turaga:
Generative Patch Priors for Practical Compressive Image Recovery. 2534-2544 - Xu Ouyang, Ying Chen, Gady Agam:
Accelerated WGAN update strategy with loss change rate balancing. 2545-2554 - Diogo C. Luvizon, Gustavo Sutter P. Carvalho, Andreza A. dos Santos, Jhonatas S. Conceição, Jose L. Flores-Campana, Luis G. L. Decker, Marcos Roberto e Souza, Hélio Pedrini, Antonio Joia, Otávio A. B. Penatti:
Adaptive Multiplane Image Generation from a Single Internet Picture. 2555-2564 - Zi Wang:
Learning Fast Converging, Effective Conditional Generative Adversarial Networks with a Mirrored Auxiliary Classifier. 2565-2574 - Suryabhan Singh Hada, Miguel Á. Carreira-Perpiñán:
Style Transfer by Rigid Alignment in Neural Net Feature Space. 2575-2584 - Hanchen Xie, Mohamed E. Hussein, Aram Galstyan, Wael Abd-Almageed:
MUSCLE: Strengthening Semi-Supervised Learning Via Concurrent Unsupervised Learning Using Mutual Information Maximization. 2585-2594 - Lukas Enderich, Fabian Timm, Wolfram Burgard:
Holistic Filter Pruning for Efficient Deep Neural Networks. 2595-2604 - Daiki Ikami, Go Irie, Takashi Shibata:
Constrained Weight Optimization for Learning without Activation Normalization. 2605-2613 - Takumi Kobayashi:
Group Softmax Loss with Discriminative Feature Grouping. 2614-2623 - Takumi Kobayashi:
Phase-wise Parameter Aggregation For Improving SGD Optimization. 2624-2633
Low-Shot Learning, Computational Photography, Super-Resolution
- James Charles, Stefano Bucciarelli, Roberto Cipolla:
Scaling digital screen reading with one-shot learning and re-identification. 2634-2642 - Frederik Pahde, Mihai Marian Puscas, Tassilo Klein, Moin Nabi:
Multimodal Prototypical Networks for Few-shot Learning. 2643-2652 - Pratik Mazumder, Pravendra Singh, Vinay P. Namboodiri:
Improving Few-Shot Learning using Composite Rotation based Auxiliary Task. 2653-2662 - Pratik Mazumder, Pravendra Singh, Vinay P. Namboodiri:
RNNP: A Robust Few-Shot Learning Approach. 2663-2672 - Reza Azad, Abdur R. Fayjie, Claude Kauffmann, Ismail Ben Ayed, Marco Pedersoli, Jose Dolz:
On the Texture Bias for Few-Shot CNN Segmentation. 2673-2682 - Min-Yuan Tseng, Yen-Chung Chen, Yi-Lun Lee, Wei-Sheng Lai, Yi-Hsuan Tsai, Wei-Chen Chiu:
Dual-Stream Fusion Network for Spatiotemporal Video Super-Resolution. 2683-2692 - Parichehr Behjati, Pau Rodríguez, Armin Mehri, Isabelle Hupont, Carles Fernández Tena, Jordi Gonzàlez:
OverNet: Lightweight Multi-Scale Super-Resolution with Overscaling Network. 2693-2702 - Armin Mehri, Parichehr B. Ardakani, Ángel D. Sappa:
MPRNet: Multi-Path Residual Network for Lightweight Image Super Resolution. 2703-2712 - Jireh Jam, Connah Kendrick, Vincent Drouard, Kevin Walker, Gee-Sern Hsu, Moi Hoon Yap:
R-MNet: A Perceptual Adversarial Network for Image Inpainting. 2713-2722 - Valéry Dewil, Jérémy Anger, Axel Davy, Thibaud Ehret, Gabriele Facciolo, Pablo Arias:
Self-supervised training for blind multi-frame video denoising. 2723-2733
Human Action, Tracking, Pose
- Jinmiao Cai, Nianjuan Jiang, Xiaoguang Han, Kui Jia, Jiangbo Lu:
JOLO-GCN: Mining Joint-Centered Light-Weight Information for Skeleton-Based Action Recognition. 2734-2743 - Ayush Srivastava, Oshin Dutta, Jigyasa Gupta, Sumeet Agarwal, Prathosh AP:
A Variational Information Bottleneck Based Method to Compress Sequential Networks for Human Action Recognition. 2744-2753 - Nuno Cruz Garcia, Sarah Adel Bargal, Vitaly Ablavsky, Pietro Morerio, Vittorio Murino, Stan Sclaroff:
Distillation Multiple Choice Learning for Multimodal Action Recognition. 2754-2763 - Ivan Sosnovik, Artem Moskalev, Arnold W. M. Smeulders:
Scale Equivariance Improves Siamese Tracking. 2764-2773 - Monika Jain, A. Venkata Subramanyam, Simon Denman, Sridha Sridharan, Clinton Fookes:
IGSSTRCF: Importance Guided Sparse Spatio-Temporal Regularized Correlation Filters For Tracking. 2774-2783 - Maya Aghaei, Matteo Bustreo, Yiming Wang, Gian Luca Bailo, Pietro Morerio, Alessio Del Bue:
Single Image Human Proxemics Estimation for Visual Social Distancing. 2784-2794 - Wen Guo, Enric Corona, Francesc Moreno-Noguer, Xavier Alameda-Pineda:
PI-Net: Pose Interacting Network for Multi-Person Monocular 3D Pose Estimation. 2795-2805 - Renat Bashirov, Anastasia Ianina, Karim Iskakov, Yevgeniy Kononenko, Valeriya Strizhkova, Victor Lempitsky, Alexander Vakhitov:
Real-time RGBD-based Extended Body Pose Estimation. 2806-2815 - Adrian Sandru, Georgian-Emilian Duta, Mariana-Iuliana Georgescu, Radu Tudor Ionescu:
SuPEr - SAM: Using the Supervision Signal from a Pose Estimator to Train a Spatial Attention Module for Personal Protective Equipment Recognition. 2816-2825 - Weidong Yin, Ziwei Liu, Leonid Sigal:
Person-in-Context Synthesis with Compositional Structural Space. 2826-2835
Applications, Misc.
- Amin Nejatbakhsh, Erdem Varol:
Neuron matching in C. elegans with robust approximate linear regression without correspondence. 2836-2845 - Aradhya Neeraj Mathur, Apoorv Khattar, Ojaswa Sharma:
2D to 3D Medical Image Colorization. 2846-2855 - Pingchuan Ma, Yujiang Wang, Jie Shen, Stavros Petridis, Maja Pantic:
Lip-reading with Densely Connected Temporal Convolutional Networks. 2856-2865 - Alexandros Rotsidis, Christof Lutteroth, Peter Hall, Christian Richardt:
ExMaps: Long-Term Localization in Dynamic Scenes using Exponential Decay. 2866-2875 - Marc Kassubeck, Florian Bürgel, Susana Castillo, Sebastian Stiller, Marcus A. Magnor:
Shape from Caustics: Reconstruction of 3D-Printed Glass from Simulated Caustic Images. 2876-2885 - Yaroslava Lochman, Oles Dobosevych, Rostyslav Hryniv, James Pritts:
Minimal Solvers for Single-View Lens-Distorted Camera Auto-Calibration. 2886-2895 - Indra Deep Mastan, Shanmuganathan Raman:
DeepCFL: Deep Contextual Features Learning from a Single Image. 2896-2905 - Yevhen Kuznietsov, Marc Proesmans, Luc Van Gool:
CoMoDA: Continuous Monocular Depth Adaptation Using Past Experiences. 2906-2916 - Gabriele Moreno Berton, Valerio Paolicelli, Carlo Masone, Barbara Caputo:
Adaptive-Attentive Geolocalization from few queries: a hybrid approach. 2917-2926 - Eric Müller-Budack, Matthias Springstein, Sherzod Hakimov, Kevin Mrutzek, Ralph Ewerth:
Ontology-driven Event Type Classification in Images. 2927-2937
Recognition, Detection, Classification
- Luca Minciullo, Fabian Manhardt, Kei Yoshikawa, Sven Meier, Federico Tombari, Norimasa Kobori:
DB-GAN: Boosting Object Recognition Under Strong Lighting Conditions. 2938-2948 - Ozan Unal, Luc Van Gool, Dengxin Dai:
Improving Point Cloud Semantic Segmentation by Learning 3D Object Detection. 2949-2958 - Aayush Jung Rana, Yogesh S. Rawat:
We don't Need Thousand Proposals: Single Shot Actor-Action Detection in Videos. 2959-2968 - Rui Dai, Srijan Das, Luca Minciullo, Lorenzo Garattoni, Gianpiero Francesca, François Brémond:
PDAN: Pyramid Dilated Attention Network for Action Detection. 2969-2978 - Alexey Sidnev, Alexander Krapivin, Alexey Trushkov, Ekaterina Krasikova, Maxim Kazakov, Mikhail Viryasov:
DeepMark++: Real-time Clothing Detection at the Edge. 2979-2987 - Zhizhong Li, Linjie Luo, Sergey Tulyakov, Qieyun Dai, Derek Hoiem:
Task-Assisted Domain Adaptation with Anchor Tasks. 2988-2997 - Ming Tang, Linyu Zheng, Bin Yu, Jinqiao Wang:
Fast Kernelized Correlation Filter without Boundary Effect. 2998-3007 - Elahe Arani, Shabbir Marzban, Andrei Pata, Bahram Zonooz:
RGPNet: A Real-Time General Purpose Semantic Segmentation. 3008-3017 - Qifei Wang, Junjie Ke, Joshua Greaves, Grace Chu, Gabriel Bender, Luciano Sbaiz, Alec Go, Andrew G. Howard, Ming-Hsuan Yang, Jeff Gilbert, Peyman Milanfar, Feng Yang:
Multi-path Neural Networks for On-device Multi-domain Visual Classification. 3018-3027 - Théo Ayral, Marco Pedersoli, Simon Bacon, Eric Granger:
Temporal Stochastic Softmax for 3D CNNs: An Application in Facial Expression Recognition. 3028-3037
Vision/Language, Video, Zero-Shot Learning
- Jesus Perez-Martin, Benjamin Bustos, Jorge Pérez:
Improving Video Captioning with Temporal Composition of a Visual-Syntactic Embedding*. 3038-3048 - Sebastiano Vascon, Sinem Aslan, Gianluca Bigaglia, Lorenzo Giudice, Marcello Pelillo:
Transductive Visual Verb Sense Disambiguation. 3049-3058 - Paul Voigtlaender, Lishu Luo, Chun Yuan, Yong Jiang, Bastian Leibe:
Reducing the Annotation Effort for Video Object Segmentation Datasets. 3059-3068 - Alina Kuznetsova, Aakrati Talati, Yiwen Luo, Keith Simmons, Vittorio Ferrari:
Efficient video annotation with visual interpolation and frame selection guidance. 3069-3078 - Ryan Szeto, Mostafa El-Khamy, Jungwon Lee, Jason J. Corso:
HyperCon: Image-To-Video Model Transfer for Video-To-Video Translation Tasks. 3079-3088 - Pratik Mazumder, Pravendra Singh, Kranti Kumar Parida, Vinay P. Namboodiri:
AVGZSLNet: Audio-Visual Generalized Zero-Shot Learning by Reconstructing Label Features from Multi-Modal Embeddings. 3089-3098 - Shivam Chandhok, Vineeth N. Balasubramanian:
Two-Level Adversarial Visual-Semantic Coupling for Generalized Zero-shot Learning. 3099-3107 - Federico Marmoreo, Jacopo Cavazza, Vittorio Murino:
Transductive Zero-Shot Learning by Decoupled Feature Generation. 3108-3117
Learning, Deep Learning, Generative Approaches
- Yuxin Hou, Arno Solin, Juho Kannala:
Novel View Synthesis via Depth-guided Skip Connections. 3118-3127 - Elahe Arani, Fahad Sarfraz, Bahram Zonooz:
Noise as a Resource for Learning in Knowledge Distillation. 3128-3137 - Diganta Misra, Trikay Nalamada, Ajay Uppili Arasanipalai, Qibin Hou:
Rotate to Attend: Convolutional Triplet Attention Module. 3138-3147 - Jinyong Hou, Jeremiah D. Deng, Stephen Cranefield, Xuejie Ding:
Cross-Domain Latent Modulation for Variational Transfer Learning. 3148-3157 - Fahad Sarfraz, Elahe Arani, Bahram Zonooz:
Noisy Concurrent Training for Efficient Learning under Label Noise. 3158-3167 - Yanlin Qian, Miaojing Shi, Joni-Kristian Kämäräinen, Jiri Matas:
Fast Fourier Intrinsic Network. 3168-3177 - Andrés Muñoz, Mohammadreza Zolfaghari, Max Argus, Thomas Brox:
Temporal Shift GAN for Large Scale Video Generation. 3178-3187 - Parth Patel, Nupur Kumari, Mayank Singh, Balaji Krishnamurthy:
LT-GAN: Self-Supervised GAN with Latent Transformation Detection. 3188-3197
Image and Video Understanding
- Zhikai Chen, Lingxi Xie, Shanmin Pang, Yong He, Qi Tian:
Appending Adversarial Frames for Universal Video Attack. 3198-3207 - Lianbo Zhang, Shaoli Huang, Wei Liu:
Intra-class Part Swapping for Fine-Grained Image Classification. 3208-3217 - Qiuhong Ke, Mario Fritz, Bernt Schiele:
Future Moment Assessment for Action Query. 3218-3227 - Menglin Wang, Baisheng Lai, Haokun Chen, Jianqiang Huang, Xiaojin Gong, Xian-Sheng Hua:
Towards Precise Intra-camera Supervised Person Re-Identification. 3228-3237 - Zutong Li, Lei Yang:
Weakly Supervised Deep Reinforcement Learning for Video Summarization With Semantically Meaningful Reward. 3238-3246 - Bin Zhu, Qing Song, Lu Yang, Zhihui Wang, Chun Liu, Mengjie Hu:
CPM R-CNN: Calibrating Point-guided Misalignment in Object Detection. 3247-3256 - Weiping Yu, Taojiannan Yang, Chen Chen:
Towards Resolving the Challenge of Long-tail Distribution in UAV Images for Object Detection. 3257-3266 - Jie Shao, Xin Wen, Bingchen Zhao, Xiangyang Xue:
Temporal Context Aggregation for Video Retrieval with Contrastive Learning. 3267-3277 - Mathieu Pagé Fortin, Brahim Chaib-draa:
Towards Contextual Learning in Few-shot Object Classification. 3278-3287 - Akshay Chawla, Hongxu Yin, Pavlo Molchanov, José M. Álvarez:
Data-free Knowledge Distillation for Object Detection. 3288-3297 - Xiaoli Xu, Yao Lu, Zhiwu Lu, Tao Xiang:
Vid2Int: Detecting Implicit Intention from Long Dialog Videos. 3298-3307 - Matthew Gwilliam, Adam Teuscher, Connor Anderson, Ryan Farrell:
Fair Comparison: Quantifying Variance in Results for Fine-grained Visual Categorization. 3308-3317 - Alejandro Pardo, Humam Alwassel, Fabian Caba Heilbron, Ali K. Thabet, Bernard Ghanem:
RefineLoc: Iterative Refinement for Weakly-Supervised Action Localization. 3318-3327 - Yuan Cheng, Yuchao Yang, Hai-Bao Chen, Ngai Wong, Hao Yu:
S3-Net: A Fast and Lightweight Video Scene Understanding Network by Single-shot Segmentation. 3328-3336 - Soufiane Belharbi, Ismail Ben Ayed, Luke McCaffrey, Eric Granger:
Deep Active Learning for Joint Classification & Segmentation with Weak Annotator. 3337-3346
Humans and Faces
- Shehzeen Hussain, Paarth Neekhara, Malhar Jere, Farinaz Koushanfar, Julian J. McAuley:
Adversarial Deepfakes: Evaluating Vulnerability of Deepfake Detectors to Adversarial Examples. 3347-3356 - Yunus Can Bilge, Mehmet Kerim Yücel, Ramazan Gokberk Cinbis, Nazli Ikizler-Cinbis, Pinar Duygulu:
Red Carpet to Fight Club: Partially-supervised Domain Transfer for Face Recognition in Violent Videos. 3357-3368 - Pu Ge, Qiushi Huang, Wei Xiang, Xue Jing, Yule Li, Yiyong Li, Zhun Sun:
Focus and retain: Complement the Broken Pose in Human Image Synthesis. 3369-3378 - Tianren Wang, Teng Zhang, Brian C. Lovell:
Faces à la Carte: Text-to-Face Generation via Attribute Disentanglement. 3379-3387 - Shitala Prasad, Yiqun Li, Dongyun Lin, Sheng Dong:
maskedFaceNet: A Progressive Semi-Supervised Masked Face Detector. 3388-3397 - Satoshi Tsutsui, Yanwei Fu, David J. Crandall:
Whose hand is this? Person Identification from Egocentric Hand Gestures. 3398-3407 - Vinoj Jayasundara, Debaditya Roy, Basura Fernando:
FlowCaps: Optical Flow Estimation with Capsule Networks For Action Recognition. 3408-3417 - Razvan Caramalau, Binod Bhattarai, Tae-Kyun Kim:
Active Learning for Bayesian 3D Hand Pose Estimation. 3418-3427 - Al Amin Hosain, Panneer Selvam Santhalingam, Parth H. Pathak, Huzefa Rangwala, Jana Kosecká:
Hand Pose Guided 3D Pooling for Word-level Sign Language Recognition. 3428-3438 - Ellen Yi-Ge, Rui Fan, Zechun Liu, Zhiqiang Shen:
Conditional Link Prediction of Category-Implicit Keypoint Detection. 3439-3448 - Chengxin Wang, Shaofeng Cai, Gary Tan:
GraphTCN: Spatio-Temporal Interaction Modeling for Human Trajectory Prediction. 3449-3458 - Chi Xu, Yasushi Makihara, Ruochen Liao, Hirotaka Niitsuma, Xiang Li, Yasushi Yagi, Jianfeng Lu:
Real-Time Gait-Based Age Estimation and Gender Classification from a Single Image. 3459-3469
Learning
- Wenlin Wang, Hongteng Xu, Guoyin Wang, Wenqi Wang, Lawrence Carin:
Zero-Shot Recognition via Optimal Transport. 3470-3480 - Jianhong Zhang, Manli Zhang, Zhiwu Lu, Tao Xiang:
AdarGCN: Adaptive Aggregation GCN for Few-Shot Learning. 3481-3490 - Yan Zuo, Gil Avraham, Tom Drummond:
Improved Training of Generative Adversarial Networks Using Decision Forests. 3491-3500 - Ruchika Chavhan, Ankit Jha, Biplab Banerjee, Subhasis Chaudhuri:
ADA-AT/DT: An Adversarial Approach for Cross-Domain and Cross-Task Knowledge Transfer. 3501-3510 - Samarth Shukla, Andrés Romero, Luc Van Gool, Radu Timofte:
Zero-Pair Image to Image Translation using Domain Conditional Normalization. 3511-3518 - Keren Ye, Mingda Zhang, Adriana Kovashka:
Breaking Shortcuts by Masking for Robust Visual Reasoning. 3519-3529 - Zhuoran Shen, Mingyuan Zhang, Haiyu Zhao, Shuai Yi, Hongsheng Li:
Efficient Attention: Attention with Linear Complexities. 3530-3538 - Naeha Sharif, Mohammed Bennamoun, Wei Liu, Syed Afaq Ali Shah:
SubICap: Towards Subword-informed Image Captioning. 3539-3540 - Chaoning Zhang, Philipp Benz, Dawit Mureja Argaw, Seokju Lee, Junsik Kim, François Rameau, Jean-Charles Bazin, In So Kweon:
ResNet or DenseNet? Introducing Dense Shortcuts to ResNet. 3549-3558 - Yimian Dai, Fabian Gieseke, Stefan Oehmcke, Yiquan Wu, Kobus Barnard:
Attentional Feature Fusion. 3559-3568 - Dimity Miller, Niko Sünderhauf, Michael Milford, Feras Dayoub:
Class Anchor Clustering: A Loss for Distance-based Open Set Recognition. 3569-3577 - Youngrock Oh, Hyungsik Jung, Jeonghyung Park, Min Soo Kim:
EVET: Enhancing Visual Explanations of Deep Neural Networks Using Image Transformations. 3578-3586 - Shaofeng Cai, Yao Shu, Wei Wang:
Dynamic Routing Networks. 3587-3596 - Ziyi Kou, Guofeng Cui, Shaojie Wang, Wentian Zhao, Chenliang Xu:
Improve CAM with Auto-adapted Segmentation and Co-supervised Augmentation. 3597-3605 - Ragav Sachdeva, Filipe R. Cordeiro, Vasileios Belagiannis, Ian D. Reid, Gustavo Carneiro:
EvidentialMix: Learning with Combined Open-set and Closed-set Noisy Labels. 3606-3614
Applications
- Raghav Brahmadesam Venkataramaiyer, Abhishek Joshi, Saisha Narang, Vinay P. Namboodiri:
SHAD3S: A model to Sketch, Shade and Shadow. 3615-3624 - Cong Chen, Amos Lynn Abbott, Daniel J. Stilwell:
Multi-Level Generative Chaotic Recurrent Network for Image Inpainting. 3625-3634 - Tangqing Li, Zheng Wang, Siying Liu, Wen-Yan Lin:
Deep Unsupervised Anomaly Detection. 3635-3644 - Zongze Wu, Dani Lischinski, Eli Shechtman:
Fine-grained Foreground Retrieval via Teacher-Student Learning. 3645-3653 - Yujia Zhang, Qianzhong Li, Xiaoguang Zhao, Min Tan:
TB-Net: A Three-Stream Boundary-Aware Network for Fine-Grained Pavement Disease Segmentation. 3654-3663 - Jingjing Chen, Jichao Zhang, Enver Sangineto, Tao Chen, Jiayuan Fan, Nicu Sebe:
Coarse-to-Fine Gaze Redirection with Numerical and Pictorial Guidance. 3664-3673 - Liangzi Rong, Chunping Li:
Coarse- and Fine-grained Attention Network with Background-aware Loss for Crowd Density Map Estimation. 3674-3683 - Yang Liu, Zhen Zhu, Xiang Bai:
WDNet: Watermark-Decomposition Network for Visible Watermark Removal. 3684-3692 - Ruijin Liu, Zejian Yuan, Tie Liu, Zhiliang Xiong:
End-to-end Lane Shape Prediction with Transformers. 3693-3701 - Connor Anderson, Adam Teuscher, Elizabeth Anderson, Alysia Larsen, Josh Shirley, Ryan Farrell:
Have Fun Storming the Castle(s)! 3702-3711 - Simon Niklaus, Xuaner Cecilia Zhang, Jonathan T. Barron, Neal Wadhwa, Rahul Garg, Feng Liu, Tianfan Xue:
Learned Dual-View Reflection Removal. 3712-3721 - Atsushi Kawasaki, Akihito Seki:
Multimodal Trajectory Predictions for Autonomous Driving without a Detailed Prior Map. 3722-3731 - Mohammad Mahdi Kazemi Moghaddam, Qi Wu, Ehsan Abbasnejad, Javen Shi:
Optimistic Agent: Accurate Graph-Based Value Estimation for More Successful Visual Navigation. 3732-3741 - Tianqi Tang, Xin Yu, Xuanyi Dong, Yi Yang:
Auto-Navigator: Decoupled Neural Architecture Search for Visual Navigation. 3742-3751 - Royston Rodrigues, Masahiro Tani:
Are These from the Same Place? Seeing the Unseen in Cross-View Image Geo-Localization. 3752-3760
3D and Applications
- Haiyan Wang, Liang Yang, Xuejian Rong, Jinglun Feng, Yingli Tian:
Self-supervised 4D Spatio-temporal Feature Learning via Order Prediction of Sequential Point Cloud Clips. 3761-3770 - Ming Zhu, Chao Ma, Pan Ji, Xiaokang Yang:
Cross-Modality 3D Object Detection. 3771-3780 - Xudong Zhang, Yutao Hu, Haochen Wang, Xianbin Cao, Baochang Zhang:
Long-range Attention Network for Multi-View Stereo. 3781-3790 - Gao Peng, Bo Pang, Cewu Lu:
Efficient 3D Video Engine Using Frame Redundancy. 3791-3801 - Hiroaki Aizawa, Hirokatsu Kataoka, Yutaka Satoh, Kunihito Kato:
Viewpoint-agnostic Image Rendering. 3802-3811 - Shi Qiu, Saeed Anwar, Nick Barnes:
Dense-Resolution Network for Point Cloud Classification and Segmentation. 3812-3821 - Gongjie Zhang, Kaiwen Cui, Rongliang Wu, Shijian Lu, Yonghong Tian:
PNPDet: Efficient Few-shot Detection without Forgetting via Plug-and-Play Sub-networks. 3822-3831 - Yawen Lu, Guoyu Lu:
An Alternative of LiDAR in Nighttime: Unsupervised Depth Estimation Based on Single Thermal Image. 3832-3842 - Bin Li, Mu Hu, Shuling Wang, Lianghao Wang, Xiaojin Gong:
Self-supervised Visual-LiDAR Odometry with Flip Consistency. 3843-3851 - Faraz Saeedan, Stefan Roth:
Boosting Monocular Depth with Panoptic Segmentation Maps. 3852-3861 - Alice Xue:
End-to-End Chinese Landscape Painting Creation Using Generative Adversarial Networks. 3862-3870 - Qian Zhang, Bo Wang, Wei Wen, Hai Li, Junhui Liu:
Line Art Correlation Matching Feature Transfer Network for Automatic Animation Colorization. 3871-3880 - Chuan Wen, Yujie Pan, Jie Chang, Ya Zhang, Siheng Chen, Yanfeng Wang, Mei Han, Qi Tian:
Handwritten Chinese Font Generation with Collaborative Stroke Refinement. 3881-3890 - Shenyi Pan, Shuxian Fan, Samuel W. K. Wong, James V. Zidek, Helge Rhodin:
Ellipse Detection and Localization with Applications to Knots in Sawn Lumber Images. 3891-3900 - Peng Kang, Jianping Zhang, Chen Ma, Guiling Sun:
ATM: Attentional Text Matting. 3901-3910 - Gourav Wadhwa, Abhinav Dhall, Subrahmanyam Murala, Usman Tariq:
Hyperrealistic Image Inpainting with Hypergraphs. 3911-3920
Learning, Medical and other Applications
- Aritra Ghosh, Andrew S. Lan:
Do We Really Need Gold Samples for Sample Weighting under Label Noise? 3921-3930 - Yifan Ding, Liqiang Wang, Boqing Gong:
Analyzing Deep Neural Network's Transferability via Fréchet Distance. 3931-3940 - Kwot Sin Lee, Ngoc-Trung Tran, Ngai-Man Cheung:
InfoMax-GAN: Improved Adversarial Image Generation via Information Maximization and Contrastive Learning. 3941-3951 - Souvik Kundu, Gourav Datta, Massoud Pedram, Peter A. Beerel:
Spike-Thrift: Towards Energy-Efficient Deep Spiking Neural Networks by Limiting Spiking Activity via Attention-Guided Compression. 3952-3961 - Qinxuan Luo, Lingfeng Wang, Jingguo Lv, Shiming Xiang, Chunhong Pan:
Few-Shot Learning via Feature Hallucination with Variational Inference. 3962-3971 - Minkyo Seo, Dongkeun Kim, Kyungmoon Lee, Seunghoon Hong, Jae Seok Bae, Jung Hoon Kim, Suha Kwak:
Neural Contrast Enhancement of CT Image. 3972-3981 - Sahil Chelaramani, Manish Gupta, Vipul Agarwal, Prashant Gupta, Ranya Habash:
Multi-Task Knowledge Distillation for Eye Disease Prediction. 3982-3992 - Xuan Gong, Shuyan Chen, Baochang Zhang, David S. Doermann:
Style Consistent Image Generation for Nuclei Instance Segmentation. 3993-4002 - Xuan Gong, Xin Xia, Wentao Zhu, Baochang Zhang, David S. Doermann, Li'an Zhuo:
Deformable Gabor Feature Networks for Biomedical Image Classification. 4003-4011 - Bin Duan, Hao Tang, Wei Wang, Ziliang Zong, Guowei Yang, Yan Yan:
Audio-Visual Event Localization via Recursive Fusion by Joint Co-Attention. 4012-4021 - Andrés Mafla, Sounak Dey, Ali Furkan Biten, Lluís Gómez, Dimosthenis Karatzas:
Multi-Modal Reasoning Graph for Scene-Text Based Fine-Grained Image Classification and Retrieval. 4022-4032 - Augusto Figueiredo, Johnata Brayan, Renan Oliveira Reis, Raphael C. Prates, William Robson Schwartz:
MoRe: A Large-Scale Motorcycle Re-Identification Dataset. 4033-4042 - Soumya Roy, Bharat Bhusan Sau:
Can Selfless Learning improve accuracy of a single classification task? 4043-4051 - Srinivas Anumasa, P. K. Srijith:
Improving Robustness and Uncertainty Modelling in Neural Ordinary Differential Equations. 4052-4060
manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.