default search action
Andrew Zisserman
Person information
- affiliation: University of Oxford, UK
Refine list
refinements active!
zoomed in on ?? of ?? records
view refined list in
export refined list as
showing all ?? records
2020 – today
- 2024
- [j105]Jaesung Huh, Joon Son Chung, Arsha Nagrani, Andrew Brown, Jee-weon Jung, Daniel Garcia-Romero, Andrew Zisserman:
The VoxCeleb Speaker Recognition Challenge: A Retrospective. IEEE ACM Trans. Audio Speech Lang. Process. 32: 3850-3866 (2024) - [c478]Akam Rahimi, Triantafyllos Afouras, Andrew Zisserman:
Voicevector: Multimodal Enrolment Vectors for Speaker Separation. ICASSP Workshops 2024: 785-789 - [c477]Bruno Korbar, Jaesung Huh, Andrew Zisserman:
Look, Listen and Recognise: Character-Aware Audio-Visual Subtitling. ICASSP 2024: 2975-2979 - [c476]Vladimir Iashin, Weidi Xie, Esa Rahtu, Andrew Zisserman:
Synchformer: Efficient Synchronization From Sparse Cues. ICASSP 2024: 5325-5329 - [c475]Andreea-Maria Oncescu, João F. Henriques, Andrew Zisserman, Samuel Albanie, A. Sophia Koepke:
A Sound Approach: Using Large Language Models to Generate Audio Descriptions for Egocentric Text-Audio Retrieval. ICASSP 2024: 7300-7304 - [i225]Ragav Sachdeva, Andrew Zisserman:
The Manga Whisperer: Automatically Generating Transcriptions for Comics. CoRR abs/2401.10224 (2024) - [i224]Bruno Korbar, Jaesung Huh, Andrew Zisserman:
Look, Listen and Recognise: Character-Aware Audio-Visual Subtitling. CoRR abs/2401.12039 (2024) - [i223]Vladimir Iashin, Weidi Xie, Esa Rahtu, Andrew Zisserman:
Synchformer: Efficient Synchronization from Sparse Cues. CoRR abs/2401.16423 (2024) - [i222]Carl Doersch, Yi Yang, Dilara Gokay, Pauline Luc, Skanda Koppula, Ankush Gupta, Joseph Heyward, Ross Goroshin, João Carreira, Andrew Zisserman:
BootsTAP: Bootstrapped Training for Tracking-Any-Point. CoRR abs/2402.00847 (2024) - [i221]Andreea-Maria Oncescu, João F. Henriques, Andrew Zisserman, Samuel Albanie, A. Sophia Koepke:
A SOUND APPROACH: Using Large Language Models to generate audio descriptions for egocentric text-audio retrieval. CoRR abs/2402.19106 (2024) - [i220]Yash Bhalgat, Iro Laina, João F. Henriques, Andrew Zisserman, Andrea Vedaldi:
N2F2: Hierarchical Scene Understanding with Nested Neural Feature Fields. CoRR abs/2403.10997 (2024) - [i219]Debidatta Dwibedi, Vidhi Jain, Jonathan Tompson, Andrew Zisserman, Yusuf Aytar:
FlexCap: Generating Rich, Localized, and Flexible Captions in Images. CoRR abs/2403.12026 (2024) - [i218]Jacob Chalk, Jaesung Huh, Evangelos Kazakos, Andrew Zisserman, Dima Damen:
TIM: A Time Interval Machine for Audio-Visual Action Recognition. CoRR abs/2404.05559 (2024) - [i217]Junyu Xie, Charig Yang, Weidi Xie, Andrew Zisserman:
Moving Object Segmentation: All You Need Is SAM (and Flow). CoRR abs/2404.12389 (2024) - [i216]Tengda Han, Max Bain, Arsha Nagrani, Gül Varol, Weidi Xie, Andrew Zisserman:
AutoAD III: The Prequel - Back to the Pixels. CoRR abs/2404.14412 (2024) - [i215]Charig Yang, Weidi Xie, Andrew Zisserman:
Made to Order: Discovering monotonic temporal changes via self-supervised video ordering. CoRR abs/2404.16828 (2024) - [i214]Charles Raude, K. R. Prajwal, Liliane Momeni, Hannah Bull, Samuel Albanie, Andrew Zisserman, Gül Varol:
A Tale of Two Languages: Large-Vocabulary Continuous Sign Language Recognition from Spoken Language Supervision. CoRR abs/2405.10266 (2024) - [i213]Mark Hamilton, Andrew Zisserman, John R. Hershey, William T. Freeman:
Separating the "Chirp" from the "Chat": Self-supervised Visual Grounding of Sound and Language. CoRR abs/2406.05629 (2024) - [i212]Niki Amini-Naieni, Tengda Han, Andrew Zisserman:
CountGD: Multi-Modal Open-World Counting. CoRR abs/2407.04619 (2024) - [i211]Skanda Koppula, Ignacio Rocco, Yi Yang, Joseph Heyward, João Carreira, Andrew Zisserman, Gabriel Brostow, Carl Doersch:
TAPVid-3D: A Benchmark for Tracking Any Point in 3D. CoRR abs/2407.05921 (2024) - [i210]Junyu Xie, Tengda Han, Max Bain, Arsha Nagrani, Gül Varol, Weidi Xie, Andrew Zisserman:
AutoAD-Zero: A Training-Free Framework for Zero-Shot Audio Description. CoRR abs/2407.15850 (2024) - [i209]Debidatta Dwibedi, Yusuf Aytar, Jonathan Tompson, Andrew Zisserman:
OVR: A Dataset for Open Vocabulary Temporal Repetition Counting in Videos. CoRR abs/2407.17085 (2024) - [i208]Ragav Sachdeva, Gyungin Shin, Andrew Zisserman:
Tails Tell Tales: Chapter-Wide Manga Transcriptions with Character Names. CoRR abs/2408.00298 (2024) - 2023
- [j104]Michael P. J. Camilleri, Li Zhang, Rasneer S. Bains, Andrew Zisserman, Christopher K. I. Williams:
Persistent animal identification leveraging non-visual markers. Mach. Vis. Appl. 34(4): 68 (2023) - [c474]Sindhu B. Hegde, Andrew Zisserman:
GestSync: Determining who is speaking without a talking head. BMVC 2023: 506-509 - [c473]Niki Amini-Naieni, Kiana Amini-Naieni, Tengda Han, Andrew Zisserman:
Open-world Text-specifed Object Counting. BMVC 2023: 510 - [c472]Yash Bhalgat, João F. Henriques, Andrew Zisserman:
A Light Touch Approach to Teaching Transformers Multi-view Geometry. CVPR 2023: 4958-4969 - [c471]Tengda Han, Max Bain, Arsha Nagrani, Gül Varol, Weidi Xie, Andrew Zisserman:
AutoAD: Movie Description in Context. CVPR 2023: 18930-18940 - [c470]Jaesung Huh, Jacob Chalk, Evangelos Kazakos, Dima Damen, Andrew Zisserman:
Epic-Sounds: A Large-Scale Dataset of Actions that Sound. ICASSP 2023: 1-5 - [c469]Hala Lamdouar, Weidi Xie, Andrew Zisserman:
The Making and Breaking of Camouflage. ICCV 2023: 832-842 - [c468]Carl Doersch, Yi Yang, Mel Vecerík, Dilara Gokay, Ankush Gupta, Yusuf Aytar, João Carreira, Andrew Zisserman:
TAPIR: Tracking Any Point with per-frame Initialization and temporal Refinement. ICCV 2023: 10027-10038 - [c467]Tengda Han, Max Bain, Arsha Nagrani, Gül Varol, Weidi Xie, Andrew Zisserman:
AutoAD II: The Sequel - Who, When, and What in Movie Audio Description. ICCV 2023: 13599-13609 - [c466]Chuhan Zhang, Ankush Gupta, Andrew Zisserman:
Helping Hands: An Object-Aware Ego-Centric Video Recognition Model. ICCV 2023: 13855-13866 - [c465]Liliane Momeni, Mathilde Caron, Arsha Nagrani, Andrew Zisserman, Cordelia Schmid:
Verbs in Action: Improving verb understanding in video-language models. ICCV 2023: 15533-15545 - [c464]Ragav Sachdeva, Andrew Zisserman:
The Change You Want to See (Now in 3D). ICCV (Workshops) 2023: 2052-2061 - [c463]Prannay Kaul, Weidi Xie, Andrew Zisserman:
Multi-Modal Classifiers for Open-Vocabulary Object Detection. ICML 2023: 15946-15969 - [c462]Max Bain, Jaesung Huh, Tengda Han, Andrew Zisserman:
WhisperX: Time-Accurate Speech Transcription of Long-Form Audio. INTERSPEECH 2023: 4489-4493 - [c461]Emmanuelle Bourigault, Amir Jamaludin, Emma Clark, Jeremy Fairbank, Timor Kadir, Andrew Zisserman:
3D Shape Analysis of Scoliosis. ShapeMI@MICCAI 2023: 271-286 - [c460]Rhydian Windsor, Amir Jamaludin, Timor Kadir, Andrew Zisserman:
Vision-Language Modelling For Radiological Imaging and Reports In The Low Data Regime. MIDL 2023: 53-73 - [c459]Jonathan Campbell, Mitchell Dawson, Andrew Zisserman, Weidi Xie, Christoffer Nellåker:
Deep Facial Phenotyping with Mixup Augmentation. MIUA 2023: 133-144 - [c458]Yash Bhalgat, Iro Laina, João F. Henriques, Andrea Vedaldi, Andrew Zisserman:
Contrastive Lift: 3D Object Instance Segmentation by Slow-Fast Contrastive Fusion. NeurIPS 2023 - [c457]Viorica Patraucean, Lucas Smaira, Ankush Gupta, Adrià Recasens, Larisa Markeeva, Dylan Banarse, Skanda Koppula, Joseph Heyward, Mateusz Malinowski, Yi Yang, Carl Doersch, Tatiana Matejovicova, Yury Sulsky, Antoine Miech, Alexandre Fréchette, Hanna Klimczak, Raphael Koster, Junlin Zhang, Stephanie Winkler, Yusuf Aytar, Simon Osindero, Dima Damen, Andrew Zisserman, João Carreira:
Perception Test: A Diagnostic Benchmark for Multimodal Video Models. NeurIPS 2023 - [c456]Sagar Vaze, Andrea Vedaldi, Andrew Zisserman:
No Representation Rules Them All in Category Discovery. NeurIPS 2023 - [c455]Ragav Sachdeva, Andrew Zisserman:
The Change You Want to See. WACV 2023: 3982-3991 - [i207]Adrià Recasens, Jason Lin, João Carreira, Andrew Jaegle, Luyu Wang, Jean-Baptiste Alayrac, Pauline Luc, Antoine Miech, Lucas Smaira, Ross Hemsley, Andrew Zisserman:
Zorro: the masked multimodal transformer. CoRR abs/2301.09595 (2023) - [i206]Jaesung Huh, Jacob Chalk, Evangelos Kazakos, Dima Damen, Andrew Zisserman:
Epic-Sounds: A Large-scale Dataset of Actions That Sound. CoRR abs/2302.00646 (2023) - [i205]Jaesung Huh, Andrew Brown, Jee-weon Jung, Joon Son Chung, Arsha Nagrani, Daniel Garcia-Romero, Andrew Zisserman:
VoxSRC 2022: The Fourth VoxCeleb Speaker Recognition Challenge. CoRR abs/2302.10248 (2023) - [i204]Max Bain, Jaesung Huh, Tengda Han, Andrew Zisserman:
WhisperX: Time-Accurate Speech Transcription of Long-Form Audio. CoRR abs/2303.00747 (2023) - [i203]Relja Arandjelovic, Alex Andonian, Arthur Mensch, Olivier J. Hénaff, Jean-Baptiste Alayrac, Andrew Zisserman:
Three ways to improve feature alignment for open vocabulary detection. CoRR abs/2303.13518 (2023) - [i202]Tengda Han, Max Bain, Arsha Nagrani, Gül Varol, Weidi Xie, Andrew Zisserman:
AutoAD: Movie Description in Context. CoRR abs/2303.16899 (2023) - [i201]Rhydian Windsor, Amir Jamaludin, Timor Kadir, Andrew Zisserman:
Vision-Language Modelling For Radiological Imaging and Reports In The Low Data Regime. CoRR abs/2303.17644 (2023) - [i200]Liliane Momeni, Mathilde Caron, Arsha Nagrani, Andrew Zisserman, Cordelia Schmid:
Verbs in Action: Improving verb understanding in video-language models. CoRR abs/2304.06708 (2023) - [i199]Viorica Patraucean, Lucas Smaira, Ankush Gupta, Adrià Recasens Continente, Larisa Markeeva, Dylan Banarse, Skanda Koppula, Joseph Heyward, Mateusz Malinowski, Yi Yang, Carl Doersch, Tatiana Matejovicova, Yury Sulsky, Antoine Miech, Alexandre Fréchette, Hanna Klimczak, Raphael Koster, Junlin Zhang, Stephanie Winkler, Yusuf Aytar, Simon Osindero, Dima Damen, Andrew Zisserman, João Carreira:
Perception Test: A Diagnostic Benchmark for Multimodal Video Models. CoRR abs/2305.13786 (2023) - [i198]Niki Amini-Naieni, Kiana Amini-Naieni, Tengda Han, Andrew Zisserman:
Open-world Text-specified Object Counting. CoRR abs/2306.01851 (2023) - [i197]Yash Bhalgat, Iro Laina, João F. Henriques, Andrew Zisserman, Andrea Vedaldi:
Contrastive Lift: 3D Object Instance Segmentation by Slow-Fast Contrastive Fusion. CoRR abs/2306.04633 (2023) - [i196]Prannay Kaul, Weidi Xie, Andrew Zisserman:
Multi-Modal Classifiers for Open-Vocabulary Object Detection. CoRR abs/2306.05493 (2023) - [i195]Carl Doersch, Yi Yang, Mel Vecerík, Dilara Gokay, Ankush Gupta, Yusuf Aytar, João Carreira, Andrew Zisserman:
TAPIR: Tracking Any Point with per-frame Initialization and temporal Refinement. CoRR abs/2306.08637 (2023) - [i194]Jaesung Huh, Max Bain, Andrew Zisserman:
OxfordVGG Submission to the EGO4D AV Transcription Challenge. CoRR abs/2307.09006 (2023) - [i193]Chuhan Zhang, Ankush Gupta, Andrew Zisserman:
Helping Hands: An Object-Aware Ego-Centric Video Recognition Model. CoRR abs/2308.07918 (2023) - [i192]Ragav Sachdeva, Andrew Zisserman:
The Change You Want to See (Now in 3D). CoRR abs/2308.10417 (2023) - [i191]Hala Lamdouar, Weidi Xie, Andrew Zisserman:
The Making and Breaking of Camouflage. CoRR abs/2309.03899 (2023) - [i190]Sindhu B. Hegde, Andrew Zisserman:
GestSync: Determining who is speaking without a talking head. CoRR abs/2310.05304 (2023) - [i189]Guanqi Zhan, Chuanxia Zheng, Weidi Xie, Andrew Zisserman:
What Does Stable Diffusion Know about the 3D Scene? CoRR abs/2310.06836 (2023) - [i188]Tengda Han, Max Bain, Arsha Nagrani, Gül Varol, Weidi Xie, Andrew Zisserman:
AutoAD II: The Sequel - Who, When, and What in Movie Audio Description. CoRR abs/2310.06838 (2023) - [i187]Jianbo Jiao, Mohammad Alsharid, Lior Drukker, Aris T. Papageorghiou, Andrew Zisserman, J. Alison Noble:
Show from Tell: Audio-Visual Modelling in Clinical Settings. CoRR abs/2310.16477 (2023) - [i186]Amir Jamaludin, Timor Kadir, Emma Clark, Andrew Zisserman:
Predicting Spine Geometry and Scoliosis from DXA Scans. CoRR abs/2311.09424 (2023) - [i185]Sagar Vaze, Andrea Vedaldi, Andrew Zisserman:
No Representation Rules Them All in Category Discovery. CoRR abs/2311.17055 (2023) - [i184]João Carreira, Michael King, Viorica Patraucean, Dilara Gokay, Catalin Ionescu, Yi Yang, Daniel Zoran, Joseph Heyward, Carl Doersch, Yusuf Aytar, Dima Damen, Andrew Zisserman:
Learning from One Continuous Video Stream. CoRR abs/2312.00598 (2023) - [i183]Pinelopi Papalampidi, Skanda Koppula, Shreya Pathak, Justin Chiu, Joseph Heyward, Viorica Patraucean, Jiajun Shen, Antoine Miech, Andrew Zisserman, Aida Nematzadeh:
A Simple Recipe for Contrastively Pre-training Video-First Encoders Beyond 16 Frames. CoRR abs/2312.07395 (2023) - [i182]Junyu Xie, Weidi Xie, Andrew Zisserman:
Appearance-based Refinement for Object-Centric Motion Segmentation. CoRR abs/2312.11463 (2023) - [i181]Bruno Korbar, Yongqin Xian, Alessio Tonioni, Andrew Zisserman, Federico Tombari:
Text-Conditioned Resampler For Long Form Video Understanding. CoRR abs/2312.11897 (2023) - [i180]Joseph Heyward, João Carreira, Dima Damen, Andrew Zisserman, Viorica Patraucean:
Perception Test 2023: A Summary of the First Challenge And Outcome. CoRR abs/2312.13090 (2023) - [i179]Guanqi Zhan, Chuanxia Zheng, Weidi Xie, Andrew Zisserman:
Amodal Ground Truth and Completion in the Wild. CoRR abs/2312.17247 (2023) - 2022
- [j103]Gül Varol, Liliane Momeni, Samuel Albanie, Triantafyllos Afouras, Andrew Zisserman:
Scaling Up Sign Spotting Through Sign Language Dictionaries. Int. J. Comput. Vis. 130(6): 1416-1439 (2022) - [j102]Manuel J. Marín-Jiménez, Vicky Kalogeiton, Pablo Medina-Suarez, Andrew Zisserman:
LAEO-Net++: Revisiting People Looking at Each Other in Videos. IEEE Trans. Pattern Anal. Mach. Intell. 44(6): 3069-3081 (2022) - [j101]Kai Han, Sylvestre-Alvise Rebuffi, Sébastien Ehrhardt, Andrea Vedaldi, Andrew Zisserman:
AutoNovel: Automatically Discovering and Learning Novel Visual Categories. IEEE Trans. Pattern Anal. Mach. Intell. 44(10): 6767-6781 (2022) - [j100]Triantafyllos Afouras, Joon Son Chung, Andrew W. Senior, Oriol Vinyals, Andrew Zisserman:
Deep Audio-Visual Speech Recognition. IEEE Trans. Pattern Anal. Mach. Intell. 44(12): 8717-8727 (2022) - [c454]Chuhan Zhang, Ankush Gupta, Andrew Zisserman:
Is an Object-Centric Video Representation Beneficial for Transfer? ACCV (4) 2022: 379-397 - [c453]Olivia Wiles, João Carreira, Iain Barr, Andrew Zisserman, Mateusz Malinowski:
Compressed Vision for Efficient Video Understanding. ACCV (7) 2022: 679-695 - [c452]Guanqi Zhan, Weidi Xie, Andrew Zisserman:
A Tri-Layer Plugin to Improve Occluded Detection. BMVC 2022: 250 - [c451]Chang Liu, Yujie Zhong, Andrew Zisserman, Weidi Xie:
CounTR: Transformer-based Generalised Visual Counting. BMVC 2022: 370 - [c450]Vladimir Iashin, Weidi Xie, Esa Rahtu, Andrew Zisserman:
Sparse in Space and Time: Audio-visual Synchronisation with Trainable Selectors. BMVC 2022: 395 - [c449]K. R. Prajwal, Hannah Bull, Liliane Momeni, Samuel Albanie, Gül Varol, Andrew Zisserman:
Weakly-supervised Fingerspelling Recognition in British Sign Language Videos. BMVC 2022: 609 - [c448]Tengda Han, Weidi Xie, Andrew Zisserman:
Turbo Training with Token Dropout. BMVC 2022: 622 - [c447]Bruno Korbar, Andrew Zisserman:
Personalised CLIP or: how to find your vacation videos. BMVC 2022: 639 - [c446]Charig Yang, Weidi Xie, Andrew Zisserman:
It's About Time: Analog Clock Reading in the Wild. CVPR 2022: 2498-2507 - [c445]Tengda Han, Weidi Xie, Andrew Zisserman:
Temporal Alignment Networks for Long-term Video. CVPR 2022: 2896-2906 - [c444]K. R. Prajwal, Triantafyllos Afouras, Andrew Zisserman:
Sub-word Level Lip Reading With Visual Attention. CVPR 2022: 5152-5162 - [c443]Wang Yifan, Carl Doersch, Relja Arandjelovic, João Carreira, Andrew Zisserman:
Input-level Inductive Biases for 3D Reconstruction. CVPR 2022: 6166-6176 - [c442]Sagar Vaze, Kai Han, Andrea Vedaldi, Andrew Zisserman:
Generalized Category Discovery. CVPR 2022: 7482-7491 - [c441]Akam Rahimi, Triantafyllos Afouras, Andrew Zisserman:
Reading to Listen at the Cocktail Party: Multi-Modal Speech Separation. CVPR 2022: 10483-10492 - [c440]Prannay Kaul, Weidi Xie, Andrew Zisserman:
Label, Verify, Correct: A Simple Few Shot Object Detection Method. CVPR 2022: 14217-14227 - [c439]Olivier J. Hénaff, Skanda Koppula, Evan Shelhamer, Daniel Zoran, Andrew Jaegle, Andrew Zisserman, João Carreira, Relja Arandjelovic:
Object Discovery and Representation Networks. ECCV (27) 2022: 123-143 - [c438]Liliane Momeni, Hannah Bull, K. R. Prajwal, Samuel Albanie, Gül Varol, Andrew Zisserman:
Automatic Dense Annotation of Large-Vocabulary Sign Language Videos. ECCV (35) 2022: 671-690 - [c437]Andrew Jaegle, Sebastian Borgeaud, Jean-Baptiste Alayrac, Carl Doersch, Catalin Ionescu, David Ding, Skanda Koppula, Daniel Zoran, Andrew Brock, Evan Shelhamer, Olivier J. Hénaff, Matthew M. Botvinick, Andrew Zisserman, Oriol Vinyals, João Carreira:
Perceiver IO: A General Architecture for Structured Inputs & Outputs. ICLR 2022 - [c436]Sagar Vaze, Kai Han, Andrea Vedaldi, Andrew Zisserman:
Open-Set Recognition: A Good Closed-Set Classifier is All You Need. ICLR 2022 - [c435]Rhydian Windsor, Amir Jamaludin, Timor Kadir, Andrew Zisserman:
Context-Aware Transformers for Spinal Cancer Detection and Radiological Grading. MICCAI (3) 2022: 271-281 - [c434]Jean-Baptiste Alayrac, Jeff Donahue, Pauline Luc, Antoine Miech, Iain Barr, Yana Hasson, Karel Lenc, Arthur Mensch, Katherine Millican, Malcolm Reynolds, Roman Ring, Eliza Rutherford, Serkan Cabi, Tengda Han, Zhitao Gong, Sina Samangooei, Marianne Monteiro, Jacob L. Menick, Sebastian Borgeaud, Andy Brock, Aida Nematzadeh, Sahand Sharifzadeh, Mikolaj Binkowski, Ricardo Barreira, Oriol Vinyals, Andrew Zisserman, Karén Simonyan:
Flamingo: a Visual Language Model for Few-Shot Learning. NeurIPS 2022 - [c433]Carl Doersch, Ankush Gupta, Larisa Markeeva, Adrià Recasens, Lucas Smaira, Yusuf Aytar, João Carreira, Andrew Zisserman, Yi Yang:
TAP-Vid: A Benchmark for Tracking Any Point in a Video. NeurIPS 2022 - [c432]Erika Lu, Forrester Cole, Weidi Xie, Tali Dekel, Bill Freeman, Andrew Zisserman, Michael Rubinstein:
Associating Objects and Their Effects in Video through Coordination Games. NeurIPS 2022 - [c431]Junyu Xie, Weidi Xie, Andrew Zisserman:
Segmenting Moving Objects via an Object-Centric Layered Representation. NeurIPS 2022 - [i178]Sagar Vaze, Kai Han, Andrea Vedaldi, Andrew Zisserman:
Generalized Category Discovery. CoRR abs/2201.02609 (2022) - [i177]Andrew Brown, Jaesung Huh, Joon Son Chung, Arsha Nagrani, Andrew Zisserman:
VoxSRC 2021: The Third VoxCeleb Speaker Recognition Challenge. CoRR abs/2201.04583 (2022) - [i176]João Carreira, Skanda Koppula, Daniel Zoran, Adrià Recasens, Catalin Ionescu, Olivier J. Hénaff, Evan Shelhamer, Relja Arandjelovic, Matthew M. Botvinick, Oriol Vinyals, Karen Simonyan, Andrew Zisserman, Andrew Jaegle:
Hierarchical Perceiver. CoRR abs/2202.10890 (2022) - [i175]Olivier J. Hénaff, Skanda Koppula, Evan Shelhamer, Daniel Zoran, Andrew Jaegle, Andrew Zisserman, João Carreira, Relja Arandjelovic:
Object discovery and representation networks. CoRR abs/2203.08777 (2022) - [i174]Tengda Han, Weidi Xie, Andrew Zisserman:
Temporal Alignment Networks for Long-term Video. CoRR abs/2204.02968 (2022) - [i173]Jean-Baptiste Alayrac, Jeff Donahue, Pauline Luc, Antoine Miech, Iain Barr, Yana Hasson, Karel Lenc, Arthur Mensch, Katie Millican, Malcolm Reynolds, Roman Ring, Eliza Rutherford, Serkan Cabi, Tengda Han, Zhitao Gong, Sina Samangooei, Marianne Monteiro, Jacob Menick, Sebastian Borgeaud, Andrew Brock, Aida Nematzadeh, Sahand Sharifzadeh, Mikolaj Binkowski, Ricardo Barreira, Oriol Vinyals, Andrew Zisserman, Karen Simonyan:
Flamingo: a Visual Language Model for Few-Shot Learning. CoRR abs/2204.14198 (2022) - [i172]Rhydian Windsor, Amir Jamaludin, Timor Kadir, Andrew Zisserman:
SpineNetV2: Automated Detection, Labelling and Radiological Grading Of Clinical MR Scans. CoRR abs/2205.01683 (2022) - [i171]Gül Varol, Liliane Momeni, Samuel Albanie, Triantafyllos Afouras, Andrew Zisserman:
Scaling up sign spotting through sign language dictionaries. CoRR abs/2205.04152 (2022) - [i170]Max Bain, Arsha Nagrani, Gül Varol, Andrew Zisserman:
A CLIP-Hitchhiker's Guide to Long Video Retrieval. CoRR abs/2205.08508 (2022) - [i169]Rhydian Windsor, Amir Jamaludin, Timor Kadir, Andrew Zisserman:
Context-Aware Transformers For Spinal Cancer Detection and Radiological Grading. CoRR abs/2206.13173 (2022) - [i168]Junyu Xie, Weidi Xie, Andrew Zisserman:
Segmenting Moving Objects via an Object-Centric Layered Representation. CoRR abs/2207.02206 (2022) - [i167]Chuhan Zhang, Ankush Gupta, Andrew Zisserman:
Is an Object-Centric Video Representation Beneficial for Transfer? CoRR abs/2207.10075 (2022) - [i166]