


default search action
Fahad Shahbaz Khan
Fahad Khan 0001
Person information
- affiliation: Mohamed bin Zayed University of Artificial Intelligence, Abu Dhabi, AUE
- affiliation: Linköping University, Sweden
Other persons with the same name
- Fahad Khan — disambiguation page
Refine list

refinements active!
zoomed in on ?? of ?? records
view refined list in
export refined list as
showing all ?? records
2020 – today
- 2025
- [j54]Omkar Thawakar, Sanath Narayan, Hisham Cholakkal, Rao Muhammad Anwer, Salman H. Khan, Jorma Laaksonen, Mubarak Shah, Fahad Shahbaz Khan:
Video Instance Segmentation in an Open-World. Int. J. Comput. Vis. 133(1): 398-409 (2025) - [j53]Saeed Anwar, Muhammad Tahir, Chongyi Li, Ajmal Mian, Fahad Shahbaz Khan, Abdul Wahab Muzaffar:
Image colorization: A survey and dataset. Inf. Fusion 114: 102720 (2025) - [j52]Ge Li
, Jiale Cao
, Hanqing Sun
, Rao Muhammad Anwer
, Jin Xie
, Fahad Khan
, Yanwei Pang
:
Video Instance Segmentation Without Using Mask and Identity Supervision. IEEE Trans. Multim. 27: 224-235 (2025) - [c198]Umair Nawaz, Muhammad Awais, Hanan Gani, Muzammal Naseer, Fahad Shahbaz Khan, Salman H. Khan, Rao Muhammad Anwer:
AgriCLIP: Adapting CLIP for Agriculture and Livestock via Domain-Specialized Cross-Model Alignment. COLING 2025: 9630-9639 - 2024
- [j51]Shahina K. Kunhimon
, Abdelrahman M. Shaker
, Muzammal Naseer
, Salman H. Khan, Fahad Shahbaz Khan:
Learnable weight initialization for volumetric medical image segmentation. Artif. Intell. Medicine 151: 102863 (2024) - [j50]Jyoti Kini
, Fahad Shahbaz Khan, Salman Khan, Mubarak Shah:
CT-VOS: Cutout prediction and tagging for self-supervised video object segmentation. Comput. Vis. Image Underst. 238: 103860 (2024) - [j49]Florinel-Alin Croitoru, Nicolae-Catalin Ristea, Dana Dascalescu, Radu Tudor Ionescu
, Fahad Shahbaz Khan, Mubarak Shah:
Lightning fast video anomaly detection via multi-scale adversarial distillation. Comput. Vis. Image Underst. 247: 104074 (2024) - [j48]Yaxing Wang
, Abel Gonzalez-Garcia, Chenshen Wu, Luis Herranz, Fahad Shahbaz Khan, Shangling Jui, Jian Yang, Joost van de Weijer
:
MineGAN++: Mining Generative Models for Efficient Knowledge Transfer to Limited Data Domains. Int. J. Comput. Vis. 132(2): 490-514 (2024) - [j47]Mohammed Hassanin, Saeed Anwar, Ibrahim Radwan, Fahad Shahbaz Khan, Ajmal Mian
:
Visual attention methods in deep learning: An in-depth survey. Inf. Fusion 108: 102417 (2024) - [j46]Long Li
, Junwei Han
, Nian Liu
, Salman H. Khan
, Hisham Cholakkal
, Rao Muhammad Anwer
, Fahad Khan
:
Robust Perception and Precise Segmentation for Scribble-Supervised RGB-D Saliency Detection. IEEE Trans. Pattern Anal. Mach. Intell. 46(1): 479-496 (2024) - [j45]Neelu Madan
, Nicolae-Catalin Ristea
, Radu Tudor Ionescu
, Kamal Nasrollahi
, Fahad Shahbaz Khan
, Thomas B. Moeslund
, Mubarak Shah
:
Self-Supervised Masked Convolutional Transformer Block for Anomaly Detection. IEEE Trans. Pattern Anal. Mach. Intell. 46(1): 525-542 (2024) - [j44]Lei Huang
, Yunhao Ni
, Xi Weng
, Rao Muhammad Anwer
, Salman Khan
, Ming-Hsuan Yang
, Fahad Khan
:
Understanding Whitening Loss in Self-Supervised Learning. IEEE Trans. Pattern Anal. Mach. Intell. 46(12): 9479-9492 (2024) - [j43]Mustansar Fiaz
, Mubashir Noman, Hisham Cholakkal, Rao Muhammad Anwer, Jacob Hanna, Fahad Shahbaz Khan:
Guided-attention and gated-aggregation network for medical image segmentation. Pattern Recognit. 156: 110812 (2024) - [j42]Mubashir Noman, Mustansar Fiaz
, Hisham Cholakkal
, Salman H. Khan
, Fahad Shahbaz Khan
:
ELGC-Net: Efficient Local-Global Context Aggregation for Remote Sensing Change Detection. IEEE Trans. Geosci. Remote. Sens. 62: 1-11 (2024) - [j41]Mubashir Noman
, Mustansar Fiaz
, Hisham Cholakkal
, Sanath Narayan
, Rao Muhammad Anwer
, Salman Khan
, Fahad Khan
:
Remote Sensing Change Detection With Transformers Trained From Scratch. IEEE Trans. Geosci. Remote. Sens. 62: 1-14 (2024) - [j40]Abdelrahman M. Shaker
, Muhammad Maaz, Hanoona Abdul Rasheed, Salman H. Khan
, Ming-Hsuan Yang
, Fahad Shahbaz Khan
:
UNETR++: Delving Into Efficient and Accurate 3D Medical Image Segmentation. IEEE Trans. Medical Imaging 43(9): 3377-3390 (2024) - [j39]Muzammal Naseer
, Salman H. Khan
, Fatih Porikli
, Fahad Shahbaz Khan:
Guidance Through Surrogate: Toward a Generic Diagnostic Attack. IEEE Trans. Neural Networks Learn. Syst. 35(2): 2042-2053 (2024) - [j38]Yao Jiang
, Xinyu Yan
, Ge-Peng Ji
, Keren Fu
, Meijun Sun
, Huan Xiong
, Deng-Ping Fan
, Fahad Shahbaz Khan
:
Effectiveness assessment of recent large vision-language models. Vis. Intell. 2(1): 17 (2024) - [c197]Sahal Shaji Mullappilly, Abhishek Singh Gehlot, Rao Muhammad Anwer, Fahad Shahbaz Khan, Hisham Cholakkal:
Semi-supervised Open-World Object Detection. AAAI 2024: 4305-4314 - [c196]Sheng Zhang, Muzammal Naseer, Guangyi Chen
, Zhiqiang Shen, Salman H. Khan, Kun Zhang, Fahad Khan:
S3A: Towards Realistic Zero-Shot Classification via Self Structural Semantic Alignment. AAAI 2024: 7278-7286 - [c195]Hashmat Shadab Malik
, Muhammad Huzaifa, Muzammal Naseer
, Salman Khan
, Fahad Khan
:
ObjectCompose: Evaluating Resilience of Vision-Based Models on Object-to-Background Compositional Changes. ACCV (5) 2024: 400-417 - [c194]Muhammad Maaz, Hanoona Abdul Rasheed, Salman Khan, Fahad Khan:
Video-ChatGPT: Towards Detailed Video Understanding via Large Vision and Language Models. ACL (1) 2024: 12585-12602 - [c193]Omkar Chakradhar Thawakar, Abdelrahman M. Shaker, Sahal Shaji Mullappilly, Hisham Cholakkal, Rao Muhammad Anwer, Salman H. Khan, Jorma Laaksonen, Fahad Khan:
XrayGPT: Chest Radiographs Summarization using Large Medical Vision-Language Models. BioNLP@ACL 2024: 440-448 - [c192]Amaya Dharmasiri, Muzammal Naseer, Salman Khan, Fahad Shahbaz Khan:
Cross-Modal Self-Training: Aligning Images and Pointclouds to learn Classification without Labels. CVPR Workshops 2024: 708-717 - [c191]Bin Xie, Jiale Cao, Jin Xie, Fahad Shahbaz Khan, Yanwei Pang:
SED: A Simple Encoder-Decoder for Open-Vocabulary Semantic Segmentation. CVPR 2024: 3426-3436 - [c190]Hanoona Abdul Rasheed, Muhammad Maaz, Sahal Shaji Mullappilly, Abdelrahman M. Shaker, Salman H. Khan, Hisham Cholakkal, Rao Muhammad Anwer, Eric P. Xing, Ming-Hsuan Yang, Fahad Shahbaz Khan:
GLaMM: Pixel Grounding Large Multimodal Model. CVPR 2024: 13009-13018 - [c189]Nicolae-Catalin Ristea, Florinel-Alin Croitoru, Radu Tudor Ionescu, Marius Popescu, Fahad Shahbaz Khan, Mubarak Shah:
Self-Distilled Masked Auto-Encoders are Efficient Video Anomaly Detectors. CVPR 2024: 15984-15995 - [c188]Ziyang Luo, Nian Liu, Wangbo Zhao, Xuguang Yang, Dingwen Zhang, Deng-Ping Fan, Fahad Khan, Junwei Han:
VSCode: General Visual Salient and Camouflaged Object Detection with 2D Prompt Learning. CVPR 2024: 17169-17180 - [c187]Syed Talal Wasim, Muzammal Naseer, Salman H. Khan, Ming-Hsuan Yang, Fahad Shahbaz Khan:
VideoGrounding-DINO: Towards Open-Vocabulary Spatio- Temporal Video Grounding. CVPR 2024: 18909-18918 - [c186]Wenjin Hou, Shiming Chen, Shuhuang Chen, Ziming Hong, Yan Wang, Xuetao Feng, Salman H. Khan, Fahad Shahbaz Khan, Xinge You:
Visual-Augmented Dynamic Semantic Prototype for Generative Zero-Shot Learning. CVPR 2024: 23627-23637 - [c185]Shiming Chen, Wenjin Hou, Salman H. Khan, Fahad Shahbaz Khan:
Progressive Semantic-Guided Vision Transformer for Zero-Shot Learning. CVPR 2024: 23964-23974 - [c184]Omkar Thawakar, Muzammal Naseer, Rao Muhammad Anwer, Salman H. Khan, Michael Felsberg, Mubarak Shah, Fahad Shahbaz Khan:
Composed Video Retrieval via Enriched Context and Discriminative Embeddings. CVPR 2024: 26886-26896 - [c183]Mubashir Noman, Muzammal Naseer, Hisham Cholakkal, Rao Muhammad Anwer, Salman H. Khan, Fahad Shahbaz Khan:
Rethinking Transformers Pre-training for Multi-Spectral Satellite Imagery. CVPR 2024: 27811-27819 - [c182]Kartik Kuckreja, Muhammad Sohail Danish, Muzammal Naseer, Abhijit Das, Salman Khan, Fahad Shahbaz Khan:
GeoChat: Grounded Large Vision-Language Model for Remote Sensing. CVPR 2024: 27831-27840 - [c181]Jin Zhang
, Ruiheng Zhang
, Yanjiao Shi
, Zhe Cao
, Nian Liu
, Fahad Shahbaz Khan
:
Learning Camouflaged Object Detection from Noisy Pseudo Label. ECCV (1) 2024: 158-174 - [c180]Long Li
, Nian Liu
, Dingwen Zhang
, Zhongyu Li, Salman Khan, Rao Muhammad Anwer, Hisham Cholakkal
, Junwei Han
, Fahad Shahbaz Khan
:
CONDA: Condensed Deep Association Learning for Co-salient Object Detection. ECCV (50) 2024: 287-303 - [c179]Mohamed El Amine Boudjoghra
, Jean Lahoud
, Hisham Cholakkal
, Rao Muhammad Anwer
, Salman Khan
, Fahad Shahbaz Khan
:
Continual Learning and Unknown Object Discovery in 3D Scenes via Self-distillation. ECCV (73) 2024: 416-431 - [c178]Sara Pieri, Sahal Shaji Mullappilly, Fahad Shahbaz Khan, Rao Muhammad Anwer, Salman H. Khan, Timothy Baldwin, Hisham Cholakkal:
BiMediX: Bilingual Medical Mixture of Experts LLM. EMNLP (Findings) 2024: 16984-17002 - [c177]Yang Bai, Xinxing Xu, Yong Liu, Salman Khan, Fahad Khan, Wangmeng Zuo, Rick Siow Mong Goh, Chun-Mei Feng:
Sentence-level Prompts Benefit Composed Image Retrieval. ICLR 2024 - [c176]Senmao Li, Joost van de Weijer, Taihang Hu, Fahad Shahbaz Khan, Qibin Hou, Yaxing Wang, Jian Yang:
Get What You Want, Not What You Don't: Image Content Suppression for Text-to-Image Diffusion Models. ICLR 2024 - [c175]Xi Weng, Yunhao Ni, Tengwei Song, Jie Luo, Rao Muhammad Anwer, Salman Khan, Fahad Khan, Lei Huang:
Modulate Your Spectrum in Self-Supervised Learning. ICLR 2024 - [c174]Yuanwei Liu, Junwei Han, Xiwen Yao, Salman Khan, Hisham Cholakkal, Rao Muhammad Anwer, Nian Liu, Fahad Shahbaz Khan:
Bidirectional Reciprocative Information Communication for Few-Shot Semantic Segmentation. ICML 2024 - [c173]Jean Lahoud, Fahad Shahbaz Khan, Hisham Cholakkal, Rao Muhammad Anwer, Salman H. Khan:
Long-Tailed 3D Semantic Segmentation with Adaptive Weight Constraint and Sampling. ICRA 2024: 5037-5044 - [c172]Shahina K. Kunhimon, Muzammal Naseer, Salman Khan, Fahad Shahbaz Khan:
Language Guided Domain Generalized Medical Image Segmentation. ISBI 2024: 1-5 - [c171]Hasindri Watawana, Kanchana Ranasinghe, Tariq Mahmood, Muzammal Naseer, Salman H. Khan, Fahad Shahbaz Khan:
Hierarchical Text-to-Vision Self Supervised Alignment for Improved Histopathology Representation Learning. MICCAI (4) 2024: 167-177 - [c170]Hanan Gani, Muzammal Naseer, Fahad Khan, Salman H. Khan:
MedContext: Learning Contextual Cues for Efficient Volumetric Medical Segmentation. MICCAI (12) 2024: 229-239 - [c169]Asif Hanif, Fahad Shamshad, Muhammad Awais, Muzammal Naseer, Fahad Shahbaz Khan, Karthik Nandakumar, Salman H. Khan, Rao Muhammad Anwer:
BAPLe: Backdoor Attacks on Medical Foundational Models Using Prompt Learning. MICCAI (12) 2024: 443-453 - [c168]Chao Qin, Jiale Cao, Huazhu Fu, Fahad Shahbaz Khan, Rao Muhammad Anwer:
DB-SAM: Delving into High Quality Universal Medical Image Segmentation. MICCAI (12) 2024: 498-508 - [c167]Jiahua Dong, Wenqi Liang, Hongliu Li, Duzhen Zhang, Meng Cao, Henghui Ding, Salman H. Khan, Fahad Shahbaz Khan:
How to Continually Adapt Text-to-Image Diffusion Models for Flexible Customization? NeurIPS 2024 - [c166]Taihang Hu, Linxuan Li, Joost van de Weijer, Hongcheng Gao, Fahad Shahbaz Khan, Jian Yang, Ming-Ming Cheng, Kai Wang, Yaxing Wang:
Token Merging for Training-Free Semantic Binding in Text-to-Image Synthesis. NeurIPS 2024 - [c165]Senmao Li, Taihang Hu, Joost van de Weijer, Fahad Shahbaz Khan, Tao Liu, Linxuan Li, Shiqi Yang, Yaxing Wang, Ming-Ming Cheng, Jian Yang:
Faster Diffusion: Rethinking the Role of the Encoder for Diffusion Model Inference. NeurIPS 2024 - [c164]Haotian Qian, Yinda Chen, Shengtao Lou, Fahad Shahbaz Khan, Xiaogang Jin, Deng-Ping Fan:
MaskFactory: Towards High-quality Synthetic Data Generation for Dichotomous Image Segmentation. NeurIPS 2024 - [i220]Syed Talal Wasim, Muzammal Naseer, Salman Khan, Ming-Hsuan Yang, Fahad Shahbaz Khan:
Video-GroundingDINO: Towards Open-Vocabulary Spatio-Temporal Video Grounding. CoRR abs/2401.00901 (2024) - [i219]Dmitry Demidov, Roba Al Majzoub, Amandeep Kumar, Fahad Shahbaz Khan:
Distilling Local Texture Features for Colorectal Tissue Classification in Low Data Regimes. CoRR abs/2401.01164 (2024) - [i218]Senmao Li, Joost van de Weijer, Taihang Hu, Fahad Shahbaz Khan, Qibin Hou, Yaxing Wang, Jian Yang:
Get What You Want, Not What You Don't: Image Content Suppression for Text-to-Image Diffusion Models. CoRR abs/2402.05375 (2024) - [i217]Sara Pieri, Sahal Shaji Mullappilly, Fahad Shahbaz Khan, Rao Muhammad Anwer, Salman H. Khan, Timothy Baldwin, Hisham Cholakkal:
BiMediX: Bilingual Medical Mixture of Experts LLM. CoRR abs/2402.13253 (2024) - [i216]Muhammad Maaz, Hanoona Abdul Rasheed, Abdelrahman M. Shaker, Salman H. Khan, Hisham Cholakkal, Rao Muhammad Anwer, Tim Baldwin, Michael Felsberg, Fahad Shahbaz Khan:
PALO: A Polyglot Large Multimodal Model for 5B People. CoRR abs/2402.14818 (2024) - [i215]Sahal Shaji Mullappilly, Abhishek Singh Gehlot, Rao Muhammad Anwer, Fahad Shahbaz Khan, Hisham Cholakkal:
Semi-supervised Open-World Object Detection. CoRR abs/2402.16013 (2024) - [i214]Omkar Thawakar, Ashmal Vayani, Salman H. Khan, Hisham Cholakkal, Rao Muhammad Anwer, Michael Felsberg, Tim Baldwin, Eric P. Xing, Fahad Shahbaz Khan:
MobiLlama: Towards Accurate and Lightweight Fully Transparent GPT. CoRR abs/2402.16840 (2024) - [i213]Hanan Gani
, Muzammal Naseer, Fahad Khan, Salman Khan:
MedContext: Learning Contextual Cues for Efficient Volumetric Medical Segmentation. CoRR abs/2402.17725 (2024) - [i212]Yao Jiang, Xinyu Yan, Ge-Peng Ji, Keren Fu, Meijun Sun, Huan Xiong, Deng-Ping Fan, Fahad Shahbaz Khan:
Effectiveness Assessment of Recent Large Vision-Language Models. CoRR abs/2403.04306 (2024) - [i211]Hashmat Shadab Malik, Muhammad Huzaifa, Muzammal Naseer, Salman Khan, Fahad Shahbaz Khan:
ObjectCompose: Evaluating Resilience of Vision-Based Models on Object-to-Background Compositional Changes. CoRR abs/2403.04701 (2024) - [i210]Mubashir Noman, Muzammal Naseer, Hisham Cholakkal, Rao Muhammad Anwer, Salman Khan, Fahad Shahbaz Khan:
Rethinking Transformers Pre-training for Multi-Spectral Satellite Imagery. CoRR abs/2403.05419 (2024) - [i209]Yuning Cui, Syed Waqas Zamir, Salman H. Khan, Alois Knoll, Mubarak Shah, Fahad Shahbaz Khan:
AdaIR: Adaptive All-in-One Image Restoration via Frequency Mining and Modulation. CoRR abs/2403.14614 (2024) - [i208]Hasindri Watawana, Kanchana Ranasinghe, Tariq Mahmood, Muzammal Naseer, Salman Khan, Fahad Shahbaz Khan:
Hierarchical Text-to-Vision Self Supervised Alignment for Improved Histopathology Representation Learning. CoRR abs/2403.14616 (2024) - [i207]Ahmad Mahmood, Ashmal Vayani, Muzammal Naseer, Salman Khan, Fahad Shahbaz Khan:
VURF: A General-purpose Reasoning and Self-refinement Framework for Video Understanding. CoRR abs/2403.14743 (2024) - [i206]Omkar Thawakar, Muzammal Naseer, Rao Muhammad Anwer, Salman H. Khan, Michael Felsberg, Mubarak Shah, Fahad Shahbaz Khan:
Composed Video Retrieval via Enriched Context and Discriminative Embeddings. CoRR abs/2403.16997 (2024) - [i205]Mubashir Noman, Mustansar Fiaz
, Hisham Cholakkal, Salman Khan, Fahad Shahbaz Khan:
ELGC-Net: Efficient Local-Global Context Aggregation for Remote Sensing Change Detection. CoRR abs/2403.17909 (2024) - [i204]Abdelrahman M. Shaker, Syed Talal Wasim, Martin Danelljan, Salman Khan, Ming-Hsuan Yang, Fahad Shahbaz Khan:
Efficient Video Object Segmentation via Modulated Cross-Attention Memory. CoRR abs/2403.17937 (2024) - [i203]Shahina K. Kunhimon, Muzammal Naseer, Salman Khan, Fahad Shahbaz Khan:
Language Guided Domain Generalized Medical Image Segmentation. CoRR abs/2404.01272 (2024) - [i202]Akshay Dudhane, Omkar Thawakar, Syed Waqas Zamir, Salman H. Khan, Fahad Shahbaz Khan, Ming-Hsuan Yang:
Dynamic Pre-training: Towards Efficient and Scalable All-in-One Image Restoration. CoRR abs/2404.02154 (2024) - [i201]Shiming Chen, Wenjin Hou, Salman H. Khan, Fahad Shahbaz Khan:
Progressive Semantic-Guided Vision Transformer for Zero-Shot Learning. CoRR abs/2404.07713 (2024) - [i200]Amaya Dharmasiri, Muzammal Naseer, Salman Khan, Fahad Shahbaz Khan:
Cross-Modal Self-Training: Aligning Images and Pointclouds to Learn Classification without Labels. CoRR abs/2404.10146 (2024) - [i199]Wenjin Hou, Shiming Chen, Shuhuang Chen, Ziming Hong, Yan Wang, Xuetao Feng, Salman Khan, Fahad Shahbaz Khan, Xinge You:
Visual-Augmented Dynamic Semantic Prototype for Generative Zero-Shot Learning. CoRR abs/2404.14808 (2024) - [i198]Muhammad Uzair Khattak, Muhammad Ferjad Naeem, Jameel Hassan, Muzammal Naseer, Federico Tombari, Fahad Shahbaz Khan, Salman H. Khan:
How Good is my Video LMM? Complex Video Reasoning and Robustness Evaluation Suite for Video-LMMs. CoRR abs/2405.03690 (2024) - [i197]Jiahua Dong, Hui Yin, Hongliu Li, Wenbo Li, Yulun Zhang, Salman Khan, Fahad Shahbaz Khan:
Dual Hyperspectral Mamba for Efficient Spectral Compressive Imaging. CoRR abs/2406.00449 (2024) - [i196]Mohamed El Amine Boudjoghra, Angela Dai, Jean Lahoud, Hisham Cholakkal, Rao Muhammad Anwer, Salman H. Khan, Fahad Shahbaz Khan:
Open-YOLO 3D: Towards Fast and Accurate Open-Vocabulary 3D Instance Segmentation. CoRR abs/2406.02548 (2024) - [i195]Yuhao Li, Muzammal Naseer, Jiale Cao, Yu Zhu, Jinqiu Sun, Yanning Zhang, Fahad Shahbaz Khan:
Multi-Granularity Language-Guided Multi-Object Tracking. CoRR abs/2406.04844 (2024) - [i194]Hashmat Shadab Malik, Numan Saeed, Asif Hanif, Muzammal Naseer, Mohammad Yaqub, Salman H. Khan, Fahad Shahbaz Khan:
On Evaluating Adversarial Robustness of Volumetric Medical Segmentation Models. CoRR abs/2406.08486 (2024) - [i193]Hashmat Shadab Malik, Fahad Shamshad, Muzammal Naseer, Karthik Nandakumar, Fahad Shahbaz Khan, Salman H. Khan:
Towards Evaluating the Robustness of Visual State Space Models. CoRR abs/2406.09407 (2024) - [i192]Muhammad Maaz, Hanoona Abdul Rasheed, Salman Khan, Fahad Shahbaz Khan:
VideoGPT+: Integrating Image and Video Encoders for Enhanced Video Understanding. CoRR abs/2406.09418 (2024) - [i191]Rohit K. Bharadwaj, Hanan Gani, Muzammal Naseer, Fahad Shahbaz Khan, Salman H. Khan:
VANE-Bench: Video Anomaly Evaluation Benchmark for Conversational LMMs. CoRR abs/2406.10326 (2024) - [i190]Akshita Gupta, Aditya Arora, Sanath Narayan, Salman Khan, Fahad Shahbaz Khan, Graham W. Taylor:
Open-Vocabulary Temporal Action Localization using Multimodal Guidance. CoRR abs/2406.15556 (2024) - [i189]Jin Zhang, Ruiheng Zhang, Yanjiao Shi, Zhe Cao, Nian Liu, Fahad Shahbaz Khan:
Learning Camouflaged Object Detection from Noisy Pseudo Label. CoRR abs/2407.13157 (2024) - [i188]Abdelrahman M. Shaker, Syed Talal Wasim, Salman Khan, Juergen Gall, Fahad Shahbaz Khan:
GroupMamba: Parameter-Efficient and Accurate Group Visual State Space Model. CoRR abs/2407.13772 (2024) - [i187]Yasheng Sun, Bohan Li, Mingchen Zhuge, Deng-Ping Fan, Salman H. Khan, Fahad Shahbaz Khan, Hideki Koike:
Connecting Dreams with Visual Brainstorming Instruction. CoRR abs/2408.07317 (2024) - [i186]Asif Hanif, Fahad Shamshad, Muhammad Awais, Muzammal Naseer, Fahad Shahbaz Khan, Karthik Nandakumar, Salman H. Khan, Rao Muhammad Anwer:
BAPLe: Backdoor Attacks on Medical Foundational Models using Prompt Learning. CoRR abs/2408.07440 (2024) - [i185]Lin Sun, Jiale Cao, Jin Xie, Fahad Shahbaz Khan, Yanwei Pang:
iSeg: An Iterative Refinement-based Framework for Training-free Segmentation. CoRR abs/2409.03209 (2024) - [i184]Muhammad Akhtar Munir, Fahad Shahbaz Khan, Salman H. Khan:
Efficient Localized Adaptation of Neural Weather Forecasting: A Case Study in the MENA Region. CoRR abs/2409.07585 (2024) - [i183]Mubashir Noman, Noor Ahsan, Muzammal Naseer, Hisham Cholakkal, Rao Muhammad Anwer, Salman H. Khan, Fahad Shahbaz Khan:
CDChat: A Large Multimodal Model for Remote Sensing Change Description. CoRR abs/2409.16261 (2024) - [i182]Umair Nawaz, Muhammad Awais, Hanan Gani, Muzammal Naseer, Fahad Shahbaz Khan, Salman Khan, Rao Muhammad Anwer:
AgriCLIP: Adapting CLIP for Agriculture and Livestock via Domain-Specialized Cross-Model Alignment. CoRR abs/2410.01407 (2024) - [i181]Ayesha Ishaq, Mohamed El Amine Boudjoghra, Jean Lahoud, Fahad Shahbaz Khan, Salman Khan, Hisham Cholakkal, Rao Muhammad Anwer:
Open3DTrack: Towards Open-Vocabulary 3D Multi-Object Tracking. CoRR abs/2410.01678 (2024) - [i180]Chao Qin, Jiale Cao, Huazhu Fu, Fahad Shahbaz Khan, Rao Muhammad Anwer:
DB-SAM: Delving into High Quality Universal Medical Image Segmentation. CoRR abs/2410.04172 (2024) - [i179]Ge-Peng Ji, Jing Liu, Peng Xu, Nick Barnes, Fahad Shahbaz Khan, Salman H. Khan, Deng-Ping Fan:
Frontiers in Intelligent Colonoscopy. CoRR abs/2410.17241 (2024) - [i178]Jiahua Dong, Wenqi Liang, Hongliu Li, Duzhen Zhang, Meng Cao, Henghui Ding, Salman H. Khan, Fahad Shahbaz Khan:
How to Continually Adapt Text-to-Image Diffusion Models for Flexible Customization? CoRR abs/2410.17594 (2024) - [i177]Sara Ghaboura, Ahmed Heakl, Omkar Thawakar, Ali Husain Salem Abdulla Alharthi, Ines Riahi, Abduljalil Saif, Jorma Laaksonen, Fahad Shahbaz Khan, Salman H. Khan, Rao Muhammad Anwer:
CAMEL-Bench: A Comprehensive Arabic LMM Benchmark. CoRR abs/2410.18976 (2024) - [i176]Shehan Munasinghe, Hanan Gani, Wenqi Zhu, Jiale Cao, Eric P. Xing, Fahad Shahbaz Khan, Salman Khan:
VideoGLaMM: A Large Multimodal Model for Pixel-Level Visual Grounding in Videos. CoRR abs/2411.04923 (2024) - [i175]Taihang Hu, Linxuan Li, Joost van de Weijer, Hongcheng Gao, Fahad Shahbaz Khan, Jian Yang, Ming-Ming Cheng, Kai Wang, Yaxing Wang:
Token Merging for Training-Free Semantic Binding in Text-to-Image Synthesis. CoRR abs/2411.07132 (2024) - [i174]Dubing Chen, Jin Fang, Wencheng Han, Xinjing Cheng, Jun Yin, Chenzhong Xu, Fahad Shahbaz Khan, Jianbing Shen:
ALOcc: Adaptive Lifting-based 3D Semantic Occupancy and Cost Volume-based Flow Prediction. CoRR abs/2411.07725 (2024) - [i173]Ashmal Vayani, Dinura Dissanayake, Hasindri Watawana, Noor Ahsan, Nevasini Sasikumar, Omkar Thawakar, Henok Biadglign Ademtew, Yahya Hmaiti, Amandeep Kumar, Kartik Kuckreja, Mykola Maslych, Wafa Al Ghallabi, Mihail Mihaylov, Chao Qin, Abdelrahman M. Shaker, Mike Zhang, Mahardika Krisna Ihsani, Amiel Esplana, Monil Gokani, Shachar Mirkin, Harsh Singh, Ashay Srivastava, Endre Hamerlik, Fathinah Asma Izzati, Fadillah Adamsyah Maani, Sebastian Cavada, Jenny Chim, Rohit Gupta, Sanjay Manjunath, Kamila Zhumakhanova, Feno Heriniaina Rabevohitra, Azril Amirudin, Muhammad Ridzuan, Daniya Kareem, Ketan More, Kunyang Li, Pramesh Shakya, Muhammad Saad, Amirpouya Ghasemaghaei, Amirbek Djanibekov, Dilshod Azizov, Branislava Jankovic, Naman Bhatia, Alvaro Cabrera, Johan S. Obando-Ceron, Olympiah Otieno, Fabian Farestam, Muztoba Rabbani, Sanoojan Baliah, Santosh Sanjeev, Abduragim Shtanchaev, Maheen Fatima, Thao Nguyen, Amrin Kareem, Toluwani Aremu, Nathan Xavier, Amit Bhatkal, Hawau Toyin, Aman Chadha, Hisham Cholakkal, Rao Muhammad Anwer, Michael Felsberg, Jorma Laaksonen, Thamar Solorio, Monojit Choudhury, Ivan Laptev, Mubarak Shah, Salman Khan, Fahad Khan:
All Languages Matter: Evaluating LMMs on Culturally Diverse 100 Languages. CoRR abs/2411.16508 (2024) - [i172]Muhammad Sohail Danish, Muhammad Akhtar Munir, Syed Roshaan Ali Shah, Kartik Kuckreja, Fahad Shahbaz Khan, Paolo Fraccaro, Alexandre Lacoste, Salman H. Khan:
GEOBench-VLM: Benchmarking Vision-Language Models for Geospatial Tasks. CoRR abs/2411.19325 (2024) - [i171]Florinel-Alin Croitoru, Andrei-Iulian Hiji, Vlad Hondru, Nicolae-Catalin Ristea, Paul Irofti, Marius Popescu, Cristian Rusu, Radu Tudor Ionescu, Fahad Shahbaz Khan, Mubarak Shah:
Deepfake Media Generation and Detection in the Generative AI Era: A Survey and Outlook. CoRR abs/2411.19537 (2024) - [i170]Sahal Shaji Mullappilly, Mohammed Irfan Kurpath, Sara Pieri, Saeed Yahya Alseiari, Shanavas Cholakkal, Khaled Aldahmani, Fahad Khan, Rao Muhammad Anwer, Salman Khan, Timothy Baldwin, Hisham Cholakkal:
BiMediX2: Bio-Medical EXpert LMM for Diverse Medical Modalities. CoRR abs/2412.07769 (2024) - [i169]Muhammad Uzair Khattak, Shahina K. Kunhimon, Muzammal Naseer, Salman Khan, Fahad Shahbaz Khan:
UniMed-CLIP: Towards a Unified Image-Text Pretraining Paradigm for Diverse Medical Imaging Modalities. CoRR abs/2412.10372 (2024) - [i168]Sagar Soni, Akshay Dudhane, Hiyam Debary, Mustansar Fiaz, Muhammad Akhtar Munir, Muhammad Sohail Danish, Paolo Fraccaro, Campbell D. Watson, Levente J. Klein, Fahad Shahbaz Khan, Salman H. Khan:
EarthDial: Turning Multi-sensory Earth Observations to Interactive Dialogues. CoRR abs/2412.15190 (2024) - [i167]Dingjie Fu, Wenjin Hou, Shiming Chen, Shuhuang Chen, Xinge You, Salman H. Khan, Fahad Shahbaz Khan:
Discriminative Image Generation with Diffusion Models for Zero-Shot Learning. CoRR abs/2412.17219 (2024) - [i166]Haotian Qian, YD Chen, Shengtao Lou, Fahad Shahbaz Khan, Xiaogang Jin, Deng-Ping Fan:
Mask Factory: Towards High-quality Synthetic Data Generation for Dichotomous Image Segmentation. CoRR abs/2412.19080 (2024) - 2023
- [j37]Antonio Barbalau, Radu Tudor Ionescu
, Mariana-Iuliana Georgescu, Jacob V. Dueholm
, Bharathkumar Ramachandra
, Kamal Nasrollahi, Fahad Shahbaz Khan, Thomas B. Moeslund
, Mubarak Shah:
SSMTL++: Revisiting self-supervised multi-task learning for video anomaly detection. Comput. Vis. Image Underst. 229: 103656 (2023) - [j36]Haotong Qin
, Ge-Peng Ji
, Salman Khan
, Deng-Ping Fan
, Fahad Shahbaz Khan
,