default search action
Ming-Hsuan Yang 0001
Person information
- affiliation: University of California, Merced, CA, USA
Other persons with the same name
- Ming-Hsuan Yang
- Ming-Hsuan Yang 0002 — Arizona University, Tucson, AZ, USA
Refine list
refinements active!
zoomed in on ?? of ?? records
view refined list in
export refined list as
showing all ?? records
2020 – today
- 2024
- [j167]Ling Yang, Zhilong Zhang, Yang Song, Shenda Hong, Runsheng Xu, Yue Zhao, Wentao Zhang, Bin Cui, Ming-Hsuan Yang:
Diffusion Models: A Comprehensive Survey of Methods and Applications. ACM Comput. Surv. 56(4): 105:1-105:39 (2024) - [j166]Zhiwei Lin, Tingting Liang, Taihong Xiao, Yongtao Wang, Ming-Hsuan Yang:
FlowNAS: Neural Architecture Search for Optical Flow Estimation. Int. J. Comput. Vis. 132(4): 1055-1074 (2024) - [j165]Wenqi Ren, Senyou Deng, Kaihao Zhang, Fenglong Song, Xiaochun Cao, Ming-Hsuan Yang:
Fast Ultra High-Definition Video Deblurring via Multi-scale Separable Network. Int. J. Comput. Vis. 132(5): 1817-1834 (2024) - [j164]Guorong Li, Hanhua Ye, Yuankai Qi, Shuhui Wang, Laiyun Qing, Qingming Huang, Ming-Hsuan Yang:
Learning Hierarchical Modular Networks for Video Captioning. IEEE Trans. Pattern Anal. Mach. Intell. 46(2): 1049-1064 (2024) - [j163]Lu Zhang, Lu Qi, Xu Yang, Hong Qiao, Ming-Hsuan Yang, Zhiyong Liu:
Automatically Discovering Novel Visual Categories With Adaptive Prototype Learning. IEEE Trans. Pattern Anal. Mach. Intell. 46(4): 2533-2544 (2024) - [j162]Jun Luo, Yunfeng Nie, Wenqi Ren, Xiaochun Cao, Ming-Hsuan Yang:
Correcting Optical Aberration via Depth-Aware Point Spread Functions. IEEE Trans. Pattern Anal. Mach. Intell. 46(8): 5541-5555 (2024) - [j161]Ziheng Yan, Yuankai Qi, Guorong Li, Xinyan Liu, Weigang Zhang, Ming-Hsuan Yang, Qingming Huang:
Progressive Multi-Resolution Loss for Crowd Counting. IEEE Trans. Circuits Syst. Video Technol. 34(5): 3232-3244 (2024) - [j160]Kaihao Zhang, Tao Wang, Wenhan Luo, Wenqi Ren, Björn Stenger, Wei Liu, Hongdong Li, Ming-Hsuan Yang:
MC-Blur: A Comprehensive Benchmark for Image Deblurring. IEEE Trans. Circuits Syst. Video Technol. 34(5): 3755-3767 (2024) - [j159]Abdelrahman M. Shaker, Muhammad Maaz, Hanoona Abdul Rasheed, Salman H. Khan, Ming-Hsuan Yang, Fahad Shahbaz Khan:
UNETR++: Delving Into Efficient and Accurate 3D Medical Image Segmentation. IEEE Trans. Medical Imaging 43(9): 3377-3390 (2024) - [j158]Xin Li, Wenjie Pei, Yaowei Wang, Zhenyu He, Huchuan Lu, Ming-Hsuan Yang:
Self-Supervised Tracking via Target-Aware Data Synthesis. IEEE Trans. Neural Networks Learn. Syst. 35(7): 9186-9197 (2024) - [c379]Youming Deng, Xueting Li, Sifei Liu, Ming-Hsuan Yang:
Physics-based Indirect Illumination for Inverse Rendering. 3DV 2024: 1249-1258 - [c378]Zhiwei Lin, Yongtao Wang, Shengxiang Qi, Nan Dong, Ming-Hsuan Yang:
BEV-MAE: Bird's Eye View Masked Autoencoders for Point Cloud Pre-training in Autonomous Driving Scenarios. AAAI 2024: 3531-3539 - [c377]Hao Zhang, Fang Li, Lu Qi, Ming-Hsuan Yang, Narendra Ahuja:
CSL: Class-Agnostic Structure-Constrained Learning for Segmentation Including the Unseen. AAAI 2024: 7078-7086 - [c376]Gaoxiang Cong, Yuankai Qi, Liang Li, Amin Beheshti, Zhedong Zhang, Anton van den Hengel, Ming-Hsuan Yang, Chenggang Yan, Qingming Huang:
StyleDubber: Towards Multi-Scale Style Learning for Movie Dubbing. ACL (Findings) 2024: 6767-6779 - [c375]Junyi Zhang, Charles Herrmann, Junhwa Hur, Eric Chen, Varun Jampani, Deqing Sun, Ming-Hsuan Yang:
Telling Left from Right: Identifying Geometry-Aware Semantic Correspondence. CVPR 2024: 3076-3085 - [c374]Lu Qi, Lehan Yang, Weidong Guo, Yu Xu, Bo Du, Varun Jampani, Ming-Hsuan Yang:
UniGS: Unified Representation for Image Generation and Segmentation. CVPR 2024: 6305-6315 - [c373]Kelvin C. K. Chan, Yang Zhao, Xuhui Jia, Ming-Hsuan Yang, Huisheng Wang:
Improving Subject-Driven Image Synthesis with Subject-Agnostic Guidance. CVPR 2024: 6733-6742 - [c372]Yuanze Lin, Yi-Wen Chen, Yi-Hsuan Tsai, Lu Jiang, Ming-Hsuan Yang:
Text-Driven Image Editing via Learnable Regions. CVPR 2024: 7059-7068 - [c371]Hsin-Ying Lee, Hung-Yu Tseng, Hsin-Ying Lee, Ming-Hsuan Yang:
Exploiting Diffusion Prior for Generalizable Dense Prediction. CVPR 2024: 7861-7871 - [c370]Hanoona Abdul Rasheed, Muhammad Maaz, Sahal Shaji Mullappilly, Abdelrahman M. Shaker, Salman H. Khan, Hisham Cholakkal, Rao Muhammad Anwer, Eric P. Xing, Ming-Hsuan Yang, Fahad Shahbaz Khan:
GLaMM: Pixel Grounding Large Multimodal Model. CVPR 2024: 13009-13018 - [c369]Tsai-Shien Chen, Aliaksandr Siarohin, Willi Menapace, Ekaterina Deyneka, Hsiang-wei Chao, Byung Eun Jeon, Yuwei Fang, Hsin-Ying Lee, Jian Ren, Ming-Hsuan Yang, Sergey Tulyakov:
Panda-70M: Captioning 70M Videos with Multiple Cross-Modality Teachers. CVPR 2024: 13320-13331 - [c368]Kuan-Chih Huang, Weijie Lyu, Ming-Hsuan Yang, Yi-Hsuan Tsai:
PTT: Point-Trajectory Transformer for Efficient Temporal 3D Object Detection. CVPR 2024: 14938-14947 - [c367]Syed Talal Wasim, Muzammal Naseer, Salman H. Khan, Ming-Hsuan Yang, Fahad Shahbaz Khan:
VideoGrounding-DINO: Towards Open-Vocabulary Spatio- Temporal Video Grounding. CVPR 2024: 18909-18918 - [c366]Yuqing Huang, Xin Li, Zikun Zhou, Yaowei Wang, Zhenyu He, Ming-Hsuan Yang:
RTracker: Recoverable Tracking via PN Tree Structured Memory. CVPR 2024: 19038-19047 - [c365]Xinyan Liu, Guorong Li, Yuankai Qi, Ziheng Yan, Zhenjun Han, Anton van den Hengel, Ming-Hsuan Yang, Qingming Huang:
Weakly Supervised Video Individual Counting. CVPR 2024: 19228-19237 - [c364]Xiaoyu Zhou, Zhiwei Lin, Xiaojun Shan, Yongtao Wang, Deqing Sun, Ming-Hsuan Yang:
DrivingGaussian: Composite Gaussian Splatting for Surrounding Dynamic Autonomous Driving Scenes. CVPR 2024: 21634-21643 - [c363]Chengxu Liu, Xuan Wang, Xiangyu Xu, Ruhao Tian, Shuai Li, Xueming Qian, Ming-Hsuan Yang:
Motion-Adaptive Separable Collaborative Filters for Blind Motion Deblurring. CVPR 2024: 25595-25605 - [c362]Yu-Ju Tsai, Jin-Cheng Jhang, Jingjing Zheng, Wei Wang, Albert Y. C. Chen, Min Sun, Cheng-Hao Kuo, Ming-Hsuan Yang:
No More Ambiguity in 360° Room Layout via Bi-Layout Estimation. CVPR 2024: 28056-28065 - [c361]Yu-Ju Tsai, Yu-Lun Liu, Lu Qi, Kelvin C. K. Chan, Ming-Hsuan Yang:
Dual Associated Encoder for Face Restoration. ICLR 2024 - [c360]Yuanhao Xiong, Long Zhao, Boqing Gong, Ming-Hsuan Yang, Florian Schroff, Ting Liu, Cho-Jui Hsieh, Liangzhe Yuan:
Structured Video-Language Modeling with Temporal Grouping and Spatial Grounding. ICLR 2024 - [c359]Lijun Yu, José Lezama, Nitesh Bharadwaj Gundavarapu, Luca Versari, Kihyuk Sohn, David Minnen, Yong Cheng, Agrim Gupta, Xiuye Gu, Alexander G. Hauptmann, Boqing Gong, Ming-Hsuan Yang, Irfan Essa, David A. Ross, Lu Jiang:
Language Model Beats Diffusion - Tokenizer is key to visual generation. ICLR 2024 - [c358]Long Zhao, Nitesh Bharadwaj Gundavarapu, Liangzhe Yuan, Hao Zhou, Shen Yan, Jennifer J. Sun, Luke Friedman, Rui Qian, Tobias Weyand, Yue Zhao, Rachel Hornung, Florian Schroff, Ming-Hsuan Yang, David A. Ross, Huisheng Wang, Hartwig Adam, Mikhail Sirotenko, Ting Liu, Boqing Gong:
VideoPrism: A Foundational Visual Encoder for Video Understanding. ICML 2024 - [c357]Dan Kondratyuk, Lijun Yu, Xiuye Gu, José Lezama, Jonathan Huang, Grant Schindler, Rachel Hornung, Vighnesh Birodkar, Jimmy Yan, Ming-Chang Chiu, Krishna Somandepalli, Hassan Akbari, Yair Alon, Yong Cheng, Joshua V. Dillon, Agrim Gupta, Meera Hahn, Anja Hauth, David Hendon, Alonso Martinez, David Minnen, Mikhail Sirotenko, Kihyuk Sohn, Xuan Yang, Hartwig Adam, Ming-Hsuan Yang, Irfan Essa, Huisheng Wang, David A. Ross, Bryan Seybold, Lu Jiang:
VideoPoet: A Large Language Model for Zero-Shot Video Generation. ICML 2024 - [c356]Zhaoliang Wan, Yonggen Ling, Senlin Yi, Lu Qi, Wang Wei Lee, Minglei Lu, Sicheng Yang, Xiao Teng, Peng Lu, Xu Yang, Ming-Hsuan Yang, Hui Cheng:
VinT-6D: A Large-Scale Object-in-hand Dataset from Vision, Touch and Proprioception. ICML 2024 - [c355]Xiaoyu Zhou, Xingjian Ran, Yajiao Xiong, Jinlin He, Zhiwei Lin, Yongtao Wang, Deqing Sun, Ming-Hsuan Yang:
GALA3D: Towards Text-to-3D Complex Scene Generation via Layout-guided Generative Gaussian Splatting. ICML 2024 - [c354]Christoph Mayer, Martin Danelljan, Ming-Hsuan Yang, Vittorio Ferrari, Luc Van Gool, Alina Kuznetsova:
Beyond SOT: Tracking Multiple Generic Objects at Once. WACV 2024: 6812-6822 - [i303]Syed Talal Wasim, Muzammal Naseer, Salman Khan, Ming-Hsuan Yang, Fahad Shahbaz Khan:
Video-GroundingDINO: Towards Open-Vocabulary Spatio-Temporal Video Grounding. CoRR abs/2401.00901 (2024) - [i302]Shilin Xu, Haobo Yuan, Qingyu Shi, Lu Qi, Jingbo Wang, Yibo Yang, Yining Li, Kai Chen, Yunhai Tong, Bernard Ghanem, Xiangtai Li, Ming-Hsuan Yang:
RAP-SAM: Towards Real-Time All-Purpose Segment Anything. CoRR abs/2401.10228 (2024) - [i301]Tao Wang, Wanglong Lu, Kaihao Zhang, Wenhan Luo, Tae-Kyun Kim, Tong Lu, Hongdong Li, Ming-Hsuan Yang:
PromptRR: Diffusion Models as Prompt Generators for Single Image Reflection Removal. CoRR abs/2402.02374 (2024) - [i300]Lu Qi, Yi-Wen Chen, Lehan Yang, Tiancheng Shen, Xiangtai Li, Weidong Guo, Yu Xu, Ming-Hsuan Yang:
Generalizable Entity Grounding via Assistance of Large Language Model. CoRR abs/2402.02555 (2024) - [i299]Xiaoyu Zhou, Xingjian Ran, Yajiao Xiong, Jinlin He, Zhiwei Lin, Yongtao Wang, Deqing Sun, Ming-Hsuan Yang:
GALA3D: Towards Text-to-3D Complex Scene Generation via Layout-guided Generative Gaussian Splatting. CoRR abs/2402.07207 (2024) - [i298]Divin Yan, Lu Qi, Vincent Tao Hu, Ming-Hsuan Yang, Meng Tang:
Training Class-Imbalanced Diffusion Model Via Overlap Optimization. CoRR abs/2402.10821 (2024) - [i297]Gaoxiang Cong, Yuankai Qi, Liang Li, Amin Beheshti, Zhedong Zhang, Anton van den Hengel, Ming-Hsuan Yang, Chenggang Yan, Qingming Huang:
StyleDubber: Towards Multi-Scale Style Learning for Movie Dubbing. CoRR abs/2402.12636 (2024) - [i296]Long Zhao, Nitesh Bharadwaj Gundavarapu, Liangzhe Yuan, Hao Zhou, Shen Yan, Jennifer J. Sun, Luke Friedman, Rui Qian, Tobias Weyand, Yue Zhao, Rachel Hornung, Florian Schroff, Ming-Hsuan Yang, David A. Ross, Huisheng Wang, Hartwig Adam, Mikhail Sirotenko, Ting Liu, Boqing Gong:
VideoPrism: A Foundational Visual Encoder for Video Understanding. CoRR abs/2402.13217 (2024) - [i295]Zhengxue Wang, Zhiqiang Yan, Ming-Hsuan Yang, Jinshan Pan, Jian Yang, Ying Tai, Guangwei Gao:
Scene Prior Filtering for Depth Map Super-Resolution. CoRR abs/2402.13876 (2024) - [i294]Hankyul Kang, Ming-Hsuan Yang, Jongbin Ryu:
Interactive Multi-Head Self-Attention with Linear Complexity. CoRR abs/2402.17507 (2024) - [i293]Tsai-Shien Chen, Aliaksandr Siarohin, Willi Menapace, Ekaterina Deyneka, Hsiang-wei Chao, Byung Eun Jeon, Yuwei Fang, Hsin-Ying Lee, Jian Ren, Ming-Hsuan Yang, Sergey Tulyakov:
Panda-70M: Captioning 70M Videos with Multiple Cross-Modality Teachers. CoRR abs/2402.19479 (2024) - [i292]Abdelrahman M. Shaker, Syed Talal Wasim, Martin Danelljan, Salman Khan, Ming-Hsuan Yang, Fahad Shahbaz Khan:
Efficient Video Object Segmentation via Modulated Cross-Attention Memory. CoRR abs/2403.17937 (2024) - [i291]Yuqing Huang, Xin Li, Zikun Zhou, Yaowei Wang, Zhenyu He, Ming-Hsuan Yang:
RTracker: Recoverable Tracking via PN Tree Structured Memory. CoRR abs/2403.19242 (2024) - [i290]Akshay Dudhane, Omkar Thawakar, Syed Waqas Zamir, Salman H. Khan, Fahad Shahbaz Khan, Ming-Hsuan Yang:
Dynamic Pre-training: Towards Efficient and Scalable All-in-One Image Restoration. CoRR abs/2404.02154 (2024) - [i289]Zhongyu Xia, Zhiwei Lin, Xinhao Wang, Yongtao Wang, Yun Xing, Shengxiang Qi, Nan Dong, Ming-Hsuan Yang:
HENet: Hybrid Encoding for End-to-end Multi-task 3D Perception from Multi-view Cameras. CoRR abs/2404.02517 (2024) - [i288]Pin-Hung Kuo, Jinshan Pan, Shao-Yi Chien, Ming-Hsuan Yang:
Mansformer: Efficient Transformer of Mixed Attention for Image Deblurring and Beyond. CoRR abs/2404.06135 (2024) - [i287]Deshui Miao, Xin Li, Zhenyu He, Huchuan Lu, Ming-Hsuan Yang:
Spatial-Temporal Multi-level Association for Video Object Segmentation. CoRR abs/2404.06265 (2024) - [i286]Bohao Peng, Zhuotao Tian, Shu Liu, Ming-Hsuan Yang, Jiaya Jia:
Scalable Language Model with Generalized Continual Learning. CoRR abs/2404.07470 (2024) - [i285]Weijie Lyu, Xueting Li, Abhijit Kundu, Yi-Hsuan Tsai, Ming-Hsuan Yang:
Gaga: Group Any Gaussians via 3D-aware Memory Bank. CoRR abs/2404.07977 (2024) - [i284]Yu-Ju Tsai, Jin-Cheng Jhang, Jingjing Zheng, Wei Wang, Albert Y. C. Chen, Min Sun, Cheng-Hao Kuo, Ming-Hsuan Yang:
No More Ambiguity in 360° Room Layout via Bi-Layout Estimation. CoRR abs/2404.09993 (2024) - [i283]Chieh Hubert Lin, Changil Kim, Jia-Bin Huang, Qinbo Li, Chih-Yao Ma, Johannes Kopf, Ming-Hsuan Yang, Hung-Yu Tseng:
Taming Latent Diffusion Model for Neural Radiance Field Inpainting. CoRR abs/2404.09995 (2024) - [i282]Hao-Wei Chen, Yu-Syuan Xu, Kelvin C. K. Chan, Hsien-Kai Kuo, Chun-Yi Lee, Ming-Hsuan Yang:
AdaIR: Exploiting Underlying Similarities of Image Restoration Tasks with Adapters. CoRR abs/2404.11475 (2024) - [i281]Chengxu Liu, Xuan Wang, Xiangyu Xu, Ruhao Tian, Shuai Li, Xueming Qian, Ming-Hsuan Yang:
Motion-adaptive Separable Collaborative Filters for Blind Motion Deblurring. CoRR abs/2404.13153 (2024) - [i280]Kelvin C. K. Chan, Yang Zhao, Xuhui Jia, Ming-Hsuan Yang, Huisheng Wang:
Improving Subject-Driven Image Synthesis with Subject-Agnostic Guidance. CoRR abs/2405.01356 (2024) - [i279]I-Hsiang Chen, Wei-Ting Chen, Yu-Wei Liu, Ming-Hsuan Yang, Sy-Yen Kuo:
Improving Point-based Crowd Counting and Localization Based on Auxiliary Point Guidance. CoRR abs/2405.10589 (2024) - [i278]Lingshun Kong, Jiangxin Dong, Ming-Hsuan Yang, Jinshan Pan:
Efficient Visual State Space Model for Image Deblurring. CoRR abs/2405.14343 (2024) - [i277]Kuan-Chih Huang, Xiangtai Li, Lu Qi, Shuicheng Yan, Ming-Hsuan Yang:
Reason3D: Searching and Reasoning 3D Segmentation via Large Language Model. CoRR abs/2405.17427 (2024) - [i276]Bin Ren, Yawei Li, Jingyun Liang, Rakesh Ranjan, Mengyuan Liu, Rita Cucchiara, Luc Van Gool, Ming-Hsuan Yang, Nicu Sebe:
Sharing Key Semantics in Transformer Makes Efficient Image Restoration. CoRR abs/2405.20008 (2024) - [i275]Chaoyang Wang, Xiangtai Li, Lu Qi, Henghui Ding, Yunhai Tong, Ming-Hsuan Yang:
SemFlow: Binding Semantic Segmentation and Image Synthesis via Rectified Flow. CoRR abs/2405.20282 (2024) - [i274]Deshui Miao, Xin Li, Zhenyu He, Yaowei Wang, Ming-Hsuan Yang:
1st Place Solution for MOSE Track in CVPR 2024 PVUW Workshop: Complex Video Object Segmentation. CoRR abs/2406.04600 (2024) - [i273]Henghui Ding, Chang Liu, Yunchao Wei, Nikhila Ravi, Shuting He, Song Bai, Philip Torr, Deshui Miao, Xin Li, Zhenyu He, Yaowei Wang, Ming-Hsuan Yang, Zhensong Xu, Jiangtao Yao, Chengjing Wu, Ting Liu, Luoqi Liu, Xinyu Liu, Jing Zhang, Kexin Zhang, Yuting Yang, Licheng Jiao, Shuyuan Yang, Mingqi Gao, Jingnan Luo, Jinyu Yang, Jungong Han, Feng Zheng, Bin Cao, Yisi Zhang, Xuanxu Lin, Xingjian He, Bo Zhao, Jing Liu, Feiyu Pan, Hao Fang, Xiankai Lu:
PVUW 2024 Challenge on Complex Video Understanding: Methods and Results. CoRR abs/2406.17005 (2024) - [i272]Haobo Yuan, Xiangtai Li, Lu Qi, Tao Zhang, Ming-Hsuan Yang, Shuicheng Yan, Chen Change Loy:
Mamba or RWKV: Exploring High-Quality and High-Efficiency Segment Anything Model. CoRR abs/2406.19369 (2024) - [i271]Shuangkang Fang, Yufeng Wang, Yi-Hsuan Tsai, Yi Yang, Wenrui Ding, Shuchang Zhou, Ming-Hsuan Yang:
Chat-Edit-3D: Interactive 3D Scene Editing via Text Prompts. CoRR abs/2407.06842 (2024) - [i270]Xin Li, Deshui Miao, Zhenyu He, Yaowei Wang, Huchuan Lu, Ming-Hsuan Yang:
Learning Spatial-Semantic Features for Robust Video Object Segmentation. CoRR abs/2407.07760 (2024) - [i269]Mingyang Zhao, Xiaohong Jia, Lei Ma, Yuke Shi, Jingen Jiang, Qizhai Li, Ming-Hsuan Yang, Tiejun Huang:
A Bayesian Approach Toward Robust Multidimensional Ellipsoid-Specific Fitting. CoRR abs/2407.19269 (2024) - [i268]Shilin Xu, Xiangtai Li, Haobo Yuan, Lu Qi, Yunhai Tong, Ming-Hsuan Yang:
LLAVADI: What Matters For Multimodal Large Language Models Distillation. CoRR abs/2407.19409 (2024) - [i267]Seung Hyun Lee, Junjie Ke, Yinxiao Li, Junfeng He, Steven Hickson, Katie Datsenko, Sangpil Kim, Ming-Hsuan Yang, Irfan Essa, Feng Yang:
Cropper: Vision-Language Model for Image Cropping through In-Context Learning. CoRR abs/2408.07790 (2024) - [i266]Xin Lin, Yuyan Zhou, Jingtong Yue, Chao Ren, Kelvin C. K. Chan, Lu Qi, Ming-Hsuan Yang:
Re-boosting Self-Collaboration Parallel Prompt GAN for Unsupervised Image Restoration. CoRR abs/2408.09241 (2024) - [i265]Deshui Miao, Yameng Gu, Xin Li, Zhenyu He, Yaowei Wang, Ming-Hsuan Yang:
Discriminative Spatial-Semantic VOS Solution: 1st Place Solution for 6th LSVOS. CoRR abs/2408.16431 (2024) - 2023
- [j157]Yan-Bo Lin, Hung-Yu Tseng, Hsin-Ying Lee, Yen-Yu Lin, Ming-Hsuan Yang:
Unsupervised sound localization via iterative contrastive learning. Comput. Vis. Image Underst. 227: 103602 (2023) - [j156]Syed Waqas Zamir, Aditya Arora, Salman Khan, Munawar Hayat, Fahad Shahbaz Khan, Ming-Hsuan Yang, Ling Shao:
Learning Enriched Features for Fast Image Restoration and Enhancement. IEEE Trans. Pattern Anal. Mach. Intell. 45(2): 1934-1948 (2023) - [j155]Weitao Wan, Cheng Yu, Jiansheng Chen, Tong Wu, Yuanyi Zhong, Ming-Hsuan Yang:
Shaping Deep Feature Space Towards Gaussian Mixture for Visual Classification. IEEE Trans. Pattern Anal. Mach. Intell. 45(2): 2430-2444 (2023) - [j154]Weihao Xia, Yulun Zhang, Yujiu Yang, Jing-Hao Xue, Bolei Zhou, Ming-Hsuan Yang:
GAN Inversion: A Survey. IEEE Trans. Pattern Anal. Mach. Intell. 45(3): 3121-3138 (2023) - [j153]Shanghua Gao, Zhong-Yu Li, Ming-Hsuan Yang, Ming-Ming Cheng, Junwei Han, Philip H. S. Torr:
Large-Scale Unsupervised Semantic Segmentation. IEEE Trans. Pattern Anal. Mach. Intell. 45(6): 7457-7476 (2023) - [j152]Jie Cao, Mandi Luo, Junchi Yu, Ming-Hsuan Yang, Ran He:
ScoreMix: A Scalable Augmentation Strategy for Training GANs With Limited Data. IEEE Trans. Pattern Anal. Mach. Intell. 45(7): 8920-8935 (2023) - [j151]Jinshan Pan, Boming Xu, Haoran Bai, Jinhui Tang, Ming-Hsuan Yang:
Cascaded Deep Video Deblurring Using Temporal Sharpness Prior and Non-Local Spatial-Temporal Similarity. IEEE Trans. Pattern Anal. Mach. Intell. 45(8): 9411-9425 (2023) - [j150]Salman H. Khan, Fahad Shahbaz Khan, Ashish Vaswani, Niki Parmar, Ming-Hsuan Yang, Mubarak Shah:
Guest Editorial Introduction to the Special Section on Transformer Models in Vision. IEEE Trans. Pattern Anal. Mach. Intell. 45(11): 12721-12725 (2023) - [j149]Yunfan Liu, Qi Li, Qiyao Deng, Zhenan Sun, Ming-Hsuan Yang:
GAN-Based Facial Attribute Manipulation. IEEE Trans. Pattern Anal. Mach. Intell. 45(12): 14590-14610 (2023) - [j148]Zheheng Jiang, Zhihua Liu, Long Chen, Lei Tong, Xiangrong Zhang, Xiangyuan Lan, Danny Crookes, Ming-Hsuan Yang, Huiyu Zhou:
Detecting and Tracking of Multiple Mice Using Part Proposal Networks. IEEE Trans. Neural Networks Learn. Syst. 34(12): 9806-9820 (2023) - [c353]Chun-Han Yao, Wei-Chih Hung, Yuanzhen Li, Michael Rubinstein, Ming-Hsuan Yang, Varun Jampani:
Hi-LASSIE: High-Fidelity Articulated Shape and Skeleton Discovery from Sparse Image Ensemble. CVPR 2023: 4853-4862 - [c352]Akshay Dudhane, Syed Waqas Zamir, Salman Khan, Fahad Shahbaz Khan, Ming-Hsuan Yang:
Burstormer: Burst Image Restoration and Enhancement Transformer. CVPR 2023: 5703-5712 - [c351]Lijun Yu, Yong Cheng, Kihyuk Sohn, José Lezama, Han Zhang, Huiwen Chang, Alexander G. Hauptmann, Ming-Hsuan Yang, Yuan Hao, Irfan Essa, Lu Jiang:
MAGVIT: Masked Generative Video Transformer. CVPR 2023: 10459-10469 - [c350]Yunhao Ge, Jie Ren, Andrew Gallagher, Yuxiao Wang, Ming-Hsuan Yang, Hartwig Adam, Laurent Itti, Balaji Lakshminarayanan, Jiaping Zhao:
Improving Zero-shot Generalization and Robustness of Multi-Modal Models. CVPR 2023: 11093-11101 - [c349]Hsin-Ping Huang, Charles Herrmann, Junhwa Hur, Erika Lu, Kyle Sargent, Austin Stone, Ming-Hsuan Yang, Deqing Sun:
Self-supervised AutoFlow. CVPR 2023: 11412-11421 - [c348]Gaoxiang Cong, Liang Li, Yuankai Qi, Zheng-Jun Zha, Qi Wu, Wenyu Wang, Bin Jiang, Ming-Hsuan Yang, Qingming Huang:
Learning to Dub Movies via Hierarchical Prosody Models. CVPR 2023: 14687-14697 - [c347]Chen Zhang, Guorong Li, Yuankai Qi, Shuhui Wang, Laiyun Qing, Qingming Huang, Ming-Hsuan Yang:
Exploiting Completeness and Uncertainty of Pseudo Labels for Weakly Supervised Video Anomaly Detection. CVPR 2023: 16271-16280 - [c346]Botao Ye, Sifei Liu, Xueting Li, Ming-Hsuan Yang:
Self-Supervised Super-Plane for Neural 3D Reconstruction. CVPR 2023: 21415-21424 - [c345]William Chettleburgh, Zhishen Huang, Ming-Hsuan Yang:
Fast Robust Principle Component Analysis Using Gauss-Newton Iterations. ICASSP 2023: 1-5 - [c344]Lu Qi, Jason Kuen, Tiancheng Shen, Jiuxiang Gu, Wenbo Li, Weidong Guo, Jiaya Jia, Zhe Lin, Ming-Hsuan Yang:
High Quality Entity Segmentation. ICCV 2023: 4024-4033 - [c343]Kuan-Chih Huang, Ming-Hsuan Yang, Yi-Hsuan Tsai:
Delving into Motion-Aware Matching for Monocular 3D Object Tracking. ICCV 2023: 6886-6895 - [c342]Long Zhao, Liangzhe Yuan, Boqing Gong, Yin Cui, Florian Schroff, Ming-Hsuan Yang, Hartwig Adam, Ting Liu:
Unified Visual Relationship Detection with Vision and Language Models. ICCV 2023: 6939-6950 - [c341]Amandeep Kumar, Ankan Kumar Bhunia, Sanath Narayan, Hisham Cholakkal, Rao Muhammad Anwer, Salman H. Khan, Ming-Hsuan Yang, Fahad Shahbaz Khan:
Generative Multiplane Neural Radiance for 3D-Aware Image Generation. ICCV 2023: 7354-7364 - [c340]Xin Li, Yuqing Huang, Zhenyu He, Yaowei Wang, Huchuan Lu, Ming-Hsuan Yang:
CiteTracker: Correlating Image and Text for Visual Tracking. ICCV 2023: 9940-9949 - [c339]Joungbin An, Hyolim Kang, Su Ho Han, Ming-Hsuan Yang, Seon Joo Kim:
MiniROAD: Minimal RNN Framework for Online Action Detection. ICCV 2023: 10307-10316 - [c338]Muhammad Uzair Khattak, Syed Talal Wasim, Muzammal Naseer, Salman Khan, Ming-Hsuan Yang, Fahad Shahbaz Khan:
Self-regulating Prompts: Foundational Model Adaptation without Forgetting. ICCV 2023: 15144-15154 - [c337]Abdelrahman M. Shaker, Muhammad Maaz, Hanoona Abdul Rasheed, Salman H. Khan, Ming-Hsuan Yang, Fahad Shahbaz Khan:
SwiftFormer: Efficient Additive Attention for Transformer-based Real-time Mobile Vision Applications. ICCV 2023: 17379-17390 - [c336]Yunhao Ge, Yuecheng Li, Shuo Ni, Jiaping Zhao, Ming-Hsuan Yang, Laurent Itti:
CLR: Channel-wise Lightweight Reprogramming for Continual Learning. ICCV 2023: 18752-18762 - [c335]Chieh Hubert Lin, Hsin-Ying Lee, Willi Menapace, Menglei Chai, Aliaksandr Siarohin, Ming-Hsuan Yang, Sergey Tulyakov:
InfiniCity: Infinite-Scale City Synthesis. ICCV 2023: 22751-22761 - [c334]Xiaoyu Zhou, Zhiwei Lin, Xiaojun Shan, Yongtao Wang, Deqing Sun, Ming-Hsuan Yang:
SAMPLING: Scene-adaptive Hierarchical Multiplane Images Representation for Novel View Synthesis from a Single Image. ICCV 2023: 22773-22783 - [c333]