


default search action
Image and Vision Computing, Volume 161
Volume 161, 2025
- Wei Xu
, Yi Wan, Dong Zhao, Long Zhang:
Efficient Mamba: Overcoming the visual limitations of Mamba with innovative structures. 105569 - Zihao Zhao
, Dinghui Wu, Qibing Zhu, Hao Wang, Yuxi Ge, Shudong Hu:
Mmi-Unet: Colorectal cancer CT image segmentation based on multi-modal information interaction. 105583 - Yanliang Ge
, Jiaxue Chen, Taichuan Liang, Yuxi Zhong, Hongbo Bi, Qiao Zhang:
Consensus exploration and detail perception for co-salient object detection in optical remote sensing images. 105586 - Wangyu Choi, Jiasi Chen, Jongwon Yoon
:
ADVC: Adversarial dense video captioning with unsupervised pretraining. 105595 - Chenyu Zhuang, Qing Zhang, Chenxi Zhang, Xinxin Yuan:
Boundary-and-object collaborative learning network for camouflaged object detection. 105596 - Haijun Xiong
, Bin Feng
, Bang Wang
, Xinggang Wang
, Wenyu Liu
:
MambaGait: Gait recognition approach combining explicit representation and implicit state space model. 105597 - Quentin Jodelet, Xin Liu, Yin Jun Phua, Tsuyoshi Murata:
Memory augmented using diffusion model for class-incremental learning. 105600 - Oladosu Oladimeji
, Abdullah-Al-Zubaer Imran
, Xiaoqin Wang, Saritha Unnikrishnan
:
Deep learning advances in breast medical imaging with a focus on clinical readiness and radiologists' perspective. 105601 - Fei Wu
, Ruixuan Zhou, Yang Gao, Yujian Feng, Qinghua Huang, Xiao-Yuan Jing:
Semi-supervised cross-modality person re-identification based on pseudo label learning. 105602 - Amaan Izhar
, Norisma Idris
, Nurul Japar
:
Enhancing radiology report generation: A prior knowledge-aware transformer network for effective alignment and fusion of multi-modal radiological data. 105603 - Thien B. Nguyen-Tat
, Anh T. Vu-Xuan, Vuong M. Ngo
:
Design of a novel fuzzy ensemble CNN framework for ovarian cancer classification using Tissue Microarray images. 105604 - Guanhua An, Yuhe Geng, Shengyu Fang, Jichang Guo
:
SFDFNet: Leveraging spatial-frequency deep fusion for RGB-T semantic segmentation. 105605 - Zhibing Wang, Wenmin Wang
, Nannan Li, Qi Chen, Yifan Zhang, Meng Xiao, Haomei Jia, Shenyong Zhang:
Multimodal Sensitive Adaptive Transformer for 3D medical image segmentation. 105606 - Qi Kuang, Ying Chen:
Visual-Aware Text as Query for Referring Video Object Segmentation. 105608 - Xin Song
, Wang Tian
, Qiqi Zhu, Xianglong Zhang:
VideoMamba++: Integrating state space model with dual attention for enhanced video understanding. 105609 - Jing Zhang, Yi Yu, Yuyao Mao, Yonggong Ren:
Event-level multimodal feature fusion for audio-visual event localization. 105610 - Chhavi Maheshwari, Siddhanth Bhat, Praveen Kumar Shukla, Madhu Oruganti, Vijaypal Singh Dhaka:
Similarity verification of kinship pairs using metricized emphasis. 105619 - Yanliang Ge
, Yuxi Zhong, Qiao Zhang, Junchao Ren, Min He, Hongbo Bi:
Research on collaborative camouflaged object detection under dual domain entanglement. 105623 - Qing Tian, Jiashuo Shen, Zixiao Zhou, Jixin Sun, Junyu Shen, Weihua Ou:
Camera information-induced vision transformer for unsupervised person re-identification. 105624 - Yiwen Zhang, Dong An, Dongzhao Yang, Tianxu Xu, Yuxuan He, Qiang Wang, Zhongqi Pan, Yang Yue:
Point-cloud-based hand gesture recognition using principal component analysis and boundary extraction. 105625 - Vishwas Rathi, Abhilasha Sharma, Aditya Venkata Nithin, Amit Kumar Singh, Brij B. Gupta:
A survey of computational techniques for fine art painting classification. 105626 - Tianyi Fu, Hongbin Dong, Benyi Yang
, Baosong Deng:
DE-DFNet: Edge enhanced diversity feature fusion guided by differences in remote sensing imagery tiny object detection. 105627 - Liangzun Fu
, Jin Chen, Yang Zhang, Xiwei Huang
, Lingling Sun:
CNN and Transformer-based deep learning models for automated white blood cell detection. 105631 - Yuanwu Xu, Minxian Li, Qiaolin Ye, Shidong Wang, Lunbo Li, Haofeng Zhang
:
CREAM: Few-shot Object Counting with Cross REfinement and Adaptive density Map. 105632 - Uzair Aslam Bhatti, Jinru Liu, Mengxing Huang, Yu Zhang:
FF-UNet: Feature fusion based deep learning-powered enhanced framework for accurate brain tumor segmentation in MRI images. 105635 - Saeed Karimi, Hamdi Dibeklioglu:
SATA: Style Agnostic Test time Adaptation for domain generalization. 105607 - Xu-Hua Yang, Hong-Xiang Hu, Xuanyu Lin:
ACMC: Adaptive cross-modal multi-grained contrastive learning for continuous sign language recognition. 105622 - Xiaobin Hong, Tarmizi Adam, Masitah Ghazali:
UHDNet: Unified multimodal fusion harmonization and hierarchical dependency learning for visible-infrared person re-identification. 105628 - Marco Gagliardi, Danilo Maurmo, Tommaso Ruga, Eugenio Vocaturo, Ester Zumpano:
BrAInVision: A hybrid explainable Artificial Intelligence framework for brain MRI analysis. 105629 - Alice Natalina Caragliano, Filippo Ruffini, Carlo Greco, Edy Ippolito, Michele Fiore, Claudia Tacconi, Lorenzo Nibid, Giuseppe Perrone, Sara Ramella, Paolo Soda, Valerio Guarrasi:
Doctor-in-the-Loop: An explainable, multi-view deep learning framework for predicting pathological response in non-small cell lung cancer. 105630 - Yongzhi Liu, Tongxin Yan:
Vision transformer enhanced with convolutional attention and graph convolution for semantic segmentation. 105633 - Wenzhe Zhai, Mingliang Gao, Gwanggil Jeon, Qiang Zhou, David Camacho:
Composed image retrieval by Multimodal Mixture-of-Expert Synergy. 105634 - Hengyou Wang, Rongxin Ma, Xiang Jiang:
Multiscale contextual joint feature enhancement GAN for semantic image synthesis. 105637 - Yixin Guo, Zhenxue Chen, Xuewen Rong, Chengyun Liu, Lili Song, Yidi Li:
3CNet: Cross-modal cooperative correction network for RGB-T semantic segmentation. 105638 - Shengkun Qi, Bing Liu, Yong Zhou, Peng Liu, Chen Zhang, Siyu Chen:
UniFormer: Consistency regularization-based semi-supervised semantic segmentation via differential dual-branch strongly augmented perturbations. 105640 - Huan Yang, Runtao Liu, Xiaotong Zhou, Yuhui Zheng, Ru Zhao:
Expert-scoring guided global information interaction network for lightweight image super-resolution. 105642 - Shengnan Fan, Zhilei Chai, Zhijun Fang, Yuying Pan, Hui Shen, Xiangyu Cheng, Qin Wu:
MaxSwap-Enhanced Knowledge Consistency Learning for long-tailed recognition. 105643 - Hongchang Zhang, Longtao Wang, Qizhan Zou, Juan Zeng:
DFF-Net: Deep Feature Fusion Network for low-light image enhancement. 105645 - Sini Raj Pulari, Maramreddy Umadevi, Shriram K. Vasudevan:
Optimizing multimodal personalized disease prediction accuracy using generated prompts and large language models. 105649 - Jin Huagang, Zhou Yu:
Multi-scale pyramid convolution transformer for remote-sensing object detection. 105651 - Jordan Daniel Joshua, Young Beom Kim, Jin Young Lee:
DMFASR: Dense multi-feature aggregation-based super-resolution for brain MR images. 105652 - Lakshita Aggarwal, Vikram Ranjan, Ananya Sharma:
Real time image processing and smart healthcare using eXplainable artificial intelligence (XAI). 105653 - P. Joyce Beryl Princess, Salaja Silas, Elijah Blessing Rajsingh, Xiao-Zhi Gao:
Human posture recognition using random search neural architecture for accident injury severity prediction and victim identification. 105654 - Jinyong Cheng, Qinghao Cui, Guohua Lv:
BSMEF: Optimized multi-exposure image fusion using B-splines and Mamba. 105660

manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.