


default search action
Image and Vision Computing, Volume 154
Volume 154, 2025
- Achraf Ouahab, Olfa Ben Ahmed
:
ProtoMed: Prototypical networks with auxiliary regularization for few-shot medical image classification. 105337 - Le Jin
, Guoshun Zhou
, Zherong Liu
, Yuanchao Yu
, Teng Zhang
, Minghui Yang
, Jun Zhou
:
IRPE: Instance-level reconstruction-based 6D pose estimator. 105340 - Meiying Gu
, Jiahe Li, Yuchen Wu, Haonan Luo, Jin Zheng, Xiao Bai:
3D human avatar reconstruction with neural fields: A recent survey. 105341 - Naigong Yu
, Yifan Fu, Qiusheng Xie, Qiming Cheng, Mohammad Mehedi Hasan:
Feature extraction and fusion algorithm for infrared visible light images based on residual and generative adversarial network. 105346 - Biying Fu
, Abdenour Hadid, Naser Damer:
Generative AI in the context of assistive technologies: Trends, limitations and future directions. 105347 - Yehui Wang
, Fang Lei, Baoyan Wang, Qiang Zhang, Xiantong Zhen, Lei Zhang:
De-noising mask transformer for referring image segmentation. 105356 - Quan Tang
, Fagui Liu
, Dengke Zhang
, Jun Jiang
, Xuhao Tang
, C. L. Philip Chen
:
Increase the sensitivity of moderate examples for semantic image segmentation. 105357 - Xian Fang
, Jiatong Chen, Yaming Wang, Mingfeng Jiang, Jianhua Ma, Xin Wang:
EPFDNet: Camouflaged object detection with edge perception in frequency domain. 105358 - Yuewen Zhang, Jiuhang Wang, Hongying Tang, Ronghua Qin:
DALSCLIP: Domain aggregation via learning stronger domain-invariant features for CLIP. 105359 - Liehao Wu, Laihua Wang, Guanghui Wei, Yang Yu:
HPD-Depth: High performance decoding network for self-supervised monocular depth estimation. 105360 - Jinxin Shao
, Haosu Zhang
, Jianming Miao
:
GPLM: Enhancing underwater images with Global Pyramid Linear Modulation. 105361 - Yuxiang Wu, Xiaoyan Wang, Xiaoyan Liu, Yuzhao Gao, Yan Dou:
Pixel integration from fine to coarse for lightweight image super-resolution. 105362 - Hong Zhang, Jianbo Song, Hanyang Liu, Yang Han, Yifan Yang, Huimin Ma:
AwareTrack: Object awareness for visual tracking via templates interaction. 105363 - Yan Zhang, Zenghui Li, Duo Shen, Ke Wang
, Jia Li, Chenxing Xia:
Information gap based knowledge distillation for occluded facial expression recognition. 105365 - Xuezhi Xiang
, Xiankun Zhou, Yingxin Wei, Xi Wang, Yulong Qiao:
Scene flow estimation from point cloud based on grouped relative self-attention. 105368 - Yan Huang, Huixin Luo, Yong Xu, Xianbing Meng
:
A temporally-aware noise-informed invertible network for progressive video denoising. 105369 - Ruchika Sharma
, Rudresh Dwivedi
:
Unmasking deepfakes: Eye blink pattern analysis using a hybrid LSTM and MLP-CNN model. 105370 - Akshat Dhamale, Ratnavel Rajalakshmi, Ananthakrishnan Balasundaram:
Dual multi scale networks for medical image segmentation using contrastive learning. 105371 - Longxin Zhang
, Wenliang Zeng, Peng Zhou, Xiaojun Deng, Jiayu Wu, Hong Wen
:
A fast and lightweight train image fault detection model based on convolutional neural networks. 105380 - Qing Pan, Xiayuan Feng, Nili Tian:
EDCAANet: A lightweight COD network based on edge detection and coordinate attention assistance. 105382 - Shihui Zhang, Gangzheng Zhai
, Kun Chen, Houlin Wang
, Shaojie Han:
CFENet: Context-aware Feature Enhancement Network for efficient few-shot object counting. 105383 - Zhenshan Hu, Bin Ge, Chenxing Xia:
Multiangle feature fusion network for style transfer. 105386 - Alireza Esmaeilzehi, Amir Mohammad Babaei, Farshid Nooshi, Hossein Zaredar, M. Omair Ahmad:
CLBSR: A deep curriculum learning-based blind image super resolution network using geometrical prior. 105364 - Pablo Garcia-Fernandez
, Daniel Cores, Manuel Mucientes:
Enhancing few-shot object detection through pseudo-label mining. 105379 - Siliang Ma, Yong Xu:
FPDIoU Loss: A loss function for efficient bounding box regression of rotated object detection. 105381 - Qiujie Ma, Shuqi Yang, Lijuan Zhang, Qing Lan, Dongdong Yang, Honghan Chen, Ying Tan:
APOVIS: Automated pixel-level open-vocabulary instance segmentation through integration of pre-trained vision-language models and foundational segmentation models. 105384 - Vladislav Li, Ilias Siniosoglou
, Thomai Karamitsou, Anastasios Lytos
, Ioannis D. Moscholios, Sotirios K. Goudos
, Jyoti S. Banerjee, Panagiotis G. Sarigiannidis
, Vasileios Argyriou
:
Enhancing 3D object detection in autonomous vehicles based on synthetic virtual environment analysis. 105385 - Rajat Kumar Arya
, Siddhant Jain, Pratik Chattopadhyay, Rajeev Srivastava:
HSIRMamba: An effective feature learning for hyperspectral image classification using residual Mamba. 105387 - Zhongyue Wang
, Ying Chen
:
Spatial-temporal sequential network for anomaly detection based on long short-term magnitude representation. 105388 - Xiaoyi Xu, Hui Cai, Mingjie Wang, Weiling Chen, Rongxin Zhang, Tiesong Zhao:
Exploring underwater image quality: A review of current methodologies and emerging trends. 105389 - Oscar Ondeng
, Heywood Ouma
, Peter O. Akuon:
Enriching visual feature representations for vision-language tasks using spectral transforms. 105390 - Chen Wang, Huifang Ma, Di Zhang, Xiaolong Li, Zhixin Li:
Enhancing weakly supervised semantic segmentation with efficient and robust neighbor-attentive superpixel aggregation. 105391 - Yufeng Cheng, Dongxue Wang, Shuang Bai, Jingkai Ma, Chen Liang, Kailong Liu, Tao Deng:
Understanding document images by introducing explicit semantic information and short-range information interaction. 105392 - Muxin Liao, Shishun Tian, Yuhang Zhang, Guoguang Hua, Rong You, Wenbin Zou, Xia Li:
Class-discriminative domain generalization for semantic segmentation. 105393 - Youwei Li, Junyong Ye, Xubin Wen, Guangyi Xu, Jingjing Wang, Xinyuan Liu:
PAdapter: Adapter combined with prompt for image and video classification. 105395 - Qihui Li
, Zongtan Li, Lianfang Tian, Qiliang Du, Guoyu Lu:
MD-Mamba: Feature extractor on 3D representation with multi-view depth. 105396 - Zhiguo Liu, Yuqi Chen
, Yuan Gao:
Rotating-YOLO: A novel YOLO model for remote sensing rotating object detection. 105397 - Wai Keung Wong, Hao Liang, Hongkun Sun, Weijun Sun, Haoliang Yuan, Shuping Zhao, Lunke Fei:
Learning to estimate 3D interactive two-hand poses with attention perception. 105398 - Hao Zhai, Peng Chen, Nannan Luo, Qinyu Li, Ping Yu:
CPFusion: A multi-focus image fusion method based on closed-loop regularization. 105399 - Peng Zan, Yuerong Wang, Haohao Hu, Wanjun Zhong, Tianyu Han, Jingwei Yue
:
An Active Transfer Learning framework for image classification based on Maximum Differentiation Classifier. 105401 - Jiaqi Zhu, Bin Li, Xinhua Zhao:
TPSFusion: A Transformer-based pyramid screening fusion network for 6D pose estimation. 105402 - Xingzheng Wang, Jianbin Wu, Shaoyong Wu, Jiahui Li:
SAMNet: Adapting segment anything model for accurate light field salient object detection. 105403 - Ding Gao, Qian Wang, Jian Yang, Junlong Wu:
Domain adaptive object detection via synthetically generated intermediate domain and progressive feature alignment. 105404 - Anshu Singh, Maheshwari Prasad Singh, Amit Kumar Singh:
Adversarially Enhanced Learning (AEL): Robust lightweight deep learning approach for radiology image classification against adversarial attacks. 105405 - Ran Gong
, Anna Zhu, Kun Liu:
Edge guided and Fourier attention-based Dual Interaction Network for scene text erasing. 105406 - Qianhao Wu, Xixi Jiang, Dong Zhang, Yifei Feng, Jinhui Tang:
Cross-set data augmentation for semi-supervised medical image segmentation. 105407 - Jie Zhong, Aiguo Chen, Yizhang Jiang, Chengcheng Sun, Yuheng Peng:
Lightweight and efficient feature fusion real-time semantic segmentation network. 105408 - Qing Li, Xiaojiang Peng, Chuan Yan, Pan Gao, Qi Hao:
Self-ensembling for 3D point cloud domain adaptation. 105409 - Yinghua Fu, Zhaofeng Liu, Jiansheng Peng, Rohit Gupta, Dawei Zhang:
GANSD: A generative adversarial network based on saliency detection for infrared and visible image fusion. 105410 - Yusong Li
, Longwei Xu, Weibin Yang, Dehua Geng, Mingyuan Xu, Zhiqi Dong, Pengwei Wang:
1D kernel distillation network for efficient image super-resolution. 105411 - Yingjie Jin, Xiaofei Zhou
, Zhenjie Zhang, Hao Fang, Ran Shi, Xiaobin Xu:
Hierarchical spatiotemporal Feature Interaction Network for video saliency prediction. 105413 - Yanliang Ge, Jinghuai Pan, Junchao Ren, Min He, Hongbo Bi, Qiao Zhang
:
Co-salient object detection with consensus mining and consistency cross-layer interactive decoding. 105414 - Bin Wang, Yuying Liang, Lei Cai, Huakun Huang, Huanqiang Zeng:
Image re-identification: Where self-supervision meets vision-language learning. 105415 - Rongji Li, Ziqian Wang:
AGSAM-Net: UAV route planning and visual guidance model for bridge surface defect detection. 105416 - Haewon Byeon, Mohammed E. Seno, Divya Nimma, Janjhyam Venkata Naga Ramesh, Abdelhamid Zaïdi, Azzah AlGhamdi, Ismail Keshta, Mukesh Soni, Mohammad Shabaz:
Privacy-preserving explainable AI enable federated learning-based denoising fingerprint recognition model. 105420 - Youming Chen, Ting Tuo, Lijun Guo
, Rong Zhang, Yirui Wang, Shangce Gao:
Robust auxiliary modality is beneficial for video-based cloth-changing person re-identification. 105400 - Chandni, Monika Sachdeva, Alok Kumar Singh Kushwaha:
AI-based intelligent hybrid framework (BO-DenseXGB) for multi- classification of brain tumor using MRI. 105417 - Ahmed Abul Hasanaath, Hamzah Luqman
, Raed Katib, Saeed Anwar:
FSBI: Deepfake detection with frequency enhanced self-blended images. 105418 - M. Raveenthini, R. Lavanya, Raul Benitez:
Grad-CAM based explanations for multiocular disease detection using Xception net. 105419 - Yunfei Chen
, Yitian Long, Zhan Yang, Jun Long:
Correlation embedding semantic-enhanced hashing for multimedia retrieval. 105421 - Ayushi Verma, Tapas Badal, Abhay Bansal:
A novel framework for diverse video generation from a single video using frame-conditioned denoising diffusion probabilistic model and ConvNeXt-V2. 105422 - David Freire-Obregón
, João Neves
, Ziga Emersic, Blaz Meden, Modesto Castrillón Santana, Hugo Proença:
Synthesizing multilevel abstraction ear sketches for enhanced biometric recognition. 105424 - Siyuan Zheng, Weiqun Cao:
EHGFormer: An efficient hypergraph-injected transformer for 3D human pose estimation. 105425 - Hongjuan Pei, Jiaying Chen, Shihao Gao, Taisong Jin, Ke Lu:
Skeleton action recognition via group sparsity constrained variant graph auto-encoder. 105426 - Umar Islam
, Hathal Salamah Alwageed, Saleh Alyahyan, Manal Alghieth, Hanif Ullah
, Naveed Khan
:
An ontological approach to investigate the impact of deep convolutional neural networks in anomaly detection of left ventricular hypertrophy using echocardiography images. 105427 - Yuanlong Wang, Hengtao Jiang, Guanying Chen, Tong Zhang, Jiaqing Zhou, Zezheng Qing, Chunyan Wang, Wanzhong Zhao:
Efficient and robust multi-camera 3D object detection in bird-eye-view. 105428 - Achyut Shankar, Hariprasath Manoharan, Adil Omar Khadidos, Alaa O. Khadidos, Shitharth Selvarajan
, S. B. Goyal:
Transparency and privacy measures of biometric patterns for data processing with synthetic data using explainable artificial intelligence. 105429 - Jiming Yang, Feipeng Da, Ru Hong:
Semantic-aware for point cloud domain adaptation with self-distillation learning. 105430 - Chi Zhang, Yun Gao, Tao Meng, Tao Wang:
Partitioned token fusion and pruning strategy for transformer tracking. 105431 - Rehman Abbas, Naijie Gu, Asma Aldrees, Muhammad Umer
, Abeer Hakeem, Shtwai Alsubai
, Lucia Cascone:
Advancing brain tumor segmentation and grading through integration of FusionNet and IBCO-based ALCResNet. 105432 - Ding Yuan, Sizhe Zhang, Hong Zhang, Yangyan Deng, Yifan Yang:
EMA-GS: Improving sparse point cloud rendering with EMA gradient and anchor upsampling. 105433 - Sicheng Zhu
, Luping Ji
, Shengjia Chen, Weiwei Duan:
Spatial-temporal-channel collaborative feature learning with transformers for infrared small target detection. 105435

manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.