


default search action
33rd ICANN 2024: Lugano, Switzerland - Part III
- Michael Wand

, Kristína Malinovská
, Jürgen Schmidhuber
, Igor V. Tetko
:
Artificial Neural Networks and Machine Learning - ICANN 2024 - 33rd International Conference on Artificial Neural Networks, Lugano, Switzerland, September 17-20, 2024, Proceedings, Part III. Lecture Notes in Computer Science 15018, Springer 2024, ISBN 978-3-031-72337-7
Computer Vision: Anomaly Detection
- Junwei Wang, Yunpeng Wang, Jinquan Zeng:

Hybrid Encoder for Anomaly Detection Based on Latent Feature Regularization. 3-13
Computer Vision: Segmentation
- Haoran Yang, Longyi Tang, Tingting Wu, Binyu Yan:

DGFormer: A Dynamic Kernel with Gaussian Fusion Transformer for Semantic Image Segmentation. 17-30 - Qingwei Geng, Xiaodong Gu

:
Integrating Audio-Visual Contexts with Refinement for Segmentation. 31-44 - Manuel Traub, Frederic Becker, Adrian Sauter, Sebastian Otte, Martin V. Butz:

Loci-Segmented: Improving Scene Segmentation Learning. 45-61 - Yao Shen, Chunmeng Liu, Hanlin Chen, Kaiyang Zeng, Guangyao Li:

Measuring Affinity: Similarity-Based Auxiliary Unlabeled Guidance for Few-Shot Segmentation. 62-75 - Guoan Xu

, Wenjing Jia
, Tao Wu, Ligeng Chen, Guangwei Gao:
MFPNet: A Multi-scale Feature Propagation Network for Lightweight Semantic Segmentation. 76-86 - Chen Wang, Di Zhang, Xiaolong Li, Huifang Ma, Zhixin Li:

Weakly-Supervised Semantic Segmentation via Label Re-assignment in Dual-View Framework. 87-99
Computer Vision: Pose Estimation and Tracking
- Zheyan Gao, Jinyan Chen, Yuxin Liu, Yucheng Jin:

DT2S-Pose: A Deeper Temporal-Spatial Skeleton Refine Model for Pedestrian Pose Estimation. 103-117 - Yangliu He, Haoge Deng, Qiwei Shen

, Jianxin Liao
:
DTG: Learning A Dynamic Token Graph for 3D Pose Forecasting. 118-129 - Yingqi He, Jinghua Li, Dehui Kong, Baocai Yin:

Dual-Branch Network with Online Knowledge Distillation for 3D Hand Pose Estimation. 130-143 - Dongyang Yu, Haoyue Zhang, Ruisheng Zhao, Guoqi Chen, Wangpeng An, Yanhong Yang:

MovePose: A High-Performance Human Pose Estimation Algorithm on Mobile and Edge Devices. 144-158 - Rui Li, Jinlong Li:

Siamese Visual Tracking with Correlation and Awareness. 159-173
Computer Vision: Video Processing
- Hong Yu, Yu Zhang, Yuanqiu Liu, Hui Li, Han Liu:

Alignment-Enhanced Network for Temporal Language Grounding in Videos. 177-192 - Fengzhen Yu

, Xiaodong Gu
:
Boundary-Aware Noise-Resistant Video Moment Retrieval. 193-206 - Wei Li, Dezhao Luo, Dongbao Yang, Weiping Wang:

Large Language Model for Action Anticipation. 207-222 - Manuel Traub, Frederic Becker, Sebastian Otte, Martin V. Butz:

Learning Object Permanence from Videos via Latent Imaginations. 223-240 - Jingze Chen, Simiao Zhuang, Qiqin Lin, Junfeng Yao, Lei Li:

SSFlowNet: Semi-supervised Scene Flow Estimation on Point Clouds With Pseudo Label. 241-255 - Yaxin Hu

, Erhardt Barth
:
Video Understanding Using 2D-CNNs on Salient Spatio-Temporal Slices. 256-270
Computer Vision: Generative Methods
- Xinlai Guo, Yanyun Tao, Yuzhen Zhang, Biao Xu, Jianyin Zheng, Guang Ji:

A Robust Image Dehazing Model Using Cycle Generative Adversarial Network with an Improved Atmospheric Scatter Model. 273-286 - Yuankun Chen, Dazhong Rong

, Yi Li
:
CrossViewDiff: A Cross-View Diffusion Model for Satellite-to-Ground Image Synthesis. 287-302 - Ziteng Zhang, Peng Qiao, Dou Yong, Sidun Liu, Wenyu Li, Li Cao, Luo Chen:

Dual Dreamer: Extending Single-View Dreamer with Few Shot of Complementary Views. 303-317 - Yuansheng Ma, Dong Zhang, Suyang Zhu, Shoushan Li:

Hair Transfer with Efficient Heuristic Chain of Editing. 318-332 - Jialiang Xu, Weiran Chen

, Lingbing Xu, Weitao Song, Yi Ji
, Ying Li
, Chunping Liu
:
MAGIC: Multi-prompt Any Length Video Generation Model with Controllable Inter-frame Correlation and Low Barrier. 333-348 - Xing Bai, Jun Zhou, Pengyuan Zhang, Ruipeng Hao:

Make Audio Solely Drive Lip in Talking Face Video Synthesis. 349-360 - Mohua Chen, Hanchao Liu, Lanfang Dong:

P2H-GAN: An Effective Method For Generating Handwritten Expressions. 361-376 - Yan Zhang, Yefei Wang

, Jialu Xiong, Jie Zhou, Jinshan Zeng:
SCI-Font: Enhancing Content-Style Representation for Chinese Calligraphy Generation with Skeleton, Contour and Inexact Paired Data. 377-391
Topics in Computer Vision
- Gokul Sudheesh Kumar

, Aparna Raj, Sujala D. Shetty:
Driver Safety System: A Real-Time Sleep Detection and Lane Detection Model Using IoT and Deep Learning. 395-414 - Ting Huang, Jian Huang

:
Gaze Target Detection with Visual Prompt Tuning Based on Attention. 415-429 - Dekun Lin, Tailai Peng, Rui Chen, Xinran Xie, Zhe Cui:

Let Multi-classification Help Deep Imbalanced Regression. 430-447 - Jingqi Hu, Chen Mao, Chong Tan, Hui Li, Hong Liu, Min Zheng:

ProGEO: Generating Prompts Through Image-Text Contrastive Learning for Visual Geo-Localization. 448-462

manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.


Google
Google Scholar
Semantic Scholar
Internet Archive Scholar
CiteSeerX
ORCID














