


default search action
MMM 2024, Amsterdam, The Netherlands - Part III
- Stevan Rudinac

, Alan Hanjalic
, Cynthia C. S. Liem
, Marcel Worring
, Björn Þór Jónsson
, Bei Liu
, Yoko Yamakata
:
MultiMedia Modeling - 30th International Conference, MMM 2024, Amsterdam, The Netherlands, January 29 - February 2, 2024, Proceedings, Part III. Lecture Notes in Computer Science 14556, Springer 2024, ISBN 978-3-031-53310-5 - Qiang Chen

, Fuxiao He
, Guoqiang Xiao
:
Global-to-Local Feature Mining Network for RGB-Infrared Person Re-Identification. 1-13 - Lu Chen

, Jiawei Tan
, Pingan Yang
, Hongxing Wang
:
Semantic Transition Detection for Self-supervised Video Scene Segmentation. 14-27 - Xueyang Qin, Lishuang Li, Jing Hao, Meiling Ge, Jiayi Huang, Guangyao Pang:

Multi-task Collaborative Network for Image-Text Retrieval. 28-42 - Hao-Yuan Ma, Li Zhang, Xiang-Yi Wei:

FGENet: Fine-Grained Extraction Network for Congested Crowd Counting. 43-56 - Jingjing Xie, Jixuan Hong, Manjin Sheng, Chenhui Yang:

MSMV-UNet: A 2.5D Stroke Lesion Segmentation Method Based on Multi-slice Feature Fusion. 57-69 - Xiang Gao, Sining Wu, Fan Wang, Xiaopeng Hu:

Non-Local Spatial-Wise and Global Channel-Wise Transformer for Efficient Image Super-Resolution. 70-85 - Ting Peng, Yihang Zhou, Rong Sun, Yizhi Luo, Yuqi Li:

MobileViT-FocR: MobileViT with Fixed-One-Centre Loss and Gradient Reversal for Generalised Fake Face Detection. 86-100 - Xiran Zhang, Haiyan Liu, Caixia Liu, Haiyang Zhang, Zhiwei Huo:

ASF-Conformer: Audio Scoring Conformer with FFC for Speaker Verification in Noisy Environments. 101-111 - Yuanjian He, Weile Zhang, Junyuan Deng, Yulai Cong:

Prior-Knowledge-Free Video Frame Interpolation with Bidirectional Regularized Implicit Neural Representations. 112-126 - Shengrong Ling, Sisi You, Bing-Kun Bao:

Two-Stage Reasoning Network with Modality Decomposition for Text VQA. 127-140 - Honglei Zheng, Wenkang Fan, Yinran Chen, Xiongbiao Luo:

Localization and Local Motion Magnification of Pulsatile Regions in Endoscopic Surgery Videos. 141-154 - Shinichi Ka

, Koichi Shinoda
:
Co-speech Gesture Generation with Variational Auto Encoder. 155-168 - Chunyin Sheng, Xiang Gao, Xiaopeng Hu, Fan Wang:

Differentiable Neural Architecture Search Based on Efficient Architecture for Lightweight Image Super-Resolution. 169-183 - Zhengwei Yang, Yange Wang, Lei Ma, Xiangzheng Li:

Learning Collaborative Reinforcement Attention for 3D Face Reconstruction and Dense Alignment. 184-197 - Konstantinos Triaridis

, Vasileios Mezaris
:
Exploring Multi-modal Fusion for Image Manipulation Detection and Localization. 198-211 - Feifei Xu, Zheng Zhong, Yitao Zhu, Yingchen Zhou, Guangzhen Li:

Appearance-Motion Dual-Stream Heterogeneous Network for VideoQA. 212-227 - Xiang Li, Ming Lu, Ziming Guo, Xiaoming Zhang:

Adaptive Token Selection and Fusion Network for Multimodal Sentiment Analysis. 228-241 - Pei Chen, Zhiyong Feng, Meng Xing, Yiming Zhang, Jinqing Zheng:

Exploring Imperceptible Adversarial Examples in YCbCr Color Space. 242-256 - Liyun Xu

, Min Zhang:
Fractional-Order Image Moments and Applications. 257-269 - Maria Pegia

, Ferran Agullo Lopez
, Anastasia Moumtzidou
, Alberto Gutierrez-Torre
, Björn Þór Jónsson
, Josep Lluis Berral-Garcia
, Ilias Gialampoukidis
, Stefanos Vrochidis
, Ioannis Kompatsiaris
:
Time-Quality Tradeoff of MuseHash Query Processing Performance. 270-283 - Zhanjie Jin, Anming Dong

, Jiguo Yu
, Shuxiang Dong, You Zhou:
Dual-Fisheye Image Stitching via Unsupervised Deep Learning. 284-298 - Junpeng Liu, Hengkang Bao:

CA-GAN: Conditional Adaptive Generative Adversarial Network for Text-to-Image Synthesis. 299-312 - Dexu Yao, Aimin Li, Deqi Liu, Mengfan Cheng:

RDC-YOLOv5: Improved Safety Helmet Detection in Adverse Weather. 313-326 - Aril Bernhard Ovesen, Tor-Arne Schmidt Nordmo, Michael Alexander Riegler, Pål Halvorsen, Dag Johansen:

Sustainable Commercial Fishery Control Using Multimedia Forensics Data from Non-trusted, Mobile Edge Nodes. 327-340 - Shan Cao, Qingfeng Wu:

MC-TCMNER: A Multi-modal Fusion Model Combining Contrast Learning Method for Traditional Chinese Medicine NER. 341-354 - Xiangyu Chen, Md Ayshik Rahman Khan

, Md. Rakibul Hasan
, Tom Gedeon, Md. Zakir Hossain
:
C3-PO: A Convolutional Neural Network for COVID Onset Prediction from Cough Sounds. 355-368 - Mingyuan Ge, Jianan Shui, Junyu Chen, Mingyong Li

:
Pseudo-label Based Unsupervised Momentum Representation Learning for Multi-domain Image Retrieval. 369-380 - Jianbo Xiong, Shinan Zou, Jin Tang:

DFGait: Decomposition Fusion Representation Learning for Multimodal Gait Recognition. 381-395 - Jiangfeng Li

, Bowen Wang
, Yongrui Qin
, Chenxi Zhang
, Gang Yu
, Qinpei Zhao
:
MoPE: Mixture of Pooling Experts Framework for Image-Text Retrieval. 396-409 - Linzi Xing, Quan Hung Tran, Fabian Caba, Franck Dernoncourt, Seunghyun Yoon, Zhaowen Wang, Trung Bui, Giuseppe Carenini:

Multi-modal Video Topic Segmentation with Dual-Contrastive Domain Adaptation. 410-424 - Wenlong Lu, Suping Wu, Xitie Zhang, Shengjia Zhang:

Unsupervised Multi-collaborative Learning Network for 3D Face Reconstruction. 425-436 - Yiru Zhang, Zeke Li, Bijing Liu, Haiwei Fan, Yong Yang, Qun Yang

:
A Region Based Non-overlapping Reference Speech Estimation Method for Speaker Extraction. 437-447 - Pan Li, Suping Wu, Xitie Zhang, Yuxin Peng, Boyang Zhang, Bin Wang:

Self-supervised Edge Structure Learning for Multi-view Stereo and Parallel Optimization. 448-461 - Shuai Wang

, Jiayi Shen, Athanasios Efthymiou, Stevan Rudinac, Monika Kackovic
, Nachoem Wijnberg
, Marcel Worring
:
Prototype-Enhanced Hypergraph Learning for Heterogeneous Information Networks. 462-476 - Ali Abdari

, Alex Falcon
, Giuseppe Serra
:
A Language-Based Solution to Enable Metaverse Retrieval. 477-488 - Chenlin Zhao

, Jiabo Ye
, Yaguang Song
, Ming Yan
, Xiaoshan Yang
, Changsheng Xu
:
Part-Aware Prompt Tuning for Weakly Supervised Referring Expression Grounding. 489-502 - Sarwar Khan, Jun-Cheng Chen, Wen-Hung Liao

, Chu-Song Chen:
Adversarially Robust Deepfake Detection via Adversarial Feature Similarity Learning. 503-516 - Adriano Baratè

, Luca Andrea Ludovico
:
A Multidimensional Taxonomy Model for Music Tangible User Interfaces. 517-531

manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.


Google
Google Scholar
Semantic Scholar
Internet Archive Scholar
CiteSeerX
ORCID














