


default search action
26th MMM 2020: Daejeon, South Korea
- Yong Man Ro, Wen-Huang Cheng, Junmo Kim, Wei-Ta Chu, Peng Cui, Jung-Woo Choi, Min-Chun Hu, Wesley De Neve:

MultiMedia Modeling - 26th International Conference, MMM 2020, Daejeon, South Korea, January 5-8, 2020, Proceedings, Part I. Lecture Notes in Computer Science 11961, Springer 2020, ISBN 978-3-030-37730-4
Oral Session 1A: Audio and Signal Processing
- Xiuxiu Jing, Yike Ma, Qiang Zhao, Ke Lyu, Feng Dai:

Light Field Reconstruction Using Dynamically Generated Filters. 3-13 - Lili Guo, Longbiao Wang, Jianwu Dang, Zhilei Liu

, Haotian Guan:
Speaker-Aware Speech Emotion Recognition by Fusing Amplitude and Phase Information. 14-25 - Congzhou Tian, Hangyu Li, Deshun Yang, Xiaoou Chen:

Gen-Res-Net: A Novel Generative Model for Singing Voice Separation. 26-36 - Congzhou Tian, Deshun Yang, Xiaoou Chen:

A Distinct Synthesizer Convolutional TasNet for Singing Voice Separation. 37-48 - Daniel Mélo, Nazareno Andrade:

Exploiting the Importance of Personalization When Selecting Music for Relaxation. 49-61
Oral Session 2A: Coding and HVS
- Yunchang Li, Zhijie Huang, Jun Sun:

An Efficient Encoding Method for Video Compositing in HEVC. 65-76 - Hongming Luo, Guangsen Liao, Xianxu Hou, Bozhi Liu, Fei Zhou, Guoping Qiu

:
VHS to HDTV Video Translation Using Multi-task Adversarial Learning. 77-86 - Haibing Yin, Yafen Xing, Guangjing Xia, Xiaofeng Huang, Chenggang Yan:

Improving Just Noticeable Difference Model by Leveraging Temporal HVS Perception Characteristics. 87-98 - Minh-Man Ho

, Gang He, Zheng Wang, Jinjia Zhou
:
Down-Sampling Based Video Coding with Degradation-Aware Restoration-Reconstruction Deep Neural Network. 99-110 - Chengpeng Fu

, Jinqiang Wang, Jitao Sang, Jian Yu, Changsheng Xu:
Beyond Literal Visual Modeling: Understanding Image Metaphor Based on Literal-Implied Concept Mapping. 111-123
Oral Session 3A: Color Processing and Art
- Zhengqing Li, Zhengjun Zha

, Yang Cao:
Deep Palette-Based Color Decomposition for Image Recoloring with Aesthetic Suggestion. 127-138 - Carlos Castellanos, Bello Bello, Hyeryeong Lee

, Mungyu Lee
, Yoo Seok Lee
, In Seop Chang:
On Creating Multimedia Interfaces for Hybrid Biological-Digital Art Installations. 139-150 - Haiyang Wei, Zhixin Li, Canlong Zhang:

Image Captioning Based on Visual and Semantic Attention. 151-162 - Wengang Cheng, Pengli Dou, Dengwen Zhou:

An Illumination Insensitive and Structure-Aware Image Color Layer Decomposition Method. 163-175 - Yugang Chen, Muchun Chen, Chaoyue Song, Bingbing Ni:

CartoonRenderer: An Instance-Based Multi-style Cartoon Image Translator. 176-187
Oral Session 4A: Detection and Classification
- Yiting Cheng, Yankai Wang, Lizhe Qi, Wenqiang Zhang:

Multi-condition Place Generator for Robust Place Recognition. 191-202 - Lingyun Zeng, You Song, Wenhai Wang:

Guided Refine-Head for Object Detection. 203-214 - Yafeng Zhou, Yongtao Wang, Zheqi He

, Zhi Tang, Ching Y. Suen:
Towards Accurate Panel Detection in Manga: A Combined Effort of CNN and Heuristics. 215-226 - Nikolaos Gkalelis

, Vasileios Mezaris
:
Subclass Deep Neural Networks: Re-enabling Neglected Classes in Deep Network Training for Multimedia Classification. 227-238 - Jacob Gately, Ying Liang, Matthew Kolessar Wright, Natasha Kholgade Banerjee, Sean Banerjee, Soumyabrata Dey:

Automatic Material Classification Using Thermal Finger Impression. 239-250
Oral Session 5A: Face
- Hongkong Ge, Jiayuan Dong, Liyan Zhang:

Face Attributes Recognition Based on One-Way Inferential Correlation Between Attributes. 253-265 - Yahui Wang, Huimin Ma, Xinpeng Xing, Zeyu Pan:

Eulerian Motion Based 3DCNN Architecture for Facial Micro-Expression Recognition. 266-277 - Siyi Mo, Wenming Yang, Guijin Wang, Qingmin Liao:

Emotion Recognition with Facial Landmark Heatmaps. 278-289 - Jianli Zhou, Jun Chen, Chao Liang, Jin Chen:

One-Shot Face Recognition with Feature Rectification via Adversarial Learning. 290-302 - Ruolin Zheng, Weixin Li, Yunhong Wang:

Visual Sentiment Analysis by Leveraging Local Regions and Human Faces. 303-314
Oral Session 6A: Image Processing
- Tong Zhang, Xiaolong Li, Wenfa Qi, Zongming Guo:

Prediction-Error Value Ordering for High-Fidelity Reversible Data Hiding. 317-328 - Xin Xu, Xin Teng:

Classroom Attention Analysis Based on Multiple Euler Angles Constraint and Head Pose Estimation. 329-340 - Han Fang, Jun Chen, Qi Tian:

Multi-branch Body Region Alignment Network for Person Re-identification. 341-352 - Wenguang Wang, Zhouhui Lian, Yingmin Tang, Jianguo Xiao:

DeepStroke: Understanding Glyph Structure with Semantic Segmentation and Tabu Search. 353-364 - Abdullah Alfarrarjeh, Zeyu Ma, Seon Ho Kim, Cyrus Shahabi:

3D Spatial Coverage Measurement of Aerial Images. 365-377
Oral Session 7A: Leaning and Knowledge Representation
- Hongkai Li, Cong Bai, Ling Huang, Yu-Gang Jiang, Shengyong Chen:

Instance Image Retrieval with Generative Adversarial Training. 381-392 - Xinjie Feng, Hongxun Yao, Wenbin Che, Shengping Zhang:

An Effective Way to Boost Black-Box Adversarial Attack. 393-404 - Lizi Liao

, Lyndon Kennedy, Lynn Wilcox, Tat-Seng Chua:
Crowd Knowledge Enhanced Multimodal Conversational Assistant in Travel Domain. 405-418 - Haoran Chen, Minghua Zhu, Xuesong Cai, Jufeng Luo, Yunzhou Qiu:

Improved Model Structure with Cosine Margin OIM Loss for End-to-End Person Search. 419-430 - Feng Ni, Xixin Cao:

Effective Barcode Hunter via Semantic Segmentation in the Wild. 431-442
Oral Session 7B: Video Processing
- Qinyu Li, Lijun Chen, Hanli Wang, Xianhui Liu:

Wonderful Clips of Playing Basketball: A Database for Localizing Wonderful Actions. 445-454 - Zefeng Sun, Hanli Wang, Yun Yi, Qinyu Li:

Structural Pyramid Network for Cascaded Optical Flow Estimation. 455-467 - Muchun Chen, Yugang Chen, Truong Tan Loc, Bingbing Ni:

Real-Time Multiple Pedestrians Tracking in Multi-camera System. 468-479 - Ying She

, Yang Yi
:
Learning Multi-feature Based Spatially Regularized and Scale Adaptive Correlation Filters for Visual Tracking. 480-491 - Evlampios Apostolidis

, Eleni Adamantidou, Alexandros I. Metsai, Vasileios Mezaris, Ioannis Patras:
Unsupervised Video Summarization via Attention-Driven Adversarial Learning. 492-504
Poster Session
- Zhijie Huang, Yunchang Li, Jun Sun:

Efficient HEVC Downscale Transcoding Based on Coding Unit Information Mapping. 507-518 - Zikai Song, Junqing Yu, Hengyou Cai, Yangliu Hu, Yi-Ping Phoebe Chen

:
Fine-Grain Level Sports Video Search Engine. 519-531 - Seunghan Yang

, Seungjun Jung
, Heekwang Kang
, Changick Kim
:
The Korean Sign Language Dataset for Action Recognition. 532-542 - Dongqi Tang, Hao Kong, Xi Meng, Ruo-Ze Liu, Tong Lu:

SEE-LPR: A Semantic Segmentation Based End-to-End System for Unconstrained License Plate Detection and Recognition. 543-554 - Changbo Zhai, Le Wang, Qilin Zhang

, Zhanning Gao, Zhenxing Niu, Nanning Zheng, Gang Hua:
Action Co-localization in an Untrimmed Video by Graph Neural Networks. 555-567 - Zhonghan Niu, Yang-Hao Zhou, Yu-Bin Yang, Jiancong Fan:

A Novel Attention Enhanced Dense Network for Image Super-Resolution. 568-580 - Ping Liu, Hongbo Yang, Jingnan Fu:

Marine Biometric Recognition Algorithm Based on YOLOv3-GAN Network. 581-592 - Qiuyuan Han

, Jin Zheng
:
Multi-scale Spatial Location Preference for Semantic Segmentation. 593-604 - Wei Chen

, Ruimin Hu, Xiaochen Wang, Dengshi Li:
HRTF Representation with Convolutional Auto-encoder. 605-616 - Xuan Zhang, Guangxing Han, Wenduo He:

Unsupervised Feature Propagation for Fast Video Object Detection Using Generative Adversarial Networks. 617-627 - Gjorgji Strezoski, Rogier Knoester, Nanne van Noord

, Marcel Worring
:
OmniEyes: Analysis and Synthesis of Artistically Painted Eyes. 628-641 - Xiyue Gao, Jun Chen, Jing Yao, Wenqian Zhu:

LDSNE: Learning Structural Network Embeddings by Encoding Local Distances. 642-652 - Liwen Zhang

, Ziqiang Shi, Jiqing Han, Anyan Shi, Ding Ma:
FurcaNeXt: End-to-End Monaural Speech Separation with Dynamic Gated Dilated Temporal Convolutional Networks. 653-665 - Chenhao Hu, Ruimin Hu, Xiaochen Wang, Tingzhao Wu, Dengshi Li:

Multi-step Coding Structure of Spatial Audio Object Coding. 666-678 - Soumya Chatterjee, Wei-Ta Chu

:
Thermal Face Recognition Based on Transformation by Residual U-Net and Pixel Shuffle Upsampling. 679-689 - Shyi-Chyi Cheng, Ting-Lan Lin, Ping-Yuan Tseng:

K-SVD Based Point Cloud Coding for RGB-D Video Compression Using 3D Super-Point Clustering. 690-701 - Siying Zhai, Xiwei Hu, Xuanhong Chen

, Bingbing Ni, Wenjun Zhang:
Resolution Booster: Global Structure Preserving Stitching Method for Ultra-High Resolution Image Translation. 702-713 - Haiyu Jiang, Yan Song, Jiang He, Xiangbo Shu:

Cross Fusion for Egocentric Interactive Action Recognition. 714-726 - Sun'ao Liu, Hai Xu, Yizhi Liu, Hongtao Xie:

Improving Brain Tumor Segmentation with Dilated Pseudo-3D Convolution and Multi-direction Fusion. 727-738 - Jian Cao

, Na Tang, Jun Wang, Fan Liang:
Texture-Based Fast CU Size Decision and Intra Mode Decision Algorithm for VVC. 739-751 - Siying Liang, Ping Wang:

An Efficient Hierarchical Near-Duplicate Video Detection Algorithm Based on Deep Semantic Features. 752-763 - Wenfeng Song, Shuai Li, Yuting Guo, Shaoqi Li

, Aimin Hao, Hong Qin
, Qinping Zhao:
Meta Transfer Learning for Adaptive Vehicle Tracking in UAV Videos. 764-777 - Ruicong Xu, Li Niu, Liqing Zhang:

Adversarial Query-by-Image Video Retrieval Based on Attention Mechanism. 778-789 - Binxin Yang, Xuejin Chen

, Richang Hong, Zihan Chen, Yuhang Li, Zheng-Jun Zha
:
Joint Sketch-Attribute Learning for Fine-Grained Face Synthesis. 790-801 - Lv Chen, Dengpan Ye, Shunzhi Jiang:

High Accuracy Perceptual Video Hashing via Low-Rank Decomposition and DWT. 802-812 - Dongyang Li, Ruimin Hu, Wenxin Huang, Xiaochen Wang, Dengshi Li, Fei Zheng:

HMM-Based Person Re-identification in Large-Scale Open Scenario. 813-825 - Junchen Deng, Ci Wang, Shiqi Liu:

No Reference Image Quality Assessment by Information Decomposition. 826-838

manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.


Google
Google Scholar
Semantic Scholar
Internet Archive Scholar
CiteSeerX
ORCID














