default search action
27th ACM Multimedia 2019: Nice, France
- Laurent Amsaleg, Benoit Huet, Martha A. Larson, Guillaume Gravier, Hayley Hung, Chong-Wah Ngo, Wei Tsang Ooi:
Proceedings of the 27th ACM International Conference on Multimedia, MM 2019, Nice, France, October 21-25, 2019. ACM 2019, ISBN 978-1-4503-6889-6
Keynote I
- Jean Carrive:
Using Artificial Intelligence to Preserve Audiovisual Archives: New Horizons, More Questions. 1-2
Session 1A: Multimodal Fusion&Visual Relations
- Chunxiao Liu, Zhendong Mao, An-An Liu, Tianzhu Zhang, Bin Wang, Yongdong Zhang:
Focus Your Attention: A Bidirectional Focal Attention Network for Image-Text Matching. 3-11 - Tan Wang, Xing Xu, Yang Yang, Alan Hanjalic, Heng Tao Shen, Jingkuan Song:
Matching Images and Text with Multi-modal Tensor Fusion and Re-ranking. 12-20 - Shijie Yang, Liang Li, Shuhui Wang, Dechao Meng, Qingming Huang, Qi Tian:
Structured Stochastic Recurrent Network for Linguistic Video Prediction. 21-29 - Hao Zhou, Chongyang Zhang, Chuanping Hu:
Visual Relationship Detection with Relative Location Mining. 30-38 - Tong Yu, Yilin Shen, Ruiyi Zhang, Xiangyu Zeng, Hongxia Jin:
Vision-Language Recommendation via Attribute Augmented Multimodal Reinforcement Learning. 39-47 - Huafeng Kuang, Rongrong Ji, Hong Liu, Shengchuan Zhang, Xiaoshuai Sun, Feiyue Huang, Baochang Zhang:
Multi-modal Multi-layer Fusion Network with Average Binary Center Loss for Face Anti-spoofing. 48-56 - Yi Hao, Nannan Wang, Xinbo Gao, Jie Li, Xiaoyu Wang:
Dual-alignment Feature Embedding for Cross-modality Person Re-identification. 57-65 - Lan Wang, Jiahao Shi, Yang Wang, Feng Su:
Video Text Detection by Attentive Spatiotemporal Fusion of Deep Convolutional Features. 66-74 - David Semedo, João Magalhães:
Cross-Modal Subspace Learning with Scheduled Adaptive Margin Constraints. 75-83 - Xufeng Qian, Yueting Zhuang, Yimeng Li, Shaoning Xiao, Shiliang Pu, Jun Xiao:
Video Relation Detection with Spatio-Temporal Graph. 84-93 - Xu Sun, Yuan Zi, Tongwei Ren, Jinhui Tang, Gangshan Wu:
Hierarchical Visual Relationship Detection. 94-102 - Wanneng Wang, Yanan Ma, Ke Gao, Juan Cao:
Cost-free Transfer Learning Mechanism: Deep Digging Relationships of Action Categories. 103-111 - Lixi Deng, Jingjing Chen, Qianru Sun, Xiangnan He, Sheng Tang, Zhaoyan Ming, Yongdong Zhang, Tat-Seng Chua:
Mixed-dish Recognition with Contextual Relation Networks. 112-120 - Sipeng Zheng, Shizhe Chen, Qin Jin:
Visual Relation Detection with Multi-Level Attention. 121-129
Session 1B: Affective Computing&Facial Analytics
- Yaochen Zhu, Zhenzhong Chen, Feng Wu:
Multimodal Deep Denoise Framework for Affective Video Content Analysis. 130-138 - Raj Kumar Gupta, Yinping Yang:
Predicting and Understanding News Social Popularity with Emotional Salience Features. 139-147 - Dong Zhang, Shoushan Li, Qiaoming Zhu, Guodong Zhou:
Effective Sentiment-relevant Word Selection for Multi-modal Sentiment Analysis in Spoken Language. 148-156 - Yue Gu, Xinyu Lyu, Weijia Sun, Weitian Li, Shuhong Chen, Xinyu Li, Ivan Marsic:
Mutual Correlation Attentive Factors in Dyadic Fusion Networks for Speech Emotion Recognition. 157-166 - Timothy Greer, Benjamin Ma, Matthew E. Sachs, Assal Habibi, Shrikanth S. Narayanan:
A Multimodal View into Music's Effect on Human Neural, Physiological, and Emotional Experience. 167-175 - Jia-Xin Ma, Hao Tang, Wei-Long Zheng, Bao-Liang Lu:
Emotion Recognition using Multimodal Residual LSTM Network. 176-183 - Yang Zhou, Wanli Yu, Zhu Li, Haibing Yin:
Stereoscopic Visual Discomfort Prediction Using Multi-scale DCT Features. 184-191 - Sicheng Zhao, Zizhou Jia, Hui Chen, Leida Li, Guiguang Ding, Kurt Keutzer:
PDANet: Polarity-consistent Deep Attention Network for Fine-grained Visual Emotion Regression. 192-201 - K. R. Prajwal, C. V. Jawahar, Ponnurangam Kumaraguru:
Towards Increased Accessibility of Meme Images with the Help of Rich Face Emotion Captions. 202-210 - Wenxuan Wang, Qiang Sun, Yanwei Fu, Tao Chen, Chenjie Cao, Ziqi Zheng, Guoqiang Xu, Han Qiu, Yu-Gang Jiang, Xiangyang Xue:
Comp-GAN: Compositional Generative Adversarial Network in Synthesizing and Recognizing Facial Expression. 211-219 - Juntong Cheng, Yi-Ping Phoebe Chen, Minjun Li, Yu-Gang Jiang:
TC-GAN: Triangle Cycle-Consistent GANs for Face Frontalization with Facial Features Preserved. 220-228 - Shiming Ge, Shengwei Zhao, Xindi Gao, Jia Li:
Fewer-Shots and Lower-Resolutions: Towards Ultrafast Face Recognition in the Wild. 229-237 - Can Wang, Shangfei Wang, Guang Liang:
Identity- and Pose-Robust Facial Expression Recognition through Adversarial Feature Learning. 238-246 - Veith Röthlingshöfer, Vivek Sharma, Rainer Stiefelhagen:
Self-supervised Face-Grouping on Graphs. 247-256
Session 1C: Fashion&Human Analysis
- Yunshan Ma, Xun Yang, Lizi Liao, Yixin Cao, Tat-Seng Chua:
Who, Where, and What to Wear?: Extracting Fashion Knowledge from Social Media. 257-265 - Na Zheng, Xuemeng Song, Zhaozheng Chen, Linmei Hu, Da Cao, Liqiang Nie:
Virtually Trying on New Clothing with Arbitrary Poses. 266-274 - Chia-Wei Hsieh, Chieh-Yun Chen, Chien-Lung Chou, Hong-Han Shuai, Jiaying Liu, Wen-Huang Cheng:
FashionOn: Semantic-guided Image-based Virtual Try-on with Detailed Human and Clothing Information. 275-283 - Weijian Ruan, Wu Liu, Qian Bao, Jun Chen, Yuhao Cheng, Tao Mei:
POINet: Pose-Guided Ovonic Insight Network for Multi-Person Pose Tracking. 284-292 - Zhonghua Wu, Guosheng Lin, Qingyi Tao, Jianfei Cai:
M2E-Try On Net: Fashion from Model to Everyone. 293-301 - Xue Dong, Xuemeng Song, Fuli Feng, Peiguang Jing, Xin-Shun Xu, Liqiang Nie:
Personalized Capsule Wardrobe Creation with Garment and User Modeling. 302-310 - Xin Jin, Le Wu, Geng Zhao, Xiaodong Li, Xiaokun Zhang, Shiming Ge, Dongqing Zou, Bin Zhou, Xinghui Zhou:
Aesthetic Attributes Assessment of Images. 311-319 - Xuemeng Song, Xianjing Han, Yunkai Li, Jingyuan Chen, Xin-Shun Xu, Liqiang Nie:
GP-BPR: Personalized Compatibility Modeling for Clothing Matching. 320-328 - Xin Wang, Bo Wu, Yueqi Zhong:
Outfit Compatibility Prediction and Diagnosis with Multi-Layered Comparison Network. 329-337 - Xinchen Liu, Meng Zhang, Wu Liu, Jingkuan Song, Tao Mei:
BraidNet: Braiding Semantics and Details for Accurate Human Parsing. 338-346 - Mang Ye, Xiangyuan Lan, Qingming Leng:
Modality-aware Collaborative Learning for Visible Thermal Person Re-Identification. 347-355 - Yuyu Guo, Lianli Gao, Jingkuan Song, Peng Wang, Wuyuan Xie, Heng Tao Shen:
Adaptive Multi-Path Aggregation for Human DensePose Estimation in the Wild. 356-364 - Yukun Huang, Zheng-Jun Zha, Xueyang Fu, Wei Zhang:
Illumination-Invariant Person Re-Identification. 365-373 - Jianbo Wang, Kai Qiu, Houwen Peng, Jianlong Fu, Jianke Zhu:
AI Coach: Deep Human Pose Estimation and Analysis for Personalized Athletic Training Assistance. 374-382
Session 1D: Live Multimedia Applications&Streaming
- Xiao Liu, Lin Zhang, Ying Shen, Shaoming Zhang, Shengjie Zhao:
Online Camera Pose Optimization for the Surround-view System. 383-391 - Xiang Chen, Tam V. Nguyen, Zhiqi Shen, Mohan S. Kankanhalli:
LiveSense: Contextual Advertising in Live Streaming Videos. 392-400 - Nicholas Diliberti, Chao Peng, Christopher Kaufman, Yangzi Dong, Jeffrey T. Hansberger:
Real-Time Gesture Recognition Using 3D Sensory Data and a Light Convolutional Neural Network. 401-410 - Yuqian Fu, Chengrong Wang, Yanwei Fu, Yu-Xiong Wang, Cong Bai, Xiangyang Xue, Yu-Gang Jiang:
Embodied One-Shot Video Recognition: Learning from Actions of a Virtual Embodied Agent. 411-419 - Rui-Xiao Zhang, Ming Ma, Tianchi Huang, Haitian Pang, Xin Yao, Chenglei Wu, Jiangchuan Liu, Lifeng Sun:
Livesmart: A QoS-Guaranteed Cost-Minimum Framework of Viewer Scheduling for Crowdsourced Live Streaming. 420-428 - Tianchi Huang, Chao Zhou, Rui-Xiao Zhang, Chenglei Wu, Xin Yao, Lifeng Sun:
Comyco: Quality-Aware Adaptive Video Streaming via Imitation Learning. 429-437 - Silas L. Fong, Salma Emara, Baochun Li, Ashish Khisti, Wai-Tian Tan, Xiaoqing Zhu, John G. Apostolopoulos:
Low-Latency Network-Adaptive Error Control for Interactive Streaming. 438-446 - Jounsup Park, Klara Nahrstedt:
Navigation Graph for Tiled Media Streaming. 447-455 - Yu Guan, Xinggong Zhang, Zongming Guo:
CACA: Learning-based Content-aware Cache Admission for Video Content in Edge Caching. 456-464 - Yabin Zhu, Chenglong Li, Bin Luo, Jin Tang, Xiao Wang:
Dense Feature Aggregation and Pruning for RGBT Tracking. 465-472 - Haosheng Chen, Qiangqiang Wu, Yanjie Liang, Xinbo Gao, Hanzi Wang:
Asynchronous Tracking-by-Detection on Adaptive Time Surfaces for Event-based Object Tracking. 473-481 - Gaoang Wang, Yizhou Wang, Haotian Zhang, Renshu Gu, Jenq-Neng Hwang:
Exploit the Connectivity: Multi-Object Tracking with TrackletNet. 482-490 - Yusen Li, Haoyuan Liu, Xiwei Wang, Lingjun Pu, Trent G. Marbach, Shanjiang Tang, Gang Wang, Xiaoguang Liu:
Themis: Efficient and Adaptive Resource Partitioning for Reducing Response Delay in Cloud Gaming. 491-499 - Can Zhang, Yuexian Zou, Guang Chen, Lei Gan:
PAN: Persistent Appearance Network with an Efficient Motion Cue for Fast Action Recognition. 500-509
Keynote II
- Pernille Bjørn, María Menéndez-Blanco:
FemTech: Broadening Participation to Digital Technology Development. 510-511
Session 2A: Knowledge Processing&Action Analysis
- Peng Zhang, Li Su, Liang Li, Bing-Kun Bao, Pamela C. Cosman, Guorong Li, Qingming Huang:
Training Efficient Saliency Prediction Models with Knowledge Distillation. 512-520 - Tao Zhuo, Zhiyong Cheng, Peng Zhang, Yongkang Wong, Mohan S. Kankanhalli:
Explainable Video Action Reasoning via Prior Knowledge and State Transitions. 521-529 - Guohao Li, Xin Wang, Wenwu Zhu:
Perceptual Visual Reasoning with Knowledge Propagation. 530-538 - Xuejing Liu, Liang Li, Shuhui Wang, Zheng-Jun Zha, Li Su, Qingming Huang:
Knowledge-guided Pairwise Reconstruction Network for Weakly Supervised Referring Expression Grounding. 539-547 - Xiaowen Huang, Quan Fang, Shengsheng Qian, Jitao Sang, Yan Li, Changsheng Xu:
Explainable Interaction-driven User Modeling over Knowledge Graph for Sequential Recommendation. 548-556 - Lei Meng, Long Chen, Xun Yang, Dacheng Tao, Hanwang Zhang, Chunyan Miao, Tat-Seng Chua:
Learning Using Privileged Information for Food Recognition. 557-565 - Bowen Pan, Shangfei Wang, Bin Xia:
Occluded Facial Expression Recognition Enhanced through Privileged Information. 566-573 - Yanli Ji, Feixiang Xu, Yang Yang, Ning Xie, Heng Tao Shen, Tatsuya Harada:
Attention Transfer (ANT) Network for View-invariant Action Recognition. 574-582 - Ziming Liu, Guangyu Gao, A. Kai Qin, Tong Wu, Chi Harold Liu:
Action Recognition with Bootstrapping based Long-range Temporal Context Attention. 583-591 - Changmao Cheng, Chi Zhang, Yichen Wei, Yu-Gang Jiang:
Sparse Temporal Causal Convolution for Efficient Action Modeling. 592-600 - Xiang Gao, Wei Hu, Jiaxiang Tang, Jiaying Liu, Zongming Guo:
Optimized Skeleton-based Action Recognition via Sparsified Graph Regression. 601-610 - Wanru Xu, Jian Yu, Zhenjiang Miao, Lili Wan, Qiang Ji:
Prediction-CGAN: Human Action Prediction with Conditional Generative Adversarial Networks. 611-619 - Haoze Wu, Zheng-Jun Zha, Xin Wen, Zhenzhong Chen, Dong Liu, Xuejin Chen:
Cross-Fiber Spatial-Temporal Co-enhanced Networks for Video Action Recognition. 620-628 - Dong Li, Ting Yao, Zhaofan Qiu, Houqiang Li, Tao Mei:
Long Short-Term Relation Networks for Video Action Detection. 629-637
Session 2B: Adversarial Learning
- Meijuan Jia, Hongyu Yang, Di Huang, Yunhong Wang:
Attacking Gait Recognition Systems via Silhouette Guided GANs. 638-646 - Yang Chen, Yingwei Pan, Ting Yao, Xinmei Tian, Tao Mei:
Mocycle-GAN: Unpaired Video-to-Video Translation. 647-655 - Zitai Wang, Qianqian Xu, Ke Ma, Yangbangyan Jiang, Xiaochun Cao, Qingming Huang:
Adversarial Preference Learning with Pairwise Comparisons. 656-664 - Jiawei Liu, Zheng-Jun Zha, Richang Hong, Meng Wang, Yongdong Zhang:
Deep Adversarial Graph Attention Convolution Network for Text-Based Person Search. 665-673 - Zhaoyu Zhang, Jun Yu:
STDGAN: ResBlock Based Generative Adversarial Nets Using Spectral Normalization and Two Different Discriminators. 674-682 - Tsai-Ho Sun, Chien-Hsun Lai, Sai-Keung Wong, Yu-Shuen Wang:
Adversarial Colorization of Icons Based on Contour and Color Conditions. 683-691 - Chen Ma, Chenxu Zhao, Hailin Shi, Li Chen, Jun-Hai Yong, Dan Zeng:
MetaAdvDet: Towards Robust Detection of Evolving Adversarial Attacks. 692-701 - Jen-Chun Lin, Wen-Li Wei, Tyng-Luh Liu, C.-C. Jay Kuo, Mark Liao:
Tell Me Where It is Still Blurry: Adversarial Blurred Region Mining and Refining. 702-710 - Rong Chen, Yuan Xie, Xiaotong Luo, Yanyun Qu, Cuihua Li:
Joint-attention Discriminator for Accurate Super-resolution via Adversarial Training. 711-719 - Hsin-Ying Hsieh, Chieh-Yu Chen, Yu-Shuen Wang, Jung-Hong Chuang:
BasketballGAN: Generating Basketball Play Simulation Through Sketching. 720-728 - Shuang Li, Chi Harold Liu, Binhui Xie, Limin Su, Zhengming Ding, Gao Huang:
Joint Adversarial Domain Adaptation. 729-737 - Chengwei Zhang, Yunlu Xu, Zhanzhan Cheng, Yi Niu, Shiliang Pu, Fei Wu, Futai Zou:
Adversarial Seeded Sequence Growing for Weakly-Supervised Temporal Action Localization. 738-746 - Jingjing Li, Erpeng Chen, Zhengming Ding, Lei Zhu, Ke Lu, Zi Huang:
Cycle-consistent Conditional Adversarial Transfer Networks. 747-755 - Peiying Li, Shikui Tu, Lei Xu:
GAN Flexible Lmser for Super-resolution. 756-764
Session 2C: Captioning&Video Analysis
- Longteng Guo, Jing Liu, Jinhui Tang, Jiangwei Li, Wei Luo, Hanqing Lu:
Aligning Linguistic Words and Visual Semantic Units for Image Captioning. 765-773 - Yaosi Hu, Zhenzhong Chen, Zheng-Jun Zha, Feng Wu:
Hierarchical Global-Local Temporal Modeling for Video Captioning. 774-783 - Yuqing Song, Shizhe Chen, Yida Zhao, Qin Jin:
Unpaired Cross-lingual Image Caption Generation with Self-Supervised Rewards. 784-792 - Xinhang Song, Bohan Wang, Gongwei Chen, Shuqiang Jiang:
MUCH: Mutual Coupling Enhancement of Scene Recognition and Dense Captioning. 793-801 - Yongqing Zhu, Shuqiang Jiang:
Attention-based Densely Connected LSTM for Video Captioning. 802-810 - Elaheh Barati, Xuewen Chen:
Critic-based Attention Network for Event-based Video Captioning. 811-817 - Xiangxi Shi, Jianfei Cai, Shafiq R. Joty, Jiuxiang Gu:
Watch It Twice: Video Captioning with a Refocused Video Encoder. 818-826 - Jiaxin Wu, Sheng-Hua Zhong, Yan Liu:
MvsGCN: A Novel Graph Convolutional Network for Multi-video Summarization. 827-835 - Junbo Wang, Wei Wang, Zhiyong Wang, Liang Wang, Dagan Feng, Tieniu Tan:
Stacked Memory Network for Video Summarization. 836-844 - Jingyi Zhang, Zhen Wei, Ionut Cosmin Duta, Fumin Shen, Li Liu, Fan Zhu, Xing Xu, Ling Shao, Heng Tao Shen:
Generative Reconstructive Hashing for Incomplete Video Analysis. 845-854 - Zhanzhan Cheng, Jing Lu, Yi Niu, Shiliang Pu, Fei Wu, Shuigeng Zhou:
You Only Recognize Once: Towards Fast Video Text Spotting. 855-863 - Linxi Jiang, Xingjun Ma, Shaoxiang Chen, James Bailey, Yu-Gang Jiang:
Black-box Adversarial Attacks on Video Recognition Models. 864-872 - Zheng Wang, Xinyu Yan, Yahong Han, Meijun Sun:
Ranking Video Salient Object Detection. 873-881 - Donghyeon Cho, Yunjae Jung, François Rameau, Dahun Kim, Sanghyun Woo, In So Kweon:
Video Retargeting: Trade-off between Content Preservation and Spatio-temporal Consistency. 882-889
Session 2D: 3D Visual Processing
- Tianxin Huang, Yong Liu:
3D Point Cloud Geometry Compression on Deep Learning. 890-898 - Haotian Zhang, Gaoang Wang, Zhichao Lei, Jenq-Neng Hwang:
Eye in the Sky: Drone-Based Object Tracking and 3D Localization. 899-907 - Weizhi Nie, Qi Liang, An-An Liu, Zhendong Mao, Yangyang Li:
MMJN: Multi-Modal Joint Networks for 3D Shape Recognition. 908-916 - Yizhou Wang, Yen-Ting Huang, Jenq-Neng Hwang:
Monocular Visual Object 3D Localization in Road Scenes. 917-925 - Xiheng Zhang, Yongkang Wong, Mohan S. Kankanhalli, Weidong Geng:
Unsupervised Domain Adaptation for 3D Human Pose Estimation. 926-934 - Hongwen Zhang, Jie Cao, Guo Lu, Wanli Ouyang, Zhenan Sun:
DaNet: Decompose-and-aggregate Network for 3D Human Shape and Pose Estimation. 935-944 - Jun Yu, Chang Wen Chen, Zengfu Wang:
3D Singing Head for Music VR: Learning External and Internal Articulatory Synchronicity from Lyric, Audio and Notes. 945-952 - Shan Huang, Zhi Wang, Laizhong Cui, Yong Jiang, Rui Gao:
Fine-grained Fitting Experience Prediction: A 3D-slicing Attention Approach. 953-961 - Dawei Zhong, Lei Han, Lu Fang:
iDFusion: Globally Consistent Dense 3D Reconstruction from RGB-D and Inertial Measurements. 962-970 - Jian Wu, Jianbo Jiao, Qingxiong Yang, Zheng-Jun Zha, Xuejin Chen:
Ground-Aware Point Cloud Semantic Segmentation for Autonomous Driving. 971-979 - Xiao Sun, Zhouhui Lian, Jianguo Xiao:
SRINet: Learning Strictly Rotation-Invariant Representations for Point Cloud Classification and Segmentation. 980-988 - Xinhai Liu, Zhizhong Han, Xin Wen, Yu-Shen Liu, Matthias Zwicker:
L2G Auto-encoder: Understanding Point Clouds by Local-to-Global Reconstruction with Hierarchical Self-Attention. 989-997 - Junnan Li, Jianquan Liu, Yongkang Wong, Shoji Nishimura, Mohan S. Kankanhalli:
Self-supervised Representation Learning Using 360° Data. 998-1006 - Ioannis Agtzidis, Mikhail Startsev, Michael Dorr:
360-degree Video Gaze Behaviour: A Ground-Truth Data Set and a Classification Algorithm for Eye Movements. 1007-1015