default search action
24th ACM Multimedia 2016: Amsterdam, The Netherlands
- Alan Hanjalic, Cees Snoek, Marcel Worring, Dick C. A. Bulterman, Benoit Huet, Aisling Kelliher, Yiannis Kompatsiaris, Jin Li:
Proceedings of the 2016 ACM Conference on Multimedia Conference, MM 2016, Amsterdam, The Netherlands, October 15-19, 2016. ACM 2016, ISBN 978-1-4503-3603-1
Keynote Address
- Dirk Helbing:
A Digital World to Thrive In: How the Internet of Things Can Make the "Invisible Hand" Work. 1
Best Paper
- Shengsheng Qian, Tianzhu Zhang, Changsheng Xu:
Multi-modal Multi-view Topic-opinion Mining for Social Event Analysis. 2-11 - Nic Lupfer, Andruid Kerne, Andrew M. Webb, Rhema Linder:
Patterns of Free-form Curation: Visual Thinking with Web Content. 12-21 - Mengbai Xiao, Viswanathan Swaminathan, Sheng Wei, Songqing Chen:
DASH2M: Exploring HTTP/2 for Internet Streaming to Mobile Devices. 22-31 - Jingjing Chen, Chong-Wah Ngo:
Deep-based Ingredient Recognition for Cooking Recipe Retrieval. 32-41
Posters
- Chris Greenhalgh, Adrian Hazzard, Sean McGrath, Steve Benford:
GeoTracks: Adaptive Music for Everyday Journeys. 42-46 - Xiaoshan Yang, Tianzhu Zhang, Changsheng Xu:
Abnormal Event Discovery in User Generated Photos. 47-51 - Shuhui Jiang, Yue Wu, Yun Fu:
Deep Bi-directional Cross-triplet Embedding for Cross-Domain Clothing Retrieval. 52-56 - Liping Jing, Bo Liu, Jaeyoung Choi, Adam Janin, Julia Bernd, Michael W. Mahoney, Gerald Friedland:
A Discriminative and Compact Audio Representation for Event Detection. 57-61 - Kyeong-Ah Jeong, Hyeon-Jeong Suk:
Jockey Time: Making Video Playback to Enhance Emotional Effect. 62-66 - Hui-Hung Wang, Yi-Ling Chen, Chen-Kuo Chiang:
Discriminative Paired Dictionary Learning for Visual Recognition. 67-71 - Yanhao Zhang, Lei Qin, Qingming Huang, Kuiyuan Yang, Jun Zhang, Hongxun Yao:
From Seed Discovery to Deep Reconstruction: Predicting Saliency in Crowd via Deep Networks. 72-76 - Ke Chen, Joni-Kristian Kämäräinen, Zhaoxiang Zhang:
Facial Age Estimation Using Robust Label Distribution. 77-81 - Sidi Liu, Jinglei Lv, Yimin Hou, Ting Shoemaker, Qinglin Dong, Kaiming Li, Tianming Liu:
What Makes a Good Movie Trailer?: Interpretation from Simultaneous EEG and Eyetracker Recording. 82-86 - Xiaojie Guo:
LIME: A Method for Low-light IMage Enhancement. 87-91 - Rufael Mekuria, Jelte Fennema, Dirk Griffioen:
Multi-Protocol Video Delivery with Late Trans-Muxing. 92-96 - Ravi Kiran Sarvadevabhatla, R. Venkatesh Babu:
Analyzing Structural Characteristics of Object Category Representations From Their Semantic-part Distributions. 97-101 - Pichao Wang, Zhaoyang Li, Yonghong Hou, Wanqing Li:
Action Recognition Based on Joint Trajectory Maps Using Convolutional Neural Networks. 102-106 - Chung-Hua Chu:
Efficient Digital Holographic Image Reconstruction on Mobile Devices. 107-111 - Tetsuaki Mano, Hiroaki Yamane, Tatsuya Harada:
Scene Image Synthesis from Natural Sentences Using Hierarchical Syntactic Analysis. 112-116 - Zan Gao, Deyu Wang, Hua Zhang, Yanbing Xue, Guangping Xu:
A Fast 3D Retrieval Algorithm via Class-Statistic and Pair-Constraint Model. 117-121 - Michael Gygli, Mohammad Soleymani:
Analyzing and Predicting GIF Interestingness. 122-126 - Chen Chen, Zuxuan Wu, Yu-Gang Jiang:
Emotion in Context: Deep Semantic Feature Fusion for Video Emotion Recognition. 127-131 - Ying Li, Xiangwei Kong, Liang Zheng, Qi Tian:
Exploiting Hierarchical Activations of Neural Network for Image Retrieval. 132-136 - Lorenzo Porzi, Samuel Rota Bulò, Elisa Ricci:
A Deeply-Supervised Deconvolutional Network for Horizon Line Detection. 137-141 - Yongqing Sun, Zuxuan Wu, Xi Wang, Hiroyuki Arai, Tetsuya Kinebuchi, Yu-Gang Jiang:
Exploiting Objects with LSTMs for Video Categorization. 142-146 - Jacob Thorn, Rodrigo Pizarro, Bernhard Spanlang, Pablo Bermell-Garcia, Mar González-Franco:
Assessing 3D Scan Quality Through Paired-comparisons Psychophysics. 147-151 - Zhou Zhao, Hanqing Lu, Deng Cai, Xiaofei He, Yueting Zhuang:
Partial Multi-Modal Sparse Coding via Adaptive Similarity Structure Regularization. 152-156 - Hervé Bredin, Gregory Gelly:
Improving Speaker Diarization of TV Series using Talking-Face Detection and Clustering. 157-161 - Jen-Yin Chang, Kuan-Ying Lee, Yu-Lin Wei, Kate Ching-Ju Lin, Winston H. Hsu:
Location-Independent WiFi Action Recognition via Vision-based Methods. 162-166 - Edip Demirbilek, Jean-Charles Grégoire:
INRS Audiovisual Quality Dataset. 167-171 - Hui Wu, Michele Merler, Rosario Uceda-Sosa, John R. Smith:
Learning to Make Better Mistakes: Semantics-aware Visual Food Recognition. 172-176 - Xin-Shun Xu:
Dictionary Learning Based Hashing for Cross-Modal Retrieval. 177-181 - Taylor Zheng, Prem Seetharaman, Bryan Pardo:
SocialFX: Studying a Crowdsourced Folksonomy of Audio Effects Terms. 182-186 - Ravi Kiran Sarvadevabhatla, Shiv Surya, Srinivas S. S. Kruthiventi, R. Venkatesh Babu:
SwiDeN: Convolutional Neural Networks For Depiction Invariant Object Recognition. 187-191 - Jiawei Liu, Zheng-Jun Zha, Q. I. Tian, Dong Liu, Ting Yao, Qiang Ling, Tao Mei:
Multi-Scale Triplet CNN for Person Re-Identification. 192-196 - Masoud Mazloom, Robert Rietveld, Stevan Rudinac, Marcel Worring, Willemijn van Dolen:
Multimodal Popularity Prediction of Brand-related Social Media Posts. 197-201 - Nam Do-Hoang Le, Jean-Marc Odobez:
Learning Multimodal Temporal Representation for Dubbing Detection in Broadcast Media. 202-206 - Zhou Ren, Hailin Jin, Zhe L. Lin, Chen Fang, Alan L. Yuille:
Joint Image-Text Representation by Gaussian Visual-Semantic Embedding. 207-211 - Yazhou Yao, Xian-Sheng Hua, Fumin Shen, Jian Zhang, Zhenmin Tang:
A Domain Robust Approach For Image Dataset Construction. 212-216 - Harsh Jhamtani, Shubham Varma, Midhun Gundapuneni, Siddhartha Kumar Dutta:
A Supervised Approach for Text Illustration. 217-221 - Yang Liu, Yan Liu, Xiang Zhang, Gong Chen, Kejun Zhang:
Learning Music Emotion Primitives via Supervised Dynamic Clustering. 222-226 - Jianfeng He, Bingpeng Ma, Shuhui Wang, Yugui Liu, Qingming Huang:
Cross-modal Retrieval by Real Label Partial Least Squares. 227-231 - Yiru Zhao, Yaoyi Li, Zhiwen Shao, Hongtao Lu:
LSOD: Local Sparse Orthogonal Descriptor for Image Matching. 232-236 - Dekui Ma, Jian Liang, Xiangwei Kong, Ran He:
Frustratingly Easy Cross-Modal Hashing. 237-241 - Joseph P. Robinson, Ming Shao, Yue Wu, Yun Fu:
Families in the Wild (FIW): Large-Scale Kinship Image Database and Benchmarks. 242-246 - Ravi Kiran Sarvadevabhatla, Jogendra Kundu, R. Venkatesh Babu:
Enabling My Robot To Play Pictionary: Recurrent Neural Networks For Sketch Recognition. 247-251 - Payal Bajaj, Sumit Shekhar:
Experience Individualization on Online TV Platforms through Persona-based Account Decomposition. 252-256 - Katsunori Ohnishi, Masatoshi Hidaka, Tatsuya Harada:
Improved Dense Trajectory with Cross Streams. 257-261 - Ye Zhou, Xin Lu, Junping Zhang, James Z. Wang:
Joint Image and Text Representation for Aesthetics Analysis. 262-266 - Laura Cabrera Quiros, Hayley Hung:
Who is where?: Matching People in Video to Wearable Acceleration During Crowded Mingling Events. 267-271 - Yun Gu, Chao Ma, Jie Yang:
Supervised Recurrent Hashing for Large Scale Video Retrieval. 272-276 - Nakamasa Inoue, Koichi Shinoda:
Adaptation of Word Vectors using Tree Structure for Visual Semantics. 277-281 - Min-Kook Choi, Hyun-Gyu Lee, Minseok Song, Sang-Chul Lee:
Adaptive Bitrate Selection for Video Encoding with Reduced Block Artifacts. 282-286 - Miriam Redi, Damon Crockett, Lev Manovich, Simon Osindero:
What Makes Photo Cultures Different? 287-291 - Michal Muszynski, Theodoros Kostoulas, Patrizia Lombardo, Thierry Pun, Guillaume Chanel:
Synchronization among Groups of Spectators for Highlight Detection in Movies. 292-296 - Chao Zhang, Junchi Yan, Changsheng Li, Xiaoguang Rui, Liang Liu, Rongfang Bie:
On Estimating Air Pollution from Photos Using Convolutional Neural Network. 297-301 - Xing Xu, Fumin Shen, Yang Yang, Heng Tao Shen, Li He, Jingkuan Song:
Cross-modal Retrieval with Label Completion. 302-306 - Yuhang Wang, Jing Liu, Yong Li, Junjie Yan, Hanqing Lu:
Objectness-aware Semantic Segmentation. 307-311 - Dimitris Chatzopoulos, Pan Hui:
ReadMe: A Real-Time Recommendation System for Mobile Augmented Reality Ecosystems. 312-316 - Yi Tian, Qiuqi Ruan, Gaoyun An, Yun Fu:
Action Recognition Using Local Consistent Group Sparse Coding with Spatio-Temporal Structure. 317-321 - Haiyi Mao, Yue Wu, Jun Li, Yun Fu:
Super Resolution of the Partial Pixelated Images With Deep Convolutional Neural Network. 322-326 - Takuhiro Kaneko, Kaoru Hiramatsu, Kunio Kashino:
Adaptive Visual Feedback Generation for Facial Expression Improvement with Multi-task Deep Neural Networks. 327-331 - Angelos Katharopoulos, Despoina Paschalidou, Christos Diou, Anastasios Delopoulos:
Fast Supervised LDA for Discovering Micro-Events in Large-Scale Video Datasets. 332-336 - Ryan Stables, Brecht De Man, Sean Enderby, Joshua D. Reiss, György Fazekas, Thomas Wilmering:
Semantic Description of Timbral Transformations in Music Production. 337-341 - Di Hu, Xiaoqiang Lu, Xuelong Li:
Multimodal Learning via Exploring Deep Semantic Similarity. 342-346 - Feifei Zhang, Qirong Mao, Ming Dong, Yongzhao Zhan:
Multi-pose Facial Expression Recognition Using Transformed Dirichlet Process. 347-351 - Botong Wu, Yizhou Wang:
Neighborhood-Preserving Hashing for Large-Scale Cross-Modal Search. 352-356 - Zhao Guo, Lianli Gao, Jingkuan Song, Xing Xu, Jie Shao, Heng Tao Shen:
Attention-based LSTM with Semantic Consistency for Videos Captioning. 357-361 - Keiji Yanai, Ryosuke Tanno, Koichi Okamoto:
Efficient Mobile Implementation of A CNN-based Object Recognition System. 362-366 - Jinxin Zheng, Yongtao Wang, Zhi Tang:
Context-aware Geometric Object Reconstruction for Mobile Education. 367-371 - Jen-Chun Lin, Wen-Li Wei, Hsin-Min Wang:
Automatic Music Video Generation Based on Emotion-Oriented Pseudo Song Prediction and Matching. 372-376 - Kuan-Yu Chen, Shih-Hung Liu, Berlin Chen, Hsin-Min Wang, Hsin-Hsi Chen:
Novel Word Embedding and Translation-based Language Modeling for Extractive Speech Summarization. 377-381 - Dae Hoe Kim, Wissam J. Baddar, Yong Man Ro:
Micro-Expression Recognition with Expression-State Constrained Spatio-Temporal Feature Representations. 382-386 - Yuma Sasaka, Takahiro Ogawa, Miki Haseyama:
Multimodal Interest Level Estimation via Variational Bayesian Mixture of Robust CCA. 387-391 - Toan H. Vu, Le Dung, Jia-Ching Wang:
Transportation Mode Detection on Mobile Devices Using Recurrent Nets. 392-396 - Youbao Tang, Xiangqian Wu, Wei Bu:
Deeply-Supervised Recurrent Convolutional Neural Network for Saliency Detection. 397-401 - Wei-Ta Chu, Yi-Ling Wu:
Deep Correlation Features for Image Style Classification. 402-406 - Ke Yan, Yaowei Wang, Dawei Liang, Tiejun Huang, Yonghong Tian:
CNN vs. SIFT for Image Retrieval: Alternative or Complementary? 407-411 - Xiaoyu Xiong, Maurizio Filippone, Alessandro Vinciarelli:
Looking Good With Flickr Faves: Gaussian Processes for Finding Difference Makers in Personality Impressions. 412-415 - Dejiang Kong, Fei Wu, Siliang Tang, Yueting Zhuang:
Ad Recommendation for Sponsored Search Engine via Composite Long-Short Term Memory. 416-420 - Zhao Liu, Yuwei Wu, Junsong Yuan, Yap-Peng Tan:
Learning a Multi-class Discriminative Dictionary with Nonredundancy Constraints for Visual Classification. 421-425 - Yuwei Wu, Zhe Wang, Junsong Yuan, Ling-Yu Duan:
A Compact Binary Aggregated Descriptor via Dual Selection for Visual Search. 426-430 - Mengfan Tang, Feiping Nie, Ramesh C. Jain:
Capped Lp-Norm Graph Embedding for Photo Clustering. 431-435 - Yi Bin, Yang Yang, Fumin Shen, Xing Xu, Heng Tao Shen:
Bidirectional Long-Short Term Memory for Video Description. 436-440 - Yashaswi Verma, C. V. Jawahar:
A Robust Distance with Correlated Metric Learning for Multi-Instance Multi-Label Data. 441-445 - Yawei Li, Xiaofeng Li, Zhizhong Fu, Wenli Zhong:
Multiview Video Super-Resolution via Information Extraction and Merging. 446-450 - Darshan Santani, Rui Hu, Daniel Gatica-Perez:
InnerView: Learning Place Ambiance from Social Media Images. 451-455 - Jiewei Cao, Zi Huang, Peng Wang, Chao Li, Xiaoshuai Sun, Heng Tao Shen:
Quartet-net Learning for Visual Instance Retrieval. 456-460 - Stavros Arestis-Chartampilas, Nikolaos Gkalelis, Vasileios Mezaris:
AKSDA-MSVM: A GPU-accelerated Multiclass Learning Framework for Multimedia. 461-465 - Chao Sun, Shuaicheng Liu, Taotao Yang, Bing Zeng, Zhengning Wang, Guanghui Liu:
Automatic Reflection Removal using Gradient Intensity and Motion Cues. 466-470 - Xueting Wang, Kensho Hara, Yu Enokibori, Takatsugu Hirayama, Kenji Mase:
Personal Multi-view Viewpoint Recommendation based on Trajectory Distribution of the Viewing Target. 471-475 - Stefano Alletto, Giuseppe Serra, Rita Cucchiara:
Motion Segmentation using Visual and Bio-mechanical Features. 476-480 - Yuan-Shan Lee, Chien-Yao Wang, Seksan Mathulaprangsan, Jia Hao Zhao, Jia-Ching Wang:
Locality-preserving K-SVD Based Joint Dictionary and Classifier Learning for Object Recognition. 481-485 - Huy Phan, Lars Hertel, Marco Maaß, Philipp Koch, Alfred Mertins:
Label Tree Embeddings for Acoustic Scene Classification. 486-490 - Yoann Baveye, Romain Cohendet, Matthieu Perreira Da Silva, Patrick Le Callet:
Deep Learning for Image Memorability Prediction: the Emotional Bias. 491-495 - Zhengzhong Zhou, Jingjin Zhou, Liqing Zhang:
Demand-adaptive Clothing Image Retrieval Using Hybrid Topic Model. 496-500 - Foteini Markatopoulou, Vasileios Mezaris, Ioannis Patras:
Deep Multi-task Learning with Label Correlation Constraint for Video Concept Detection. 501-505 - Raheeb Muzaffar, Evsen Yanmaz, Christian Bettstetter, Andrea Cavallaro:
Application-Layer Rate-Adaptive Multicast Video Streaming over 802.11 for Mobile Devices. 506-510 - Xing Wang, Jie Liang:
Scalable Compression of Deep Neural Networks. 511-515 - Jiahui Yu, Yuning Jiang, Zhangyang Wang, Zhimin Cao, Thomas S. Huang:
UnitBox: An Advanced Object Detection Network. 516-520 - Wenxuan Mou, Hatice Gunes, Ioannis Patras:
Alone versus In-a-group: A Comparative Analysis of Facial Affect Recognition. 521-525 - Meng Wang, Yi Fang:
Local Diffusion Map Signature for Symmetry-aware Non-rigid Shape Correspondence. 526-530 - Francesco Barbieri, Germán Kruszewski, Francesco Ronzano, Horacio Saggion:
How Cosmopolitan Are Emojis?: Exploring Emojis Usage and Meaning over Different Languages with Distributional Semantics. 531-535 - Hanhe Lin, Jeremiah D. Deng, Brendon J. Woodford, Ahmad Shahi:
Online Weighted Clustering for Real-time Abnormal Event Detection in Video Surveillance. 536-540 - Peisong Wang, Jian Cheng:
Accelerating Convolutional Neural Networks for Mobile Applications. 541-545 - Raghvendra Kannao, Durgaprasad Dandi, Swamy Yellapu, Prithwijit Guha:
News Program Detection in TV Broadcast Videos. 546-550 - Wenyi Huang, Dafang He, Xiao Yang, Zihan Zhou, Daniel Kifer, C. Lee Giles:
Detecting Arbitrary Oriented Text in the Wild with a Visual Attention Model. 551-555 - Meng Wang, Yi Fang:
Global Consistent Shape Correspondence for Efficient and Effective Active Shape Models. 556-560 - Pin-Chun Wang, Ching-Ling Fan, Chun-Ying Huang, Kuan-Ta Chen, Cheng-Hsin Hsu:
Towards Ultra-Low-Bitrate Video Conferencing Using Facial Landmarks. 561-565 - Niluthpol Chowdhury Mithun, Rameswar Panda, Amit K. Roy-Chowdhury:
Generating Diverse Image Datasets with Limited Labeling. 566-570 - Shizhe Chen, Qin Jin:
Multi-modal Conditional Attention Fusion for Dimensional Emotion Prediction. 571-575 - Shohei Yamamoto, Tatsuya Harada:
Video Generation Using 3D Convolutional Neural Network. 576-580 - Weiwei Sun, Jiantao Zhou, Ran Lyu, Shuyuan Zhu:
Processing-Aware Privacy-Preserving Photo Sharing over Online Social Networks. 581-585 - Xirong Li, Yujia Huo, Qin Jin, Jieping Xu:
Detecting Violence in Video using Subclasses. 586-590 - Yachuang Feng, Yuan Yuan, Xiaoqiang Lu:
Deep Representation for Abnormal Event Detection in Crowded Scenes. 591-595 - Sanket Khanwalkar, Shonali Balakrishna, Ramesh C. Jain:
Exploration of Large Image Corpuses in Virtual Reality. 596-600 - Alireza Zare, Alireza Aminlou, Miska M. Hannuksela, Moncef Gabbouj:
HEVC-compliant Tile-based Streaming of Panoramic Video for Virtual Reality Applications. 601-605