


default search action
23rd ACM Multimedia 2015: Brisbane, Australia
- Xiaofang Zhou, Alan F. Smeaton, Qi Tian, Dick C. A. Bulterman, Heng Tao Shen, Ketan Mayer-Patel, Shuicheng Yan:
Proceedings of the 23rd Annual ACM Conference on Multimedia Conference, MM '15, Brisbane, Australia, October 26 - 30, 2015. ACM 2015, ISBN 978-1-4503-3459-4
Keynote 1
- Judy Kay:
Harnessing Big Personal Data, with Scrutable User Modelling for Privacy and Control. 1-2
Keynote 2
- Zhengyou Zhang:
Vision-enhanced Immersive Interaction and Remote Collaboration with Large Touch Displays. 3-4
Best Paper Session
- Xavier Alameda-Pineda, Yan Yan, Elisa Ricci
, Oswald Lanz
, Nicu Sebe
:
Analyzing Free-standing Conversational Groups: A Multimodal Approach. 5-14 - Michael Stengel, Steve Grogorick
, Martin Eisemann, Elmar Eisemann, Marcus A. Magnor
:
An Affordable Solution for Binocular Eye Tracking and Calibration in Head-mounted Displays. 15-24 - Wei Wang, Gang Chen, Tien Tuan Anh Dinh, Jinyang Gao, Beng Chin Ooi, Kian-Lee Tan
, Sheng Wang:
SINGA: Putting Deep Learning in the Hands of Multimedia Users. 25-34 - Xiangbo Shu, Guo-Jun Qi
, Jinhui Tang
, Jingdong Wang
:
Weakly-Shared Deep Transfer Networks for Heterogeneous-Domain Knowledge Propagation. 35-44
Panel 1
- Shih-Fu Chang, Matthew Cooper, Denver Dash, Funda Kivran-Swaine, Jia Li, David A. Shamma:
Opportunities and Challenges of Industry-Academic Collaborations in Multimedia Research. 45
Panel 2
- Joanna Batstone
, Touradj Ebrahimi
, Tiejun Huang, Yung-Hsiang Lu, Yonggang Wen:
Opportunities and Challenges of Global Network Cameras. 47-48
Session 1: Multimedia Indexing and Search
- Lu Jiang, Shoou-I Yu, Deyu Meng, Yi Yang, Teruko Mitamura, Alexander G. Hauptmann:
Fast and Accurate Content-based Semantic Search in 100M Internet Videos. 49-58 - Yang Yang, Hanwang Zhang
, Mingxing Zhang
, Fumin Shen, Xuelong Li
:
Visual Coding in a Semantic Hierarchy. 59-68 - Xinyang Jiang, Fei Wu, Xi Li, Zhou Zhao, Weiming Lu, Siliang Tang, Yueting Zhuang:
Deep Compositional Cross-modal Learning to Rank via Local-Global Alignment. 69-78 - Yang Wang, Xuemin Lin, Lin Wu
, Wenjie Zhang
:
Effective Multi-Query Expansions: Robust Landmark Retrieval. 79-88
Session 2: Social Multimedia
- Hongyun Cai, Yang Yang, Xuefei Li, Zi Huang
:
What are Popular: Exploring Twitter Features for Event Detection, Tracking and Visualization. 89-98 - Shengsheng Qian, Tianzhu Zhang, Richang Hong, Changsheng Xu:
Cross-Domain Collaborative Learning in Social Multimedia. 99-108 - Shaowei Liu, Peng Cui, Wenwu Zhu, Shiqiang Yang:
Learning Socially Embedded Visual Representation from Scratch. 109-118 - Jiewei Cao
, Zi Huang
, Yang Yang:
Spatial-aware Multimodal Location Estimation for Social Images. 119-128
Session 3: Emotional and Social Signals in Multimedia
- Yang Hu, Xi Yi, Larry S. Davis:
Collaborative Fashion Recommendation: A Functional Tensor Factorization Approach. 129-138 - Lorenzo Porzi, Samuel Rota Bulò, Bruno Lepri, Elisa Ricci
:
Predicting and Understanding Urban Perception with Convolutional Neural Networks. 139-148 - Maarten Brilman
, Stefan Scherer:
A Multimodal Predictive Model of Successful Debaters or How I Learned to Sway Votes. 149-158 - Brendan Jou, Tao Chen, Nikolaos Pappas
, Miriam Redi, Mercan Topkara, Shih-Fu Chang:
Visual Affect Around the World: A Large-scale Multilingual Visual Sentiment Ontology. 159-168
Session: Multimedia Grand Challenge
- Qiang Song, Sixie Yu, Cong Leng, Jiaxiang Wu, Qinghao Hu, Jian Cheng:
Learning Deep Features For MSR-bing Information Retrieval Challenge. 169-172 - Jianfeng Dong, Xirong Li
, Shuai Liao, Jieping Xu
, Duanqing Xu, Xiaoyong Du:
Image Retrieval by Cross-Media Relevance Fusion. 173-176 - Kuan-Ting Chen, Kezhen Chen, Peizhong Cong, Winston H. Hsu, Jiebo Luo
:
Who are the Devils Wearing Prada in New York City? 177-180 - Hsun-Ping Hsieh
, Tzu-Chi Yen, Cheng-Te Li:
What Makes New York So Noisy?: Reasoning Noise Pollution by Mining Multimodal Geo-Social Big Data. 181-184 - Rajiv Ratn Shah, Anwar Dilawar Shaikh, Yi Yu, Wenjing Geng, Roger Zimmermann
, Gangshan Wu:
EventBuilder: Real-time Multimedia Event Summarization by Visualizing Social Media. 185-188 - Manos Schinas, Symeon Papadopoulos, Georgios Petkos, Yiannis Kompatsiaris, Pericles A. Mitkas:
Multimodal Graph-based Event Detection and Summarization in Social Media Streams. 189-192 - Jaeyoung Choi, Eungchan Kim, Martha A. Larson, Gerald Friedland, Alan Hanjalic
:
Evento 360: Social Event Discovery from Web-scale Multimedia Collection. 193-196 - Wen-Yu Lee, Yin-Hsi Kuo, Peng-Ju Hsieh, Wen-Feng Cheng, Ting-Hsuan Chao, Hui-Lan Hsieh, Chieh-En Tsai, Hsiao-Ching Chang, Jia-Shin Lan, Winston H. Hsu:
Unsupervised Latent Aspect Discovery for Diverse Event Summarization. 197-200
Session: Brave New Ideas
- Claudio Martella, Ekin Gedik, Laura Cabrera Quiros
, Gwenn Englebienne, Hayley Hung:
How Was It?: Exploiting Smartphone Sensing to Measure Implicit Audience Responses to Live Performances. 201-210 - Darshan Santani, Daniel Gatica-Perez:
Loud and Trendy: Crowdsourcing Impressions of Social Ambiance in Popular Indoor Urban Places. 211-220 - Laleh Jalali, Ramesh C. Jain:
Bringing Deep Causality to Multimedia Data Streams. 221-230 - Jan Zahálka
, Stevan Rudinac, Marcel Worring
:
Analytic Quality: Evaluation of Performance and Insight in Multimedia Collection Analysis. 231-240
Session 4: Multimedia and Vision
- I-Kao Chiang, Ian Spiro, Seungkyu Lee, Alyssa Lees, Jingchen Liu, Chris Bregler, Yanxi Liu:
Dancing with Turks. 241-250 - Antonio Robles-Kelly
:
Single Image Spectral Reconstruction for Multimedia Applications. 251-260 - Xiangyun Meng, Wei Wang, Ben Leong
:
SkyStitch: A Cooperative Multi-UAV-based Real-time Video Surveillance System with Stitching. 261-270 - Ravi Kiran Sarvadevabhatla
, R. Venkatesh Babu
:
Eye of the Dragon: Exploring Discriminatively Minimalist Sketch-based Abstractions for Object Categories. 271-280
Session 5: Multimedia Art, Entertainment and Culture
- Douglas L. Williams, Ian C. Kegel
, Marian Florin Ursu, Pablo César
, Jack Jansen
, Erik Geelhoed, Andras Horti, Michael Frantzis, Bill Scott:
A Distributed Theatre Experiment with Shakespeare. 281-290 - Jia Chen, Qin Jin, Yong Yu, Alexander G. Hauptmann:
Image Profiling for History Events on the Fly. 291-300 - Zihan Zhou, Siqiong He, Jia Li, James Ze Wang
:
Modeling Perspective Effects in Photographic Composition. 301-310 - Andreza Sartori
, Dubravko Culibrk, Yan Yan, Nicu Sebe
:
Who's Afraid of Itten: Using the Art Theory of Color Combination to Analyze Emotions in Abstract Paintings. 311-320
Session 6: Telepresence, Virtual, and Augmented Reality
- Xiaowu Chen, Jianwei Li, Qing Li, Bo Gao, Dongqing Zou, Qinping Zhao:
Image2Scene: Transforming Style of 3D Room. 321-330 - Kiana Calagari, Mohamed A. Elgharib, Piotr Didyk
, Alexandre Kaspar, Wojciech Matusik, Mohamed Hefeeda:
Gradient-based 2D-to-3D Conversion for Soccer Videos. 331-340 - Zhanpeng Huang, Weikai Li, Pan Hui:
Ubii: Towards Seamless Interaction between Digital and Physical Worlds. 341-350 - Chun-Ying Huang
, Chih-Fan Hsu, Tsung-Han Tsai, Ching-Ling Fan, Cheng-Hsin Hsu, Kuan-Ta Chen:
Smart Beholder: An Open-Source Smart Lens for Mobile Photography. 351-360
Session 7: Actions and Events
- Yunpeng Wu
, Yangdong Ye, Chenyang Zhao:
Coherent Motion Detection with Collective Density Clustering. 361-370 - Chen Sun, Sanketh Shetty, Rahul Sukthankar, Ram Nevatia:
Temporal Localization of Fine-Grained Actions in Videos by Domain Transfer from Web Images. 371-380 - Sébastien Poullot, Shunsuke Tsukatani, Phuong Anh Nguyen
, Hervé Jégou, Shin'ichi Satoh:
Temporal Matching Kernel with Explicit Feature Maps. 381-390 - Gregory D. Castañón, Yuting Chen, Ziming Zhang, Venkatesh Saligrama
:
Efficient Activity Retrieval through Semantic Graph Queries. 391-400
Session 8: Video Systems
- Charles D. Estes, Ketan Mayer-Patel:
Video Killed The Data Store: Extending the n-Dimensional Display Interface for Full Screen Video. 401-410 - Mohammad Reza Zakerinasab, Mea Wang:
Dependency-Aware Unequal Error Protection for Layered Video Coding. 411-420 - Shahid Akhtar, Andre Beck, Ivica Rimac:
HiFi: A Hierarchical Filtering Algorithm for Caching of Online Video. 421-430 - Zhisheng Yan
, Qian Liu, Tong Zhang, Chang Wen Chen
:
Exploring QoE for Power Efficiency: A Field Study on Mobile Videos with LCD Displays. 431-440
Session 9: Deep Learning and Multimedia
- Yalong Bai
, Kuiyuan Yang, Wei Yu, Chang Xu
, Wei-Ying Ma, Tiejun Zhao:
Automatic Image Dataset Construction from Click-through Logs Using Deep Neural Network. 441-450 - Zhangyang Wang, Jianchao Yang, Hailin Jin, Eli Shechtman, Aseem Agarwala, Jonathan Brandt, Thomas S. Huang:
DeepFont: Identify Your Font from An Image. 451-459 - Zuxuan Wu, Xi Wang, Yu-Gang Jiang, Hao Ye, Xiangyang Xue:
Modeling Spatial-Temporal Clues in a Hybrid Deep Learning Framework for Video Classification. 461-470 - Guangnan Ye, Yitong Li, Hongliang Xu, Dong Liu, Shih-Fu Chang:
EventNet: A Large Scale Structured Concept Library for Complex Event Detection in Video. 471-480
Session 10: Multimedia Quality Perception
- Michael James Scott
, Sharath Chandra Guntuku, Huan Yang, Weisi Lin, Gheorghita Ghinea
:
Modelling Human Factors in Perceptual Multimedia Quality: On The Role of Personality and Culture. 481-490 - Luming Zhang, Meng Wang, Liqiang Nie, Richang Hong, Yingjie Xia, Roger Zimmermann
:
Biologically Inspired Media Quality Modeling. 491-500 - Wei Song, Yao Xiao, Dian Tjondronegoro, Antonio Liotta
:
QoE Modelling for VP9 and H.265 Videos on Mobile Devices. 501-510 - Bilei Zhu, Wei Li, Linwei Li:
Towards Solving the Bottleneck of Pitch-based Singing Voice Separation. 511-520
Session 11: Multimedia Networking
- Mohammed Shatnawi, Mohamed Hefeeda:
Enhancing the Quality of Interactive Multimedia Services by Proactive Monitoring and Failure Prediction. 521-530 - Fanxin Kong, Xingjian Lu, Mingyuan Xia, Xue Liu, Haibing Guan:
Distributed Optimal Datacenter Bandwidth Allocation for Dynamic Adaptive Video Streaming. 531-540 - Rafael Huysegems, Tom Bostoen, Patrice Rondao-Alface, Jeroen van der Hooft, Stefano Petrangeli, Tim Wauters, Filip De Turck:
HTTP/2-Based Methods to Improve the Live Experience of Adaptive Streaming. 541-550 - Vengatanathan Krishnamoorthi, Niklas Carlsson, Derek L. Eager, Anirban Mahanti, Nahid Shahmehri:
Bandwidth-aware Prefetching for Proactive Multi-video Preloading and Improved HAS Performance. 551-560
Session 12: Data Imperfectness for Multimedia
- Qilin Zhang
, Gang Hua:
Multi-View Visual Recognition of Imperfect Testing Data. 561-570 - Pravin Kakar, Alex Yong Sang Chia:
If You Can't Beat Them, Join Them: Learning with Noisy Data. 571-580 - Xiaojun Chang
, Yaoliang Yu, Yi Yang, Alexander G. Hauptmann:
Searching Persuasively: Joint Event Detection and Evidence Recounting with Limited Supervision. 581-590 - Liqiang Nie, Luming Zhang, Yi Yang, Meng Wang, Richang Hong, Tat-Seng Chua:
Beyond Doctors: Future Health Prediction from Multimedia and Multimodal Observations. 591-600
Session 13: Multimedia Experiences and Expectations
- Tian Gan, Yongkang Wong, Bappaditya Mandal, Vijay Chandrasekhar, Mohan S. Kankanhalli
:
Multi-sensor Self-Quantification of Presentations. 601-610 - Andreas Girgensohn, Jennifer Marlow, Frank M. Shipman III, Lynn Wilcox:
HyperMeeting: Supporting Asynchronous Meetings with Hypervideo. 611-620 - Arijit Biswas, Ankit Gandhi, Om Deshmukh:
MMToC: A Multimodal Method for Table of Content Creation in Educational Videos. 621-630 - Kai Ruhl, Martin Eisemann, Anna Hilsmann, Peter Eisert
, Marcus A. Magnor
:
Interactive Scene Flow Editing for Improved Image-based Rendering and Virtual Spacetime Navigation. 631-640
Doctoral Symposium
- Yogesh Singh Rawat:
Real-Time Assistance in Multimedia Capture Using Social Media. 641-644 - Christoph Korinke:
Intuitive Input Methods for Interactive Segmentation on Mobile Touch-Based Devices. 645-648 - Shannon Chen:
Exploiting Contextual Information to Enable Efficient Content Delivery for 3D Tele-Immersion Applications. 649-652 - Yuhui Wang:
Socializing Multimodal Sensors for Information Fusion. 653-656 - Zhenzhong Lan:
Learn to Recognize Actions Through Neural Networks. 657-660 - Yusuke Matsui:
Challenge for Manga Processing: Sketch-based Manga Retrieval. 661-664 - Alexander Patrick Mathews:
Captioning Images Using Different Styles. 665-668 - Jia-Lin Chen:
Weakly Supervised Learning of Part-based Models for Interaction Prediction via LDA. 669-671
Open Source Software Competition
- Christoph Lassner, Rainer Lienhart
:
The fertilized forests Decision Forest Library. 681-684 - Beng Chin Ooi, Kian-Lee Tan
, Sheng Wang, Wei Wang, Qingchao Cai, Gang Chen, Jinyang Gao, Zhaojing Luo, Anthony K. H. Tung
, Yuan Wang, Zhongle Xie, Meihui Zhang, Kaiping Zheng:
SINGA: A Distributed Deep Learning Platform. 685-688 - Andrea Vedaldi, Karel Lenc
:
MatConvNet: Convolutional Neural Networks for MATLAB. 689-692 - Christopher Sweeney, Tobias Höllerer, Matthew A. Turk:
Theia: A Fast and Scalable Structure-from-Motion Library. 693-696 - Joël Dumoulin, Diana Affi, Elena Mugellini
, Omar Abou Khaled
:
eRS: A System to Facilitate Emotion Recognition in Movies. 697-700 - Federico Bartoli, Lorenzo Seidenari, Giuseppe Lisanti
, Svebor Karaman
, Alberto Del Bimbo:
WATTS: a Web Annotation Tool for Surveillance Scenarios. 701-704 - Mario Guggenberger:
Aurio: Audio Processing, Analysis and Retrieval. 705-708 - Nicolas Hervé, Pierre Letessier, Mathieu Derval, Hakim Nabi:
Amalia.js: An Open-Source Metadata Driven HTML5 Multimedia Player. 709-712 - Britta Meixner, Stefan John, Christian Handschigl:
SIVA Suite: Framework for Hypervideo Creation, Playback and Management. 713-716
Art Exhibit
- Alinta Krauth
:
Using Handmade Controllers for Interactive Projection Mapping. 717-719 - He-Lin Luo, I-Chun Chen, Yi-Ping Hung:
3D Printing and Camera Mapping: Dialectic of Virtual and Reality. 721-722 - James She, Carmen Ng, Desmond Leung:
Drag A Star: The Social Media in Outer Space. 723-726 - Oksana Krzyhanivska, Simon Fay, Jeffrey E. Boyd:
Disturbed System: Recreating Sculptor's Experience ofTheir Medium With Haptics and Generated Sound. 727-730 - David S. Monaghan, Noel E. O'Connor, Anne Cleary, Denis Connolly:
The Real Time Rolling Shutter. 731-734
Videos/Demos 1
- Spencer Cappallo, Thomas Mensink
, Cees G. M. Snoek:
Query-by-Emoji Video Search. 735-736 - Daisuke Ochi, Kenta Niwa, Akio Kameda, Yutaka Kunita, Akira Kojima:
Dive into Remote Events: Omnidirectional Video Streaming with Acoustic Immersion. 737-738 - Joël Dumoulin, Diana Affi, Elena Mugellini
, Omar Abou Khaled
, Marco Bertini
, Alberto Del Bimbo:
Movie's Affect Communication Using Multisensory Modalities. 739-740 - Chao Zhou, Lifeng Sun, Wenming Shi, Shiqiang Yang:
QOEYE: A Data Driven Platform for QoE Visualization and System Performance Monitoring. 741-742 - Hui Liang, Junsong Yuan
, Daniel Thalmann, Nadia Magnenat-Thalmann:
AR in Hand: Egocentric Palm Pose Tracking and Gesture Recognition for Augmented Reality Applications. 743-744 - Changcheng Xiao, Changhu Wang, Liqing Zhang:
PPTLens: Create Digital Objects with Sketch Images. 745-746 - François Destelle, Amin Ahmadi, Kieran Moran
, Noel E. O'Connor, Nikolaos Zioulis
, Anargyros Chatzitofis
, Dimitrios Zarpalas, Petros Daras
, Luis Unzueta
, Jon Goenetxea, Mikel Rodriguez, María Teresa Linaza, Yvain Tisserand, Nadia Magnenat-Thalmann
:
A Multi-Modal 3D Capturing Platform for Learning and Preservation of Traditional Sports and Games. 747-748 - Thomas Röggla, Chen Wang, Pablo César
:
Analysing Audience Response to Performing Events: A Web Platform for Interactive Exploration of Physiological Sensor Data. 749-750 - Jean Le Feuvre, Cyril Concolato, Nassima Bouzakaria, Viet-Thanh-Trung Nguyen:
MPEG-DASH for Low Latency and Hybrid Streaming Services. 751-752 - Jheng-Wei Peng, Shih-Wei Sun
, Wen-Huang Cheng, Yi-Hsuan Yang:
eMosic: Mobile Media Pushing through Social Emotion Sensing. 753-754 - Andrea Ferracani, Daniele Pezzatini, Andrea Benericetti
, Marco Guiducci, Alberto Del Bimbo:
PITAGORA: Recommending Users and Local Experts in an Airport Social Network. 755-756 - Andrea Ferracani, Daniele Pezzatini, Marco Bertini
, Saverio Meucci, Alberto Del Bimbo:
A System for Video Recommendation using Visual Saliency, Crowdsourced and Automatic Annotations. 757-758 - Faizan Ur Rehman
, Ahmed Lbath, Abdullah Murad, Md. Abdur Rahman
, Bilal Sadiq, Akhlaq Ahmad, Ahmad M. Qamar, Saleh M. Basalamah
:
A Semantic Geo-Tagged Multimedia-Based Routing in a Crowdsourced Big Data Environment. 759-760 - Bilal Sadiq, Md. Abdur Rahman
, Abdullah Murad, Muhammad Shahid, Faizan Ur Rehman
, Ahmed Lbath, Akhlaq Ahmad, Ahmad M. Qamar:
Crowdsourced Multimedia Enhanced Spatio-temporal Constraint Based on-Demand Social Network for Group Mobility. 761-762 - Ahmad M. Qamar, Abdullah Murad, Mohamed Abdur Rahman
, Faizan Ur Rehman
, Akhlaq Ahmad, Bilal Sadiq, Saleh M. Basalamah
:
A Multi-sensory Gesture-Based Login Environment. 763-764 - Xiong Lv, Shuqiang Jiang, Luis Herranz
, Shuang Wang:
Hand-Object Sense: A Hand-held Object Recognition System Based on RGB-D Information. 765-766 - Chao Chen, Fuhai Chen, Donglin Cao, Rongrong Ji:
A Cross-media Sentiment Analytics Platform For Microblog. 767-769 - Chao Liang, Bingyue Huang, Ruimin Hu, Chunjie Zhang
, Xiao-Yuan Jing, Jing Xiao:
A Unsupervised Person Re-identification Method Using Model Based Representation and Ranking. 771-774 - Tony Dunnigan, John Doherty, Daniel Avrahami, Jacob T. Biehl
, Patrick Chiu, Chelhwon Kim, Qiong Liu, Henry Tang, Lynn Wilcox:
Evolution of a Tabletop Telepresence System through Art and Technology. 775-776 - Tom Z. J. Fu, Jianbing Ding, Richard T. B. Ma
, Marianne Winslett, Yin Yang
, Zhenjie Zhang, Yong Pei, Bingbing Ni:
LiveTraj: Real-Time Trajectory Tracking over Live Video Streams. 777-780 - Zhuo Wei, Swee-Won Lo, Yu Liang, Tieyan Li, Jialie Shen, Robert H. Deng
:
Automatic Accident Detection and Alarm System. 781-784