


default search action
22nd ACM Multimedia 2014: Orlando, FL, USA
- Kien A. Hua, Yong Rui, Ralf Steinmetz, Alan Hanjalic, Apostol Natsev, Wenwu Zhu:

Proceedings of the ACM International Conference on Multimedia, MM '14, Orlando, FL, USA, November 03 - 07, 2014. ACM 2014, ISBN 978-1-4503-3063-3
Keynote 1
- Harry Shum:

Bing, the fastest growing image search engine. 1
Keynote 2
- Rosalind W. Picard:

Affective media and wearables: surprising findings. 3-4
Keynote 3
- Klara Nahrstedt:

Back and to the future: quality provisioning for multimedia content delivery. 5
Best Paper Session
- Fangxiang Feng, Xiaojie Wang, Ruifan Li:

Cross-modal Retrieval with Correspondence Autoencoder. 7-16 - AmirHossein Habibian, Thomas Mensink

, Cees G. M. Snoek:
VideoStory: A New Multimedia Embedding for Few-Example Recognition and Translation of Events. 17-26 - Yelin Kim, Emily Mower Provost

:
Say Cheese vs. Smile: Reducing Speech-Related Variability for Facial Emotion Recognition. 27-36
Multimedia Art and Entertainment
- Javier Villegas, Angus Graeme Forbes

:
Analysis/synthesis approaches for creatively processing video signals. 37-46 - Sicheng Zhao, Yue Gao, Xiaolei Jiang, Hongxun Yao, Tat-Seng Chua, Xiaoshuai Sun:

Exploring Principles-of-Art Features For Image Emotion Recognition. 47-56 - Jiajia Li, Grace Ngai

, Stephen Chi-fai Chan
, Kien A. Hua, Hong Va Leong
, Alvin T. S. Chan:
From Writing to Painting: A Kinect-Based Cross-Modal Chinese Painting Generation System. 57-66 - Charles Roberts, Matthew Wright, JoAnn Kuchera-Morin, Tobias Höllerer:

Gibber: Abstractions for Creative Multimedia Programming. 67-76
Action, Activity, and Event Recognition
- Zhigang Ma, Yi Yang, Nicu Sebe

, Alexander G. Hauptmann:
Multiple Features But Few Labels?: A Symbiotic Solution Exemplified for Video Analysis. 77-86 - Chengcheng Jia, Yu Kong, Zhengming Ding, Yun Raymond Fu:

Latent Tensor Transfer Learning for RGB-D Action Recognition. 87-96 - Keze Wang

, Xiaolong Wang, Liang Lin, Meng Wang, Wangmeng Zuo:
3D Human Activity Recognition with Reconfigurable Convolutional Neural Networks. 97-106 - Pei Xu, Mao Ye, Xue Li

, Qihe Liu, Yi Yang, Jian Ding:
Dynamic Background Learning through Deep Auto-encoder Networks. 107-116
Music, Speech and Audio
- Bin Wu, Erheng Zhong, Andrew Horner, Qiang Yang:

Music Emotion Recognition by Multi-label Multi-layer Multi-instance Multi-view Learning. 117-126 - Kuang Mao, Ju Fan, Lidan Shou, Gang Chen, Mohan S. Kankanhalli

:
Song Recommendation for Social Singing Community. 127-136 - Hervé Bredin, Anindya Roy, Nicolas Pécheux, Alexandre Allauzen:

"Sheldon speaking, Bonjour!": Leveraging Multilingual Tracks for (Weakly) Supervised Speaker Identification. 137-146 - Kai Li

, Jun Ye, Kien A. Hua:
What's Making that Sound? 147-156
Deep Learning for Multimedia
- Ji Wan, Dayong Wang, Steven Chu-Hong Hoi

, Pengcheng Wu, Jianke Zhu, Yongdong Zhang, Jintao Li:
Deep Learning for Content-Based Image Retrieval: A Comprehensive Study. 157-166 - Zuxuan Wu, Yu-Gang Jiang, Jun Wang, Jian Pu, Xiangyang Xue:

Exploring Inter-feature and Inter-class Relationships with Deep Neural Networks for Video Classification. 167-176 - Tianjun Xiao, Jiaxing Zhang, Kuiyuan Yang, Yuxin Peng, Zheng Zhang:

Error-Driven Incremental Learning in Deep Convolutional Neural Network for Large-Scale Image Classification. 177-186 - Hanwang Zhang

, Yang Yang, Huan-Bo Luan, Shuicheng Yan, Tat-Seng Chua:
Start from Scratch: Towards Automatically Identifying, Modeling, and Naming Visual Attributes. 187-196
Multimedia Grand Challenge
- Shintami Chusnul Hidayati, Kai-Lung Hua

, Wen-Huang Cheng, Shih-Wei Sun
:
What are the Fashion Trends in New York? 197-200 - Yin-Hsi Kuo, Yan-Ying Chen, Bor-Chun Chen, Wen-Yu Lee, Chun-Che Wu, Chia-Hung Lin, Yu-Lin Hou, Wen-Feng Cheng, Yi-Chih Tsai, Chung-Yen Hung, Liang-Chi Hsieh, Winston H. Hsu:

Discovering the City by Mining Diverse and Multimodal Data Streams. 201-204 - Jan Zahálka

, Stevan Rudinac, Marcel Worring
:
New Yorker Melange: Interactive Brew of Personalized Venue Recommendations. 205-208 - Rajiv Ratn Shah

, Yi Yu, Anwar Dilawar Shaikh, Suhua Tang, Roger Zimmermann
:
ATLAS: Automatic Temporal Segmentation and Annotation of Lecture Videos Based on Modelling Transition Time. 209-212 - Brendan Jou, Subhabrata Bhattacharya, Shih-Fu Chang:

Predicting Viewer Perceived Emotions in Animated GIFs. 213-216 - Yogesh Singh Rawat, Mohan S. Kankanhalli

:
Context-Based Photography Learning using Crowdsourced Images and Social Media. 217-220 - Mei-Chen Yeh

, Hsiao-Wei Lin:
Virtual Portraitist: Aesthetic Evaluation of Selfies Based on Angle. 221-224 - Jian Wang, Cuicui Kang, Yonghao He, Shiming Xiang, Chunhong Pan:

Cross Modal Deep Model and Gaussian Process Based Model for MSR-Bing Challenge. 225-228 - Yalong Bai

, Wei Yu, Tianjun Xiao, Chang Xu
, Kuiyuan Yang, Wei-Ying Ma
, Tiejun Zhao:
Bag-of-Words Based Deep Neural Network for Image Retrieval. 229-232 - Yingwei Pan

, Ting Yao, Xinmei Tian, Houqiang Li, Chong-Wah Ngo
:
Click-through-based Subspace Learning for Image Search. 233-236
Multimedia HCI and QoE
- Luming Zhang, Yue Gao, Chao Zhang

, Hanwang Zhang
, Qi Tian, Roger Zimmermann
:
Perception-Guided Multimodal Feature Fusion for Photo Aesthetics Assessment. 237-246 - Hiromi Nemoto, Philippe Hanhart, Pavel Korshunov, Touradj Ebrahimi

:
Impact of Ultra High Definition on Visual Attention. 247-256 - Jiangyang Zhang, C.-C. Jay Kuo

:
An Objective Quality of Experience (QoE) Assessment Index for Retargeted Images. 257-266 - Wei Song, Dian Tjondronegoro

, Ivan Himawan
:
Acceptability-based QoE Management for User-centric Mobile Video Delivery: A Field Study Evaluation. 267-276
Multimedia Analysis and Mining
- Wenxuan Xie, Yuxin Peng, Jianguo Xiao:

Weakly-Supervised Image Parsing via Constructing Semantic Graphs and Hypergraphs. 277-286 - Xiaopeng Zhang, Hongkai Xiong

, Wengang Zhou, Qi Tian:
Fused one-vs-all mid-level features for fine-grained visual categorization. 287-296 - Wei Zhang, Hongzhi Li, Chong-Wah Ngo

, Shih-Fu Chang:
Scalable Visual Instance Mining with Threads of Features. 297-306 - Yanfei Wang, Fei Wu, Jun Song, Xi Li, Yueting Zhuang:

Multi-modal Mutual Topic Reinforce Modeling for Cross-media Retrieval. 307-316
Multimedia Systems
- Vengatanathan Krishnamoorthi, Niklas Carlsson, Derek L. Eager, Anirban Mahanti, Nahid Shahmehri:

Quality-adaptive Prefetching for Interactive Branched Video using HTTP-based Adaptive Streaming. 317-326 - Benjamin Rainer, Christian Timmerer:

Self-Organized Inter-Destination Multimedia Synchronization For Adaptive Media Streaming. 327-336 - Kiana Calagari, Krzysztof Templin, Tarek Elgamal, Khaled Diab, Piotr Didyk

, Wojciech Matusik, Mohamed Hefeeda
:
Anahita: A System for 3D Video Streaming with Depth Customization. 337-346 - Li Lin, Xiaofei Liao, Guang Tan, Hai Jin, Xiaobin Yang, Wei Zhang, Bo Li:

LiveRender: A Cloud Gaming System Based on Compressed Graphics Streaming. 347-356
Emotional and Social Signals in Multimedia
- Enver Sangineto

, Gloria Zen, Elisa Ricci
, Nicu Sebe
:
We are not All Equal: Personalizing Models for Facial Expression Analysis with Transductive Parameter Transfer. 357-366 - Tao Chen, Felix X. Yu, Jiawei Chen, Yin Cui, Yan-Ying Chen, Shih-Fu Chang:

Object-Based Visual Sentiment Concept Analysis and Application. 367-376 - Florian Lingenfelser, Johannes Wagner, Elisabeth André

, Gary McKeown
, William Curran:
An Event Driven Fusion Approach for Enjoyment Recognition in Real-time. 377-386 - John R. Zhang, Jason Sherwin, Jacek Dmochowski, Paul Sajda, John R. Kender:

Correlating Speaker Gestures in Political Debates with Audience Engagement Measured via EEG. 387-396
High Risks High Rewards
- Michael Riegler, Martha A. Larson, Mathias Lux, Christoph Kofler:

How 'How' Reflects What's What: Content-based Exploitation of How Users Frame Social Images. 397-406 - Miaojing Shi, Teddy Furon, Hervé Jégou:

A Group Testing Framework for Similarity Search in High-dimensional Spaces. 407-416 - Eva Mohedano, Graham Healy

, Kevin McGuinness
, Xavier Giró-i-Nieto
, Noel E. O'Connor
, Alan F. Smeaton
:
Object Segmentation in Images using EEG Signals. 417-426 - Oche Ejembi, Saleem N. Bhatti

:
Help Save The Planet: Please Do Adjust Your Picture. 427-436
Multimedia Applications
- Kenta Kusumoto, Teemu Kinnunen, Jari Kätsyri, Heikki Lindroos, Pirkko Oittinen:

Media Experience of Complementary Information and Tweets on a Second Screen. 437-446 - Pradeep Kumar Jayaraman, Chi-Wing Fu

:
Interactive Line Drawing Recognition and Vectorization with Commodity Camera. 447-456 - Xin Lu, Zhe Lin, Hailin Jin, Jianchao Yang, James Z. Wang

:
RAPID: Rating Pictorial Aesthetics using Deep Learning. 457-466 - Si Liu, Xiaodan Liang, Luoqi Liu, Ke Lu, Liang Lin, Shuicheng Yan:

Fashion Parsing with Video Context. 467-476
Privacy, Health and Well-being
- Andrey Bogomolov, Bruno Lepri, Michela Ferron

, Fabio Pianesi, Alex Pentland:
Daily Stress Recognition from Mobile Phone Data, Weather Conditions and Individual Traits. 477-486 - Shenggao Zhu, Robert J. Ellis

, Gottfried Schlaug, Yee Sien Ng
, Ye Wang
:
Validating an iOS-based Rhythmic Auditory Cueing Evaluation (iRACE) for Parkinson's Disease. 487-496 - Zhan Qin

, Jingbo Yan, Kui Ren
, Chang Wen Chen
, Cong Wang
:
Towards Efficient Privacy-preserving Image Feature Extraction in Cloud Computing. 497-506 - Huijie Lin, Jia Jia, Quan Guo, Yuanyuan Xue, Qi Li, Jie Huang, Lianhong Cai, Ling Feng:

User-level psychological stress detection from social media using deep neural network. 507-516
Multimedia Search and Indexing
- Jianfeng Wang, Heng Tao Shen, Shuicheng Yan, Nenghai Yu, Shipeng Li

, Jingdong Wang
:
Optimized Distances for Binary Code Ranking. 517-526 - Yao Hu, Zhongming Jin, Hongyi Ren

, Deng Cai, Xiaofei He:
Iterative Multi-View Hashing for Cross Media Indexing. 527-536 - Xiaopeng Yang, Tao Mei, Yongdong Zhang:

Rescue Tail Queries: Learning to Image Search Re-rank via Click-wise Multimodal Fusion. 537-546 - Lu Jiang, Deyu Meng, Teruko Mitamura, Alexander G. Hauptmann:

Easy Samples First: Self-paced Reranking for Zero-Example Multimedia Search. 547-556
Social Media and Crowd
- Ming Yan, Jitao Sang, Changsheng Xu:

Mining Cross-network Association for YouTube Video Promotion. 557-566 - Xue Geng, Hanwang Zhang

, Zheng Song, Yang Yang, Huan-Bo Luan, Tat-Seng Chua:
One of a Kind: User Profiling by Social Curation. 567-576 - Axel Carlier, Lilian Calvet, Duong-Trung-Dung Nguyen, Wei Tsang Ooi

, Pierre Gurdjos, Vincent Charvillat:
3D Interest Maps From Simultaneous Video Recordings. 577-586 - Prem Seetharaman, Bryan Pardo:

Crowdsourcing a Reverberation Descriptor Map. 587-596
Multimedia Recommendations
- Peng Cui, Zhiyu Wang, Zhou Su:

What Videos Are Similar with You?: Learning a Common Attributed Representation for Video Recommendation. 597-606 - Rajiv Ratn Shah

, Yi Yu, Roger Zimmermann
:
ADVISOR: Personalized Video Soundtrack Recommendation by Late Fusion with Heuristic Rankings. 607-616 - Shaowei Liu, Peng Cui, Wenwu Zhu, Shiqiang Yang, Qi Tian:

Social Embedding Image Distance Learning. 617-626 - Xinxi Wang, Ye Wang

:
Improving Content-based and Hybrid Music Recommendation using Deep Learning. 627-636
Doctoral Symposium 1
- Mario Taschwer:

Medical case retrieval. 639-642 - Stefan Wilk, Wolfgang Effelsberg:

Mobile Video Broadcasting Services: Combining Video Composition and Network Efficient Transmission. 643-646 - David Grunberg:

Music-information retrieval in environments containing acoustic noise. 647-650 - Jeffrey J. Scott:

Automated Multi-Track Mixing and Analysis of Instrument Mixtures. 651-654
Doctoral Symposium 2
- Jichao Sun:

Local Selection of Features for Image Search and Annotation. 655-658 - Manfred Jürgen Primus:

Segmentation and Indexing of Endoscopic Videos. 659-662 - Desara Xhura:

Learning recognition of semantically relevant video segments from endoscopy videos contributed and edited in a private social network. 663-666 - Mario Guggenberger:

Multimodal Alignment of Videos. 667-670
Open Source Software Competition 1
- Xin Yang, Chong Huang, Kwang-Ting (Tim) Cheng

:
libLDB: a library for extracting ultrafast and distinctive binary feature description. 671-674 - Yangqing Jia, Evan Shelhamer, Jeff Donahue, Sergey Karayev, Jonathan Long, Ross B. Girshick, Sergio Guadarrama, Trevor Darrell:

Caffe: Convolutional Architecture for Fast Feature Embedding. 675-678 - Joan Alabort-i-Medina, Epameinondas Antonakos, James Booth

, Patrick Snape, Stefanos Zafeiriou:
Menpo: A Comprehensive Platform for Parametric Image Alignment and Visual Deformable Models. 679-682
Open Source Software Competition 2
- Jack Jansen

:
VideoLat: An Extensible Tool for Multimedia Delay Measurements. 683-686 - Matthijs Douze, Hervé Jégou:

The Yael Library. 687-690 - Giuseppe Becchi, Marco Bertini, Lorenzo Cioni, Alberto Del Bimbo, Andrea Ferracani, Daniele Pezzatini, Mathias Lux:

Loki+Lire: a framework to create web-based multimedia search engines. 691-694
Art Exhibit
- Parag Kumar Mital:

Audiovisual Resynthesis in an Augmented Reality. 695-698 - Charles Roberts:

Sound-Light Giblet. 699-700 - Michael Riegler, Mathias Lux, Christian Zellot, Lukas Knoch, Horst Schnattler, Sabrina Napetschnig, Julian Kogler, Claus Degendorfer, Norbert Spot, Manuel Zoderer:

Gone: an interactive experience for two people. 701-704 - Sarah Linebaugh:

Circles and Sounds. 705-708 - Yuan-Yi Fan:

Qi Visualizer: An Interactive Pulse Spectrogram Visualization using Mobile Participatory Biometrics. 709-712 - F. Myles Sciotto, Jean-Michel Crettaz:

Stoicheia: Architecture, Sound and Tesla's Apotheosis. 713-716 - Lonce Wyse:

States of Diffusion for n+1 devices. 717-719
Demos 1: Searching and Finding
- Julien Champ

, Alexis Joly, Pierre Bonnet
:
Fine-grained Visual Faceted Search. 721-722 - André F. Araújo, David M. Chen, Peter Vajda, Bernd Girod:

Real-time query-by-image video search system. 723-724 - Vamsidhar Reddy Gaddam, Ragnar Langseth, Håkon Kvale Stensland, Carsten Griwodz, Pål Halvorsen, Øystein Landsverk:

Automatic Real-Time Zooming and Panning on Salient Objects from a Panoramic Video. 725-726 - Hao-Kai Wen, Wei-Che Chang, Chia-Hu Chang

, Yin-Tzu Lin, Ja-Ling Wu
:
Event Detection in Broadcasting Video for Halfpipe Sports. 727-728 - Jianquan Liu, Shoji Nishimura, Takuya Araki:

Wally: A Scalable Distributed Automated Video Surveillance System with Rich Search Functionalities. 729-730 - Junshi Huang, Wei Xia, Shuicheng Yan:

Deep Search with Attribute-aware Deep Network. 731-732 - Rene Kaiser

, Wolfgang Weiss, Manolis Falelakis, Marian Florin Ursu:
Virtual Director Adapting Visual Presentation to Conversation Context in Group Videoconferencing: An Interactive Demo. 733-734 - Jie Wu, Changhu Wang, Liqing Zhang, Yong Rui:

SmartVisio: Interactive Sketch Recognition with Natural Correction and Editing. 735-736
Demos 2: Senses and Sensors
- Nimesha Ranasinghe, Kuan-Yi Lee, Gajan Suthokumar, Ellen Yi-Luen Do

:
Taste+: Digitally Enhancing Taste Sensations of Food and Beverages. 737-738 - Prem Seetharaman, Bryan Pardo:

Reverbalize: A Crowdsourced Reverberation Controller. 739-740 - Mark Cartwright, Bryan Pardo:

SynthAssist: an audio synthesizer programmed with vocal imitation. 741-742 - Yong-Xiang Wang, Li-Yun Lo, Min-Chun Hu:

Eat as much as you can: a kinect-based facial rehabilitation game based on mouth and tongue movements. 743-744 - Ahmad M. Qamar, Imad Afyouni, Delwar Hossain, Faizan Ur Rehman

, Asad H. Toonsi, Mohamed Abdur Rahman
, Saleh M. Basalamah
:
A Multimedia E-Health Framework Towards An Interactive And Non-Invasive Therapy Monitoring Environment. 745-746 - Hongyun Cai, Zhongxian Tang, Yang Yang, Zi Huang

:
EventEye: Monitoring Evolving Events from Tweet Streams. 747-748 - Yuan Tian, Suraj Raghuraman, Yin Yang, Xiaohu Guo, Balakrishnan Prabhakaran:

3D Immersive Cardiopulmonary Resuscitation (CPR) Trainer. 749-750 - Mei-Chen Yeh

, Hsiao-Wei Lin:
Taking good selfies on your phone. 751-752
Demos 3: Systems
- Peng Wang

, Yang Yang, Zi Huang
, Jiewei Cao
, Heng Tao Shen:
WeMash: An Online System for Web Video Mashup. 753-754 - Zhenhuan Gao, Chien-Nan (Shannon) Chen, Klara Nahrstedt:

FreeViewer: An Intelligent Director for 3D Tele-Immersion System. 755-756 - Jun Chen, Chaokun Wang, Lei Yang, Qingfu Wen, Xu Wang:

MiSCon: a hot plugging tool for real-time motion-based system control. 757-758 - Zhineng Chen, Jinfeng Bai, Chong-Wah Ngo

, Bailan Feng, Bo Xu:
CeleLabel: an interactive system for annotating celebrities in web videos. 759-760 - Yoshiyuki Kawano, Keiji Yanai

:
FoodCam-256: A Large-scale Real-time Mobile Food RecognitionSystem employing High-Dimensional Features and Compression of Classifier Weights. 761-762 - Daisuke Ochi, Yutaka Kunita, Kensaku Fujii, Akira Kojima, Shinnosuke Iwaki, Junichi Hirose:

HMD Viewing Spherical Video Streaming System. 763-764 - Duong-Trung-Dung Nguyen, Axel Carlier, Wei Tsang Ooi

, Vincent Charvillat:
Jiku director 2.0: a mobile video mashup system with zoom and pan using motion maps. 765-766 - Mario Guggenberger, Mathias Lux, László Böszörményi:

ClockDrift: a mobile application for measuring drift in multimedia devices. 767-768
Posters 1
- Guangxin Ren, Junjie Cai, Shipeng Li

, Nenghai Yu, Qi Tian:
Salable Image Search with Reliable Binary Code. 769-772 - Kota Yamaguchi

, Tamara L. Berg, Luis E. Ortiz
:
Chic or Social: Visual Popularity Analysis in Online Fashion Networks. 773-776 - Nakamasa Inoue, Koichi Shinoda:

n-gram Models for Video Semantic Indexing. 777-780 - Wei-Ta Chu

, Ying-Chieh Chao:
Line-Based Drawing Style Description for Manga Classification. 781-784 - Fatih Çakir, Stan Sclaroff:

Supervised hashing with error correcting codes. 785-788 - Yubin Deng, Ping Luo, Chen Change Loy, Xiaoou Tang:

Pedestrian Attribute Recognition At Far Distance. 789-792 - Yin-Tzu Lin, Po-Nien Chen, Chia-Hu Chang

, Ja-Ling Wu
:
MSVA: Musical Street View Animator: An Effective and Efficient Way to Enjoy the Street Views of Your Journey. 793-796 - Parvez Ahammad, Brian Kennedy, Padmapani Ganti, Hariharan Kolam:

QoE-driven Unsupervised Image Categorization for Optimized Web Delivery: Short Paper. 797-800 - Zhengwei Huang, Ming Dong, Qirong Mao, Yongzhao Zhan:

Speech Emotion Recognition Using CNN. 801-804 - Shuyang Wang, Ming Shao, Yun Fu:

Attractive or Not?: Beauty Prediction with Attractiveness-Aware Encoders and Robust Late Fusion. 805-808 - Chin-Chia Michael Yeh, Ping-Keng Jao, Yi-Hsuan Yang:

AWtoolbox: Characterizing Audio Information Using Audio Words. 809-812 - Chih-Fan Hsu, De-Yu Chen, Chun-Ying Huang

, Cheng-Hsin Hsu, Kuan-Ta Chen:
Screencast in the Wild: Performance and Limitations. 813-816 - Wei Jiang, Zhenyu Wu, John Wus, Hong Heather Yu:

One-Pass Video Stabilization on Mobile Devices. 817-820 - Bo Zhang

, Yan Yan, Nicola Conci
, Nicu Sebe
:
You Talkin' to Me?: Recognizing Complex Human Interactions in Unconstrained Videos. 821-824 - Shoou-I Yu, Lu Jiang, Alexander G. Hauptmann:

Instructional Videos for Unsupervised Harvesting and Learning of Action Examples. 825-828 - Xiaobo Wang, Xiaochun Cao, Xiaojie Guo, Zhanjie Song:

Beautifying Fisheye Images using Orientation and Shape Cues. 829-832 - Jianfeng Xu, Shigeyuki Sakazawa:

Temporal Fusion Approach Using Segment Weight for Affect Recognition from Body Movements. 833-836 - Chun-Te Chu, Jaeyeon Jung, Zhicheng Liu, Ratul Mahajan:

sTrack: Secure Tracking in Community Surveillance. 837-840 - Na Zhao

, Richang Hong, Meng Wang, Xuegang Hu, Tat-Seng Chua:
Searching for Recent Celebrity Images in Microblog Platform. 841-844 - Jiajun Wang, Yu-Gang Jiang, Qiang Wang, Kuiyuan Yang, Chong-Wah Ngo

:
Organizing Video Search Results to Adapted Semantic Hierarchies for Topic-based Browsing. 845-848 - Xi Wang, Yu-Gang Jiang, Zhenhua Chai, Zichen Gu, Xinyu Du, Dong Wang:

Real-time summarization of user-generated videos based on semantic recognition. 849-852 - Yunhang Shen

, Rongrong Ji, Donglin Cao, Min Wang:
Hacking Chinese Touclick CAPTCHA by Multi-Scale Corner Structure Model with Fast Pattern Matching. 853-856 - Kai Zhu, Dihong Gong, Zhifeng Li, Xiaoou Tang:

Orthogonal Gaussian Process for Automatic Age Estimation. 857-860 - Ying Zhang, Roger Zimmermann

, Luming Zhang, David A. Shamma:
Points of Interest Detection from Multiple Sensor-Rich Videos in Geo-Space. 861-864 - Arindam Ghosh

, Giuseppe Riccardi:
Recognizing Human Activities from Smartphone Sensor Signals. 865-868 - Honglin Yu, Lexing Xie

, Scott Sanner:
Twitter-driven YouTube Views: Beyond Individual Influencers. 869-872 - Shijie Zhao, Xi Jiang

, Junwei Han, Xintao Hu
, Dajiang Zhu, Jinglei Lv
, Tuo Zhang, Lei Guo, Tianming Liu:
Decoding Auditory Saliency from FMRI Brain Imaging. 873-876 - Huiyuan Fu, Huadong Ma, Hongtian Xiao:

Crowd Counting via Head Detection and Motion Flow Estimation. 877-880 - Prasanth Lade, Troy McDaniel

, Sethuraman Panchanathan:
Semantic feature projection for continuous emotion analysis. 881-884 - Huiyuan Fu, Huadong Ma:

Real-time crowd detection based on gradient magnitude entropy model. 885-888 - Yang Mu, Henry Z. Lo, Wei Ding

, Dacheng Tao
:
Face Recognition from Multiple Images per Subject. 889-892 - Zhisheng Yan, Chang Wen Chen

, Bin Liu:
Admission Control for Wireless Adaptive HTTP Streaming: An Evidence Theory Based Approach. 893-896 - Zheng Yang, Yao Hu, Haifeng Liu, Huajun Chen, Zhaohui Wu:

Matrix Completion for Cross-view Pairwise Constraint Propagation. 897-900 - Yueting Zhuang, Zhou Yu

, Wei Wang, Fei Wu, Siliang Tang
, Jian Shao:
Cross-Media Hashing with Neural Networks. 901-904 - Jie Nie, Peng Cui, Yan Yan, Lei Huang, Zhen Li, Zhiqiang Wei:

How Your Portrait Impresses People?: Inferring Personality Impressions from Portrait Contents. 905-908 - Bart Thomee, José G. Moreno, David A. Shamma:

Who's Time Is It Anyway?: Investigating the Accuracy of Camera Timestamps. 909-912 - Hanqi Wang, Fei Wu, Xi Li, Siliang Tang

, Jian Shao, Yueting Zhuang:
Jointly Discovering Fine-grained and Coarse-grained Sentiments via Topic Modeling. 913-916 - Hao Kuang, Benjamin Guthier, Mukesh Kumar Saini, Dwarikanath Mahapatra, Abdulmotaleb El-Saddik

:
A Real-Time Smart Assistant for Video Surveillance Through Handheld Devices. 917-920
Posters 2
- Jun Chen, Chaokun Wang, Jianmin Wang

:
Modeling the Interest-Forgetting Curve for Music Recommendation. 921-924 - Bahetiyaer Bare, Ke Li, Weiyi Wang, Bo Yan:

Learning to Assess Image Retargeting. 925-928 - Bo Yan, Xiaochu Yang, Ke Li:

Efficient Image Retargeting via Adaptive Pixel Fusion. 929-932 - Haoqiang Fan, Mu Yang, Zhimin Cao, Yuning Jiang, Qi Yin:

Learning Compact Face Representation: Packing a Face into an int32. 933-936 - Yuanlu Xu

, Bingpeng Ma, Rui Huang, Liang Lin:
Person Search in a Scene by Jointly Modeling People Commonness and Person Uniqueness. 937-940 - Tao Zhuo, Peng Zhang, Yanning Zhang, Wei Huang, Hichem Sahli:

Object Tracking using Reformative Transductive Learning with Sample Variational Correspondence. 941-944 - Di Wu, Ling Shao

:
Multimodal Dynamic Networks for Gesture Recognition. 945-948 - Keisuke Doman, Taishi Tomita, Ichiro Ide

, Daisuke Deguchi
, Hiroshi Murase:
Event Detection based on Twitter Enthusiasm Degree for Generating a Sports Highlight Video. 949-952 - Hong Zhang, Junsong Yuan

, Xingyu Gao
, Zhenyu Chen
:
Boosting cross-media retrieval via visual-auditory feature analysis and relevance feedback. 953-956 - Hui-Tang Chang, Yu-Chiang Frank Wang, Ming-Syan Chen

:
Transfer in Photography Composition. 957-960 - Heysem Kaya

, Albert Ali Salah
:
Eyes Whisper Depression: A CCA based Multimodal Approach. 961-964 - Yehia Elkhatib

, Rebecca Killick
, Mu Mu, Nicholas J. P. Race
:
Just Browsing?: Understanding User Journeys in Online TV. 965-968 - Xinyu Ou, Lingyu Yan, Hefei Ling, Cong Liu, Maolin Liu:

Inductive Transfer Deep Hashing for Image Retrieval. 969-972 - Jianguang Zhang, Yahong Han, Jinhui Tang

, Qinghua Hu, Jianmin Jiang:
What Can We Learn about Motion Videos from Still Images? 973-976 - Felix X. Yu, Liangliang Cao, Michele Merler

, Noel C. F. Codella, Tao Chen, John R. Smith, Shih-Fu Chang:
Modeling Attributes from Category-Attribute Proportions. 977-980 - Yang Wang, Xuemin Lin

, Lin Wu
, Wenjie Zhang
, Qing Zhang:
Exploiting Correlation Consensus: Towards Subspace Clustering for Multi-modal Data. 981-984 - Xinyan Lu, Fei Wu, Xi Li, Yin Zhang, Weiming Lu, Donghui Wang, Yueting Zhuang:

Learning Multimodal Neural Network with Ranking Examples. 985-988 - Viet Anh Nguyen, Jiwen Lu

, Minh N. Do
:
Supervised Discriminative Hashing for Compact Binary Codes. 989-992 - Zhenxing Niu, Shiliang Zhang, Xinbo Gao, Qi Tian:

Personalized Visual Vocabulary Adaption for Social Image Retrieval. 993-996 - Xiaochun Cao, Yupeng Cheng, Zhiqiang Tao, Huazhu Fu

:
Co-Saliency Detection via Base Reconstruction. 997-1000 - Hong-Wun Jheng, Bor-Chun Chen, Yan-Ying Chen, Winston H. Hsu:

Automatic Facial Image Annotation and Retrieval by Integrating Voice Label and Visual Appearance. 1001-1004 - Tianxu Ji, Xianglong Liu, Cheng Deng, Lei Huang, Bo Lang:

Query-Adaptive Hash Code Ranking for Fast Nearest Neighbor Search. 1005-1008 - Stefan Wilk, Wolfgang Effelsberg:

Systematic Assessment of the Video Recording Position for User-generated Event Videos. 1009-1012 - Hanhui Li, Donghui Li, Xiaonan Luo:

BAP: Bimodal Attribute Prediction for Zero-Shot Image Categorization. 1013-1016 - Michael Xuelin Huang, Tiffany C. K. Kwok, Grace Ngai

, Hong Va Leong
, Stephen C. F. Chan
:
Building a Self-Learning Eye Gaze Model from User Interaction Data. 1017-1020 - Alexandru-Lucian Gînsca, Adrian Popescu, Bogdan Ionescu, Anil Armagan, Ioannis Kanellos:

Toward an Estimation of User Tagging Credibility for Social Image Retrieval. 1021-1024 - Sicheng Zhao, Hongxun Yao, You Yang, Yanhao Zhang:

Affective Image Retrieval via Multi-Graph Learning. 1025-1028 - Valentin Leveau, Alexis Joly, Olivier Buisson, Pierre Letessier, Patrick Valduriez:

Recognizing Thousands of Legal Entities through Instance-based Visual Classification. 1029-1032 - Evlampios Apostolidis

, Vasileios Mezaris, Mathilde Sahuguet, Benoit Huet
, Barbora Cervenková, Daniel Stein, Stefan Eickeler, José Luis Redondo García, Raphaël Troncy, Lukás Pikora:
Automatic fine-grained hyperlinking of videos within a closed collection using scene segmentation. 1033-1036 - Rui Hu, Carlos Pallan Gayol, Guido Krempel, Jean-Marc Odobez

, Daniel Gatica-Perez
:
Automatic Maya hieroglyph retrieval using shape and context information. 1037-1040 - Justin Salamon

, Christopher Jacoby, Juan Pablo Bello
:
A Dataset and Taxonomy for Urban Sound Research. 1041-1044 - Lin Chen, Peng Zhang, Baoxin Li:

Instructive Video Retrieval Based on Hybrid Ranking and Attribute Learning: A Case Study on Surgical Skill Training. 1045-1048 - Shu Shi, John W. Barrus:

A Real-Time Smart Display Detection System. 1049-1052 - Shuang Ma, Yangyu Fan, Chang Wen Chen

:
Pose Maker: A Pose Recommendation System for Person in the Landscape Photographing. 1053-1056 - Matthew Prockup, Jeffrey J. Scott, Youngmoo E. Kim

:
Representing Musical Patterns via the Rhythmic Style Histogram Feature. 1057-1060 - Kolbeinn Karlsson, Wei Jiang, Dong-Qing Zhang:

Mobile Photo Album Management with Multiscale Timeline. 1061-1064 - Lonce Wyse:

Interactive Audio Web Development Workflow. 1065-1068 - Yang Liu, Yan Liu, Yu Zhao, Kien A. Hua:

What Strikes the Strings of Your Heart?: Multi-Label Dimensionality Reduction for Music Emotion Analysis. 1069-1072 - Bing Xu, Xiaogang Wang, Xiaoou Tang:

Fusing Music and Video Modalities Using Multi-timescale Shared Representations. 1073-1076
Posters 3
- Yang Zhou, Weiyao Lin

, Hang Su, Jianxin Wu, Jinjun Wang, Yu Zhou
:
Representing And Recognizing Motion Trajectories: A Tube And Droplet Approach. 1077-1080 - Huizhong Chen, Matthew Cooper, Dhiraj Joshi, Bernd Girod:

Multi-modal Language Models for Lecture Video Retrieval. 1081-1084 - Hokuto Kagaya, Kiyoharu Aizawa, Makoto Ogawa

:
Food Detection and Recognition Using Convolutional Neural Network. 1085-1088 - Kang Zhao, Hongtao Lu, Yangcheng He, Shaokun Feng:

Locality Preserving Discriminative Hashing. 1089-1092 - Xiaochun Cao, Xingxing Wei, Xiaojie Guo, Yahong Han, Jinhui Tang

:
Augmented Image Retrieval using Multi-order Object Layout with Attributes. 1093-1096 - Klaus Schoeffmann:

The Stack-of-Rings Interface for Large-Scale Image Browsing on Mobile Touch Devices. 1097-1100 - Fabio Celli, Elia Bruni, Bruno Lepri:

Automatic Personality and Interaction Style Recognition from Facebook Profile Pictures. 1101-1104 - Chen Fang, Zhe Lin, Radomír Mech, Xiaohui Shen:

Automatic Image Cropping using Visual Composition, Boundary Simplicity and Content Preservation Models. 1105-1108 - Kezhen Teng, Jinqiao Wang, Min Xu

, Hanqing Lu:
Mask Assisted Object Coding with Deep Learning for Object Retrieval in Surveillance Videos. 1109-1112 - Huiying Liu, Min Xu

, Xiangjian He
, Jinqiao Wang:
Estimate Gaze Density by Incorporating Emotion. 1113-1116 - Chun-Chieh Hsu, Hua-Tsung Chen, Chien-Li Chou, Chien-Peng Ho, Suh-Yin Lee:

Trajectory Based Jump Pattern Recognition in Broadcast Volleyball Videos. 1117-1120 - Masakazu Iwamura

, Nobuaki Matozaki, Koichi Kise:
Fast Instance Search Based on Approximate Bichromatic Reverse Nearest Neighbor Search. 1121-1124 - Xufang Pang

, Ying Cao, Rynson W. H. Lau
, Antoni B. Chan
:
A Robust Panel Extraction Method for Manga. 1125-1128 - Song Wu, Michael S. Lew:

RIFF: Retina-inspired Invariant Fast Feature Descriptor. 1129-1132 - Sabrina Schulte, Chien-Nan (Shannon) Chen, Klara Nahrstedt:

Stevens' Power Law in 3D Tele-immersion: Towards Subjective Modeling of Multimodal Cyber Interaction. 1133-1136 - Zhiqiang Zuo, Yong Luo, Dacheng Tao

, Chao Xu:
Multi-view Multi-task Feature Extraction for Web Image Classification. 1137-1140 - Shanmin Pang, Jianru Xue, Zhanning Gao, Qi Tian:

Image Re-ranking with an Alternating Optimization. 1141-1144 - Edgar Roman-Rangel

, Stéphane Marchand-Maillet:
Automatic Removal of Visual Stop-Words. 1145-1148 - Sho Inaba, Asako Kanezaki, Tatsuya Harada:

Automatic Image Synthesis from Keywords Using Scene Context. 1149-1152 - Noura Al Moubayed

, Yolanda Vazquez-Alvarez, Alex McKay, Alessandro Vinciarelli
:
Face-Based Automatic Personality Perception. 1153-1156 - Chia-Hung Lin, Yan-Ying Chen, Bor-Chun Chen, Yu-Lin Hou, Winston H. Hsu:

Facial Attribute Space Compression by Latent Human Topic Discovery. 1157-1160 - Mohammad Soleymani, Anna Aljanaki, Yi-Hsuan Yang, Michael N. Caro, Florian Eyben, Konstantin Markov, Björn W. Schuller, Remco C. Veltkamp, Felix Weninger, Frans Wiering:

Emotional Analysis of Music: A Comparison of Methods. 1161-1164 - Masaru Mizuochi, Asako Kanezaki, Tatsuya Harada:

Clothing Retrieval Based on Local Similarity with Multiple Images. 1165-1168 - Markus Koskela, Jorma Laaksonen

:
Convolutional Network Features for Scene Recognition. 1169-1172 - Andrew Hines

, Eoin Gillen, Damien Kelly, Jan Skoglund
, Anil C. Kokaram
, Naomi Harte
:
Perceived Audio Quality for Streaming Stereo Music. 1173-1176 - Christos Georgakis

, Stavros Petridis, Maja Pantic:
Discriminating Native from Non-Native Speech Using Fusion of Visual Cues. 1177-1180 - Guoyu Lan, Heng Qi, Keqiu Li, Kai Lin, Wenyu Qu, Zhiyang Li:

A Framework of Mobile Visual Search Based on the Weighted Matching of Dominant Descriptor. 1181-1184 - Che-Chun Lee, Yin-Hsi Kuo, Winston H. Hsu, Shin'ichi Satoh, Sebastian Agethen:

Efficient Cross-Domain Image Retrieval by Multi-Level Matching and Spatial Verification for Structural Similarity. 1185-1188 - Emre Yilmaz

, Konstantinos Rematas, Tinne Tuytelaars
, Hugo Van hamme
:
Learning Like a Toddler: Watching Television Series to Learn Vocabulary from Images and Audio. 1189-1192 - Yu Bao, Jing Yang, Liangliang Cao, Haojie Li, Jinhui Tang

:
Cuteness Recognition and Localization in the Photos of Animals. 1193-1196 - Jun Wu, Wenjing Qiao, Cailiang Kuang, Zhenbao Liu, Shuhui Bu, Junwei Han:

A 3D Fingertips Detecting and Tracking Algorithm based on the Sliding Window. 1197-1200 - Dubravko Culibrk, Nicu Sebe

:
Temporal Dropout of Changes Approach to Convolutional Learning of Spatio-Temporal Features. 1201-1204 - Lorenz Kellerer, Vamsidhar Reddy Gaddam, Ragnar Langseth, Håkon Kvale Stensland, Carsten Griwodz, Dag Johansen, Pål Halvorsen:

Real-Time HDR Panorama Video. 1205-1208 - Yunhua Deng, Siqi Shen, Zhe Huang, Alexandru Iosup

, Rynson W. H. Lau
:
Dynamic Resource Management in Cloud-based Distributed Virtual Environments. 1209-1212 - Masayuki Furukawa, Yasuhiro Akagi, Yukiko Kawai, Hiroshi Kawasaki:

Interactive 3D Animation Creation and Viewing System based on Motion Graph and Pose Estimation Method. 1213-1216 - Kentaro Yamada, Hiroshi Sankoh, Sei Naito:

Color Transfer based on Spatial Structure for Telepresence. 1217-1220 - Zhen-Peng Bian, Junhui Hou

, Lap-Pui Chau
, Nadia Magnenat-Thalmann
:
Human Computer Interface for Quadriplegic People Based on Face Position/gesture Detection. 1221-1224 - Lasse Farnung Laursen, Masataka Goto

, Takeo Igarashi:
A Multi-Touch DJ Interface with Remote Audience Feedback. 1225-1228
Tutorials
- Wanmin Wu, Cha Zhang:

Immersive 3D Communication. 1229-1230 - Christian Timmerer

, Ali C. Begen
:
Over-the-Top Content Delivery: State of the Art and Challenges Ahead. 1231-1232 - Yi Yu, Kiyoharu Aizawa, Toshihiko Yamasaki, Roger Zimmermann

:
Emerging Topics on Personalized and Localized Multimedia Information Systems. 1233-1234 - Lexing Xie

, Haixun Wang:
Learning Knowledge Bases for Text and Multimedia. 1235-1236 - Peng Cui, Lexing Xie

, Jitao Sang, Changsheng Xu:
Social multimedia computing. 1237-1238 - Vasileios Mezaris, Benoit Huet

:
Video hyperlinking. 1239-1240 - David A. Shamma, Daragh Byrne:

An Introduction to Arts and Digital Culture Inside Multimedia. 1241-1242
Workshop Summaries
- Michel F. Valstar

, Björn W. Schuller
, Jarek Krajewski, Roddy Cowie
, Maja Pantic:
AVEC 2014: the 4th international audio/visual emotion challenge and workshop. 1243-1244 - Fabio Celli, Bruno Lepri, Joan-Isaac Biel, Daniel Gatica-Perez

, Giuseppe Riccardi, Fabio Pianesi:
The Workshop on Computational Personality Recognition 2014. 1245-1246 - Judith A. Redi, Mathias Lux:

CrowdMM14 - 2014 International ACM Workshop on Crowdsourcing for Multimedia. 1247-1248 - M. Anwar Hossain

, Abdulmotaleb El-Saddik
:
EMASC14: 1st International Workshop on Emerging Multimedia Applications and Services for Smart Cities. 1249-1250 - Liangliang Cao, Gerald Friedland, Lexing Xie

:
GeoMM 2014: the third ACM multimedia workshop ongeotagging and its applications in multimedia. 1251-1252 - Ansgar Scherp

, Vasileios Mezaris, Bogdan Ionescu, Francesco G. B. De Natale:
HuEvent'14: 2014 workshop on human-centered event understanding from multimedia. 1253-1254 - Teresa Chambel

, Paula Viana
, V. Michael Bove Jr., Sharon Strover
, Graham Thomas
:
ImmersiveMe'14: 2nd ACM international workshop on immersive media experiences. 1255-1256 - Roger Zimmermann, Yi Yu:

WISMM'14 - First ACM International Workshop on Internet-Scale Multimedia Management. 1257-1258 - Pablo César

, David A. Shamma, Matthew Cooper, Aisling Kelliher
:
3rd International Workshop on Socially-Aware Multimedia (SAM'14). 1259-1260 - Concetto Spampinato, Vasileios Mezaris, Marco Cristani:

Summary Abstract for the 3rd ACM International Workshop on Multimedia Analysis for Ecological Data. 1261-1262 - Hari Kalva, Homer H. Chen

, Gerardo Fernández-Escribano, Velibor Adzic:
PIVP 2014: First International Workshop on Perception Inspired Video Processing. 1263-1264 - Wolfgang Effelsberg, Stefan Göbel:

Serious Games 2014: International Workshop on Serious Games. 1265-1266

manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.


Google
Google Scholar
Semantic Scholar
Internet Archive Scholar
CiteSeerX
ORCID














