default search action
20th ISM 2018: Taichung, Taiwan
- 2018 IEEE International Symposium on Multimedia, ISM 2018, Taichung, Taiwan, December 10-12, 2018. IEEE Computer Society 2018, ISBN 978-1-5386-6857-3
Session 1: Bio-Related Applications
- Tor Jan Derek Berstad, Michael Riegler, Håvard Espeland, Thomas de Lange, Pia Helen Smedsrud, Konstantin Pogorelov, Håkon Kvale Stensland, Pål Halvorsen:
Tradeoffs Using Binary and Multiclass Neural Network Classification for Medical Multidisease Detection. 1-8 - Yuting Yang, Peisong Shen, Chi Chen:
A Robust Iris Segmentation Using Fully Convolutional Network with Dilated Convolutions. 9-16 - Meghna Ayyar, Puneet Mathur, Rajiv Ratn Shah, Shree G. Sharma:
Harnessing AI for Kidney Glomeruli Classification. 17-20 - Shimaa A. Abdelrahman, Moataz M. Abdelwahab, Mohammed Sharaf Sayed:
Malignancy Classification of Lung Nodule Based on Accumulated Multi Planar Views and Canonical Correlation Analysis. 21-24
Session 2: Image, Video, and Other Applications
- Wen-Cheng Lai, Sheng-Lyang Jang, Jyun-Jhih Wang:
Voltage Controlled Oscillator with Impedance Spectroscopy for Non-invasive Glucose Application. 25-28 - Martin Oelsch, Basak Gülecyüz, Eckehard G. Steinbach:
MID: A Novel Contrast Metric for the MSER Detector. 29-35 - Akinori Sato, Takatsugu Hirayama, Keisuke Doman, Yasutomo Kawanishi, Ichiro Ide, Daisuke Deguchi, Hiroshi Murase:
Gaze-Inspired Learning for Estimating the Attractiveness of a Food Photo. 36-43 - Wen-Chih Lo, Chih-Yuan Huang, Cheng-Hsin Hsu:
Edge-Assisted Rendering of 360° Videos Streamed to Head-Mounted Virtual Reality. 44-51 - Petra Budíková, Michal Batko, Pavel Zezula:
Multi-modal Image Retrieval for Search-Based Image Annotation with RF. 52-60 - Divya Sharma, Nitin Gupta, Chiranjoy Chattopadhyay, Sameep Mehta:
REXplore: A Sketch Based Interactive Explorer for Real Estates Using Building Floor Plan Images. 61-64 - Robert Skupin, Yago Sanchez, Lei Jiao, Cornelius Hellge, Thomas Schierl:
Tile-Based Rate Assignment for 360-Degree Video Based on Spatio-Temporal Activity Metrics. 65-68 - Qi Lou, Somdeb Sarkhel, Saayan Mitra, Viswanathan Swaminathan:
Content-Based Effectiveness Prediction of Video Advertisements. 69-72
Session 3: Best Paper Candidates
- Li Ren, Kien A. Hua:
Improved Image Description Via Embedded Object Structure Graph and Semantic Feature Matching. 73-80 - Mattis Jeppsson, Håvard Espeland, Tomas Kupka, Ragnar Langseth, Andreas Petlund, Peng Qiaoqiao, Chuansong Xue, Konstantin Pogorelov, Michael Riegler, Dag Johansen, Carsten Griwodz, Pål Halvorsen:
Efficient Live and on-Demand Tiled HEVC 360 VR Video Streaming. 81-88 - Mariem Ben Yahia, Yannick Le Louédec, Gwendal Simon, Loutfi Nuaymi:
HTTP/2-Based Streaming Solutions for Tiled Omnidirectional Videos. 89-96
Session 4: Deep Learning
- Chun-Fu Chen, Jinwook Oh, Quanfu Fan, Marco Pistoia:
SC-Conv: Sparse-Complementary Convolution for Efficient Model Utilization on CNNs. 97-100 - Zheng Wu, Naimul Mefraz Khan, Lei Gao, Ling Guan:
Deep Reinforcement Learning with Parameterized Action Space for Object Detection. 101-104 - Gabriel Mittag, Sebastian Möller:
Non-intrusive Estimation of Packet Loss Rates in Speech Communication Systems Using Convolutional Neural Networks. 105-109 - Ehab M. Ibrahim, Emad Badry, Ahmed M. Abdelsalam, Ibrahim L. Abdalla, Mohammed Sayed, Hossam M. H. Shalaby:
Neural Networks Based Fractional Pixel Motion Estimation for HEVC. 110-113
Session 5: Video Encoding & Quality
- Saeed Shafiee Sabet, Steven Schmidt, Saman Zadtootaghaj, Carsten Griwodz, Sebastian Möller:
Towards Applying Game Adaptation to Decrease the Impact of Delay on Quality of Experience. 114-121 - Falk Ralph Schiffner, Vladimir Bondarenko, Sebastian Möller:
Investigation of Video Quality Dimensions for Different Type of Video Content. 122-126 - Ramin Ghaznavi Youvalari, Alireza Aminlou:
Geometry-Based Motion Vector Scaling for Omnidirectional Video Coding. 127-130 - Saman Zadtootaghaj, Nabajeet Barman, Steven Schmidt, Maria G. Martini, Sebastian Möller:
NR-GVQM: A No Reference Gaming Video Quality Metric. 131-134 - Santiago De-Luxán-Hernández, Heiko Schwarz, Detlev Marpe, Thomas Wiegand:
Fast Line-Based Intra Prediction for Video Coding. 135-138 - Yusuke Sakamoto, Shintaro Saika, Masaru Takeuchi, Tatsuya Nagashima, Zhengxue Cheng, Kenji Kanai, Jiro Katto, Kaijin Wei, Ju Zengwei, Xu Wei:
Light-Weight Video Coding Based on Perceptual Video Quality for Live Streaming. 139-142 - Donghuo Zeng, Yi Yu, Keizo Oyama:
Audio-Visual Embedding for Cross-Modal Music Video Retrieval through Supervised Deep CCA. 143-150
Session 6: Audio, Music, Speech
- Katunobu Itou, Daiki Tanaka:
Automatic Electronic Organ Reduction System Based on Melody Clustering Considering Melodic and Instrumental Characteristics. 151-158 - Yaman Kumar, Rohit Jain, Khwaja Mohd. Salik, Rajiv Ratn Shah, Roger Zimmermann, Yifang Yin:
MyLipper: A Personalized System for Speech Reconstruction using Multi-view Visual Feeds. 159-166 - Ryuka Nanzaka, Tsuyoshi Kitamura, Tetsuya Takiguchi, Yuji Adachi, Kiyoto Tai:
Spectrum Enhancement of Singing Voice Using Deep Learning. 167-170 - Yohei Fuse, Yusuke Yasumi, Tetsuya Takiguchi:
Sound Recovery Considering the Vibration Direction of an Object in a Video. 171-174 - Rafael Zequeira Jiménez, Gabriel Mittag, Sebastian Möller:
Effect of Number of Stimuli on Users Perception of Different Speech Degradations. A Crowdsourcing Case Study. 175-179
3-Day Poster Showcases
- Takuya Kobayashi, Akira Kubota, Yusuke Suzuki:
Audio Feature Extraction Based on Sub-Band Signal Correlations for Music Genre Classification. 180-181 - M.-H. Kim, S.-H. Chae, J.-S. Kim:
A Burn-in Potential Region Detection Method for the OLED panel displays. 182-183 - Joni Rasanen, Marko Viitanen, Jarno Vanne, Timo D. Hämäläinen:
Live Demonstration: Kvazzup 4K HEVC Video Call. 184-185 - Joose Sainio, Arttu Ylä-Outinen, Marko Viitanen, Jarno Vanne, Timo Hämäläinen:
Eye-Controlled Region of Interest HEVC Encoding. 186-187 - Yi Yu, Samuel Beuret, Donghuo Zeng, Keizo Oyama:
Deep Learning of Human Perception in Audio Event Classification. 188-189 - Florian Schniederjann, Jana Krahe, Tobias Guth, Johanna Wendel, Robert Mertens:
Using Linear and Non-linear Magnifiers in Eyetracking-Based Human Computer Interaction. 190-191 - Fan Liu, Zewen Li, Xueyi Li, Tanyue Lv:
A Text-Based CAPTCHA Cracking System with Generative Adversarial Networks. 192-193
Workshop: The Second IEEE International Workshop on Machine Learning and Computing for Visual Semantic Analysis (MLCSA)
- Pin-Hsien Liu, Zhen-You Lian, Chih-Yang Lin, Cheng-Hung Chuang, Chung-Lin Huang, Yuan-Yu Tsai:
Two Staged Machine Learning Network for Spine Segmentation and Recognition. 194-197 - Minori Uno, Xian-Hua Han, Yen-Wei Chen:
Comprehensive Study of Multiple CNNs Fusion for Fine-Grained Dog Breed Categorization. 198-203 - Honoka Kakimoto, Yuanyuan Wang, Yukiko Kawai, Kazutoshi Sumiya:
Extraction of Movie Trailer Biases Based on Editing Features for Trailer Generation. 204-208 - Ashok Shrestha, Truong X. Tran, Ramazan S. Aygün, Marc L. Pusey:
Mobile Scanner for Protein Crystallization Plates. 209-214
Workshop: Multimodal Representation, Retrieval, and Analysis of Multimedia Content in Social Media (MR2ARMC)
- Yue Jiang, Mun-Cheon Kang, Ming Fan, Sung-Ho Chae, Sung-Jea Ko:
A Novel Relative Camera Motion Estimation Algorithm with Applications to Visual Odometry. 215-216 - Yaman Kumar, Agniv Sharma, Abhigyan Khaund, Akash Kumar, Ponnurangam Kumaraguru, Rajiv Ratn Shah, Roger Zimmermann:
IceBreaker: Solving Cold Start Problem for Video Recommendation Engines. 217-222 - Zeeshan Ahmad, Naimul Mefraz Khan:
Towards Improved Human Action Recognition Using Convolutional Neural Networks and Multimodal Fusion of Depth and Inertial Sensor Data. 223-230 - Márcio Ferreira Moreno, Wallas Henrique Sousa dos Santos, Rodrigo Costa Mesquita Santos, Patricia Torres Pereira Carrion, Renato Fontoura de Gusmão Cerqueira:
Supporting Multimedia Retrieval in Annotated Content using Hyperknowledge. 231-238 - Márcio Ferreira Moreno, Wallas Henrique Sousa dos Santos, Rodrigo Costa Mesquita Santos, Renato Fontoura de Gusmão Cerqueira:
Supporting Knowledge Creation through HAS: The Hyperknowledge Annotation System. 239-246
Workshop: Multimedia Technologies for E-Learning (MTEL)
- Florian Schimanke, Robert Mertens, Bettina Sophie Huck:
Player Types in Mobile Learning Games - Playing Patterns and Motivation. 247-252 - Hanjian Song, Lihua Tian, Chen Li:
3D Convolutional Network Based Foreground Feature Fusion. 253-258 - Setia Budi, Oscar Karnalim, Erico D. Handoyo, Sulaeman Santoso, Hapnes Toba, Huyen Nguyen, Vishv Malhotra:
IBAtS - Image Based Attendance System: A Low Cost Solution to Record Student Attendance in a Classroom. 259-266
Workshop: The First IEEE International Workshop on State-of-the-art Speech Technologies in Multimedia and Mobile Environments (STeMME)
- Jun-Xiang Xu, Tzu-Ching Lin, Tsai-Ching Yu, Tzu-Chiang Tai, Pao-Chi Chang:
Acoustic Scene Classification Using Reduced MobileNet Architecture. 267-270 - Sylvio Barbon Junior, Victor G. Turrisi da Costa, Shi-Huang Chen, Rodrigo Capobianco Guido:
U-Healthcare System for Pre-Diagnosis of Parkinson's Disease from Voice Signal. 271-274 - Nguyen Le, Sih-Huei Chen, Tzu-Chiang Tai, Jia-Ching Wang:
Single-Channel Speech Separation Based on Gaussian Process Regression. 275-278 - Khoa Pho, Muhamad Kamal Mohammed Amin, Atsuo Yoshitaka:
Segmentation-Driven RetinaNet for Protozoa Detection. 279-286 - Christian Herglotz, David Muller, Andreas Weinlich, Frank Bauer, Michael Ortner, Marc Stamminger, André Kaup:
Improving HEVC Encoding of Rendered Video Data Using True Motion Information. 287-290 - Salah Rabba, Matthew J. Kyan, Lei Gao, Azhar Quddus, Ali Shahidi Zandi, Ling Guan:
Discriminative Robust Gaze Estimation Using Kernel-DMCCA Fusion. 291-298 - Kari Siivonen, Joose Sainio, Marko Viitanen, Jarno Vanne, Timo D. Hämäläinen:
Open framework for error-compensated gaze data collection with eye tracking glasses. 299-302
manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.