


default search action
ICME 2019: Shanghai, China
- IEEE International Conference on Multimedia and Expo, ICME 2019, Shanghai, China, July 8-12, 2019. IEEE 2019, ISBN 978-1-5386-9552-4

Oral Sessions
Best Paper Session
- Yu Hao, Yanwei Fu

, Yu-Gang Jiang, Qi Tian:
An End-to-End Architecture for Class-Incremental Object Detection with Knowledge Distillation. 1-6 - Zunjie Zhu, Feng Xu, Chenggang Yan, Xinhong Hao, Xiangyang Ji, Yongdong Zhang, Qionghai Dai:

Real-time Indoor Scene Reconstruction with RGBD and Inertial Input. 7-12 - Changde Du, Changying Du, Huiguang He

:
Doubly Semi-Supervised Multimodal Adversarial Learning for Classification, Generation and Retrieval. 13-18 - Yihang Lou, Ling-Yu Duan, Yong Luo, Ziqian Chen, Tongliang Liu

, Shiqi Wang
, Wen Gao:
Towards Digital Retina in Smart Cities: A Model Generation, Utilization and Communication Paradigm. 19-24
O-01: Content Recommendation and Cross-modal Hashing
- Zhenhua Tan

, Danke Wu
, Liangliang He, Qiuyun Chang, Bin Zhang:
SDP: An Improved Baseline Estimation Model Based On Standard Deviation Proportion. 25-30 - Jie Chen, Yang Liu, Shu Zhao, Yanping Zhang:

Citation Recommendation Based on Weighted Heterogeneous Information Network Containing Semantic Linking. 31-36 - Li Wang, Lei Zhu

, En Yu
, Jiande Sun, Huaxiang Zhang:
Fusion-Supervised Deep Cross-Modal Hashing. 37-42 - Wei Chen, Nan Pu

, Yu Liu, Erwin M. Bakker
, Michael S. Lew:
Domain Uncertainty Based On Information Theory for Cross-Modal Hash Retrieval. 43-48
O-02: Development of Multimedia Standards and Related Research
- Eurico Lopes, João Ascenso

, Catarina Brites
, Fernando Pereira
:
Adaptive Plane Projection for Video-Based Point Cloud Coding. 49-54 - Ting Fu, Hao Zhang, Fan Mu, Huanbang Chen:

Fast CU Partitioning Algorithm for H.266/VVC Intra-Frame Coding. 55-60 - Ting Fu, Hao Zhang, Fan Mu, Huanbang Chen:

Two-Stage Fast Multiple Transform Selection Algorithm for VVC Intra Coding. 61-66 - Junru Li, Meng Wang

, Li Zhang, Kai Zhang, Hongbin Liu, Shiqi Wang
, Siwei Ma, Wen Gao:
History-Based Motion Vector Prediction for Future Video Coding. 67-72
O-03: Classification and Low Shot Learning
- Jingcai Guo

, Song Guo:
AMS-SFE: Towards an Alignment of Manifold Structures via Semantic Feature Expansion for Zero-shot Learning. 73-78 - Xuefeng Du, Dexing Zhong, Pengna Li:

Low-Shot Palmprint Recognition Based on Meta-Siamese Network. 79-84 - Zihan Ye

, Fan Lyu, Linyan Li, Qiming Fu, Jinchang Ren, Fuyuan Hu:
SR-GAN: Semantic Rectifying Generative Adversarial Network for Zero-shot Learning. 85-90 - Huaxi Huang, Junjie Zhang, Jian Zhang

, Qiang Wu
, Jingsong Xu
:
Compare More Nuanced: Pairwise Alignment Bilinear Network for Few-Shot Fine-Grained Learning. 91-96
O-04: 3D Media Computing
- Gerasimos Arvanitis

, Aris S. Lalos, Konstantinos Moustakas:
Feature-Aware and Content-wise Denoising of 3D Static and Dynamic Meshes using Deep Autoencoders. 97-102 - Xinyu Wei, Jun Huang

, Xiaoyuan Ma
:
Real-Time Monocular Visual SLAM by Combining Points and Lines. 103-108 - Chuanpu Li, Xin Jin, Junke Li, Qionghai Dai:

F-Number Adaptation for Maximizing the Sensor Usage of Light Field Cameras. 109-114 - Xufu Sun, Xin Jin, Pei Wang, Yanqin Chen, Qionghai Dai:

Blind Calibration for Focused Plenoptic Cameras. 115-120
O-05: Special Session "Pedestrian Detection, Tracking and Reidentification in Videos"
- Peizhen Zhang, Feng Zheng, Junlong Du

, Jun Zhang, Xiaowei Guo, Wei-Shi Zheng:
Particle Swarm Loss for Lightweight Object Detection. 121-126 - Qiang Fu, Linsen Dong, Ziyuan Liu, Yong Luo, Yonggang Wen, Ying Li, Ling-Yu Duan:

Incorporating Category Taxonomy in Deep Reinforcement Learning Based Image Hashing. 127-132 - Ji Hu, Chenggang Yan, Xin Liu, Jiyong Zhang

, Dongliang Peng, Yi Yang:
Truncated Gradient Confidence-Weighted Based Online Learning for Imbalance Streaming Data. 133-138 - Mohamed A. Kassab

, Ali Maher
, Fathy Elkazzaz, Baochang Zhang:
UAV Target Tracking By Detection via Deep Neural Networks. 139-144
O-06: Special Session "Multimedia Technologies Empowering Retail Experiences"
- Shan An, Zhibiao Huang, Guangfu Che, Xianglong Liu

, Xin Ma, Yu Chen:
Quarter-Point Codeword Expansion for Product Quantization. 145-150 - Minghui Zhang, Yumeng Liang

, Huadong Ma:
Context-Aware Affective Graph Reasoning for Emotion Recognition. 151-156 - Weibo Zhang, Fuqing Zhu, Jiao Dai

, Songlin Hu
, Jizhong Han
, Tao Guo:
SPL: Exploiting Unlabeled Data for Multi-label Image Classification. 157-162 - Yu Zhou, Shancheng Fang, Hongtao Xie, Zheng-Jun Zha

, Yongdong Zhang:
MLTS: A Multi-Language Scene Text Spotter. 163-168
O-07: 3D and Low Level Vision
- Xinchen Ye, Mingliang Zhang, Rui Xu, Wei Zhong, Xin Fan, Zhu Liu, Jiaao Zhang:

Unsupervised Monocular Depth Estimation Based on Dual Attention Mechanism and Depth-Aware Loss. 169-174 - Gang Fu, Qing Zhang, Chunxia Xiao:

Towards High-Quality Intrinsic Images in the Wild. 175-180 - Shuosen Guan, Haoxin Li, Wei-Shi Zheng:

Unsupervised Learning for Optical Flow Estimation Using Pyramid Convolution LSTM. 181-186 - Yuan Gao, Robert Bregovic

, Atanas P. Gotchev
, Reinhard Koch
:
MAST: Mask-Accelerated Shearlet Transform for Densely-Sampled Light Field Reconstruction. 187-192
O-08: Object Detection I
- Li Wang, Yongbo Li, Xiangyang Xue:

CODA: Counting Objects via Scale-Aware Adversarial Density Adaption. 193-198 - Chunbiao Zhu, Xing Cai, Kan Huang, Thomas H. Li, Ge Li:

PDNet: Prior-Model Guided Depth-Enhanced Network for Salient Object Detection. 199-204 - Qi Yuan, Bingwang Zhang, Haojie Li, Zhihui Wang, Zhongxuan Luo, Wei Zhong:

Continuous Scale Adaption for Efficient Box-Based Scene Text Detection. 205 - Xiaobao Guo, Jinxing Li, Bingzhi Chen, Guangming Lu:

Mask-Most Net: Mask Approximation Based Multi-oriented Scene Text Detection Network. 206-211
O-09: Emerging Applications of Deep Learning
- Junhao Huang

, Lin Zhang, Ying Shen, Huijuan Zhang, Shengjie Zhao, Yukai Yang
:
DMPR-PS: A Novel Approach for Parking-Slot Detection Using Directional Marking-Point Regression. 212-217 - Yong-Xiang Lin

, Daniel Stanley Tan
, Wen-Huang Cheng, Kai-Lung Hua
:
Adapting Semantic Segmentation of Urban Scenes via Mask-Aware Gated Discriminator. 218-223 - Maomao Li, Chun Yuan, Zhihui Lin, Zhuobin Zheng, Yangyang Cheng:

Stochastic Video Generation with Disentangled Representations. 224-229 - Jianjin Zhang, Yunbo Wang, Mingsheng Long, Jianmin Wang, Philip S. Yu:

Z-Order Recurrent Neural Networks for Video Prediction. 230-235
O-10: Multimedia Quality Assessment and Enhancement
- Yingru Liu

, Dongliang Xie, Xin Wang:
Energy-Based Recurrent Model for Stochastic Modeling of Music. 236-241 - Huaixuan Zhang, Yuhai Lan, Tao Dai, Ruizhi Qiao, Ying Xu, Yao Yao

, Shu-Tao Xia:
Residual Frame for Noisy Video Classification According to Perceptual Quality in Convolutional Neural Networks. 242-247 - Guanqun Hou, Yujiu Yang

, Jing-Hao Xue:
Residual Dilated Network with Attention for Image Blind Denoising. 248-253 - Zhuopeng Li, Xiaoyan Zhang:

Collaborative Deep Reinforcement Learning for Image Cropping. 254-259
O-11: Multimedia for Society and Health
- Penghui Sun, Hao Liu, Xin Wang, Zhenhua Yu, Suping Wu:

Similarity-Aware Deep Adversarial Learning for Facial Age Estimation. 260-265 - Yinghong Liao, Bin Qiu, Zhuo Su, Ruomei Wang, Xiangjian He

:
Learning Transmission Filtering Network for Image-Based Pm2.5 Estimation. 266-271 - Yuan Tian

, Xiongkuo Min
, Guangtao Zhai
, Zhiyong Gao:
Video-Based Early ASD Detection via Temporal Pyramid Networks. 272-277 - Ying Zhang, Yinjia Zhang, Qinpei Zhao, Weixiong Rao:

Automatic User Categorization Through Large Transaction Data. 278-283
O-12: Immersive Media
- Junkun Qi, Wei Hu, Zongming Guo:

Feature Preserving and Uniformity-Controllable Point Cloud Simplification on Graph. 284-289 - Jun Fu, Xiaoming Chen

, Zhizheng Zhang, Shilin Wu, Zhibo Chen:
360SRL: A Sequential Reinforcement Learning Approach for ABR Tile-Based 360 Video Streaming. 290-295 - Falah Jabar, João Ascenso

, Maria Paula Queluz
:
Content-Aware Perspective Projection Optimization for Viewport Rendering of 360° Images. 296-301 - Ziming Wu, Jiabin Guo, Shuangli Zhang, Chen Zhao, Xiaojuan Ma:

An AR Benchmark System for Indoor Planar Object Tracking. 302-307
O-13: 3D and Stereo Computing
- Zhenchao Wu, Kun Li, Yu-Kun Lai, Jingyu Yang:

Global as-Conformal-as-Possible Non-Rigid Registration of Multi-view Scans. 308-313 - Zhengning Wang

, Longfei Feng, Fanwei Zeng, Guang Hu, Xiang Zhang, Xia Lv, Fengjun Zhang:
A Light-Weighted Network for Facial Landmark Detection via Combined Heatmap and Coordinate Regression. 314-319 - Xianzhe Xu, Yonghong Hou, Pichao Wang, Zhongyu Jiang, Wanqing Li

:
Light Weight Stereo Matching via Deep Extraction and Integration of Low and High Level Information. 320-325 - Hongxin Lin, Zelin Xiao, Yang Tan, Hongyang Chao, Shengyong Ding

:
Justlookup: One Millisecond Deep Feature Extraction for Point Clouds By Lookup Tables. 326-331
O-14: Machine Learning Applications in Image and Video Coding I
- Bo Jiang, Xingyue Jiang, Jin Tang, Bin Luo, Shilei Huang:

Multiple Graph Convolutional Networks for Co-Saliency Detection. 332-337 - Lahiru D. Chamain, Sen-ching Samson Cheung

, Zhi Ding
:
Quannet: Joint Image Compression and Classification Over Channels with Limited Bandwidth. 338-343 - Jiawen Gu, Bichuan Guo, Jiangtao Wen:

High Efficiency Light Field Compression via Virtual Reference and Hierarchical MV-HEVC. 344-349 - Youfa Liu

, Bo Du, Lefei Zhang:
Self-Paced Subspace Clustering. 350-355
O-15: Vison, Language and Text Processing
- Xuri Ge

, Fuhai Chen, Chen Shen, Rongrong Ji:
Colloquial Image Captioning. 356-361 - Yike Wu, Shiwan Zhao

, Jia Chen, Ying Zhang, Xiaojie Yuan, Zhong Su:
Improving Captioning for Low-Resource Languages by Cycle Consistency. 362-367 - Zhuo Lei, Chao Zhang

, Qian Zhang, Guoping Qiu
:
FrameRank: A Text Processing Approach to Video Summarization. 368-373 - Anna Zhu, Qiyang Zhang, Xiongbo Lu

, Shengwu Xiong
:
Character Image Synthesis Based on Selected Content and Referenced Style Embedding. 374-379
O-16: Media Classification and Segmentation II
- Yujia Liu, Weiming Zhang, Nenghai Yu:

Query-Free Embedding Attack Against Deep Learning. 380-386 - Zongmin Li, Jun Zhang, Guanlin Li, Yujie Liu, Siyuan Li:

Graph Attention Neural Networks for Point Cloud Recognition. 387-392 - Lu Li, Yang Li

, Xiangxiang Xu, Shao-Lun Huang, Lin Zhang:
Maximal Correlation Embedding Network for Multilabel Learning with Missing Labels. 393-398 - Zengyuan Guo, Xinzhu Ma, Haojie Li, Zhihui Wang, Pengbo Zhang:

Self-Adaption Multi-classifier Fusion Networks for Image Recognition. 399-405
O-17: AI for Human Understanding
- Baohan Xu, Yingbin Zheng

, Hao Ye, Caili Wu, Heng Wang, Gufei Sun:
Video Emotion Recognition with Concept Selection. 406-411 - Han Zhang, Yonghong Song, Yuanlin Zhang:

Graph Convolutional LSTM Model for Skeleton-Based Action Recognition. 412-417 - Zhongwei Qiu, Kai Qiu, Jianlong Fu, Dongmei Fu:

Learning Recurrent Structure-Guided Attention Network for Multi-person Pose Estimation. 418-423 - Zhenying Fang

, Suguo Zhu, Jun Yu, Qi Tian:
PCPCAD: Proposal Complementary Action Detector. 424-429
O-18: Image Quality Metrics
- Leida Li

, Hancheng Zhu, Sicheng Zhao, Guiguang Ding, Hongyan Jiang, Allen Tan:
Personality Driven Multi-task Learning for Image Aesthetic Assessment. 430-435 - Chen Bai, Amy R. Reibman

:
Video Quality Temporal Pooling using a Visibility Measure. 436-441 - Yuming Fang, Yan Zeng, Hanwei Zhu, Guangtao Zhai

:
Image Quality Assessment of Multi-exposure Image Fusion for Both Static and Dynamic Scenes. 442-447 - Sumei Li

, Jianwei Xue, Yongtian Han:
No-Reference Stereoscopic Image Quality Assessment Based on Local to Global Feature Regression. 448-453
O-19: Multimedia Recommendations
- Wenmian Yang

, Wenyuan Gao, Xiaojie Zhou, Weijia Jia
, Shaohua Zhang, Yutao Luo:
Herding Effect Based Attention for Personalized Time-Sync Video Recommendation. 454-459 - Shang Liu, Zhenzhong Chen:

Sequential Behavior Modeling for Next Micro-Video Recommendation with Collaborative Transformer. 460-465 - Dawei Liu, Ying Cao, Rynson W. H. Lau, Antoni B. Chan

:
ButtonTips: Design Web Buttons with Suggestions. 466-471 - Shengjie Ma, Zheng-Jun Zha

, Feng Wu:
Knowing User Better: Jointly Predicting Click-Through and Playtime for Micro-Video. 472-477
O-20: Search and Retrieval
- Xin Wen, Zhizhong Han, Xinyu Yin, Yu-Shen Liu

:
Adversarial Cross-Modal Retrieval via Learning and Transferring Single-Modal Similarities. 478-483 - Zekun Li, Zeyu Cui, Shu Wu, Xiaoyu Zhang, Liang Wang:

Semi-Supervised Compatibility Learning Across Categories for Clothing Matching. 484-489 - Kevin Lin

, Fan Yang, Qiaosong Wang, Robinson Piramuthu:
Adversarial Learning for Fine-Grained Image Search. 490-495 - Lei Qi

, Jing Huo, Lei Wang, Yinghuan Shi, Yang Gao:
A Mask Based Deep Ranking Neural Network for Person Retrieval. 496-501
O-21: Media Understanding
- Kunal Swami, Kaushik Raghavan, Nikhilanj Pelluri, Rituparna Sarkar, Pankaj Bajpai:

DISCO: Depth Inference from Stereo using Context. 502-507 - Yunian Chen, Yanjie Wang, Yang Zhang

, Yanwen Guo:
PANet: A Context Based Predicate Association Network for Scene Graph Generation. 508-513 - Aming Wu, Yahong Han, Quanxin Zhang, Xiaohui Kuang:

Untargeted Adversarial Attack via Expanding the Semantic Gap. 514-519 - Yen-Wei Chang, Wen-Hsiao Peng

:
Learning Goal-Oriented Visual Dialog Agents: Imitating and Surpassing Analytic Experts. 520-525
O-22: Super-resolution and Enhancement
- Kui Jiang

, Zhongyuan Wang, Peng Yi, Junjun Jiang
, Guangcheng Wang, Zhen Han, Tao Lu:
GAN-Based Multi-level Mapping Network for Satellite Imagery Super-Resolution. 526-531 - Ren Yang, Xiaoyan Sun, Mai Xu, Wenjun Zeng

:
Quality-Gated Convolutional Lstm for Enhancing Compressed Video. 532-537 - Risheng Liu

, Minjun Hou, Jinyuan Liu
, Xin Fan, Zhongxuan Luo:
Compounded Layer-Prior Unrolling: A Unified Transmission-Based Image Enhancement Framework. 538-543 - Qiang Fu, Wenhan Yang, Ying Li, Jiaying Liu:

Deep Pyramid Variation Learning for Image Interpolation. 544-549
O-23: Pose and Action Recognition II
- Zhangxuan Gu, Jianfu Zhang, Ziqi Pan, Haohua Zhao, Liqing Zhang:

Clothes Keypoints Localization and Attribute Recognition via Prior Knowledge. 550-555 - Yong Su

, Zhiyong Feng:
Spatio-Temporal Multi-Factor Discriminant Analysis for Individual Identification. 556-561 - Jianjun Lei, Yalong Jia, Bo Peng, Qingming Huang:

Channel-wise Temporal Attention Network for Video Action Recognition. 562-567 - Qichao Xu, John See

, Weiyao Lin
:
Localization Guided Fight Action Detection in Surveillance Videos. 568-573
O-24: Image and Video Enhancements I
- Yue Lu, Zhuqing Jiang, Guodong Ju, Liangheng Shen, Aidong Men:

Recursive Multi-Stage Upscaling Network with Discriminative Fusion for Super-Resolution. 574-579 - Yuanfei Huang

, Jie Li, Xinbo Gao
, Wen Lu, Yanting Hu:
Improving Image Super-Resolution via Feature Re-Balancing Fusion. 580-585 - Jinghui Qin, Ziwei Xie, Yukai Shi, Wushao Wen:

Difficulty-Aware Image Super Resolution via Deep Adaptive Dual-Network. 586-591 - Xiaoting Du, Yuan Zhou, Yanfang Chen, Yeda Zhang, Jianxing Yang, Dou Jin:

Dense-Connected Residual Network for Video Super-Resolution. 592-597
O-25: Face and Person Analysis
- Zhihao Zhang, Liansheng Zhuang, Wengang Zhou, Houqiang Li:

Dynamic Cascaded Regression Network with Reinforcement Learning for Robust Face Alignment. 598-603 - Mengyan Li, Yuechuan Sun, Zhaoyu Zhang, Haonian Xie, Jun Yu:

Deep Learning Face Hallucination via Attributes Transfer and Enhancement. 604-609 - Junjie Zhu, Xibin Zhao, Han Hu, Yue Gao:

Emotion Recognition from Physiological Signals using Multi-Hypergraph Neural Networks. 610-615 - Yue Liao, Si Liu, Tianrui Hui, Chen Gao, Yao Sun, Hefei Ling, Bo Li:

GPS: Group People Segmentation with Detailed Part Inference. 616-621
O-26: Media Classification and Segmentation III
- Zhao-Min Chen, Xiu-Shen Wei, Xin Jin, Yanwen Guo:

Multi-Label Image Recognition with Joint Class-Aware Map Disentangling and Label Correlation Embedding. 622-627 - Zhengtao Tan, Bin Liu, Weihai Li, Nenghai Yu:

Real Time Compressed Video Object Segmentation. 628-633 - Zhihui Wang, Shijie Wang, Pengbo Zhang, Haojie Li, Bo Liu:

Accurate And Fast Fine-Grained Image Classification via Discriminative Learning. 634-639 - Zhong Li

, Xin Chen
, Wangyiteng Zhou, Yingliang Zhang
, Jingyi Yu:
Pose2Body: Pose-Guided Human Parts Segmentation. 640-645
O-27: Image and Video Enhancements II
- Zhan Shu, Mengcheng Cheng, Biao Yang, Zhuo Su, Xiangjian He

:
Residual Magnifier: A Dense Information Flow Network for Super Resolution. 646-651 - Xinyu Li, Wei Zhang, Tong Shen, Tao Mei

:
Everyone is a Cartoonist: Selfie Cartoonization with Attentive Adversarial Networks. 652-657 - Jichun Li

, Ke Li, Bo Yan:
Scale-Aware Deep Network with Hole Convolution for Blind Motion Deblurring. 658-663 - Tie Liu, Mai Xu, Zulin Wang:

Removing Rain in Videos: A Large-Scale Database and a Two-Stream ConvLSTM Approach. 664-669
O-28: Multimedia Learning and Adaptation
- Zhengyuan Pang, Lifeng Sun, Tianchi Huang, Zhi Wang, Shiqiang Yang:

Towards QoS-Aware Cloud Live Transcoding: A Deep Reinforcement Learning Approach. 670-675 - Ding Ma, Xiangqian Wu:

High Speed Recurrent Regression Network for Visual Tracking. 676-681 - Yanmin Shang, Zhezhou Kang, Yanan Cao, Dongjie Zhang, Yang Li, Yangxi Li, Yanbing Liu

:
PAAE: A Unified Framework for Predicting Anchor Links with Adversarial Embedding. 682-687 - Ying Li, Lin Cheng, Yaxin Peng, Zhijie Wen, Shihui Ying:

Manifold Alignment and Distribution Adaptation for Unsupervised Domain Adaptation. 688-693
O-29: Person (Re-)Identification and People Detection
- Hui Li, Meng Yang, Zhihui Lai, Weishi Zheng, Zitong Yu:

Pedestrian re-Identification Based on Tree Branch Network with Local and Global Learning. 694-699 - Zheng Liu, Jie Qin, Annan Li

, Yunhong Wang, Luc Van Gool:
Adversarial Binary Coding for Efficient Person Re-Identification. 700-705 - Yingzhi Tang

, Xi Yang, Nannan Wang, Xinrui Jiang, Bin Song, Xinbo Gao
:
Person re-Identification with Gradual Background Suppression. 706-711 - Yingxin Zhu, Xiaoqiang Guo, Jianlei Liu, Zhuqing Jiang:

Multi-Branch Context-Aware Network for Person Re-Identification. 712-717
O-30: Multimedia and Language II
- Fenxiao Chen, Angela Wang, C.-C. Jay Kuo

:
Post-Processing of Word Representations via Variance Normalization and Dynamic Embedding. 718-723 - Dong Zhang, Liangqing Wu, Shoushan Li, Qiaoming Zhu, Guodong Zhou

:
Multi-Modal Language Analysis with Hierarchical Interaction-Level and Selection-Level Attentions. 724-729 - Dong Zhang, Shoushan Li, Qiaoming Zhu, Guodong Zhou

:
Modeling the Clause-Level Structure to Multimodal Sentiment Analysis via Reinforcement Learning. 730-735 - Jianming Wang, Wei Deng, Yukuan Sun, Yuanyuan Li, Kai Wang, Guanghao Jin:

Twice Opportunity Knocks Syntactic Ambiguity: A Visual Question Answering Model with yes/no Feedback. 736-741
O-31: Multimedia Communications and Localization
- Bin Sun, Chen Chen, Yingying Zhu, Jianmin Jiang:

GEOCAPSNET: Ground to Aerial View Image Geo-Localization using Capsule Network. 742-747 - Bo Wang, Fengyuan Ren:

Improving Robustness of DASH Against Network Uncertainty. 748-753 - Bo Wang, Fengyuan Ren, Chao Zhou:

Hybrid Control-Based ABR: Towards Low-Delay Live Streaming. 754-759 - Zhilin Qiu, Lingbo Liu, Guanbin Li, Qing Wang, Nong Xiao, Liang Lin:

Taxi Origin-Destination Demand Prediction with Contextualized Spatial-Temporal Network. 760-765
O-32: Multimedia Security, Privacy and Forensics II
- Sahib Khan

, Tiziano Bianchi:
Fast Image Clustering Based on Camera Fingerprint Ordering. 766-771 - Xin Xu, Quanwei Cai

, Jingqiang Lin, Shiran Pan, Liangqin Ren:
Enforcing Access Control in Distributed Version Control Systems. 772-777 - Peixuan He, Kaiping Xue

, Jie Xu
, Qiudong Xia, Jianqing Liu, Hao Yue
:
Attribute-Based Accountable Access Control for Multimedia Content with In-Network Caching. 778-783 - Liyue Fan:

Practical Image Obfuscation with Provable Privacy. 784-789
O-33: Multimedia Sensing and Signal Processing
- Zhenwen Liang, Dongyang Zhang

, Jie Shao:
Jointly Solving Deblurring and Super-Resolution Problems with Dual Supervised Network. 790-795 - Michael Gref, Christoph Schmidt, Sven Behnke

, Joachim Köhler:
Two-Staged Acoustic Modeling Adaption for Robust Speech Recognition by the Example of German Oral History Interviews. 796-801 - Yang Zhang

, Huiming Zhang, Yanwen Guo, Kai Lin, Jingwu He:
An Adaptive Affinity Graph with Subspace Pursuit for Natural Image Segmentation. 802-807 - Li He, Yi Zhou, Hongqing Liu:

Phase Time-Frequency Masking Based Speech Enhancement Algorithm Using Circular Microphone Array. 808-813
O-34: Detection and Recognition
- Yanyan Fang, Biyun Zhan, Wandi Cai, Shenghua Gao, Bo Hu:

Locality-Constrained Spatial Transformer Network for Video Crowd Counting. 814-819 - Yixin Li, Shengqin Tang, Yun Ye, Jinwen Ma:

Spatial-Aware Non-Local Attention for Fashion Landmark Detection. 820-825 - Wu Zheng, Lin Li, Zhaoxiang Zhang, Yan Huang, Liang Wang:

Relational Network for Skeleton-Based Action Recognition. 826-831 - Weipeng Lin, Yidong Li, Xiaoliang Yang, Peixi Peng, Junliang Xing:

Multi-View Learning for Vehicle Re-Identification. 832-837
O-35: Multi-modal Media Computing and Human-machine Interaction
- Yi Zhang

, Cheng Zeng, Hao Cheng, Chongjun Wang, Lei Zhang:
Many Could be Better Than All: A Novel Instance-Oriented Algorithm for Multi-modal Multi-label Problem. 838-843 - Benchao Li, Zhenzhong Chen, Shan Li, Wei-Shi Zheng:

Affective Video Content Analyses by Using Cross-Modal Embedding Learning Features. 844-849 - Xiaolong Zhou, Jianing Lin, Jiaqi Jiang, Shengyong Chen:

Learning A 3D Gaze Estimator with Improved Itracker Combined with Bidirectional LSTM. 850-855 - Jingda Guo, Xianwei Cheng, Qi Chen, Qing Yang:

Detection of Occluded Road Signs on Autonomous Driving Vehicles. 856-861
Poster Sessions
Poster Session 1 & TMM Poster
- Yingyi Zhang, Lin Zhang, Xiao Liu, Shengjie Zhao, Ying Shen, Yukai Yang

:
Pay By Showing Your Palm: A Study of Palmprint Verification on Mobile Platforms. 862-867 - Yuze Guo, Wenjing Huang, Yajing Chen, Shikui Tu:

Regularize Network Skip Connections by Gating Mechanisms for Electron Microscopy Image Segmentation. 868-873 - Xiaohui Lin, Yi Xu, Mingda Wang, Bingbing Ni, Xiaokang Yang, Guangyu Tao, Xiaodan Ye:

Cross Modality Alignment of Medical Volumes using Spatio-Semantic Attentive Cycle-GAN. 874-879 - Bin Yuan, Zongqing Lu, Jing-Hao Xue, Qingmin Liao:

A New Approach to Automatic Clothing Matting from Mannequins. 880-885 - Jinlin Wu, Shengcai Liao

, Zhen Lei, Xiaobo Wang, Yang Yang, Stan Z. Li:
Clustering and Dynamic Sampling Based Unsupervised Domain Adaptation for Person Re-Identification. 886-891 - Fanchao Lin, Chuanbin Liu, Hongtao Xie, Zheng-Jun Zha

, Yongdong Zhang:
Semantic-Embedding and Shape-Aware U-Net for Ultrasound Eyeball Segmentation. 892-897 - Chengpei Xu, Ruomei Wang, Shujin Lin, Xiaonan Luo, Baoquan Zhao, Lijie Shao, Mengqiu Hu:

Lecture2Note: Automatic Generation of Lecture Notes from Slide-Based Educational Videos. 898-903 - Jianqiang Liu, Jian Yao, Jingmin Tu, Junhao Cheng:

Data-Adaptive Packing Method for Compression of Dynamic Point Cloud Sequences. 904-909 - Pengfei Li, Meng Yang

:
Semantic GAN: Application for Cross-Domain Image Style Transfer. 910-915 - Paras Maharjan, Li Li, Zhu Li, Ning Xu, Chongyang Ma, Yue Li:

Improving Extreme Low-Light Image Denoising via Residual Learning. 916-921 - Yang Gao

, Jun Tao, Li Zeng, Xiaoming Fang, Qian Fang, Xiaoyan Li:
User Profiling with Campus Wi-Fi Access Trace and Network Traffic. 922-927 - Baoquan Zhao, Songhua Xu, Shujin Lin, Ruomei Wang, Xiaonan Luo:

A New Visual Interface for Searching and Navigating Slide-Based Lecture Videos. 928-933 - Pin Fang, Yisen Wang, Yuan Luo:

Self-Attentive Networks for one-shot Image Recognition. 934-939 - Tianyi Wu, Sheng Tang, Rui Zhang, Juan Cao, Jintao Li:

Tree-Structured Kronecker Convolutional Network for Semantic Segmentation. 940-945 - Yixin Zhu, Jun-Yong Zhu

, Wei-Shi Zheng:
Part-Based Convolutional Network for Imbalanced Age Estimation. 946-951 - Qiuzheng Chen, Ruoyu Yang:

Learning to Distinguish: A General Method to Improve Compare-Based one-shot Learning Frameworks for Similar Classes. 952-957 - Xiaokai Chen

, Ke Gao, Juan Cao:
Predictability Analyzing: Deep Reinforcement Learning for Early Action Recognition. 958-963 - Junming Chen, Jie Shao, Dongyang Zhang

, Xuehui Wu:
A Fast End-to-End Method with Style Transfer for Room Layout Estimation. 964-969 - Lijyun Huang, Kate Ching-Ju Lin

, Yu-Chee Tseng:
Resolving Intra-Class Imbalance for GAN-Based Image Augmentation. 970-975 - Weitong Zhang, Qieshi Zhang, Jun Cheng, Cong Bai, Pengyi Hao:

End-to-End Panoptic Segmentation with Pixel-Level Non-Overlapping Embedding. 976-981 - Kaixiang Wang:

Robust Embedding Framework with Dynamic Hypergraph Fusion for Multi-label Classification. 982-987
Poster Session 2
- Peirui Cheng, Weiqiang Wang, Yuanqiang Cai:

Multi-scale Scene Text Detection via Resolution Transform. 988-993 - Haiyan Wang, Xuejian Rong, Yingli Tian

:
Towards Accurate Instance-Level Text Spotting with Guided Attention. 994-999 - Youming Deng, Xianming Lin, Run Li, Rongrong Ji:

Multi-scale Gem Pooling with N-Pair Center Loss for Fine-Grained Image Search. 1000-1005 - Xingzhi Wang

, Xin Liu, Zhikai Hu
, Nannan Wang, Wentao Fan
, Ji-Xiang Du:
Semi-Supervised Semantic-Preserving Hashing for Efficient Cross-Modal Retrieval. 1006-1011 - Haitao Wang, Hui Chen, Min Meng, Jigang Wu:

Robust Multi-View Hashing for Cross-Modal Retrieval. 1012-1017 - Siwei Wang, Yongtao Wang, Xiaoran Qin, Qijie Zhao, Zhi Tang:

Scene Text Recognition via Gated Cascade Attention. 1018-1023 - Yuyang Wang, Feng Su, Ye Qian:

Text-Attentional Conditional Generative Adversarial Network for Super-Resolution of Text Images. 1024-1029 - Fan Ma, Haoyun Yang, Haibing Yin, Xiaofeng Huang, Chenggang Yan, Xiang Meng:

Online Learning to Rank in a Listwise Approach for Information Retrieval. 1030-1035 - Yang Mi

, Song Wang
:
Recognizing Micro Actions in Videos: Learning Motion Details via Segment-Level Temporal Pyramid. 1036-1041 - Miao Xin, Shuhang Wang, Jian Cheng:

Entanglement Loss for Context-Based Still Image Action Recognition. 1042-1047 - Lu Zhou, Yingying Chen, Jinqiao Wang, Ming Tang, Hanqing Lu:

Bi-Directional Message Passing Based Scanet for Human Pose Estimation. 1048-1053 - Jingjun Chen, Yonghong Song, Yuanlin Zhang:

Spatial Mask ConvLSTM Network and Intra-Class Joint Training Method for Human Action Recognition in Video. 1054-1059 - Renyi Xiao, Yonghong Hou, Zihui Guo, Chuankun Li, Pichao Wang, Wanqing Li

:
Self-Attention Guided Deep Features for Action Recognition. 1060-1065 - Yanshan Li, Rongjie Xia, Xing Liu, Qinghua Huang:

Learning Shape-Motion Representations from Geometric Algebra Spatio-Temporal Model for Skeleton-Based Action Recognition. 1066-1071 - Yang Bai, Weiqiang Wang:

ACPNet: Anchor-Center Based Person Network for Human Pose Estimation and Instance Segmentation. 1072-1077 - Jianyu Yang, Chen Zhu, Junsong Yuan

:
Spatio-Temporal Multi-scale Soft Quantization Learning for Skeleton-Based Human Action Recognition. 1078-1083 - Wei Sun, Yezhao Fan, Xiongkuo Min

, Shihao Peng, Siwei Ma, Guangtao Zhai
:
LPHD: A Large-Scale Head Pose Dataset for RGB Images. 1084-1089 - Zhengyuan Yang, Yixuan Zhang, Jiebo Luo

:
Human-Centered Emotion Recognition in Animated GIFs. 1090-1095 - Qize Yang, Ancong Wu

, Wei-Shi Zheng:
Deep Semi-Supervised Person Re-Identification with External Memory. 1096-1101 - Tanzila Rahman, Mrigank Rochan, Yang Wang:

Convolutional Temporal Attention Model for Video-Based Person Re-Identification. 1102-1107 - Zhiyuan Li, Shizhong Han, Ahmed-Shehab Khan, Jie Cai, Zibo Meng, James O'Reilly, Yan Tong:

Pooling Map Adaptation in Convolutional Neural Network for Facial Expression Recognition. 1108-1113 - Jianheng Li, Fuhang Liang, Yuanxun Li, Wei-Shi Zheng:

Fast Person Search Pipeline. 1114-1119 - Gaoqi He, Zhenwei Ma, Binhao Huang, Bin Sheng, Yubo Yuan:

Dynamic Region Division for Adaptive Learning Pedestrian Counting. 1120-1125 - Jing Zhang, Han Sun, Zhe Wang, Tong Ruan:

Another Dimension: Towards Multi-subnet Neural Network for Image Sentiment Analysis. 1126-1131 - Pilin Dai, Jinna Lv, Bin Wu:

Two-Stage Model for Social Relationship Understanding from Videos. 1132-1137 - Junhao Hu, Lei Jin, Shenghuo Gao:

FPN++: A Simple Baseline for Pedestrian Detection. 1138-1143 - Fei Ma, Wei Zhang, Yang Li

, Shao-Lun Huang, Lin Zhang:
An End-to-End Learning Approach for Multimodal Emotion Recognition: Extracting Common and Private Information. 1144-1149
Poster Session 3 & Demo Session 1
- Haoyu Ma, Juncheng Zhang

, Shaojun Liu
, Qingmin Liao:
Boundary Aware Multi-focus Image Fusion Using Deep Neural Network. 1150-1155 - Chenxi Ma, Weimin Tan, Bahetiyaer Bare, Bo Yan:

A Multi-level Aggregated Network for Image Restoration. 1156-1161 - Xuehui Wu, Jie Shao, Dongyang Zhang

, Junming Chen:
Unsupervised Facial Image Synthesis Using Two-Discriminator Adversarial Autoencoder Network. 1162-1167 - Jie Liu

, Cheolkon Jung:
Facial Image Inpainting Using Multi-level Generative Network. 1168-1173 - Jianyu Wang, Shaohui Liu, Feng Jiang, Xiaoshuai Sun, Yongliang Liu:

A Video Post-Filter Deblocking Method Based on Temporal Boosting Residual Networks. 1174-1179 - Xiaopeng Sun, Wen Lu, Rui Wang, Furui Bai:

Distilling with Residual Network for Single Image Super Resolution. 1180-1185 - Junyi Wang, Weimin Tan, Xuejing Niu, Bo Yan:

RDGAN: Retinex Decomposition Based Adversarial Learning for Low-Light Enhancement. 1186-1191 - Shichao Li, Yonghong Hou, Huanjing Yue, Zihui Guo:

Single Image De-Raining via Generative Adversarial Nets. 1192-1197 - Yuanlue Zhu

, Mengchao Bai, Linlin Shen, Zhiwei Wen:
SwitchGAN for Multi-domain Facial Image Translation. 1198-1203 - Michele Brizzi

, Federica Battisti, Alessandro Neri
:
A Feature-Based Approach for Light Field Video Enhancement. 1204-1209 - Jindong Wang

, Yiqiang Chen
, Han Yu
, Meiyu Huang, Qiang Yang:
Easy Transfer Learning By Exploiting Intra-Domain Structures. 1210-1215 - Guyue Hu, Bo Cui, Shan Yu

:
Skeleton-Based Action Recognition with Synchronous Local and Non-Local Spatio-Temporal Learning and Frequency Attention. 1216-1221 - Meilu Zhu

, Daming Shi:
Deep Geometry Embedding Networks for Robust Facial Landmark Detection. 1222-1227 - Guangzhen Liu, Jiechao Guan, Manli Zhang, Jianhong Zhang, Zihao Wang, Zhiwu Lu

:
Joint Projection and Subspace Learning for Zero-Shot Recognition. 1228-1233 - Haoye Dong

, Xiaodan Liang, Chenxing Zhou, Hanjiang Lai
, Jia Zhu
, Jian Yin:
Part-Preserving Pose Manipulation for Person Image Synthesis. 1234-1239 - He Chen, Faming Fang:

Bregman-Tanimoto Based Method for Contrast Preserving Decolorization. 1240-1245 - Xiaoqiang Li, Yaqin Zhu, Jiayue Han, Jide Li, Weiqin Tong:

TDCC: Top-Down Semantic Aggregation for Color Constancy. 1246-1251 - Lin Zhang, Jianbo Zhao, Si Li, Boxin Shi

, Ling-Yu Duan:
From Market to Dish: Multi-ingredient Image Recognition for Personalized Recipe Recommendation. 1252-1257 - Hongjie Zhang, Ang Li, Xu Han, Zhaoming Chen, Yang Zhang

, Yanwen Guo:
Improving Open Set Domain Adaptation Using Image-to-Image Translation. 1258-1263 - Chaoqun Wang, Xuejin Chen, Shaobo Min, Feng Wu:

Structure Generation and Guidance Network for Unsupervised Monocular Depth Estimation. 1264-1269 - Xinyao Chen, Bichuan Guo, Minhao Tang, Yuxing Han, Jiangtao Wen:

A Conditional Bayesian Block Structure Inference Model for Optimized AV1 Encoding. 1270-1275 - Ce Wang, Renjie Wan

, Feng Gao, Boxin Shi
, Ling-Yu Duan:
Learning to Remove Reflections for Text Images. 1276-1281
Poster Session 4 & Demo Session 2
- Hao Zhou

, Wengang Zhou, Houqiang Li:
Dynamic Pseudo Label Decoding for Continuous Sign Language Recognition. 1282-1287 - Yupan Huang, Qi Dai, Yutong Lu:

Decoupling Localization and Classification in Single Shot Temporal Action Detection. 1288-1293 - Zhiming Ma, Chun Yuan, Yangyang Cheng, Xinrui Zhu:

Image-to-Tree: A Tree-Structured Decoder for Image Captioning. 1294-1299 - Liang Sun, Bing Li, Chunfeng Yuan, Zhengjun Zha

, Weiming Hu:
Multimodal Semantic Attention Network for Video Captioning. 1300-1305 - Jie Wu, Tianshui Chen, Hefeng Wu, Zhi Yang, Qing Wang, Liang Lin:

Concrete Image Captioning by Integrating Content Sensitive and Global Discriminative Objective. 1306-1311 - Huidong Li, Dandan Song, Lejian Liao, Cuimei Peng:

REVnet: Bring Reviewing Into Video Captioning for a Better Description. 1312-1317 - Xi Meng, Hao Kong, Dongqi Tang, Tong Lu:

Multimodal Image Captioning Through Combining Reinforced Cross Entropy Loss and Stochastic Deprecation. 1318-1323 - Qi Wei, Kai Fan

, Wenlin Wang, Tianhang Zheng, Amit Chakraborty, Katherine A. Heller, Changyou Chen, Kui Ren:
InverseNet: Solving Inverse Problems of Multimedia Data with Splitting Networks. 1324-1329 - Shaobo Lin, Long Chen, Qin Zou

, Wei Tian
:
High-Resolution Driving Scene Synthesis Using Stacked Conditional Gans and Spectral Normalization. 1330-1335 - Yuqi Huo, Jiechao Guan, Jianhong Zhang, Manli Zhang, Ji-Rong Wen, Zhiwu Lu

:
Zero-Shot Learning with Few Seen Class Samples. 1336-1341 - Zhihao Ouyang, Yan Feng, Zihao He, Tianbo Hao, Tao Dai, Shu-Tao Xia:

Attentiondrop for Convolutional Neural Networks. 1342-1347 - Yongyong Chen

, Xiaolin Xiao, Yicong Zhou:
Multi-view Clustering via Simultaneously Learning Graph Regularized Low-Rank Tensor Representation and Affinity Matrix. 1348-1353 - Boxin He, Shengbei Wang, Weitao Yuan

, Jianming Wang, Masashi Unoki
:
Data Augmentation for Monaural Singing Voice Separation Based on Variational Autoencoder-Generative Adversarial Network. 1354-1359 - Tao He, Xiaoming Jin, Guiguang Ding, Lan Yi, Chenggang Yan:

Towards Better Uncertainty Sampling: Active Learning with Multiple Views for Deep Convolutional Neural Network. 1360-1365 - Chunbin Gu, Jiajun Bu, Keyue Shi, Zhi Yu, Beidou Wang, Liangcheng Li:

Local Metric Learning Based on Anchor Points for Multimedia Analysis. 1366-1371 - Tung Doan, Atsuhiro Takasu:

Sparse Regression-Based Multiple Sequence Alignment. 1372-1377 - Youzhao Yang, Hong Lu:

Single Image Deraining using a Recurrent Multi-scale Aggregation and Enhancement Network. 1378-1383 - Chunpeng Wang, Jie Zhu:

Neural Network Based Phase Compensation Methods on Monaural Speech Separation. 1384-1389 - Huikai Shao, Dexing Zhong, Yuhan Li:

PalmGAN for Cross-Domain Palmprint Recognition. 1390-1395 - Jianing Li, Siwei Dong, Zhaofei Yu, Yonghong Tian, Tiejun Huang:

Event-Based Vision Enhanced: A Joint Detection Framework in Autonomous Driving. 1396-1401 - Liming Zhai

, Lina Wang, Yanzhen Ren:
Multi-domain Embedding Strategies for Video Steganography by Combining Partition Modes and Motion Vectors. 1402-1407 - Hangqing Guo, Nan Zhang, Wenjun Shi, Saeed Ali-AlQarni, Shaoen Wu, Honggang Wang

:
Real-Time Indoor 3D Human Imaging Based on MIMO Radar Sensing. 1408-1413 - Shanfa Ke, Ruimin Hu, Gang Li, Tingzhao Wu, Xiaochen Wang, Zhongyuan Wang:

Multi-speakers Speech Separation Based on Modified Attractor Points Estimation and GMM Clustering. 1414-1419 - Jing Zhao, Ruiqin Xiong, Jizheng Xu, Feng Wu, Tiejun Huang:

Learning a Deep Convolutional Network for Subband Image Denoising. 1420-1425 - Zhijie Lin, Sen Jia, Bin Deng:

Multi-Task Embedded Convolutional Neural Network for Hyperspectral Image Classification. 1426-1431 - Lin Zhu

, Siwei Dong, Tiejun Huang, Yonghong Tian:
A Retina-Inspired Sampling Method for Visual Texture Reconstruction. 1432-1437 - Zhizheng Zhang, Zhibo Chen, Jianxin Lin, Weiping Li:

Learned Scalable Image Compression with Bidirectional Context Disentanglement Network. 1438-1443 - Jiabao Yao, Li Wang, Fangdong Chen, Chaoyi Lin, Shiliang Pu:

An Attention Residual Neural Network with Recurrent Greedy Approach as Loop Filter for Inter Frames. 1444-1449 - Yuhang Liu, Wenyong Dong, Wanjuan Song, Lei Zhang:

Bayesian Nonnegative Matrix Factorization with a Truncated Spike-and-Slab Prior. 1450-1455 - Chao Huang, Zongju Peng, Fen Chen, Qiuping Jiang, Xin Cui, Gangyi Jiang:

Encoding Complexity Control for Live Video Applications: An Interpretable Machine Learning Approach. 1456-1461 - Risheng Liu

, Cheng Yang, Long Ma, Miao Zhang, Xin Fan, Zhongxuan Luo:
Enhanced Residual Dense Intrinsic Network for Intrinsic Image Decomposition. 1462-1467 - Zhipeng Lin, Zhenyu Zhao, Tingjin Luo

, Wenjing Yang, Yongjun Zhang, Yuhua Tang:
Non-Convex Transfer Subspace Learning for Unsupervised Domain Adaptation. 1468-1473 - Jianping Gou

, Lei Wang
, Zhang Yi, Yun-Hao Yuan, Weihua Ou, Qirong Mao:
Discriminative Group Collaborative Competitive Representation for Visual Classification. 1474-1479 - Jiahong Wu, He Zheng, Bo Zhao, Yixin Li, Baoming Yan, Rui Liang, Wenjia Wang, Shipei Zhou, Guosen Lin, Yanwei Fu

, Yizhou Wang, Yonggang Wang:
Large-Scale Datasets for Going Deeper in Image Understanding. 1480-1485 - Xuan Shao, Xiao Liu, Lin Zhang, Shengjie Zhao, Ying Shen, Yukai Yang

:
Revisit Surround-view Camera System Calibration. 1486-1491 - Vishal Keshav, Tej Pratap G. V. S. L.:

Decoupling Semantic Context and Color Correlation with Multi-class Cross Branch Regularization. 1492-1497 - Zhilin Qiu, Lingbo Liu, Guanbin Li, Qing Wang, Nong Xiao, Liang Lin:

Crowd Counting via Multi-view Scale Aggregation Networks. 1498-1503 - Jia Shao, Bo Du, Chen Wu

, Pingkun Yan
:
PASiam: Predicting Attention Inspired Siamese Network, for Space-Borne Satellite Video Tracking. 1504-1509 - Wenbo Zheng

, Lan Yan, Chao Gou
, Wenwen Zhang, Fei-Yue Wang:
A Relation Network Embedded with Prior Features for Few-Shot Caricature Recognition. 1510-1515 - Fenfen Sheng, Zhineng Chen, Tao Mei, Bo Xu:

A Single-Shot Oriented Scene Text Detector with Learnable Anchors. 1516-1521 - Rui Lu, Menghan Zhou, Anlong Ming, Yu Zhou:

Context-Constrained Accurate Contour Extraction for Occlusion Edge Detection. 1522-1527 - Yun-Hao Yuan, Jin Li

, Jianping Gou
, Yun Li, Jipeng Qiang, Bin Li:
Learning Simultaneous Face Super-Resolution Using Multiset Partial Least Squares. 1528-1533 - Qifeng Lin, Jianhui Zhao, Qianqian Tong, Guian Zhang, Zhiyong Yuan, Gang Fu:

Cropping Region Proposal Network Based Framework for Efficient Object Detection on Large Scale Remote Sensing Images. 1534-1539 - Jinghua Wang

, Adrian Hilton, Jianmin Jiang:
Spectral Analysis Network for Deep Representation Learning and Image Clustering. 1540-1545
Poster Session 5 & Grand Challenge
- Chang Tang

, Xinzhong Zhu, Xinwang Liu, Pichao Wang:
Salient Object Detection via Recurrently Aggregating Spatial Attention Weighted Cross-Level Deep Features. 1546-1551
Normal University), Xinwang Liu (National University of Defense Technology), and Pichao Wang (Alibaba Group (U.S.) Inc)
- Xiaoshui Huang

, Lixin Fan, Qiang Wu
, Jian Zhang
, Chun Yuan:
Fast Registration for Cross-Source Point Clouds by using Weak Regional Affinity and Pixel-Wise Refinement. 1552-1557 - Cunkuan Yuan, Kun Li, Yu-Kun Lai, Yebin Liu, Jingyu Yang:

3D Face Reprentation and Reconstruction with Multi-scale Graph Convolutional Autoencoders. 1558-1563 - Qiang Wang, Yahong Han:

Visual Dialog with Targeted Objects. 1564-1569 - Zhengyang Sun, Zongqing Lu, Jing-Hao Xue, Qingmin Liao:

A New Object Scene Flow Algorithm Based on Support Points Selection and Robust Moving Object Proposal. 1570-1575 - Dashan Guo, Wei Li, Ning Xu, Jianhui Sun, Xiangzhong Fang:

Refining Proposals with Neighboring Contexts for Temporal Action Detection. 1576-1581 - Yanjun Chen, Jie Guo, Bingyang Hu, Yanwen Guo, Jingui Pan:

A Data-Driven Framework for Appearance Editing of Measured Materials. 1582-1587 - Yang Zhou, Shuhan Shen

, Zhanyi Hu:
Active Semantic Labeling of Street View Point Clouds. 1588-1593 - Qian Wu, Wenmin Wang, Xiongtao Chen, Weimian Li:

Video Prediction with Temporal-Spatial Attention Mechanism and Deep Perceptual Similarity Branch. 1594-1599 - Chongyang Bai, Maksim Bolonkin, Judee K. Burgoon, Chao Chen, Norah E. Dunbar, Bharat Singh, V. S. Subrahmanian, Zhe Wu:

Automatic Long-Term Deception Detection in Group Interaction Videos. 1600-1605 - Yachi Zhang, Zongqing Lu, Jing-Hao Xue, Qingmin Liao:

A New Rotation-Invariant Deep Network for 3D Object Recognition. 1606-1611 - Andreas Kah, Matthias Narroschke:

Local Optical Flow Considering Object Boundaries by Adaptive Window Positioning. 1612-1617 - Meng Zhang, Xinchen Liu, Wu Liu, Anfu Zhou, Huadong Ma, Tao Mei:

Multi-Granularity Reasoning for Social Relation Recognition From Images. 1618-1623 - Xin Chen, Yahong Han:

Multi-Timescale Context Encoding for Scene Parsing Prediction. 1624-1629 - Lingyu Zhu

, Tinghuai Wang, Emre Aksu, Joni-Kristian Kamarainen:
Portrait Instance Segmentation for Mobile Devices. 1630-1635 - Pengbo Zhang, Zhihui Wang, Xinzhu Ma, Haojie Li, Jianjun Li:

Learning to Segment Unseen Category Objects using Gradient Gaussian Attention. 1636-1641 - Fei Pan, Yanwen Guo, Zhicheng Yan, Jie Guo:

Temporal Segment Convolutional Kernel Networks for Sequence Modeling of Videos. 1642-1647 - Shaoshuai Li, Fuyan Liu:

SVNet: A Single View Network for 3D Shape Recognition. 1648-1653 - Fei Wang, Shujin Lin, Hefeng Wu, Hanhui Li, Ruomei Wang, Xiaonan Luo, Xiangjian He

:
SPFusionNet: Sketch Segmentation Using Multi-modal Data Fusion. 1654-1659 - Mengmeng Jing, Jingjing Li, Ke Lu, Jieyan Liu, Zi Huang

:
Adaptive Component Embedding for Unsupervised Domain Adaptation. 1660-1665 - Truc Nguyen, Franz Pernkopf

:
Acoustic Scene Classification with Mismatched Recording Devices Using Mixture of Experts Layer. 1666-1671 - Hongchao Gao, Xi Wang, Yujia Li, Jizhong Han

, Songlin Hu
, Ruixuan Li:
Self-Representation Convolutional Neural Networks. 1672-1677
Poster Session 6
- Tianchi Huang, Xin Yao, Chenglei Wu, Rui-Xiao Zhang, Zhengyuan Pang, Lifeng Sun:

Tiyuntsong: A Self-Play Reinforcement Learning Approach for ABR Video Streaming. 1678-1683 - Venkatraman Balasubramanian

, Mu Wang, Martin Reisslein, Changqiao Xu:
Edge-Boost: Enhancing Multimedia Delivery with Mobile Edge Caching in 5G-D2D Networks. 1684-1689 - Hao Wu, Xiaoyan Sun, Jingyu Yang, Feng Wu:

3D Mesh Based Inter-Image Prediction for Image Set Compression. 1690-1695 - Dayong Wang, Yu Sun, Weisheng Li, Ce Zhu, Frédéric Dufaux

:
Fast Inter Mode Predictions for SHVC. 1696-1701 - Xing Chen, Lijun He, Shang Xu, Shibo Hu, Qingzhou Li, Guizhong Liu:

Hit Ratio Driven Mobile Edge Caching Scheme for Video on Demand Services. 1702-1707 - Fang Liu, Wei Zhang, Yonggang Wen:

QoE-Driven Mobile Streaming: A Location-Aware Approach. 1708-1713 - Aris S. Lalos, Gerasimos Arvanitis

, Evangelos Vlachos, Konstantinos Moustakas:
Energy Efficient Transmission of 3D Meshes Over MMWave-Based Massive MIMO Systems. 1714-1719 - Hao Fan, Xu Tong, Qing Zhang, Tianxiang Zhang, Chenyang Wang, Xiaofei Wang:

Identifying Influential Users in Mobile Device-to-Device Social Networks to Promote Offline Multimedia Content Propagation. 1720-1725 - Yuhao Chen, Min Zhao, Xin Tan

, Hong Tang, Dihua Sun:
Accurate and Efficient Object Detection with Context Enhancement Block. 1726-1731 - Yousong Zhu, Chaoyang Zhao, Chenxia Han, Jinqiao Wang, Hanqing Lu:

Mask Guided Knowledge Distillation for Single Shot Detector. 1732-1737 - Yang Wang, Lan Wang, Feng Su, Jiahao Shi:

Video Text Detection with Fully Convolutional Network and Tracking. 1738-1743 - Dongming Yang, Yuexian Zou:

Cascade Region Proposal Networks for Object Detection in the Wild. 1744-1749 - Wenfei Yang, Bin Liu, Weihai Li, Nenghai Yu:

Tracking Assisted Faster Video Object Detection. 1750-1755 - Pengyuan Xie, Jing Xiao, Yang Cao, Jia Zhu

, Asad Khan:
RefineText: Refining Multi-oriented Scene Text Detection with a Feature Refinement Module. 1756-1761 - Qi Qi, Sanyuan Zhao, Jianbing Shen

, Kin-Man Lam:
Multi-scale Capsule Attention-Based Salient Object Detection with Multi-crossed Layer Connections. 1762-1767 - Donghao Gu, Zhaojing Wen, Wenxue Cui, Rui Wang, Feng Jiang, Shaohui Liu:

Continuous Bidirectional Optical Flow for Video Frame Sequence Interpolation. 1768-1773 - Chunhui Zhang, Shiming Ge, Yingying Hua, Dan Zeng:

Robust Deep Tracking with Two-step Augmentation Discriminative Correlation Filters. 1774-1779 - Yiwu Yao, Bin Dong, Yuke Li, Weiqiang Yang, Haoqi Zhu:

Efficient Implementation of Convolutional Neural Networks with End to End Integer-Only Dataflow. 1780-1785 - Qianqian Wang, Liansheng Zhuang, Ning Wang, Wengang Zhou, Houqiang Li:

Learning Motion-Aware Policies for Robust Visual Tracking. 1786-1791 - Lei Jiang, Wengang Zhou, Houqiang Li:

Knowledge Distillation with Category-Aware Attention and Discriminant Logit Losses. 1792-1797 - Anjie Wang, Yongbin Gao, Zhijun Fang, Xiaoyan Jiang, Shanshe Wang, Siwei Ma, Jenq-Neng Hwang:

Unsupervised Learning of Depth and Ego-Motion with Spatial-Temporal Geometric Constraints. 1798-1803 - Ming-Ya Ko, Jeng-Lin Li, Chi-Chun Lee

:
Learning Minimal Intra-Genre Multimodal Embedding from Trailer Content and Reactor Expressions for Box Office Prediction. 1804-1809 - Yangwo Jian, Jing Xiao, Yang Cao, Asad Khan, Jia Zhu

:
Deep Pairwise Ranking with Multi-label Information for Cross-Modal Retrieval. 1810-1815 - Luo Xiong, Yanjie Liang, Yan Yan, Hanzi Wang:

Correlation Filter Tracking with Adaptive Proposal Selection for Accurate Scale Estimation. 1816-1821 - Haitao Wang, Min Meng, Hui Chen, Jigang Wu:

Supervised Consistent and Specific Hashing. 1822-1827 - Shengdong Li, Xueqiang Lv:

Momentum Based on Adaptive Bold Driver. 1828-1833 - Meiyu Huang

, Xueshuang Xiang, Yao Xu, Yiqiang Chen
:
A Lightweight Neural Network Based Human Depth Recovery Method. 1834-1839 - Shiyu Zhao, Lin Zhang, Shuaiyi Huang, Ying Shen, Shengjie Zhao, Yukai Yang

:
Evaluation of Defogging: A Real-World Benchmark Dataset, A New Criterion and Baselines. 1840-1845 - Yanan Wang, Haili Wang, Jiaoyang Shang, Hu Tuo:

RESA: A Real-Time Evaluation System for ABR. 1846-1851 - Qingbo Wu, Rui Ma, King Ngi Ngan, Hongliang Li, Fanman Meng:

Blind Image Sharpness Assessment And Enhancement via Deep Auxiliary Learning. 1852-1857 - Jinjian Wu, Jupo Ma, Fuhu Liang, Weisheng Dong, Guangming Shi:

End-to-End Blind Image Quality Assessment with Cascaded Deep Features. 1858-1863 - Chen Huang, Tingting Jiang

, Ming Jiang:
Encoding Distortions for Multi-task Full-Reference Image Quality Assessment. 1864-1869 - Yuan Meng, Shenglin Zhang

, Zijie Ye, Benliang Wang, Zhi Wang, Yongqian Sun, Qitong Liu, Shuai Yang, Dan Pei
:
Causal Analysis of the Unsatisfying Experience in Realtime Mobile Multiplayer Games in the Wild. 1870-1875

manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.


Google
Google Scholar
Semantic Scholar
Internet Archive Scholar
CiteSeerX
ORCID














