


Остановите войну!
for scientists:


default search action
IEEE Transactions on Multimedia, Volume 24
Volume 24, 2022
- Pan Gao
, Pengwei Zhang
, Aljosa Smolic
:
Quality Assessment for Omnidirectional Video: A Spatio-Temporal Distortion Modeling Approach. 1-16 - Haonan Su
, Long Yu, Cheolkon Jung
:
Joint Contrast Enhancement and Noise Reduction of Low Light Images Via JND Transform. 17-32 - Yongqiang Gui, Hancheng Lu
, Feng Wu, Chang Wen Chen
:
LensCast: Robust Wireless Video Transmission Over MmWave MIMO With Lens Antenna Array. 33-48 - Haonan Fan, Hai-Miao Hu
, Shuailing Liu, Weiqing Lu, Shiliang Pu:
Correlation Graph Convolutional Network for Pedestrian Attribute Recognition. 49-60 - Chih-Hung Liang, Yu-An Chen, Yueh-Cheng Liu, Winston H. Hsu
:
Raw Image Deblurring. 61-72 - Zhenyu Wu, Shuai Li
, Chenglizhao Chen
, Aimin Hao, Hong Qin
:
Deeper Look at Image Salient Object Detection: Bi-Stream Network With a Small Training Dataset. 73-86 - Lei Cao
, Huijun Zhang, Ling Feng:
Building and Using Personal Knowledge Graph to Improve Suicidal Ideation Detection on Social Media. 87-102 - Chunxiao Liu
, Zhendong Mao
, Tianzhu Zhang
, An-An Liu
, Bin Wang, Yongdong Zhang
:
Focus Your Attention: A Focal Attention for Multimodal Learning. 103-115 - Fei Ye, Chaoqin Huang, Jinkun Cao, Maosen Li, Ya Zhang
, Cewu Lu
:
Attribute Restoration Framework for Anomaly Detection. 116-127 - Shaoyue Song
, Zhenjiang Miao, Hongkai Yu, Jianwu Fang
, Kang Zheng
, Cong Ma
, Song Wang
:
Deep Domain Adaptation Based Multi-Spectral Salient Object Detection. 128-140 - Haitao Zeng
, Xinhang Song
, Gongwei Chen
, Shuqiang Jiang
:
Amorphous Region Context Modeling for Scene Recognition. 141-151 - Xinpeng Huang
, Ping An
, Yilei Chen
, Deyang Liu
, Liquan Shen
:
Low Bitrate Light Field Compression With Geometry and Content Consistency. 152-165 - Xingyuan Zhang
, Fuhai Zhang
:
Differentiable Spatial Regression: A Novel Method for 3D Hand Pose Estimation. 166-176 - Xingyu Chen
, Jin Li
, Xuguang Lan
, Nanning Zheng:
Generalized Zero-Shot Learning Via Multi-Modal Aggregated Posterior Aligning Neural Network. 177-187 - Jingjia Huang
, Wei Yan, Thomas H. Li, Shan Liu
, Ge Li
:
Learning the Global Descriptor for 3-D Object Recognition Based on Multiple Views Decomposition. 188-201 - Canqiang Chen, Chunmei Qing
, Xiangmin Xu
, Patrick Dickinson:
Cross Parallax Attention Network for Stereo Image Super-Resolution. 202-216 - Xun Gong
, Zu Yao
, Xin Li
, Yueqiao Fan, Bin Luo, Jianfeng Fan, Boji Lao:
LAG-Net: Multi-Granularity Network for Person Re-Identification via Local Attention System. 217-229 - Jinwei Wang
, Junjie Zhao
, Qilin Yin
, Xiangyang Luo
, Yuhui Zheng
, Yun-Qing Shi, Sunil Kr. Jha
:
SmsNet: A New Deep Convolutional Neural Network Model for Adversarial Example Detection. 230-244 - Joongchol Shin
, Hasil Park, Joonki Paik
:
Region-Based Dehazing via Dual-Supervised Triple-Convolutional Network. 245-260 - Yu-Jen Ma, Hong-Han Shuai
, Wen-Huang Cheng
:
Spatiotemporal Dilated Convolution With Uncertain Matching for Video-Based Crowd Estimation. 261-273 - Che Sun
, Hao Song
, Xinxiao Wu
, Yunde Jia
, Jiebo Luo
:
Exploiting Informative Video Segments for Temporal Action Localization. 274-287 - Xu Chen
, Chenqiang Gao
, Chaoyu Li
, Yi Yang, Deyu Meng
:
Infrared Action Detection in the Dark via Cross-Stream Attention Mechanism. 288-300 - Xuefeng Zhu
, Xiaojun Wu
, Tianyang Xu
, Zhen-Hua Feng
, Josef Kittler
:
Robust Visual Object Tracking Via Adaptive Attribute-Aware Discriminative Correlation Filters. 301-312 - Xing Zhang
, Zuxuan Wu
, Yu-Gang Jiang
:
SAM: Modeling Scene, Object and Action With Semantics Attention Modules for Video Recognition. 313-322 - Bo Wang
, Mingwei Xu
, Fengyuan Ren, Jianping Wu:
Improving Robustness of DASH Against Unpredictable Network Variations. 323-337 - Aihua Zheng
, Menglan Hu
, Bo Jiang
, Yan Huang, Yan Yan, Bin Luo
:
Adversarial-Metric Learning for Audio-Visual Cross-Modal Matching. 338-351 - Mengyang Zhang, Guohui Tian, Ying Zhang, Peng Duan:
Reinforcement Learning for Logic Recipe Generation: Bridging Gaps From Images to Plans. 352-365 - Mauricio Perez
, Jun Liu
, Alex C. Kot
:
Interaction Relational Network for Mutual Action Recognition. 366-376 - Lê Minh Ngô
, Sezer Karaoglu, Theo Gevers:
Self-Supervised Face Image Manipulation by Conditioning GAN on Face Decomposition. 377-385 - Pantelis Maniotis
, Nikolaos Thomos
:
Viewport-Aware Deep Reinforcement Learning Approach for 360$^\circ$ Video Caching. 386-399 - Xinchao Dong
, Liquan Shen
, Mei Yu
, Hao Yang:
Fast Intra Mode Decision Algorithm for Versatile Video Coding. 400-414 - Yaoyu Li
, Hantao Yao
, Changsheng Xu
:
Intra-Domain Consistency Enhancement for Unsupervised Person Re-Identification. 415-425 - Zhihao Shi, Xiaohong Liu
, Kangdi Shi
, Linhui Dai
, Jun Chen
:
Video Frame Interpolation via Generalized Deformable Convolution. 426-439 - Yang Zhang
, Moyun Liu
, Jingwu He
, Fei Pan, Yanwen Guo
:
Affinity Fusion Graph-Based Framework for Natural Image Segmentation. 440-450 - Zhuoman Liu
, Wei Jia, Ming Yang, Peiyao Luo, Yong Guo
, Mingkui Tan
:
Deep View Synthesis via Self-Consistent Generative Network. 451-465 - Peng-Fei Zhang
, Yang Li, Zi Huang
, Xin-Shun Xu
:
Aggregation-Based Graph Convolutional Hashing for Unsupervised Cross-Modal Retrieval. 466-479 - Ziqiang Zheng, Zhibin Yu
, Haiyong Zheng
, Yang Yang
, Heng Tao Shen
:
One-Shot Image-to-Image Translation via Part-Global Learning With a Multi-Adversarial Framework. 480-491 - Tengpeng Li, Kaihua Zhang
, Shiwen Shen, Bo Liu, Qingshan Liu
, Zhu Li
:
Image Co-Saliency Detection and Instance Co-Segmentation Using Attention Graph Clustering Based Graph Convolutional Network. 492-505 - Xusong Chen
, Chenyi Lei, Dong Liu
, Guoxin Wang, Haihong Tang, Zheng-Jun Zha
, Houqiang Li
:
E-Commerce Storytelling Recommendation Using Attentional Domain-Transfer Network and Adversarial Pre-Training. 506-518 - Zhaoqing Pan
, Feng Yuan, Jianjun Lei
, Wanqing Li
, Nam Ling
, Sam Kwong
:
MIEGAN: Mobile Image Enhancement via a Multi-Module Cascade Neural Network. 519-533 - Liming Xu
, Xianhua Zeng
, Weisheng Li
, Ling Bai
:
IDHashGAN: Deep Hashing With Generative Adversarial Nets for Incomplete Data Retrieval. 534-545 - Huafeng Liu
, Chuanyi Zhang
, Yazhou Yao
, Xiu-Shen Wei, Fumin Shen
, Zhenmin Tang, Jian Zhang
:
Exploiting Web Images for Fine-Grained Visual Recognition by Eliminating Open-Set Noise and Utilizing Hard Examples. 546-557 - Jianjie Lu
, Weidong Zhang, Haibing Yin:
Generate and Purify: Efficient Person Data Generation for Re-Identification. 558-566 - Qin Xu
, Yiming Mei, Jinpei Liu
, Chenglong Li
:
Multimodal Cross-Layer Bilinear Pooling for RGBT Tracking. 567-580 - Lijuan Sun, Songhe Feng
, Jun Liu
, Gengyu Lyu, Congyan Lang
:
Global-Local Label Correlation for Partial Multi-Label Learning. 581-593 - Huan Li
, Ping Wei
, Ping Hu:
AVN: An Adversarial Variation Network Model for Handwritten Signature Verification. 594-608 - Zhongze Chen
, Jing Li
, Jia Wu
, Jun Chang, Yafu Xiao
, Xiaoting Wang:
Drift-Proof Tracking With Deep Reinforcement Learning. 609-624 - Rizard Renanda Adhi Pramono
, Yie-Tarng Chen
, Wen-Hsien Fang
:
Spatial-Temporal Action Localization With Hierarchical Self-Attention. 625-639 - Heng Yao
, Mian Zou
, Chuan Qin
, Xinpeng Zhang
:
Signal-Dependent Noise Estimation for a Real-Camera Model via Weight and Shape Constraints. 640-654 - Jun Chen
, Xuejiao Li
, Linbo Luo, Jiayi Ma
:
Multi-Focus Image Fusion Based on Multi-Scale Gradients and Image Matting. 655-667 - Linchao Zhu
, Hehe Fan
, Yawei Luo, Mingliang Xu
, Yi Yang:
Temporal Cross-Layer Correlation Mining for Action Recognition. 668-676 - Zhaoyu Zhang
, Mengyan Li, Haonian Xie, Jun Yu
, Tongliang Liu, Chang Wen Chen
:
TWGAN: Twin Discriminator Generative Adversarial Networks. 677-688 - Md. Moniruzzaman
, Zhaozheng Yin
, Zhihai He, Ruwen Qin, Ming C. Leu:
Human Action Recognition by Discriminative Feature Pooling and Video Segment Attention Model. 689-701 - Meng Chang
, Huajun Feng
, Zhihai Xu
, Qi Li
:
Low-Light Image Restoration With Short- and Long-Exposure Raw Pairs. 702-714 - Hanli Wang
, Pengjie Tang, Qinyu Li, Meng Cheng
:
Emotion Expression With Fact Transfer for Video Description. 715-727 - Sihao Lin, Wenhao Wu, Si Wu
, Yong Xu
, Hau-San Wong
:
Unreliable-to-Reliable Instance Translation for Semi-Supervised Pedestrian Detection. 728-739 - Zhi Zeng
, Ting Wang
, Fulei Ma, Liang Zhang
, Peiyi Shen, Syed Afaq Ali Shah
, Mohammed Bennamoun
:
Probability-Based Framework to Fuse Temporal Consistency and Semantic Information for Background Segmentation. 740-754 - Yuan-fang Zhang
, Jiangbin Zheng, Wenjing Jia
, Wenfeng Huang
, Long Li
, Nian Liu
, Fei Li, Xiangjian He
:
Deep RGB-D Saliency Detection Without Depth. 755-767 - Hao Zhou
, Wengang Zhou
, Yun Zhou, Houqiang Li
:
Spatial-Temporal Multi-Cue Network for Sign Language Recognition and Translation. 768-779 - Amir Shirian
, Subarna Tripathi
, Tanaya Guha
:
Dynamic Emotion Modeling With Learnable Graphs and Graph Inception Network. 780-790 - Jingkuan Song
, Jingqiu Zhang, Lianli Gao
, Zhou Zhao
, Heng Tao Shen
:
AgeGAN++: Face Aging and Rejuvenation With Dual Conditional GANs. 791-804 - Desheng Cai, Shengsheng Qian
, Quan Fang, Changsheng Xu
:
Heterogeneous Hierarchical Feature Aggregation Network for Personalized Micro-Video Recommendation. 805-818 - Huijing Zhan
, Jie Lin
, Kenan Emir Ak, Boxin Shi
, Ling-Yu Duan
, Alex C. Kot
:
$A^3$-FKG: Attentive Attribute-Aware Fashion Knowledge Graph for Outfit Preference Prediction. 819-831 - Hongchen Tan
, Xiuping Liu
, Baocai Yin, Xin Li
:
Cross-Modal Semantic Matching Generative Adversarial Networks for Text-to-Image Synthesis. 832-845 - Aite Zhao
, Junyu Dong
, Jianbo Li
, Lin Qi, Huiyu Zhou
:
Associated Spatio-Temporal Capsule Network for Gait Recognition. 846-860 - Jiaxu Leng
, Ying Liu, Zhihui Wang
, Haibo Hu
, Xinbo Gao
:
CrossNet: Detecting Objects as Crosses. 861-875 - Yangyang Shu
, Qian Li
, Chang Xu
, Shaowu Liu, Guandong Xu
:
V-SVR+: Support Vector Regression With Variational Privileged Information. 876-889 - Yifang Yin
, Ying Zhang
, Zhenguang Liu
, Sheng Wang, Rajiv Ratn Shah, Roger Zimmermann
:
GPS2Vec: Pre-Trained Semantic Embeddings for Worldwide GPS Coordinates. 890-903 - Huixia Ben
, Yingwei Pan
, Yehao Li
, Ting Yao
, Richang Hong
, Meng Wang
, Tao Mei
:
Unpaired Image Captioning With semantic-Constrained Self-Learning. 904-916 - Yanhao Tan
, Mohammad Muntasir Rahman
, Yanfu Yan, Jian Xue
, Ling Shao
, Ke Lu
:
Fine-Grained Categorization From RGB-D Images. 917-928 - Xiao Luan
, Yuanyuan Zhao
, Weihua Ou
, Linghui Liu
, Weisheng Li
, Yucheng Shu, Hongmin Geng
:
Collaborative Learning With a Multi-Branch Framework for Feature Enhancement. 929-941 - Xinyuan Qian
, Alessio Brutti
, Oswald Lanz
, Maurizio Omologo
, Andrea Cavallaro
:
Audio-Visual Tracking of Concurrent Speakers. 942-954 - Han Fang
, Dongdong Chen
, Feng Wang, Zehua Ma
, Honggu Liu
, Wenbo Zhou, Weiming Zhang
, Nenghai Yu
:
TERA: Screen-to-Camera Image Code With Transparency, Efficiency, Robustness and Adaptability. 955-967 - Tao Chen
, Guo-Sen Xie
, Yazhou Yao
, Qiong Wang
, Fumin Shen
, Zhenmin Tang, Jian Zhang
:
Semantically Meaningful Class Prototype Learning for One-Shot Image Segmentation. 968-980 - Pandeng Li
, Hongtao Xie
, Shaobo Min
, Zheng-Jun Zha
, Yongdong Zhang
:
Online Residual Quantization Via Streaming Data Correlation Preserving. 981-994 - Tianze Gao
, Huihui Pan
, Zidong Wang
, Huijun Gao
:
A CRF-Based Framework for Tracklet Inactivation in Online Multi-Object Tracking. 995-1007 - Mahesh Kumar Krishna Reddy
, Mrigank Rochan
, Yiwei Lu
, Yang Wang
:
AdaCrowd: Unlabeled Scene Adaptation for Crowd Counting. 1008-1019 - Yukun Zuo, Hantao Yao
, Liansheng Zhuang
, Changsheng Xu
:
Seek Common Ground While Reserving Differences: A Model-Agnostic Module for Noisy Domain Adaptation. 1020-1030 - Qi Wang, Weidong Min
, Qing Han
, Qian Liu, Cheng Zha
, Haoyu Zhao
, Zitai Wei:
Inter-Domain Adaptation Label for Data Augmentation in Vehicle Re-Identification. 1031-1041 - Tao Chen
, Shui-Hua Wang, Qiong Wang
, Zheng Zhang
, Guo-Sen Xie
, Zhenmin Tang:
Enhanced Feature Alignment for Unsupervised Domain Adaptation of Semantic Segmentation. 1042-1054 - Xiuwen Gong
, Jiahui Yang, Dong Yuan
, Wei Bao
:
Generalized Large Margin $k$NN for Partial Label Learning. 1055-1066 - Jing Yi, Zhenzhong Chen
:
Multi-Modal Variational Graph Auto-Encoder for Recommendation Systems. 1067-1079 - Zhengning Wu
, Xiaobo Xia
, Ruxin Wang
, Jiatong Li, Jun Yu
, Yinian Mao, Tongliang Liu
:
LR-SVM+: Learning Using Privileged Information with Noisy Labels. 1080-1092 - Zeren Sun
, Huafeng Liu, Qiong Wang
, Tianfei Zhou
, Qi Wu
, Zhenmin Tang:
Co-LDL: A Co-Training-Based Label Distribution Learning Method for Tackling Label Noise. 1093-1104 - Huafeng Liu
, Haofeng Zhang
, Jianfeng Lu
, Zhenmin Tang:
Exploiting Web Images for Fine-Grained Visual Recognition via Dynamic Loss Correction and Global Sample Selection. 1105-1115 - Xiaobo Shen
, Guohua Dong, Yuhui Zheng
, Long Lan
, Ivor W. Tsang
, Quan-Sen Sun
:
Deep Co-Image-Label Hashing for Multi-Label Image Retrieval. 1116-1126 - Hao-Chiang Shao
, Hsin-Chieh Wang, Weng-Tai Su
, Chia-Wen Lin
:
Ensemble Learning With Manifold-Based Data Splitting for Noisy Label Correction. 1127-1140 - Junya Teng, Xiankai Lu
, Yongshun Gong
, Xinfang Liu, Xiushan Nie
, Yilong Yin
:
Regularized Two Granularity Loss Function for Weakly Supervised Video Moment Retrieval. 1141-1151 - Sijie Song
, Jiaying Liu
, Lilang Lin, Zongming Guo
:
Learning to Recognize Human Actions From Noisy Skeleton Data Via Noise Adaptation. 1152-1163 - Jingyu Hao
, Chengjia Wang
, Guang Yang
, Zhifan Gao
, Jinglin Zhang
, Heye Zhang
:
Annealing Genetic GAN for Imbalanced Web Data Learning. 1164-1174 - Bin Zhu
, Chong-Wah Ngo, Wing Kwong Chan
:
Learning From Web Recipe-Image Pairs for Food Recognition: Problem, Baselines and Performance. 1175-1185 - Yaoyao Zhong
, Weihong Deng
, Han Fang, Jiani Hu, Dongyue Zhao
, Xian Li, Dongchao Wen
:
Dynamic Training Data Dropout for Robust Deep Face Recognition. 1186-1197 - Chuanyi Zhang
, Qiong Wang
, Guo-Sen Xie
, Qi Wu
, Fumin Shen
, Zhenmin Tang:
Robust Learning From Noisy Web Images Via Data Purification for Fine-Grained Recognition. 1198-1209 - Shiji Zhou, Lianzhe Wang, Shanghang Zhang, Zhi Wang, Wenwu Zhu:
Active Gradual Domain Adaptation: Dataset and Approach. 1210-1220 - Gongmian Wang
, Xing Xu
, Fumin Shen
, Huimin Lu
, Yanli Ji
, Heng Tao Shen
:
Cross-Modal Dynamic Networks for Video Moment Retrieval With Text Query. 1221-1232 - Bingwen Hu
, Ping Liu
, Zhedong Zheng
, Mingwu Ren:
SPG-VTON: Semantic Prediction Guidance for Multi-Pose Virtual Try-on. 1233-1246 - Zhenfeng Xue
, Weijie Mao
, Liang Zheng:
Learning to Simulate Complex Scenes for Street Scene Segmentation. 1253-1265 - Mitra Tajrobehkar
, Kaihua Tang
, Hanwang Zhang
, Joo-Hwee Lim
:
Align R-CNN: A Pairwise Head Network for Visual Relationship Detection. 1266-1276 - Peiguang Jing
, Jing Zhang, Liqiang Nie
, Shu Ye, Jing Liu
, Yuting Su
:
Tripartite Graph Regularized Latent Low-Rank Representation for Fashion Compatibility Prediction. 1277-1287 - Yaomin Wang
, Wenguang He
:
High Capacity Reversible Data Hiding in Encrypted Image Based on Adaptive MSB Prediction. 1288-1298 - Shiguang Liu
, Ting Zhu:
Structure-Guided Arbitrary Style Transfer for Artistic Image and Video. 1299-1312 - Dung Nguyen, Duc Thanh Nguyen
, Rui Zeng, Thanh Thi Nguyen
, Son N. Tran, Thin Nguyen
, Sridha Sridharan
, Clinton Fookes
:
Deep Auto-Encoders With Sequential Learning for Multimodal Dimensional Emotion Recognition. 1313-1324 - Yalan Ye
, Yukun He, Tongjie Pan
, Jingjing Li
, Heng Tao Shen
:
Alleviating Domain Shift via Discriminative Learning for Generalized Zero-Shot Learning. 1325-1337 - Haoyu Tang, Jihua Zhu
, Meng Liu
, Zan Gao
, Zhiyong Cheng
:
Frame-Wise Cross-Modal Matching for Video Moment Retrieval. 1338-1349 - Tianchi Huang
, Rui-Xiao Zhang, Lifeng Sun
:
Zwei: A Self-Play Reinforcement Learning Framework for Video Transmission Services. 1350-1365 - Chong Mou, Jian Zhang
, Xiaopeng Fan
, Hangfan Liu
, Ronggang Wang
:
COLA-Net: Collaborative Attention Network for Image Restoration. 1366-1377 - Kaihao Zhang
, Wenhan Luo
, Lin Ma
, Wenqi Ren
, Hongdong Li
:
Disentangled Feature Networks for Facial Portrait and Caricature Generation. 1378-1388 - Kyohoon Sim
, Jiachen Yang
, Wen Lu, Xinbo Gao
:
Blind Stereoscopic Image Quality Evaluator Based on Binocular Semantic and Quality Channels. 1389-1398 - Giulia Slavic
, Mohamad Baydoun, Damian Campo, Lucio Marcenaro
, Carlo S. Regazzoni
:
Multilevel Anomaly Detection Through Variational Autoencoders and Bayesian Models for Self-Aware Embodied Agents. 1399-1414 - Lu Zhang
, Jingsong Xu
, Yongshun Gong
, Litao Yu
, Jian Zhang
, Jialie Shen
:
Unsupervised Image and Text Fusion for Travel Information Enhancement. 1415-1425 - Amin Parvaneh
, Ehsan Abbasnejad, Qi Wu
, Qinfeng (Javen) Shi
, Anton van den Hengel
:
Show, Price and Negotiate: A Negotiator With Online Value Look-Ahead. 1426-1434