default search action
IEEE Journal of Selected Topics in Signal Processing, Volume 14
Volume 14, Number 1, January 2020
- Mai Xu, Ali Borji, Ce Zhu, Edward J. Delp, Marta Mrak, Patrick Le Callet:
Introduction to the Issue on Perception-Driven 360° Video Processing. 2-4 - Mai Xu, Chen Li, Shanyi Zhang, Patrick Le Callet:
State-of-the-Art in 360° Video/Image Processing: Perception, Assessment and Compression. 5-26 - Youqiang Zhang, Feng Dai, Yike Ma, Hongliang Li, Qiang Zhao, Yongdong Zhang:
Saliency Prediction Network for 360° Videos. 27-37 - Jia Li, Jinming Su, Changqun Xia, Yonghong Tian:
Distortion-Adaptive Salient Object Detection in 360° Omnidirectional Images. 38-48 - Falah Jabar, João Ascenso, Maria Paula Queluz:
Objective Assessment of Perceived Geometric Distortions in Viewport Rendering of 360° Images. 49-63 - Wei Sun, Xiongkuo Min, Guangtao Zhai, Ke Gu, Huiyu Duan, Siwei Ma:
MC360IQA: A Multi-channel CNN for Blind 360-Degree Image Quality Assessment. 64-77 - Zesong Fei, Fei Wang, Jing Wang, Xiang Xie:
QoE Evaluation Methods for 360-Degree VR Video Transmission. 78-88 - Meixu Chen, Yize Jin, Todd Goodall, Xiangxu Yu, Alan Conrad Bovik:
Study of 3D Virtual Reality Picture Quality. 89-102 - Zhibo Chen, Jiahua Xu, Chaoyi Lin, Wei Zhou:
Stereoscopic Omnidirectional Image Quality Assessment Based on Predictive Coding Theory. 103-117 - Yimin Zhou, Ling Tian, Ce Zhu, Xin Jin, Yu Sun:
Video Coding Optimization for Virtual Reality 360-Degree Source. 118-129 - Li Li, Ning Yan, Zhu Li, Shan Liu, Houqiang Li:
λ-Domain Perceptual Rate Control for 360-Degree Video Compression. 130-145 - Shaowei Xie, Yiling Xu, Yunqiao Li, Qiu Shen, Zhan Ma, Wenjun Zhang:
Perceptually Optimized Quality Adaptation of Viewport-Dependent Omnidirectional Video Streaming. 146-160 - Junni Zou, Chenglin Li, Chengming Liu, Qin Yang, Hongkai Xiong, Eckehard G. Steinbach:
Probabilistic Tile Visibility-Based Server-Side Rate Adaptation for Adaptive 360-Degree Video Streaming. 161-176 - Hui Yuan, Shiyun Zhao, Junhui Hou, Xuekai Wei, Sam Kwong:
Spatial and Temporal Consistency-Aware Dynamic Adaptive Streaming for 360-Degree Videos. 177-193 - Yiping Duan, Chaoyi Han, Xiaoming Tao, Bingrui Geng, Yunfei Du, Jianhua Lu:
Panoramic Image Generation: From 2-D Sketch to Spherical Image. 194-208 - Jia Li, Yifan Zhao, Weihua Ye, Kaiwen Yu, Shiming Ge:
Attentive Deep Stitching and Quality Assessment for 360° Omnidirectional Images. 209-221 - Kang Liao, Chunyu Lin, Yao Zhao, Moncef Gabbouj, Yang Zheng:
OIDC-Net: Omnidirectional Image Distortion Correction via Coarse-to-Fine Region Attention. 222-231
Volume 14, Number 2, February 2020
- Juan Ignacio Godino-Llorente, Douglas D. O'Shaughnessy, Tan Lee, Najim Dehak, Claudia Manfredi:
Introduction to the Issue on Automatic Assessment of Health Disorders Based on Voice, Speech, and Language Processing. 234-239 - Juan M. Perero-Codosero, Fernando Espinoza-Cuadros, Javier Antón-Martín, Miguel Antonio Barbero-Álvarez, Luis A. Hernández Gómez:
Modeling Obstructive Sleep Apnea Voices Using Deep Neural Network Embeddings and Domain-Adversarial Training. 240-250 - Ruby Melody Simply, Eliran Dafna, Yaniv Zigel:
Diagnosis of Obstructive Sleep Apnea Using Speech Signals From Awake Subjects. 251-260 - Anna Pompili, Alberto Abad, David Martins de Matos, Isabel Pavão Martins:
Pragmatic Aspects of Discourse Production for the Automatic Identification of Alzheimer's Disease. 261-271 - Fasih Haider, Sofia de la Fuente, Saturnino Luz:
An Assessment of Paralinguistic Acoustic Features for Detection of Alzheimer's Dementia in Spontaneous Speech. 272-281 - Rohit Voleti, Julie M. Liss, Visar Berisha:
A Review of Automated Speech and Language Features for Assessment of Cognitive and Thought Disorders. 282-298 - Yun-Shao Lin, Susan Shur-Fen Gau, Chi-Chun Lee:
A Multimodal Interlocutor-Modulated Attentional BLSTM for Classifying Autism Subgroups During Clinical Interviews. 299-311 - Andrew D. Back, Daniel Angus, Janet Wiles:
Transitive Entropy - A Rank Ordered Approach for Natural Sequences. 312-321 - Chitralekha Bhat, Helmer Strik:
Automatic Assessment of Sentence-Level Dysarthria Intelligibility Using BLSTM. 322-330 - Ying Qin, Tan Lee, Anthony Pak-Hin Kong:
Automatic Assessment of Speech Impairment in Cantonese-Speaking People with Aphasia. 331-345 - T. A. Mariya Celin, T. Nagarajan, P. Vijayalakshmi:
Data Augmentation Using Virtual Microphone Array Synthesis and Multi-Resolution Feature Extraction for Isolated Word Dysarthric Speech Recognition. 346-354 - Julián Villegas, Konstantin Markov, Jeremy Perkins, Seunghun J. Lee:
Prediction of Creaky Speech by Recurrent Neural Networks Using Psychoacoustic Roughness. 355-366 - Sudarsana Reddy Kadiri, Paavo Alku:
Analysis and Detection of Pathological Voice Using Glottal Source Features. 367-379 - László Czap:
Automated Speech Production Assessment of Hard of Hearing Children. 380-389 - H. M. Chandrashekar, Veena Karjigi, Narayan sreedevi:
Spectro-Temporal Representation of Speech for Intelligibility Assessment of Dysarthria. 390-399 - Mostafa Ali Shahin, Usman Zafar, Beena Ahmed:
The Automatic Detection of Speech Disorders in Children: Challenges, Opportunities, and Preliminary Results. 400-412 - Julián D. Arias-Londoño, Jorge Andrés Gómez García, Juan Ignacio Godino-Llorente:
Multimodal and Multi-Output Deep Learning Architectures for the Automatic Assessment of Voice Quality Using the GRB Scale. 413-422 - Ziping Zhao, Zhongtian Bao, Zixing Zhang, Jun Deng, Nicholas Cummins, Haishuai Wang, Jianhua Tao, Björn W. Schuller:
Automatic Assessment of Depression From Speech via a Hierarchical Attention Transfer Network and Attention Autoencoders. 423-434 - Zhaocheng Huang, Julien Epps, Dale Joachim, Vidhyasaharan Sethu:
Natural Language Processing Methods for Acoustic and Landmark Event-Based Features in Speech-Based Depression Detection. 435-448 - Jon Z. Lin, Víctor M. Espinoza, Katherine L. Marks, Matías Zañartu, Daryush D. Mehta:
Improved Subglottal Pressure Estimation From Neck-Surface Vibration in Healthy Speakers Producing Non-Modal Phonation. 449-460 - Hirak Dasgupta, Prem C. Pandey, K. S. Nataraj:
Detection Using Hilbert Envelope for Glottal Excitation Enhancement and Maximum-Sum Subarray for Epoch Marking. 461-471 - Meixu Chen, Yize Jin, Todd Goodall, Xiangxu Yu, Alan Conrad Bovik:
Corrections to "Study of 3D Virtual Reality Picture Quality". 472
Volume 14, Number 3, March 2020
- Xiaodong He, Li Deng, Richard Rose, Minlie Huang, Isabel Trancoso, Chao Zhang:
Introduction to the Special Issue on Deep Learning for Multi-Modal Intelligence Across Speech, Language, Vision, and Heterogeneous Signals. 474-477 - Chao Zhang, Zichao Yang, Xiaodong He, Li Deng:
Multimodal Intelligence: Representation Learning, Information Fusion, and Applications. 478-493 - Difei Gao, Ruiping Wang, Shiguang Shan, Xilin Chen:
Learning to Recognize Visual Concepts for Visual Question Answering With Structural Label Space. 494-505 - Wenjia Xu, Jiuniu Wang, Yang Wang, Guangluan Xu, Daoyu Lin, Wei Dai, Yirong Wu:
Where is the Model Looking At? - Concentrate and Explain the Network Attention. 506-516 - Jiguo Li, Xinfeng Zhang, Chuanmin Jia, Jizheng Xu, Li Zhang, Yue Wang, Siwei Ma, Wen Gao:
Direct Speech-to-Image Translation. 517-529 - Rongzhi Gu, Shi-Xiong Zhang, Yong Xu, Lianwu Chen, Yuexian Zou, Dong Yu:
Multi-Modal Multi-Channel Target Speech Separation. 530-541 - Ke Tan, Yong Xu, Shi-Xiong Zhang, Meng Yu, Dong Yu:
Audio-Visual Speech Separation and Dereverberation With a Two-Stage Multimodal Network. 542-553 - Lingyu Zhang, Richard J. Radke:
A Multi-Stream Recurrent Neural Network for Social Role Detection in Multiparty Interactions. 554-567 - Soo-Whan Chung, Joon Son Chung, Hong-Goo Kang:
Perfect Match: Self-Supervised Embeddings for Cross-Modal Retrieval. 568-576 - Lucia Specia, Loïc Barrault, Ozan Caglayan, Amanda Cardoso Duarte, Desmond Elliott, Spandana Gella, Nils Holzenberger, Chiraag Lala, Sun Jae Lee, Jindrich Libovický, Pranava Madhyastha, Florian Metze, Karl Mulligan, Alissa Ostapenko, Shruti Palaskar, Ramon Sanabria, Josiah Wang, Raman Arora:
Grounded Sequence to Sequence Transduction. 577-591 - Malu Zhang, Xiaoling Luo, Yi Chen, Jibin Wu, Ammar Belatreche, Zihan Pan, Hong Qu, Haizhou Li:
An Efficient Threshold-Driven Aggregate-Label Learning Algorithm for Multimodal Information Processing. 592-602
Volume 14, Number 4, May 2020
- Lixin Fan, Diana Marculescu, Werner Bailer, Yurong Chen:
Editorial: Special Issue on Compact Deep Neural Networks With Industrial Applications. 605-608 - Dimitrios Stamoulis, Ruizhou Ding, Di Wang, Dimitrios Lymberopoulos, Bodhi Priyantha, Jie Liu, Diana Marculescu:
Single-Path Mobile AutoML: Efficient ConvNet Design and NAS Hyperparameter Optimization. 609-622 - Yue Wang, Jianghao Shen, Ting-Kuei Hu, Pengfei Xu, Tan M. Nguyen, Richard G. Baraniuk, Zhangyang Wang, Yingyan Lin:
Dual Dynamic Inference: Enabling More Efficient, Adaptive, and Controllable Deep Inference. 623-633 - Reiya Kawamoto, Masakazu Taichi, Masaya Kabuto, Daisuke Watanabe, Shintaro Izumi, Masahiko Yoshimoto, Hiroshi Kawaguchi, Go Matsukawa, Toshio Goto, Motoshi Kojima:
A 1.15-TOPS 6.57-TOPS/W Neural Network Processor for Multi-Scale Object Detection With Reduced Convolutional Operations. 634-645 - Sayan Faraz, Idir Mellal, Milad Lankarany:
Impact of Synaptic Strength on Propagation of Asynchronous Spikes in Biologically Realistic Feed-Forward Neural Network. 646-653 - Gianmarco Cerutti, Rahul Prasad, Alessio Brutti, Elisabetta Farella:
Compact Recurrent Neural Networks for Acoustic Event Detection on Low-Energy Low-Complexity Platforms. 654-664 - Heng Zhao, Kim-Hui Yap, Alex ChiChung Kot, Lingyu Duan:
JDNet: A Joint-Learning Distilled Network for Mobile Visual Food Recognition. 665-675 - Diaa Badawi, Hongyi Pan, Sinan Cem Cetin, A. Enis Çetin:
Computationally Efficient Spatio-Temporal Dynamic Texture Recognition for Volatile Organic Compound (VOC) Leakage Detection in Industrial Plants. 676-687 - Jun Jia, Guangtao Zhai, Ping Ren, Jiahe Zhang, Zhongpai Gao, Xiongkuo Min, Xiaokang Yang:
Tiny-BDN: An Efficient and Compact Barcode Detection Network. 688-699 - Simon Wiedemann, Heiner Kirchhoffer, Stefan Matlage, Paul Haase, Arturo Marbán, Talmaj Marinc, David Neumann, Tung Nguyen, Heiko Schwarz, Thomas Wiegand, Detlev Marpe, Wojciech Samek:
DeepCABAC: A Universal Compression Algorithm for Deep Neural Networks. 700-714 - Yoojin Choi, Mostafa El-Khamy, Jungwon Lee:
Universal Deep Neural Network Compression. 715-726 - Jiaxing Wang, Haoli Bai, Jiaxiang Wu, Jian Cheng:
Bayesian Automatic Model Compression. 727-736 - Akshay Jain, Pulkit Goel, Shivam Aggarwal, Alexander Fell, Saket Anand:
Symmetric $k$-Means for Deep Neural Network Compression and Hardware Acceleration on FPGAs. 737-749 - Mojan Javaheripi, Mohammad Samragh, Tara Javidi, Farinaz Koushanfar:
AdaNS: Adaptive Non-Uniform Sampling for Automated Design of Compact DNNs. 750-764 - Jian Zhang, Chen Zhao, Wen Gao:
Optimization-Inspired Compact Deep Compressive Sensing. 765-774 - Huan Wang, Xinyi Hu, Qiming Zhang, Yuehai Wang, Lu Yu, Haoji Hu:
Structured Pruning for Efficient Convolutional Neural Networks via Incremental Regularization. 775-788 - Dor Livne, Kobi Cohen:
PoPS: Policy Pruning and Shrinking for Deep Reinforcement Learning. 789-801 - Chinthaka Gamanayake, Lahiru Jayasinghe, Benny Kai Kiat Ng, Chau Yuen:
Cluster Pruning: An Efficient Filter Pruning Method for Edge AI Vision Applications. 802-816 - Tao Huang, Weisheng Dong, Jinshan Liu, Fangfang Wu, Guangming Shi, Xin Li:
Accelerating Convolutional Neural Network via Structured Gaussian Scale Mixture Models: A Joint Grouping and Pruning Approach. 817-827 - Artur Jordão, Maiko Lie, William Robson Schwartz:
Discriminative Layer Pruning for Convolutional Neural Networks. 828-837 - Pravendra Singh, Vinay Kumar Verma, Piyush Rai, Vinay P. Namboodiri:
Acceleration of Deep Convolutional Neural Networks Using Adaptive Filter Pruning. 838-847 - Jun Chen, Yong Liu, Hao Zhang, Shengnan Hou, Jian Yang:
Propagating Asymptotic-Estimated Gradients for Low Bitwidth Quantized Neural Networks. 848-859 - Fabien Cardinaux, Stefan Uhlich, Kazuki Yoshiyama, Javier Alonso García, Lukas Mauch, Stephen Tiedemann, Thomas Kemp, Akira Nakamura:
Iteratively Training Look-Up Tables for Network Quantization. 860-870 - Zhaohui H. Sun:
Binary Outer Product Expansion of Convolutional Kernels. 871-883 - Chunlei Liu, Wenrui Ding, Yuan Hu, Xin Xia, Baochang Zhang, Jianzhuang Liu, David S. Doermann:
Circulant Binary Convolutional Networks for Object Recognition. 884-893 - Jonathan Ephrath, Moshe Eliasof, Lars Ruthotto, Eldad Haber, Eran Treister:
LeanConvNets: Low-Cost Yet Effective Convolutional Neural Networks. 894-904
Volume 14, Number 5, August 2020
- Edward J. Delp, Jiwu Huang, Nasir D. Memon, Anderson Rocha, Matt Turek, Luisa Verdoliva:
Editorial: Media Authentication and Forensics - New Solutions and Research Opportunities. 906-909 - Luisa Verdoliva:
Media Forensics and DeepFakes: An Overview. 910-932 - Haoliang Li, Shiqi Wang, Peisong He, Anderson Rocha:
Face Anti-Spoofing With Deep Neural Network Distillation. 933-946 - Pengpeng Yang, Daniele Baracchi, Massimo Iuliani, Dasara Shullani, Rongrong Ni, Yao Zhao, Alessandro Piva:
Efficient Video Integrity Analysis Through Container Characterization. 947-954 - Xin Liao, Kaide Li, Xinshan Zhu, K. J. Ray Liu:
Robust Detection of Image Operator Chain With Two-Stream Convolutional Neural Network. 955-968 - Zhongjie Mi, Xinghao Jiang, Tanfeng Sun, Ke Xu:
GAN-Generated Image Detection With Self-Attention Mechanism Against GAN Generator Defect. 969-981 - Khalid Mahmood Malik, Ali Javed, Hafiz Malik, Aun Irtaza:
A Light-Weight Replay Detection Framework For Voice Controlled IoT Devices. 982-996 - Yifang Chen, Zheng Wang, Z. Jane Wang, Xiangui Kang:
Automated Design of Neural Network Architectures With Reinforcement Learning for Detection of Global Manipulations. 997-1011 - Xu Zhang, Zhaohui H. Sun, Svebor Karaman, Shih-Fu Chang:
Discovering Image Manipulation History by Pairwise Relation and Forensics Tools. 1012-1023 - Akash Chintha, Bao Thai, Saniat Javid Sohrawardi, Kartavya Bhatt, Andrea Hickerson, Matthew Wright, Raymond W. Ptucha:
Recurrent Convolutional Structures for Audio Spoof and Video Deepfake Detection. 1024-1037 - João C. Neves, Ruben Tolosana, Rubén Vera-Rodríguez, Vasco Lopes, Hugo Proença, Julian Fiérrez:
GANprintR: Improved Fakes and Evaluation of the State of the Art in Face Manipulation Detection. 1038-1048 - Owen Mayer, Matthew C. Stamm:
Exposing Fake Images With Forensic Similarity Graphs. 1049-1064 - Hirak Dasgupta, Prem C. Pandey, K. S. Nataraj:
Corrections to "Detection Using Hilbert Envelope for Glottal Excitation Enhancement and Maximum-Sum Subarray for Epoch Marking". 1065
Volume 14, Number 6, October 2020
- Vishal Monga, Scott T. Acton, Abd-Krim Seghouane, Arrate Muñoz-Barrutia, Jong Chul Ye:
Editorial: Introduction to the Issue on Domain Enriched Learning for Medical Imaging. 1068-1071 - Salman Ul Hassan Dar, Mahmut Yurt, Mohammad Shahdloo, Muhammed Emrullah Ildiz, Berk Tinaz, Tolga Çukur:
Prior-Guided Image Reconstruction for Accelerated Multi-Contrast MRI via Generative Adversarial Networks. 1072-1087 - Jiaming Liu, Yu Sun, Cihat Eldeniz, Weijie Gan, Hongyu An, Ulugbek S. Kamilov:
RARE: Image Reconstruction Using Deep Priors Learned Without Groundtruth. 1088-1099 - Qiyang Zhang, Juan Gao, Yongshuai Ge, Na Zhang, Yongfeng Yang, Xin Liu, Hairong Zheng, Dong Liang, Zhanli Hu:
PET Image Reconstruction Using a Cascading Back-Projection Neural Network. 1100-1111 - Kwanyoung Kim, Shakarim Soltanayev, Se Young Chun:
Unsupervised Training of Denoisers for Low-Dose CT Reconstruction Without Full-Dose Ground Truth. 1112-1125 - Roberto Souza, Youssef Beauferris, Wallace Loos, Robert Marc Lebel, Richard Frayne:
Enhanced Deep-Learning-Based Magnetic Resonance Image Reconstruction by Leveraging Prior Subject-Specific Brain Imaging: Proof-of-Concept Using a Cohort of Presumed Normal Subjects. 1126-1136 - Kihwan Choi, Joon Seok Lim, Sungwon Kim:
StatNet: Statistical Image Restoration for Low-Dose CT using Deep Learning. 1137-1150 - Hemant Kumar Aggarwal, Mathews Jacob:
J-MoDL: Joint Model-Based Deep Learning for Optimized Sampling and Reconstruction. 1151-1162 - Zihui Wu, Yu Sun, Alex Matlock, Jiaming Liu, Lei Tian, Ulugbek S. Kamilov:
SIMBA: Scalable Inversion in Optical Tomography Using Deep Denoising Priors. 1163-1175 - Xiaoming Liu, Aihui Yu, Xiangkai Wei, Zhifang Pan, Jinshan Tang:
Multimodal MR Image Synthesis Using Gradient Prior and Adversarial Learning. 1176-1188 - Rosana El Jurdi, Caroline Petitjean, Paul Honeine, Fahed Abdallah:
BB-UNet: U-Net With Bounding Box Prior. 1189-1198 - Jialin Peng, Jiajin Yi, Zhimin Yuan:
Unsupervised Mitochondria Segmentation in EM Images via Domain Adaptive Multi-Task Learning. 1199-1209 - Max L. Olender, Lambros S. Athanasiou, Lampros K. Michalis, Dimitris I. Fotiadis, Elazer R. Edelman:
A Domain Enriched Deep Learning Approach to Classify Atherosclerosis Using Intravascular Ultrasound Imaging. 1210-1220 - Jeya Maria Jose Valanarasu, Rajeev Yasarla, Puyang Wang, Ilker Hacihaliloglu, Vishal M. Patel:
Learning to Segment Brain Anatomy From 2D Ultrasound With Less Data. 1221-1234 - Georgios Simantiris, Georgios Tziritas:
Cardiac MRI Segmentation With a Dilated CNN Incorporating Domain-Specific Constraints. 1235-1243 - Nora Ouzir, Esa Ollila, Sergiy A. Vorobyov:
Data-Adaptive Similarity Measures for B-mode Ultrasound Images Using Robust Noise Models. 1244-1254 - Suchita Bhinge, Qunfang Long, Vince D. Calhoun, Tülay Adali:
Adaptive Constrained Independent Vector Analysis: An Effective Solution for Analysis of Large-Scale Medical Imaging Data. 1255-1264 - Rui Jin, Krishna Dontaraju, Seung-Jun Kim, Mohammad Abu Baker Siddique Akhonda, Tülay Adali:
Dictionary Learning-Based fMRI Data Analysis for Capturing Common and Individual Neural Activation Maps. 1265-1279 - Seyed Amir Hossein Hosseini, Burhaneddin Yaman, Steen Moeller, Mingyi Hong, Mehmet Akçakaya:
Dense Recurrent Neural Networks for Accelerated MRI: History-Cognizant Unrolling of Optimization Algorithms. 1280-1291