


default search action
ICASSP 2021: Toronto, ON, Canada
- IEEE International Conference on Acoustics, Speech and Signal Processing, ICASSP 2021, Toronto, ON, Canada, June 6-11, 2021. IEEE 2021, ISBN 978-1-7281-7606-2
- Yi Luo, Zhuo Chen, Cong Han, Chenda Li, Tianyan Zhou, Nima Mesgarani:
Rethinking The Separation Layers In Speech Separation Networks. 1-5 - Xiaoyu Liu, Jordi Pons:
On Permutation Invariant Training For Speech Source Separation. 6-10 - Zhong-Qiu Wang, DeLiang Wang:
Count And Separate: Incorporating Speaker Counting For Continuous Speaker Separation. 11-15 - Yi Luo, Cong Han, Nima Mesgarani:
Ultra-Lightweight Speech Separation Via Group Communication. 16-20 - Cem Subakan, Mirco Ravanelli, Samuele Cornell
, Mirko Bronzi, Jianyuan Zhong:
Attention Is All You Need In Speech Separation. 21-25 - Aidan O. T. Hogg, Christine Evers
, Patrick A. Naylor
:
Multichannel Overlapping Speaker Segmentation Using Multiple Hypothesis Tracking Of Acoustic And Spatial Features. 26-30 - Zhepei Wang, Ritwik Giri, Umut Isik, Jean-Marc Valin, Arvindh Krishnaswamy:
Semi-Supervised Singing Voice Separation With Noisy Self-Training. 31-35 - Giorgia Cantisani
, Slim Essid, Gaël Richard:
Neuro-Steered Music Source Separation With EEG-Based Auditory Attention Decoding And Contrastive-NMF. 36-40 - Yixuan Zhang, Yuzhou Liu, DeLiang Wang:
Complex Ratio Masking For Singing Voice Separation. 41-45 - Yun-Ning Hung, Gordon Wichern, Jonathan Le Roux:
Transcription Is All You Need: Learning To Separate Musical Mixtures With Score As Supervision. 46-50 - Ryosuke Sawata, Stefan Uhlich, Shusuke Takahashi, Yuki Mitsufuji:
All For One And One For All: Improving Music Separation By Bridging Networks. 51-55 - Yongwei Gao, Xingjian Du
, Bilei Zhu, Xiaoheng Sun, Wei Li, Zejun Ma:
An Hrnet-Blstm Model With Two-Stage Training For Singing Melody Extraction. 56-60 - Satwinder Singh
, Ruili Wang, Yuanhang Qiu:
DeepF0: End-To-End Fundamental Frequency Estimation for Music and Speech Signals. 61-65 - Marco A. Martínez Ramírez, Oliver Wang, Paris Smaragdis, Nicholas J. Bryan:
Differentiable Signal Processing With Black-Box Audio Effects. 66-70 - Christian J. Steinmetz, Jordi Pons, Santiago Pascual, Joan Serrà:
Automatic Multitrack Mixing With A Differentiable Mixing Console Of Neural Audio Effects. 71-75 - Jiatong Shi, Shuai Guo, Nan Huo, Yuekai Zhang, Qin Jin:
Sequence-To-Sequence Singing Voice Synthesis With Perceptual Entropy Loss. 76-80 - Junghyun Koo
, Seungryeol Paik, Kyogu Lee:
Reverb Conversion Of Mixed Vocal Tracks Using An End-To-End Convolutional Deep Neural Network. 81-85 - Bo-Wei Tseng, Yih-Liang Shen, Tai-Shih Chi:
Extending Music Based On Emotion And Tonality Via Generative Adversarial Network. 86-90 - William Vickers, Ben Milner, Robert Lee:
Improving The Robustness Of Right Whale Detection In Noisy Conditions Using Denoising Autoencoders And Augmented Training. 91-95 - Ondrej Cífka
, Alexey Ozerov, Umut Simsekli, Gaël Richard:
Self-Supervised VQ-VAE for One-Shot Music Style Transfer. 96-100 - Hongwei Song, Jiqing Han, Shiwen Deng, Zhihao Du:
Capturing Temporal Dependencies Through Future Prediction for CNN-Based Audio Classifiers. 101-105 - T. J. Tsai:
Segmental Dtw: A Parallelizable Alternative to Dynamic Time Warping. 106-110 - Keitaro Tanaka, Ryo Nishikimi, Yoshiaki Bando, Kazuyoshi Yoshii
, Shigeo Morishima
:
Pitch-Timbre Disentanglement Of Musical Instrument Sounds Based On Vae-Based Metric Learning. 111-115 - Robert Ayrapetian, Philip Hilmes, Mohamed Mansour, Trausti Kristjansson, Carlo Murgia:
Asynchronous Acoustic Echo Cancellation Over Wireless Channels. 116-120 - Mhd Modar Halimeh, Thomas Haubner, Annika Briegleb
, Alexander Schmidt, Walter Kellermann:
Combining Adaptive Filtering And Complex-Valued Deep Postfiltering For Acoustic Echo Cancellation. 121-125 - Amir Ivry, Israel Cohen, Baruch Berdugo:
Deep Residual Echo Suppression With A Tunable Tradeoff Between Signal Distortion And Echo Suppression. 126-130 - Saeed Bagheri, Daniele Giacobello:
Robust STFT Domain Multi-Channel Acoustic Echo Cancellation with Adaptive Decorrelation of the Reference Signals. 131-135 - Meng Guo
:
A Method for Determining Periodically Time-Varying Bias and Its Applications in Acoustic Feedback Cancellation. 136-140 - Ziteng Wang, Yueyue Na, Zhang Liu, Biao Tian, Qiang Fu:
Weighted Recursive Least Square Filter and Neural Network Based Residual ECHO Suppression for the AEC-Challenge. 141-145 - Renhua Peng, Linjuan Cheng, Chengshi Zheng, Xiaodong Li:
ICASSP 2021 Acoustic Echo Cancellation Challenge: Integrated Adaptive Echo Cancellation with Time Alignment and Deep Learning-Based Residual Echo Plus Noise Suppression. 146-150 - Kusha Sridhar, Ross Cutler, Ando Saabas, Tanel Pärnamaa, Markus Loide, Hannes Gamper, Sebastian Braun, Robert Aichner, Sriram Srinivasan:
ICASSP 2021 Acoustic Echo Cancellation Challenge: Datasets, Testing Framework, and Results. 151-155 - Jan Franzen, Ernst Seidel
, Tim Fingscheidt
:
AEC in A Netshell: on Target and Topology Choices for FCRN Acoustic Echo Cancellation. 156-160 - Jesper Brunnström
, Shoichi Koyama:
Kernel-Interpolation-Based Filtered-X Least Mean Square for Spatial Active Noise Control In Time Domain. 161-165 - Jian Xu, Kean Chen, Yunhe Li:
Wave-Domain Optimization of Secondary Source Placement Free From Information of Error Sensor Positions. 166-170 - Woo-Sung Choi
, Minseok Kim, Jaehwa Chung, Soonyoung Jung:
Lasaft: Latent Source Attentive Frequency Transformation For Conditioned Source Separation. 171-175 - Robin Scheibler, Masahito Togami:
Surrogate Source Model Learning for Determined Source Separation. 176-180 - Han Li, Kean Chen, Bernhard U. Seeber:
Auditory Filterbanks Benefit Universal Sound Source Separation. 181-185 - Scott Wisdom, Hakan Erdogan, Daniel P. W. Ellis, Romain Serizel, Nicolas Turpault, Eduardo Fonseca, Justin Salamon, Prem Seetharaman, John R. Hershey:
What's all the Fuss about Free Universal Sound Separation Data? 186-190 - Shota Inoue, Hirokazu Kameoka, Li Li, Shoji Makino
:
SepNet: A Deep Separation Matrix Prediction Network for Multichannel Audio Source Separation. 191-195 - Pranay Manocha, Zeyu Jin, Richard Zhang, Adam Finkelstein:
CDPAM: Contrastive Learning for Perceptual Audio Similarity. 196-200 - Soichiro Oyabu, Daichi Kitamura, Kohei Yatabe
:
Linear Multichannel Blind Source Separation based on Time-Frequency Mask Obtained by Harmonic/Percussive Sound Separation. 201-205 - Daniel Arteaga, Jordi Pons:
Multichannel-based Learning for Audio Object Extraction. 206-210 - Ali Aroudi, Sebastian Braun:
DBnet: Doa-Driven Beamforming Network for end-to-end Reverberant Sound Source Separation. 211-215 - Taishi Nakashima
, Robin Scheibler, Masahito Togami, Nobutaka Ono:
Joint Dereverberation and Separation With Iterative Source Steering. 216-220 - Ingvi Örnolfsson
, Torsten Dau
, Ning Ma
, Tobias May
:
Exploiting Non-Negative Matrix Factorization for Binaural Sound Localization in the Presence of Directional Interference. 221-225 - Jirí Málek, Jakub Janský, Tomás Kounovský, Zbynek Koldovský, Jindrich Zdánský:
Blind Extraction of Moving Audio Source in a Challenging Environment Supported by Speaker Identification Via X-Vectors. 226-230 - Ashvala Vinay, Alexander Lerch, Grace Leslie
:
Mind the Beat: Detecting Audio Onsets from EEG Recordings of Music Listening. 231-235 - Mojtaba Heydari, Zhiyao Duan:
Don't Look Back: An Online Beat Tracking Method Using RNN and Enhanced Particle Filtering. 236-240 - Xingjian Du
, Bilei Zhu, Qiuqiang Kong, Zejun Ma:
Singing Melody Extraction from Polyphonic Music based on Spectral Correlation Modeling. 241-245 - I-Chieh Wei, Chih-Wei Wu, Li Su:
Improving Automatic Drum Transcription Using Large-Scale Audio-to-Midi Aligned Data. 246-250 - Shuai Yu
, Xiaoheng Sun, Yi Yu, Wei Li:
Frequency-Temporal Attention Network for Singing Melody Extraction. 251-255 - Yuki Hiramatsu, Go Shibata, Ryo Nishikimi, Eita Nakamura, Kazuyoshi Yoshii:
Statistical Correction of Transcribed Melody Notes Based on Probabilistic Integration of a Music Language Model and a Transcription Error Model. 256-260 - Sebastian Rosenzweig, Frank Scherbaum, Meinard Müller:
Reliability Assessment of Singing Voice F0-Estimates Using Multiple Algorithms. 261-265 - Sakya Basak, Shrutina Agarwal, Sriram Ganapathy, Naoya Takahashi:
End-to-End Lyrics Recognition with Voice to Singing Style Transfer. 266-270 - Lenny Renault
, Andrea Vaglio, Romain Hennequin:
Singing Language Identification Using a Deep Phonotactic Approach. 271-275 - Jun-You Wang, Jyh-Shing Roger Jang:
On the Preparation and Validation of a Large-Scale Dataset of Singing Transcription. 276-280 - Lele Liu, Veronica Morfi, Emmanouil Benetos
:
Joint Multi-Pitch Detection and Score Transcription for Polyphonic Piano Music. 281-285 - Yuan Wang, Shigeki Tanaka, Keita Yokoyama, Hsin-Tai Wu, Yi Fang:
Karaoke Key Recommendation Via Personalized Competence-Based Rating Prediction. 286-290 - Afagh Farhadi, Skyler G. Jennings, Elizabeth A. Strickland, Laurel H. Carney:
A Closed-Loop Gain-Control Feedback Model for The Medial Efferent System of The Descending Auditory Pathway. 291-295 - Zehai Tu, Ning Ma
, Jon Barker:
DHASP: Differentiable Hearing Aid Speech Processing. 296-300 - Anil M. Nagathil, Florian Göbel, Alexandru Nelus
, Ian C. Bruce:
Computationally Efficient DNN-Based Approximation of an Auditory Model for Applications in Speech Processing. 301-305 - Hideki Kawahara, Kohei Yatabe
:
Cascaded All-Pass Filters with Randomized Center Frequencies and Phase Polarity for Acoustic and Speech Measurement and Data Augmentation. 306-310 - Danni Ma, Neville Ryant, Mark Y. Liberman
:
Probing Acoustic Representations for Phonetic Properties. 311-315 - Zhuohuang Zhang, Piyush Vyas, Xuan Dong, Donald S. Williamson
:
An End-To-End Non-Intrusive Model for Subjective and Objective Real-World Speech Assessment Using a Multi-Task Framework. 316-320 - Yu Wang, Nicholas J. Bryan, Mark Cartwright, Juan Pablo Bello
, Justin Salamon:
Few-Shot Continual Learning for Audio Classification. 321-325 - Huang Xie
, Okko Räsänen
, Tuomas Virtanen
:
Zero-Shot Audio Classification with Factored Linear and Nonlinear Acoustic-Semantic Projections. 326-330 - Hsin-Ping Huang, Krishna C. Puvvada, Ming Sun, Chao Wang:
Unsupervised and Semi-Supervised Few-Shot Acoustic Event Classification. 331-335 - Kota Dohi, Takashi Endo, Harsh Purohit, Ryo Tanabe, Yohei Kawaguchi:
Flow-Based Self-Supervised Density Estimation for Anomalous Sound Detection. 336-340 - Sangwook Park, Ashwin Bellur, David K. Han, Mounya Elhilali:
Self-Training for Sound Event Detection in Audio Mixtures. 341-345 - Shubhr Singh, Helen L. Bear, Emmanouil Benetos
:
Prototypical Networks for Domain Adaptation in Acoustic Scene Classification. 346-350 - Helin Wang, Yuexian Zou, Wenwu Wang:
A Global-Local Attention Framework for Weakly Labelled Audio Tagging. 351-355 - Xu Zheng, Yan Song, Ian McLoughlin
, Lin Liu, Li-Rong Dai:
An Improved Mean Teacher Based Method for Large Scale Weakly Labeled Semi-Supervised Sound Event Detection. 356-360 - Léo Cances, Thomas Pellegrini:
Comparison of Deep Co-Training and Mean-Teacher Approaches for Semi-Supervised Audio Tagging. 361-365 - Shawn Hershey, Daniel P. W. Ellis, Eduardo Fonseca, Aren Jansen, Caroline Liu, R. Channing Moore, Manoj Plakal:
The Benefit of Temporally-Strong Labels in Audio Event Classification. 366-370 - Eduardo Fonseca, Diego Ortego, Kevin McGuinness
, Noel E. O'Connor, Xavier Serra:
Unsupervised Contrastive Learning of Sound Event Representations. 371-375 - Chih-Yuan Koh, You-Siang Chen, Yi-Wen Liu, Mingsian R. Bai
:
Sound Event Detection by Consistency Training and Pseudo-Labeling With Feature-Pyramid Convolutional Recurrent Neural Networks. 376-380 - Joan Serrà, Jordi Pons, Santiago Pascual:
SESQA: Semi-Supervised Learning for Speech Quality Assessment. 381-385 - Helmer Nylén, Saikat Chatterjee, Sten Ternström:
Detecting Signal Corruptions in Voice Recordings For Speech Therapy. 386-390 - Yichong Leng, Xu Tan
, Sheng Zhao, Frank K. Soong, Xiang-Yang Li, Tao Qin
:
MBNET: MOS Prediction for Synthesized Speech with Mean-Bias Network. 391-395 - Jana Roßbach, Saskia Röttges, Christopher F. Hauth, Thomas Brand, Bernd T. Meyer:
Non-Intrusive Binaural Prediction of Speech Intelligibility Based on Phoneme Classification. 396-400 - Wissam A. Jassim, Jan Skoglund
, Michael Chinen, Andrew Hines
:
Warp-Q: Quality Prediction for Generative Neural Speech Codecs. 401-405 - Ross Cutler, Babak Nadari, Markus Loide, Sten Sootla, Ando Saabas:
Crowdsourcing Approach for Subjective Evaluation of Echo Impairment. 406-410 - Shoichi Koyama, Takashi Amakasu, Natsuki Ueno, Hiroshi Saruwatari:
Amplitude Matching: Majorization-Minimization Algorithm for Sound Field Control Only with Amplitude Constraint. 411-415 - Huanyu Zuo, Thushara D. Abhayapala, Prasanga N. Samarasinghe
:
3D Multizone Soundfield Reproduction in a Reverberant Environment Using Intensity Matching Method. 416-420 - Jens Ahrens, Hannes Helmholz
, David Lou Alon, Sebastià Vicenc Amengual Garí
:
The Far-Field Equatorial Array for Binaural Rendering. 421-425 - Fabrice Katzberg, Marco Maaß
, Alfred Mertins:
Spherical Harmonic Representation for Dynamic Sound-Field Measurements. 426-430 - Adrian Herzog, Daniele Mirabilii
, Emanuël A. P. Habets:
Direction Preserving Wind Noise Reduction Of B-Format Signals. 431-435 - Robin Scheibler, Masahito Togami:
Refinement of Direction of Arrival Estimators by Majorization-Minimization Optimization on the Array Manifold. 436-440 - Yaxuan Zhou, Hao Jiang, Vamsi Krishna Ithapu:
On the Predictability of Hrtfs from Ear Shapes Using Deep Networks. 441-445 - Lior Arbel, Zamir Ben-Hur, David Lou Alon, Boaz Rafaely:
Applied Methods for Sparse Sampling of Head-Related Transfer Functions. 446-450 - Mengfan Zhang, Jui-Hsien Wang, Doug L. James
:
Personalized HRTF Modeling Using DNN-Augmented BEM. 451-455 - Fabian Hübner, Wolfgang Mack, Emanuël A. P. Habets:
Efficient Training Data Generation for Phase-Based DOA Estimation. 456-460 - Giovanni Bologni
, Richard Heusdens, Jorge Martínez
:
Acoustic Reflectors Localization from Stereo Recordings Using Neural Networks. 1-5 - Usama Saqib
, Antoine Deleforge, Jesper Rindom Jensen
:
Detecting Acoustic Reflectors Using A Robot's Ego-Noise. 466-470 - Ziqi Fan, Vibhav Vineet, Chenshen Lu, T. W. Wu, Kyla A. McMullen:
Prediction of Object Geometry from Acoustic Scattering Using Convolutional Neural Networks. 471-475 - Tom Shlomo, Boaz Rafaely:
Blind Amplitude Estimation of Early Room Reflections Using Alternating Least Squares. 476-480 - Thomas McKenzie
, Sebastian J. Schlecht, Ville Pulkki:
Acoustic Analysis and Dataset of Transitions Between Coupled Rooms. 481-485 - Yuying Li, Yuchen Liu, Donald S. Williamson
:
On Loss Functions for Deep-Learning Based T60 Estimation. 486-490 - Hideyuki Tachibana
:
Towards Listening to 10 People Simultaneously: An Efficient Permutation Invariant Training of Audio Source Separation Using Sinkhorn's Algorithm. 491-495 - Andreas Brendel
, Walter Kellermann:
Accelerating Auxiliary Function-Based Independent Vector Analysis. 496-500 - Beat Gfeller, Dominik Roblek, Marco Tagliasacchi:
One-Shot Conditional Audio Filtering of Arbitrary Sounds. 501-505 - Tetsuya Ueda, Tomohiro Nakatani, Rintaro Ikeshita, Keisuke Kinoshita
, Shoko Araki, Shoji Makino
:
Low Latency Online Blind Source Separation Based on Joint Optimization with Blind Dereverberation. 506-510 - Kouhei Sekiguchi, Yoshiaki Bando, Aditya Arie Nugraha, Mathieu Fontaine, Kazuyoshi Yoshii
:
Autoregressive Fast Multichannel Nonnegative Matrix Factorization For Joint Blind Source Separation And Dereverberation. 511-515 - Paul Magron, Pierre-Hugo Vial, Thomas Oberlin, Cédric Févotte:
Phase Recovery with Bregman Divergences for Audio Source Separation. 516-520 - Naoya Takahashi, Shota Inoue, Yuki Mitsufuji:
Adversarial Attacks on Audio Source Separation. 521-525 - Mieszko Fras, Konrad Kowalczyk
:
Maximum a Posteriori Estimator for Convolutive Sound Source Separation with Sub-Source Based NTF Model and the Localization Probabilistic Prior on the Mixing Matrix. 526-530 - Efthymios Tzinis, Dimitrios Bralios, Paris Smaragdis:
Unified Gradient Reweighting for Model Biasing with Applications to Source Separation. 531-535 - Andres Ferraro
, Yuntae Kim, Soohyeon Lee, Biho Kim, Namjun Jo, Semi Lim, Suyon Lim, Jungtaek Jang, Sehwan Kim, Xavier Serra, Dmitry Bogdanov
:
Melon Playlist Dataset: A Public Dataset for Audio-Based Playlist Generation and Music Tagging. 536-540 - Furkan Yesiler, Emilio Molina, Joan Serrà, Emilia Gómez:
Investigating the Efficacy of Music Version Retrieval Systems for Setlist Identification. 541-545 - Kevin Ji, Daniel Yang, T. J. Tsai:
Instrument Classification of Solo Sheet Music Images. 546-550 - Xingjian Du
, Zhesong Yu, Bilei Zhu, Xiaoou Chen, Zejun Ma:
Bytecover: Cover Song Identification Via Multi-Loss Training. 551-555 - Ho-Hsiang Wu, Chieh-Chi Kao, Qingming Tang, Ming Sun, Brian McFee, Juan Pablo Bello
, Chao Wang:
Multi-Task Self-Supervised Pre-Training for Music Classification. 556-560 - Shreyan Chowdhury
, Gerhard Widmer:
Towards Explaining Expressive Qualities in Piano Recordings: Transfer of Explanatory Features Via Acoustic Domain Adaptation. 561-565 - Ju-Chiang Wang, Jordan B. L. Smith, Jitong Chen, Xuchen Song, Yuxuan Wang:
Supervised Chorus Detection for Popular Music Using Convolutional Neural Network and Multi-Task Learning. 566-570 - Ruchit Agrawal, Daniel Wolff, Simon Dixon:
Structure-Aware Audio-to-Score Alignment Using Progressively Dilated Convolutional Neural Networks. 571-575 - Juan Sebastián Gómez Cañón
, Estefanía Cano
, Ana Gabriela Pandrea, Perfecto Herrera, Emilia Gómez:
Language-Sensitive Music Emotion Recognition Models: are We Really There Yet? 576-580 - Paul Magron, Cédric Févotte:
Leveraging the Structure of Musical Preference in Content-Aware Music Recommendation. 581-585 - Emir Demirel, Sven Ahlbäck, Simon Dixon:
Low Resource Audio-To-Lyrics Alignment from Polyphonic Music Recordings. 586-590 - Minz Won, Sergio Oramas, Oriol Nieto, Fabien Gouyon, Xavier Serra:
Multimodal Metric Learning for Tag-Based Music Retrieval. 591-595 - Xavier Favory, Konstantinos Drossos
, Tuomas Virtanen
, Xavier Serra:
Learning Contextual Tag Embeddings for Cross-Modal Alignment of Audio and Tags. 596-600 - Paulo Lopez-Meyer, Juan A. del Hoyo Ontiveros, Hong Lu, Georg Stemmer
:
Efficient End-to-End Audio Embeddings Generation for Audio Classification on Target Applications. 601-605 - Xuenan Xu, Heinrich Dinkel, Mengyue Wu, Kai Yu:
Text-to-Audio Grounding: Building Correspondence Between Captions and Sound Events. 606-610 - Huy Phan, Huy Le Nguyen, Oliver Y. Chén
, Lam Dang Pham, Philipp Koch, Ian McLoughlin
, Alfred Mertins:
Multi-View Audio And Music Classification. 611-615 - Juncheng B. Li, Kaixin Ma, Shuhui Qu, Po-Yao Huang, Florian Metze:
Audio-Visual Event Recognition Through the Lens of Adversary. 616-620 - Jee-weon Jung, Hye-jin Shim, Ju-ho Kim, Ha-Jin Yu:
DCASENET: An Integrated Pretrained Deep Neural Network for Detecting and Classifying Acoustic Scenes and Events. 621-625 - Shanshan Wang
, Annamaria Mesaros
, Toni Heittola, Tuomas Virtanen
:
A Curated Dataset of Urban Scenes for Audio-Visual Scene Analysis. 626-630 - Giacomo Ferroni, Nicolas Turpault, Juan Azcarreta, Francesco Tuveri, Romain Serizel, Çagdas Bilen, Sacha Krstulovic
:
Improving Sound Event Detection Metrics: Insights from DCASE 2020. 631-635 - Satvik Venkatesh
, David Moffat
, Alexis Kirke, Gözel Shakeri, Stephen A. Brewster, Jörg Fachner
, Helen Odell-Miller, Alex Street, Nicolas Farina, Sube Banerjee
, Eduardo Reck Miranda:
Artificially Synthesising Data for Audio Classification and Segmentation to Improve Speech and Music Detection in Radio Broadcast. 636-640 - Weiquan Fan, Xiangmin Xu, Xiaofen Xing, Weidong Chen, Dongyan Huang:
LSSED: A Large-Scale Dataset and Benchmark for Speech Emotion Recognition. 641-645 - Turab Iqbal, Karim Helwani, Arvindh Krishnaswamy, Wenwu Wang:
Enhancing Audio Augmentation Methods with Consistency Learning. 646-650 - Thomas Pellegrini, Timothée Masquelier
:
Fast Threshold Optimization for Multi-Label Audio Tagging Using Surrogate Gradient Learning. 651-655 - Sebastian Braun, Hannes Gamper, Chandan K. A. Reddy, Ivan Tashev:
Towards Efficient Models for Real-Time Deep Noise Suppression. 656-660 - Sotaro Nakaoka, Li Li, Shota Inoue, Shoji Makino
:
Teacher-Student Learning for Low-Latency Online Speech Enhancement Using Wave-U-Net. 661-665 - Nana Hou, Chenglin Xu, Eng Siong Chng, Haizhou Li:
Learning Disentangled Feature Representations for Speech Enhancement Via Adversarial Training. 666-670 - Koen Oostermeijer, Jun Du, Qing Wang, Chin-Hui Lee:
Speech Enhancement Autoencoder with Hierarchical Latent Structure. 671-675 - Huajian Fang, Guillaume Carbajal, Stefan Wermter
, Timo Gerkmann
:
Variational Autoencoder for Speech Enhancement with a Noise-Aware Encoder. 676-680 - Guillaume Carbajal, Julius Richter
, Timo Gerkmann
:
Guided Variational Autoencoder for Speech Enhancement with a Supervised Classifier. 681-685 - Satoru Emura
, Noboru Harada
:
An Extension of Sparse Audio Declipper to Multiple Measurement Vectors. 686-690 - Yunpeng Li, Marco Tagliasacchi, Oleg Rybakov, Victor Ungureanu, Dominik Roblek:
Real-Time Speech Frequency Bandwidth Extension. 691-695 - Jiaqi Su, Yunyun Wang
, Adam Finkelstein, Zeyu Jin:
Bandwidth Extension is All You Need. 696-700 - Pavel Záviska
, Pavel Rajmic, Ondrej Mokrý:
Audio Dequantization Using (Co)Sparse (Non)Convex Methods. 701-705 - Haici Yang, Kai Zhen, Seungkwon Beack, Minje Kim:
Source-Aware Neural Speech Coding for Noisy Speech Compression. 706-710 - Jonah Casebeer, Vinjai Vale, Umut Isik, Jean-Marc Valin, Ritwik Giri, Arvindh Krishnaswamy:
Enhancing into the Codec: Noise Robust Speech Coding with Vector-Quantized Autoencoders. 711-715 - Shlomo E. Chazan, Jacob Goldberger, Sharon Gannot:
Speech Enhancement with Mixture of Deep Experts with Clean Clustering Pre-Training. 716-720 - Yang Xiang, Liming Shi
, Jesper Lisby Højvang, Morten Højfeldt Rasmussen, Mads Græsbøll Christensen
:
A Novel NMF-HMM Speech Enhancement Algorithm Based on Poisson Mixture Model. 721-725 - Yajing Liu, Xiulian Peng, Zhiwei Xiong, Yan Lu:
Phoneme-Based Distribution Regularization for Speech Enhancement. 726-730 - Carol Chermaz, Dario Leuchtmann, Simon Tanner, Roger Wattenhofer:
Compressed Representation of Cepstral Coefficients via Recurrent Neural Networks for Informed Speech Enhancement. 731-735 - An Zhao, Krishna Subramani, Paris Smaragdis:
Optimizing Short-Time Fourier Transform Parameters via Gradient Descent. 736-740 - Tobias Gburrek, Joerg Schmalenstroeer, Reinhold Haeb-Umbach
:
Iterative Geometry Calibration from Distance Estimates for Wireless Acoustic Sensor Networks. 741-745 - Xudong Zhao
, Gongping Huang, Jacob Benesty, Jingdong Chen, Israel Cohen:
On the Design of Square Differential Microphone Arrays with a Multistage Structure. 746-750 - Federico Borra, Alberto Bernardini, Ivan Bertuletti, Fabio Antonacci, Augusto Sarti:
Arrays of First-Order Steerable Differential Microphones. 751-755 - Xi Chen, Chao Pan, Jingdong Chen, Jacob Benesty:
Planar Array Geometry Optimization for Region Sound Acquisition. 756-760 - Alexandru Nelus
, Rene Glitza
, Rainer Martin
:
Estimation of Microphone Clusters in Acoustic Sensor Networks Using Unsupervised Federated Learning. 761-765 - Gabriel F. Miller, Andreas Brendel
, Walter Kellermann, Sharon Gannot:
Misalignment Recognition in Acoustic Sensor Networks Using a Semi-Supervised Source Estimation Method and Markov Random Fields. 766-770 - Yukoh Wakabayashi, Kouei Yamaoka, Nobutaka Ono:
Rotation-Robust Beamforming Based on Sound Field Interpolation with Regularly Circular Microphone Array. 771-775 - Shiduo Yu, Craig T. Jin, Fabio Antonacci, Augusto Sarti:
Sparse Recovery Beamforming and Upscaling in the Ray Space. 776-780 - Gongping Huang, Yuzhu Wang
, Jacob Benesty, Israel Cohen
, Jingdong Chen:
Combined Differential Beamforming With Uniform Linear Microphone Arrays. 781-785 - Vincent W. Neo
, Christine Evers, Patrick A. Naylor
:
Polynomial Matrix Eigenvalue Decomposition of Spherical Harmonics for Speech Enhancement. 786-790 - Jie Zhang:
A Parametric Unconstrained Binaural Beamformer Based Noise Reduction and Spatial Cue Preservation for Hearing-Assistive Devices. 791-795 - Fan Zhang, Chao Pan, Jacob Benesty, Jingdong Chen:
A Simplified Wiener Beamformer Based on Covariance Matrix Modelling. 796-800 - Aleksej Chinaev
, Sven Wienand, Gerald Enzner
:
Control Architecture of the Double-Cross-Correlation Processor for Sampling-Rate-Offset Estimation in Acoustic Sensor Networks. 801-805 - Yuto Kondo, Yuki Kubo, Norihiro Takamune, Daichi Kitamura, Hiroshi Saruwatari:
Deficient Basis Estimation of Noise Spatial Covariance Matrix for Rank-Constrained Spatial Covariance Matrix Estimation Method in Blind Speech Extraction. 806-810 - Noman Akbar, Glenn Dickins
, Mark R. P. Thomas, Prasanga N. Samarasinghe
, Thushara D. Abhayapala:
Reducing Modal Error Propagation through Correcting Mismatched Microphone Gains Using Rapid. 811-814 - Yonggang Hu
, Prasanga N. Samarasinghe
, Sharon Gannot, Thushara D. Abhayapala:
Evaluation and Comparison of Three Source Direction-of-Arrival Estimators Using Relative Harmonic Coefficients. 815-819 - Michael Günther, Haitham Afifi, Andreas Brendel
, Holger Karl, Walter Kellermann:
Network-Aware Optimal Microphone Channel Selection in Wireless Acoustic Sensor Networks. 820-824 - Bing Yang, Xiaofei Li, Hong Liu:
Supervised Direct-Path Relative Transfer Function Learning for Binaural Sound Source Localization. 825-829 - Yang Liu, Alexandros Neophytou, Sunando Sengupta, Eric Sommerlade:
Cross-Modal Spectrum Transformation Network for Acoustic Scene Classification. 830-834 - Ziheng Lin, Yanxiong Li, Zhangjin Huang, Wenhao Zhang
, Yufeng Tan, Yichun Chen, Qianhua He:
Domestic Activities Clustering From Audio Recordings Using Convolutional Capsule Autoencoder Network. 835-839 - Nicolas Turpault, Romain Serizel, Scott Wisdom, Hakan Erdogan, John R. Hershey, Eduardo Fonseca, Prem Seetharaman, Justin Salamon:
Sound Event Detection and Separation: A Benchmark on Desed Synthetic Soundscapes. 840-844 - Hu Hu
, Chao-Han Huck Yang, Xianjun Xia, Xue Bai, Xin Tang, Yajian Wang, Shutong Niu, Li Chai, Juanjuan Li, Hongning Zhu, Feng Bao, Yuanjun Zhao, Sabato Marco Siniscalchi, Yannan Wang, Jun Du, Chin-Hui Lee:
A Two-Stage Approach to Device-Robust Acoustic Scene Classification. 845-849 - Simyung Chang, Hyoungwoo Park, Janghoon Cho, Hyunsin Park, Sungrack Yun, Kyuwoong Hwang:
Subspectral Normalization for Neural Audio Data Processing. 850-854 - Evangelos Kazakos, Arsha Nagrani, Andrew Zisserman, Dima Damen:
Slow-Fast Auditory Streams for Audio Recognition. 855-859 - Keisuke Imoto, Sakiko Mishima, Yumi Arai, Reishi Kondo:
Impact of Sound Duration and Inactive Frames on Sound Event Detection Performance. 860-864 - Jan Baumann, Patrick Meyer, Timo Lohrenz
, Alexander Roy, Michael Papendieck, Tim Fingscheidt
:
A New DCASE 2017 Rare Sound Event Detection Benchmark Under Equal Training Data: CRNN With Multi-Width Kernels. 865-869 - Jaejun Lee, Donmoon Lee, Hyeong-Seok Choi, Kyogu Lee:
Room Adaptive Conditioning Method for Sound Event Classification in Reverberant Environments. 870-874 - Noriyuki Tonami, Keisuke Imoto, Yuki Okamoto, Takahiro Fukumori, Yoichi Yamashita:
Sound Event Detection Based on Curriculum Learning Considering Learning Difficulty of Events. 875-879 - Christopher Ick, Brian McFee:
Sound Event Detection in Urban Audio with Single and Multi-Rate Pcen. 880-884 - Yin Cao, Turab Iqbal, Qiuqiang Kong, Fengyan An
, Wenwu Wang, Mark D. Plumbley:
An Improved Event-Independent Network for Polyphonic Sound Event Localization and Detection. 885-889 - Shahan Nercessian, Andy M. Sarroff, Kurt James Werner:
Lightweight and Interpretable Neural Modeling of an Audio Distortion Effect Using Hyperconditioned Differentiable Biquads. 890-894 - Chih-Hsiang Huang, Po-Hao Wu, Yi-Wen Liu, Shan-Hung Wu:
Attacking and Defending Behind A Psychoacoustics-Based Captcha. 895-899 - JinHong Lu, Tianhang Liu, Shuzhuang Xu, Hiroshi Shimodaira:
Double-DCCCAE: Estimation of Body Gestures From Speech Waveform. 900-904 - Xuenan Xu, Heinrich Dinkel, Mengyue Wu, Zeyu Xie, Kai Yu:
Investigating Local and Global Information for Automated Audio Captioning with Transfer Learning. 905-909 - Jian Luo, Jianzong Wang
, Ning Cheng, Jing Xiao:
Unidirectional Memory-Self-Attention Transducer for Online Speech Recognition. 910-914 - Kazuki Shimada
, Yuichiro Koyama, Naoya Takahashi, Shusuke Takahashi, Yuki Mitsufuji:
Accdoa: Activity-Coupled Cartesian Direction of Arrival Representation for Sound Event Localization And Detection. 915-919 - Kun Zhou, Berrak Sisman
, Rui Liu, Haizhou Li:
Seen and Unseen Emotional Style Transfer for Voice Conversion with A New Emotional Speech Dataset. 920-924 - Eesung Kim, Jae-Jin Jeon, Hyeji Seo:
U-Convolution Based Residual Echo Suppression with Multiple Encoders. 925-929 - You Wang, Chuyao Feng, David V. Anderson:
A Multi-Channel Temporal Attention Convolutional Neural Network Model for Environmental Sound Classification. 930-934 - Thi Ngoc Tho Nguyen, Ngoc Khanh Nguyen, Huy Phan, Lam Pham, Kenneth Ooi, Douglas L. Jones, Woon-Seng Gan
:
A General Network Architecture for Sound Event Localization and Detection Using Transfer Learning and Recurrent Neural Network. 935-939 - Hongsen He, Jingdong Chen, Jacob Benesty, Yi Yu
:
Robust Recursive Least M-Estimate Adaptive Filter for the Identification of Low-Rank Acoustic Systems. 940-944 - Thomas Haubner, Andreas Brendel
, Mohamed Elminshawi, Walter Kellermann:
Noise-Robust Adaptation Control for Supervised Acoustic System Identification Exploiting a Noise Dictionary. 945-949 - Matteo Acerbi, Raffaele Malvermi
, Mirco Pezzoli
, Fabio Antonacci, Augusto Sarti, Roberto Corradi:
Interpolation of Irregularly Sampled Frequency Response Functions Using Convolutional Neural Networks. 950-954 - Heinrich W. Löllmann, Andreas Brendel
, Walter Kellermann:
Effective Rank-Based Estimation of the Coherent-to-Diffuse Power Ratio. 955-959 - Orchisama Das
, Paul Calamia, Sebastià Vicenc Amengual Garí
:
Room Impulse Response Interpolation from a Sparse Set of Measurements Using a Modal Architecture. 960-964 - Alastair H. Moore, Rebecca R. Vos, Patrick A. Naylor
, Mike Brookes
:
Processing Pipelines for Efficient, Physically-Accurate Simulation of Microphone Array Signals in Dynamic Sound Scenes. 965-969 - Hadi Habibzadeh, Olivia Zhou
, James J. S. Norton, Theresa M. Vaughan, Daphney-Stavroula Zois
:
A Classifier for Improving Cause and Effect in SSVEP-based BCIs for Individuals with Complex Communication Disorders. 970-974 - Boyuan Feng, Yuke Wang, Yufei Ding:
Saga: Sparse Adversarial Attack on EEG-Based Brain Computer Interface. 975-979 - Marie-Constance Corsi
, Florian Yger, Sylvain Chevallier, Camille Noûs:
Riemannian Geometry on Connectivity for Clinical BCI. 980-984 - Winko W. An
, Barbara G. Shinn-Cunningham, Hannes Gamper, Dimitra Emmanouilidou, David Johnston, Mihai Jalobeanu, Edward Cutrell, Andrew D. Wilson, Kuan-Jung Chiang, Ivan Tashev:
Decoding Music Attention from "EEG Headphones": A User-Friendly Auditory Brain-Computer Interface. 985-989 - Sunhee Hwang, Sungho Park, Dohyung Kim, Jewook Lee, Hyeran Byun:
Mitigating Inter-Subject Brain Signal Variability FOR EEG-Based Driver Fatigue State Classification. 990-994 - Pradeep Kumar, Erik J. Scheme:
A Deep Spatio-Temporal Model for EEG-Based Imagined Speech Recognition. 995-999 - Bahman Abdi-Sargezeh, Antonio Valentín
, Gonzalo Alarcón, Saeid Sanei:
Incorporating Uncertainty In Data Labeling Into Detection of Brain Interictal Epileptiform Discharges From EEG Using Weighted optimization. 1000-1004 - Mikko Impiö
, Mehmet Yamaç, Jenni Raitoharju
:
Multi-Level Reversible Encryption for ECG Signals Using Compressive Sensing. 1005-1009 - Minh C. Tran, Phi Anh Phan, Douglas C. Crockett, Federico Formenti
, John N. Cronin
, Stephen J. Payne, Andrew D. Farmery:
Validating the Inspired Sinewave Technique to Measure Lung Heterogeneity Compared to Atelectasis & Over-Distended Volume in Computed Tomography Images. 1010-1014 - Nasimuddin Ahmed, Shivam Singhal, Varsha Sharma
, Sakyajit Bhattacharya, Aniruddha Sinha, Avik Ghose:
A Patient-Invariant Model for Freezing of Gait Detection Aided by Wavelet Decomposition. 1015-1019 - Liu Yang
, Cassandra Heiselman, J. Gerald Quirk, Petar M. Djuric:
Identification of Uterine Contractions by An Ensemble of Gaussian Processes. 1020-1024 - Bin Wang, Chang Liu, Chuanyan Hu, Xudong Liu
, Jun Cao:
Arrhythmia Classification with Heartbeat-Aware Transformer. 1025-1029 - Alejandro Cohen, Nir Shlezinger, Amit Solomon, Yonina C. Eldar, Muriel Médard:
Multi-Level Group Testing with Application to One-Shot Pooled COVID-19 Tests. 1030-1034 - Mahmoud Al Ismail, Soham Deshmukh, Rita Singh:
Detection of Covid-19 Through the Analysis of Vocal Fold Oscillations. 1035-1039 - Shahin Heidarian, Parnian Afshar, Arash Mohammadi, Moezedin Javad Rafiee, Anastasia Oikonomou, Konstantinos N. Plataniotis, Farnoosh Naderkhani:
Ct-Caps: Feature Extraction-Based Automated Framework for Covid-19 Disease Identification From Chest Ct Scans Using Capsule Networks. 1040-1044 - Yifan Jiang
, Han Chen
, Hanseok Ko
, David K. Han:
Few-Shot Learning for Ct Scan Based Covid-19 Diagnosis. 1045-1049 - Huimin Huang, Ming Cai, Lanfen Lin, Jing Zheng, Xiongwei Mao, Xiaohan Qian, Zhiyi Peng, Jianying Zhou, Yutaro Iwamoto, Xian-Hua Han, Yen-Wei Chen, Ruofeng Tong:
Graph-Based Pyramid Global Context Reasoning With a Saliency- Aware Projection for Covid-19 Lung Infections Segmentation. 1050-1054 - Soham Deshmukh, Mahmoud Al Ismail, Rita Singh:
Interpreting Glottal Flow Dynamics for Detecting Covid-19 From Voice. 1055-1059 - Daniel I. Morís
, Joaquim de Moura
, Jorge Novo
, Marcos Ortega
:
Cycle Generative Adversarial Network Approaches to Produce Novel Portable Chest X-Rays Images for Covid-19 Diagnosis. 1060-1064 - Seyed Saman Saboksayr, Gonzalo Mateos
, Müjdat Çetin:
EEG-Based Emotion Classification Using Graph Signal Processing. 1065-1069 - Tamanna T. K. Munia, Selin Aviyente:
Granger Causality Based Directional Phase-Amplitude Coupling Measure. 1070-1074 - Giulia Cisotto
:
REPAC: Reliable Estimation of Phase-Amplitude Coupling in Brain Networks. 1075-1079 - Maria Sayu Yamamoto, Florian Yger, Sylvain Chevallier:
Subspace Oddity - Optimization on Product of Stiefel Manifolds for EEG Data. 1080-1084 - Erdem Varol, Julien Boussard
, Nishchal Dethe, Olivier Winter, Anne E. Urai
, International Brain Laboratory, Anne Churchland, Nick Steinmetz, Liam Paninski:
Decentralized Motion Inference and Registration of Neuropixel Data. 1085-1089 - Bo Jiang
, Yiyi Yu
, Hamid Krim
, Spencer L. Smith:
Dynamic Graph Learning Based on Graph Laplacian. 1090-1094 - Syed Ahmed Pasha, Victor Solo:
Mutual Information Flows in a Bivariate Point Process. 1095-1099 - Karim Armanious, Sherif Abdulatif
, Wenbin Shi, Tobias Hepp, Sergios Gatidis
, Bin Yang:
Uncertainty-Based Biological Age Estimation of Brain MRI Scans. 1100-1104 - Jia-Yang Song, Miao-Ying Qi, Dun-Pei Lv, Chao-Ying Zhang, Qiu-Hua Lin, Vince D. Calhoun
:
Sparse Representation of Complex-Valued fMRI Data Based on Hard Thresholding of Spatial Source Phase. 1105-1109 - Yue Han, Qiu-Hua Lin, Li-Dan Kuang, Xiao-Feng Gong, Fengyu Cong
, Vince D. Calhoun
:
Tucker Decomposition for Extracting Shared and Individual Spatial Maps from Multi-Subject Resting-State fMRI Data. 1110-1114 - Simon Geirnaert, Tom Francart, Alexander Bertrand
:
Riemannian Geometry-Based Decoding of the Directional Focus of Auditory Attention Using EEG. 1115-1119 - Wei Chen, Qiuli Wang
, Sheng Huang, Xiaohong Zhang, Yucong Li, Chen Liu:
DFDM: A Deep Feature Decoupling Module for Lung Nodule Segmentation. 1120-1124 - Jiawei Zhang, Yanchun Zhang, Xiaowei Xu:
Pyramid U-Net for Retinal Vessel Segmentation. 1125-1129 - Xiaojiang Long, Wei Chen, Qiuli Wang
, Xiaohong Zhang, Chen Liu, Yucong Li, Jiuquan Zhang:
A Probabilistic Model for Segmentation of Ambiguous 3D Lung Nodule. 1130-1134 - Zhiqiang Xie, Enmei Tu, Hao Zheng, Yun Gu, Jie Yang:
Semi-Supervised Skin Lesion Segmentation with Learning Model Confidence. 1135-1139 - Xiangjiang Wu, Xuanya Li, Kai Hu, Zhineng Chen, Xieping Gao:
A Hybrid Feature Enhancement Method for Gl And Segmentation In Histopathology Images. 1140-1144 - Annika Liebgott, Charlotte Lorenz, Sergios Gatidis
, Viet Chau Vu, Konstantin Nikolaou, Bin Yang:
Automated Multi-Organ Segmentation in Pet Images Using Cascaded Training of a 3d U-Net and Convolutional Autoencoder. 1145-1149 - Burhaneddin Yaman, Seyed Amir Hossein Hosseini, Steen Moeller, Mehmet Akçakaya
:
Improved Supervised Training of Physics-Guided Deep Learning Image Reconstruction with Multi-Masking. 1150-1154 - Jingshuai Liu
, Mehrdad Yaghoobi:
Fine-Grained Mri Reconstruction Using Attentive Selection Generative Adversarial Networks. 1155-1159 - Hemant Kumar Aggarwal, Aniket Pramanik
, Mathews Jacob:
Ensure: Ensemble Stein's Unbiased Risk Estimator for Unsupervised Learning. 1160-1164 - Narges Mohammadi, Marvin M. Doyley, Müjdat Çetin:
Ultrasound Elasticity Imaging Using Physics-Based Models and Learning-Based Plug-and-Play Priors. 1165-1169 - Yinbing Tian, Shibiao Xu, Li Guo, Fu'ze Cong:
A Periodic Frame Learning Approach for Accurate Landmark Localization in M-Mode Echocardiography. 1170-1174 - Madhuri Nagare, Roman Melnyk, Obaidullah Rahman
, Ken D. Sauer, Charles A. Bouman:
A Bias-Reducing Loss Function for CT Image Denoising. 1175-1179 - Xiao Kang, Xingbo Liu, Xiushan Nie, Yilong Yin:
Learning Binary Semantic Embedding for Breast Histology Image Classification and Retrieval. 1180-1184 - Changlu Guo, Márton Szemenyei, Yangtao Hu
, Wenle Wang, Wei Zhou, Yugen Yi:
Channel Attention Residual U-Net for Retinal Vessel Segmentation. 1185-1189 - Tristan Sylvain, Francis Dutil, Tess Berthier, Lisa Di-Jorio, Margaux Luck, R. Devon Hjelm, Yoshua Bengio:
CMIM: Cross-Modal Information Maximization For Medical Imaging. 1190-1194 - Rui Zhao, Zixun Huang, Tianshan Liu, Frank H. F. Leung
, Sai Ho Ling, De Yang, Timothy Tin-Yan Lee, Daniel Pak-Kong Lun, Yong-Ping Zheng, Kin-Man Lam:
Structure-Enhanced Attentive Learning For Spine Segmentation From Ultrasound Volume Projection Images. 1195-1199 - Zhijin Liang, Junkang Zhang, Cheolhong An:
Foveal Avascular Zone Segmentation of Octa Images Using Deep Learning Approach with Unsupervised Vessel Segmentation. 1200-1204 - Angelo Genovese
, Mahdi S. Hosseini, Vincenzo Piuri, Konstantinos N. Plataniotis, Fabio Scotti
:
Acute Lymphoblastic Leukemia Detection Based on Adaptive Unsharpening and Deep Learning. 1205-1209 - Yiming Lei, Hongming Shan, Junping Zhang:
Meta Ordinal Weighting Net For Improving Lung Nodule Classification. 1210-1214 - Jingqin Li, Kun Wang
, Dan Yang, Xiaohong Zhang, Chen Liu:
Deepnodule: Multi-Task Learning of Segmentation Bootstrap for Pulmonary Nodule Detection. 1215-1219 - Jiannan Liu, Jie Li, Fanyong Xue, Chentao Wu:
Dense Attention Module for Accurate Pulmonary Nodule Detection. 1220-1224 - Zhe Xu, Jiangpeng Yan, Jie Luo, Xiu Li, Jayender Jagadeesan:
Unsupervised Multimodal Image Registration with Adaptative Gradient Guidance. 1225-1229 - Meng Jia, Matthew Kyan:
Improving Intraoperative Liver Registration in Image-Guided Surgery with Learning-Based Reconstruction. 1230-1234 - Xinxin Shan, Ying Wen:
A New Framework Based on Transfer Learning for Cross-Database Pneumonia Detection. 1235-1239 - Chao Li, Boyang Chen, Ziping Zhao, Nicholas Cummins
, Björn W. Schuller:
Hierarchical Attention-Based Temporal Convolutional Networks for Eeg-Based Emotion Recognition. 1240-1244 - Jaswanth Reddy Katthi, Sriram Ganapathy:
Deep Multiway Canonical Correlation Analysis For Multi-Subject Eeg Normalization. 1245-1249 - Puneet Mathur, Trisha Mittal, Dinesh Manocha:
Dynamic Graph Modeling Of Simultaneous EEG And Eye-Tracking Data For Reading Task Identification. 1250-1254 - Aaqib Saeed, David Grangier, Olivier Pietquin, Neil Zeghidour:
Learning From Heterogeneous Eeg Signals with Differentiable Channel Reordering. 1255-1259 - Chi Nok Enoch Kan, Richard J. Povinelli, Dong Hye Ye:
Enhancing Multi-Channel Eeg Classification with Gramian Temporal Generative Adversarial Networks. 1260-1264 - Haoming Zhang
, Chen Wei, Mingqi Zhao
, Quanying Liu, Haiyan Wu:
A Novel Convolutional Neural Network Model to Remove Muscle Artifacts from EEG. 1265-1269 - Alexander William Wong, Amir Salimi, Abram Hindle, Sunil Vasu Kalmady
, Padma Kaul
:
Multilabel 12-Lead Electrocardiogram Classification Using Beat to Sequence Autoencoders. 1270-1274 - Wenjie Song
, Jiqing Han, Hongwei Song:
Contrastive Embeddind Learning Method for Respiratory Sound Classification. 1275-1279 - Pei-Chun Chang, Jia-Ren Chang, Po-Yu Chen, Li-Kai Cheng, Jen-Chuen Hsieh, Hsin-Yen Yu, Li-Fen Chen
, Yong-Sheng Chen:
Decoding Neural Representations of Rhythmic Sounds From Magnetoencephalography. 1280-1284 - Jian Guan, Wenbo Wang, Pengming Feng, Xinxin Wang, Wenwu Wang:
Low-Dimensional Denoising Embedding Transformer for ECG Classification. 1285-1289 - Qinfeng Xiao, Jing Wang, Jianan Ye, Hongjun Zhang, Yuyan Bu, Yiqiong Zhang, Hao Wu:
Self-Supervised Learning for Sleep Stage Classification with Predictive and Discriminative Contrastive Coding. 1290-1294 - Chuanqi Han, Fang Yu, Peng Wang, Ruoran Huang, Xi Huang, Li Cui:
Length No Longer Matters: A Real Length Adaptive Arrhythmia Classification Model with Multi-Scale Convolution. 1295-1299 - Elahe Rahimian, Soheil Zabihi, Amir Asif
, Seyed Farokh Atashzar
, Arash Mohammadi:
Few-Shot Learning for Decoding Surface Electromyography for Hand Gesture Recognition. 1300-1304 - Upasana Tiwari, Swapnil Bhosale, Rupayan Chakraborty
, Sunil Kumar Kopparapu:
Deep Lung Auscultation Using Acoustic Biomarkers for Abnormal Respiratory Sound Event Detection. 1305-1309 - Maryam Hosseini
, Luca Celotti, Eric Plourde:
Speaker-Independent Brain Enhanced Speech Denoising. 1310-1314 - Shreyasi Datta
, Chandan K. Karmakar
, Punit Rathore, Marimuthu Palaniswami:
Shapelet Based Visual Assessment of Cluster Tendency in Analyzing Complex Upper Limb Motion. 1315-1319 - Ryosuke Sawata, Takahiro Ogawa
, Miki Haseyama:
Human-Centered Favorite Music Classification Using EEG-Based Individual Music Preference Via Deep Time-Series CCA. 1320-1324 - Mingyue Niu, Jianhua Tao, Bin Liu:
Multi-Scale and Multi-Region Facial Discriminative Representation for Automatic Depression Level Prediction. 1325-1329 - Zeeshan Ahmad, Anika Tabassum, Ling Guan, Naimul Mefraz Khan:
ECG Heart-Beat Classification Using Multimodal Image Fusion. 1330-1334 - Takaaki Higashi, Keisuke Maeda, Takahiro Ogawa
, Miki Haseyama:
Estimation of Visual Features of Viewed Image From Individual and Shared Brain Information Based on FMRI Data Using Probabilistic Generative Model. 1335-1339 - Jianxiong Zhou
, Zhongyu Jiang, Jang-Hee Yoo, Jenq-Neng Hwang:
Hierarchical Pose Classification for Infant Action Analysis and Mental Development Assessment. 1340-1344 - Zohreh Mostaani, Venkata Srikanth Nallanthighal, Aki Härmä
, Helmer Strik, Mathew Magimai-Doss:
On The Relationship Between Speech-Based Breathing Signal Prediction Evaluation Measures and Breathing Parameters Estimation. 1345-1349 - Jianhong Cheng, Jin Liu, Meilin Jiang, Hailin Yue, Lin Wu, Jianxin Wang:
Prediction of Egfr Mutation Status in Lung Adenocarcinoma Using Multi-Source Feature Representations. 1350-1354 - Taeheon Lee
, Jeonghwan Hwang, Honggu Lee:
Training Neural Networks with Domain Pattern-Aware Auxiliary Task for Sleep Staging. 1355-1359 - Yusuke Akamatsu
, Keisuke Maeda, Takahiro Ogawa
, Miki Haseyama:
Classification of Expert-Novice Level Using Eye Tracking And Motion Data via Conditional Multimodal Variational Autoencoder. 1360-1364 - Fang Yu, Chuanqi Han, Pengcheng Wang, Xi Huang, Li Cui:
Gate Trimming: One-Shot Channel Pruning for Efficient Convolutional Neural Networks. 1365-1369 - Christopher A. Metzler, Gordon Wetzstein
:
Deep S3PR: Simultaneous Source Separation and Phase Retrieval Using Deep Generative Models. 1370-1374 - Zhenbo Shi
, Wei Yang, Zhenbo Xu, Zhi Chen, Yingjie Li, Haoran Zhu, Liusheng Huang:
Adversarial Attacks on Object Detectors with Limited Perturbations. 1375-1379 - Rakib Hyder, Hassan Mansour, Yanting Ma, Petros T. Boufounos, Pu Wang:
A Consensus Equilibrium Solution For Deep Image Prior Powered By Red. 1380-1384 - Ruangrawee Kitichotkul
, Christopher A. Metzler, Frank Ong, Gordon Wetzstein
:
Suremap: Predicting Uncertainty in Cnn-Based Image Reconstructions Using Stein's Unbiased Risk Estimate. 1385-1389 - Zhengyu Chen
, Donglin Wang:
Multi-Initialization Meta-Learning with Domain Adaptation. 1390-1394 - Jiaming Liu, Yu Sun
, Weijie Gan, Xiaojian Xu, Brendt Wohlberg, Ulugbek S. Kamilov:
Stochastic Deep Unfolding for Imaging Inverse Problems. 1395-1399 - Laixi Shi, Dehong Liu, Masaki Umeda, Norihiko Hana:
Fusion-Based Digital Image Correlation Framework for Strain Measurement. 1400-1404 - Kaiyi Yang, Narong Borijindargoon
, Boon Poh Ng, Saiprasad Ravishankar, Bihan Wen
:
Learning Sparsifying Transforms for Image Reconstruction in Electrical Impedance Tomography. 1405-1409 - Christopher A. Metzler, Gordon Wetzstein
:
D-VDAMP: Denoising-Based Approximate Message Passing for Compressive MRI. 1410-1414 - Byung Hyun Lee, Se Young Chun:
Empirically Accelerating Scaled Gradient Projection Using Deep Neural Network for Inverse Problems in Image Processing. 1415-1419 - Boqiang Fan, Samarjit Das:
Synthetic Aperture Acoustic Imaging with Deep Generative Model Based Source Distribution Prior. 1420-1424 - Chaobing Zheng, Zhengguo Li, Yuwen Li, Shiqian Wu:
Non-Local Single Image DE-Raining Without Decomposition. 1425-1429 - Takashi Isobe, Fang Zhu, Shengjin Wang:
Frame-Rate-Aware Aggregation for Efficient Video Super-Resolution. 1430-1434 - Rentao Wan, Jinjia Zhou, Bowen Huang, Hui Zeng, Yibo Fan:
Measurement Coding Framework with Adjacent Pixels Based Measurement Matrix for Compressively Sensed Images. 1435-1439 - Yanting Ma, Petros T. Boufounos, Hassan Mansour, Shuchin Aeron:
Multiview Sensing with Unknown Permutations: an Optimal Transport Approach. 1440-1444 - Yuhu Chang
, Changyang He, Yingying Zhao
, Tun Lu, Ning Gu:
A High-Frame-Rate Eye-Tracking Framework for Mobile Devices. 1445-1449 - Ali Ghofrani
, Rahil Mahdian Toroghi, Seyed Mojtaba Tabatabaie:
Catiloc: Camera Image Transformer for Indoor Localization. 1450-1454 - Zi-Yao Zhang, Odysseas A. Pappas, Alin Achim:
Sar Image Autofocusing Using Wirtinger Calculus and Cauchy Regularization. 1455-1459 - Luciano C. Ayres
, Sérgio J. M. de Almeida, José C. M. Bermudez, Ricardo Augusto Borsoi:
A Homogeneity-Based Multiscale Hyperspectral Image Representation for Sparse Spectral Unmixing. 1460-1464 - Jisheng Li
, Qi Dai, Jiangtao Wen:
Learning to Estimate Kernel Scale and Orientation of Defocus Blur with Asymmetric Coded Aperture. 1465-1469 - Jorge Bacca, Tatiana Gelvez, Henry Arguello:
Transmittance Regularizer for Binary coded Aperture Design in a Computational Imaging end-to-end Approach. 1470-1474 - Demetris Lappas, Vasileios Argyriou, Dimitrios Makris:
Fourier Transformation Autoencoders for Anomaly Detection. 1475-1479 - Kazuki Naganuma, Saori Takeyama, Shunsuke Ono:
Zero-Gradient Constraints for Destriping of Remote-Sensing Data. 1480-1484 - Zhiguo Li, Yuan Yuan, Dandan Ma:
Selection Based on Statistical Characteristics for Object Detection. 1485-1489 - Tianyuan Wang, Can Ma, Haoshan Su, Weiping Wang
:
CSPN: Multi-Scale Cascade Spatial Pyramid Network for Object Detection. 1490-1494 - Shuyong Gao, Qianyu Guo, Wei Zhang, Wenqiang Zhang, Zhongwei Ji:
Dual-Stream Network Based On Global Guidance for Salient Object Detection. 1495-1499 - Tianyuan Wang, Can Ma, Haoshan Su, Weiping Wang
:
SSFENet: Spatial and Semantic Feature Enhancement Network for Object Detection. 1500-1504 - Kristian Fischer, Felix Fleckenstein, Christian Herglotz, André Kaup:
Saliency-Driven Versatile Video Coding for Neural Object Detection. 1505-1509 - Shuyu Miao, Rui Feng:
Object-Oriented Relational Distillation for Object Detection. 1510-1514 - Kateryna Chumachenko, Jenni Raitoharju
, Alexandros Iosifidis
, Moncef Gabbouj
:
Ensembling Object Detectors for Image and Video Data Analysis. 1515-1519 - Qing-Yang Shen
, Tian-Guo Huang, Peng-Xin Ding, Jia He:
Training Real-Time Panoramic Object Detectors with Virtual Dataset. 1520-1524 - Lv Tang, Bo Li, Yanliang Wu, Bo Xiao, Shouhong Ding:
Fast: Feature Aggregation for Detecting Salient Object in Real-Time. 1525-1529 - Wanli Ma
, Alin Achim, Oktay Karakus:
Exploiting the Dual-Tree Complex Wavelet Transform for Ship Wake Detection in SAR Imagery. 1530-1534 - Zhinan Cai, Zhiyu Jiang
, Yuan Yuan:
Task-Related Self-Supervised Learning For Remote Sensing Image Change Detection. 1535-1539 - Makoto Okuda, Shin'ichi Satoh, Yoichi Sato, Yutaka Kidawara:
Unsupervised Common Particular Object Discovery and Localization by Analyzing a Match Graph. 1540-1544 - Madeleine Barowsky, Alexander Mariona, Flávio P. Calmon
:
Predictive Coding for Lossless Dataset Compression. 1545-1549 - Weijia Zhu, Jizheng Xu, Li Zhang, Yue Wang:
Adaptive Dual Tree Structure For Screen Content Coding. 1550-1554 - Mingze Ding, Jiahui Li, Mengyao Ma, Xiaopeng Fan:
SNR-Adaptive Deep Joint Source-Channel Coding for Wireless Image Transmission. 1555-1559 - Gabriel B. Sant'Anna
, Luiz Henrique Cancellier, Ismael Seidel, Mateus Grellert
, José Luís Güntzel:
Relying on a Rate Constraint to Reduce Motion Estimation Complexity. 1560-1564 - Andy Regensky, Christian Herglotz, André Kaup:
A Novel Viewport-Adaptive Motion Compensation Technique for Fisheye Video. 1565-1569 - Alban Marie, Navid Mahmoudian Bidgoli, Thomas Maugey, Aline Roumy:
Rate-Distortion Optimized Motion Estimation for on-the-Sphere Compression of 360 Videos. 1570-1574 - Bohan Li, Jingning Han, Yaowu Xu:
Adaptive GOP Size Decision for Multi-Pass Video Coding Based on Hidden Markov Model. 1575-1579 - Yize Jin, Liang Zhao, Xin Zhao, Shan Liu, Alan C. Bovik:
Improved Intra Mode Coding Beyond Av1. 1580-1584 - Xinyao Chen, Yiwei Zhang
, Yanghao Li, Jiangtao Wen:
Decision Tree Based Inter Partition Termination For Av1 Encoding. 1585-1589 - Nam Le
, Honglei Zhang
, Francesco Cricri, Ramin Ghaznavi Youvalari
, Esa Rahtu
:
Image Coding For Machines: an End-To-End Learned Approach. 1590-1594 - Shihui Zhao, Shuyuan Yang, Zhi Liu, Zhixi Feng, Xu Liu:
Sparse Flow Adversarial Model For Robust Image Compression. 1595-1599 - Lee Prangnell
, Victor Sanchez:
HVS-Based Perceptual Color Compression of Image Data. 1600-1604 - Yalei Lv, Tao Dai, Bin Chen, Jian Lu, Shu-Tao Xia, Jingchao Cao:
HOCA: Higher-Order Channel Attention for Single Image Super-Resolution. 1605-1609 - Anqi Liu, Sumei Li
, Yongli Chang:
Image Super-Resolution Using Multi-Resolution Attention Network. 1610-1614 - Zhihong Pan
, Baopu Li:
Real Image Super-Resolution Using Token Based Contextual Attention. 1615-1619 - Jun Xiao
, Wenqi Jia, Kin-Man Lam:
Feature Redundancy Mining: Deep Light-Weight Image Super-Resolution Model. 1620-1624 - Risheng Wang, Tao Lei
, Wenzheng Zhou, Qi Wang, Hongying Meng
, Asoke K. Nandi:
Lightweight Non-Local Network for Image Super-Resolution. 1625-1629 - Zhonghan Niu, Xi-Peng Lin, An-Ni Yu, Yang-Hao Zhou, Yu-Bin Yang:
Lightweight and Accurate Single Image Super-Resolution with Channel Segregation Network. 1630-1634 - Angel Villar-Corrales
, Franziska Schirrmacher, Christian Riess:
Deep Learning Architectural Designs for Super-Resolution Of Noisy Images. 1635-1639 - Andrew Gigie, Achanna Anil Kumar, Angshul Majumdar, Kriti Kumar, M. Girish Chandra:
Joint Coupled Transform Learning Framework for Multimodal Image Super-Resolution. 1640-1644 - Qiang Li, Qi Wang, Xuelong Li:
Hyperspectral Image Super-Resolution Via Adjacent Spectral Fusion Strategy. 1645-1649 - Miguel Heredia Conde:
Raw Data Processing for Practical Time-of-Flight Super-Resolution. 1650-1654 - Jun Xia, Guanghua Tan, Yi Xiao, Fangqiang Xu
, Chi-Sing Leung:
Edge-Aware Multi-Scale Progressive Colorization. 1655-1659 - Kangbo Sun, Jie Zhu:
Learning Representation of Multi-Scale Object for Fine-Grained Image Retrieval. 1660-1664 - Yu Sang, Jinguang Sun, Si-Miao Wang, Heng Qi, Keqiu Li:
Super-Resolution and Infection Edge Detection Co-Guided Learning for Covid-19 Ct Segmentation. 1665-1669 - Weidong He, Yangjinan Hu, Lulu Wang, Zhongshi He, Jinglong Du:
Gating Feature Dense Network for Single Anisotropic Mr Image Super-Resolution. 1670-1674 - Yankai Wang, Dawei Yang, Wei Zhang, Zhe Jiang, Wenqiang Zhang:
Adaptable Ensemble Distillation. 1675-1679 - Akshay Rangamani, Nam H. Nguyen, Abhishek Kumar, Dzung T. Phan, Sang (Peter) Chin, Trac D. Tran:
A Scale Invariant Measure of Flatness for Deep Network Minima. 1680-1684 - Zhixiao Fu, Xinyuan Chen
, Jianfeng Dong, Shouling Ji:
Multi-Order Adversarial Representation Learning for Composed Query Image Retrieval. 1685-1689 - Zhengbo Luo, Sei-ichiro Kamata, Zitang Sun, Weilian Zhou:
Deep Neural Networks with Flexible Complexity While Training Based on Neural Ordinary Differential Equations. 1690-1694 - Adrian Bulat, Enrique Sánchez-Lozano, Georgios Tzimiropoulos:
Improving Memory Banks for Unsupervised Learning with Large Mini-Batch, Consistency and Hard Negative Mining. 1695-1699 - Defu Liu, Guowu Yang, Jinzhao Wu, Jiayi Zhao, Fengmao Lv:
Robust Binary Loss for Multi-Category Classification with Label Noise. 1700-1704 - Zengsheng Kuang, Xian Fang, Ruixun Zhang, Xiuli Shao, Hongpeng Wang:
A Plug and Play Fast Intersection Over Union Loss for Boundary Box Regression. 1705-1709 - Sheng-Jhe Huang, Jen-Tzung Chien
:
Attribute Decomposition for Flow-Based Domain Mapping. 1710-1714 - Mahesh Sudhakar, Sam Sattarzadeh, Konstantinos N. Plataniotis, Jongseong Jang, Yeonjeong Jeong, Hyunwoo Kim:
Ada-Sise: Adaptive Semantic Input Sampling for Efficient Explanation of Convolutional Neural Networks. 1715-1719 - Hao Pan, Zhongdi Chao, Jiang Qian, Bojin Zhuang, Shaojun Wang, Jing Xiao:
Network Pruning Using Linear Dependency Analysis on Feature Maps. 1720-1724 - Fangming Zhong, Guangze Wang, Zhikui Chen, Xu Yuan, Feng Xia:
Multiple-Input Multiple-Output Fusion Network for Generalized Zero-Shot Learning. 1725-1729 - Kun Yan, Lingbo Liu, Jun Hou, Ping Wang:
Representative Local Feature Mining for Few-Shot Learning. 1730-1734 - Zeyang Zhu, Xin Lin:
KAN: Knowledge-Augmented Networks for Few-Shot Learning. 1735-1739 - Kun Yan, Zied Bouraoui, Ping Wang, Shoaib Jameel, Steven Schockaert:
Few-Shot Image Classification with Multi-Facet Prototypes. 1740-1744 - Da Chen
, Yuefeng Chen, Yuhong Li, Feng Mao, Yuan He, Hui Xue:
Self-Supervised Learning for Few-Shot Image Classification. 1745-1749 - Chun-Chih Teng, Pin-Yu Chen, Wei-Chen Chiu:
Domain Adaptation for Learning Generator From Paired Few-Shot Data. 1750-1754 - Furen Zhuang, Pierre Moulin:
Deep Semi-Supervised Metric Learning Via Identification of Manifold Memberships. 1755-1759 - Jian Wang, Zhichao Zhang
, Dongmei Huang, Wei Song, Quanmiao Wei, XinYue Li
:
A Ranked Similarity Loss Function with pair Weighting for Deep Metric Learning. 1760-1764 - Ting-Yao Hu, Alexander G. Hauptmann:
Statistical Distance Metric Learning for Image Set Retrieval. 1765-1769 - Yinong Zhu, Yong Feng, Mingliang Zhou, Baohua Qiang, Leong Hou U
, Jiajie Zhu
:
Distribution-Aware Hierarchical Weighting Method for Deep Metric Learning. 1770-1774 - Sam Sattarzadeh, Mahesh Sudhakar, Konstantinos N. Plataniotis, Jongseong Jang, Yeonjeong Jeong, Hyunwoo Kim:
Integrated Grad-Cam: Sensitivity-Aware Visual Explanation of Deep Convolutional Networks Via Integrated Gradient-Based Scoring. 1775-1779 - Taiga Kashima, Ryuichiro Hataya, Hideki Nakayama:
Visualizing Association in Exemplar-Based Classification. 1780-1784 - Zitang Sun
, Ruojing Wang, Zhengbo Luo, Weili Chen:
HFGCNET: High-Frequency Graph Reasoning for Finer Semantic Image Segmentation. 1785-1789 - Hugo Gangloff, Jean-Baptiste Courbot, Emmanuel Monfrini, Christophe Collet:
Unsupervised Image Segmentation with Spatial Triplet Markov Trees. 1790-1794 - Dong Liang
, Bin Kang, Xinyu Liu, Han Sun, Liyan Zhang, Ningzhong Liu:
Cross Scene Video Foreground Segmentation Via Co-Occurrence Probability Oriented Supervised and Unsupervised Model Interaction. 1795-1799 - Jianfeng Cao
, Hong Yan:
Instance Segmentation with the Number of Clusters Incorporated in Embedding Learning. 1800-1804 - Lianlei Shan, Xiaobin Li, Weiqiang Wang:
Decouple the High-Frequency and Low-Frequency Information of Images for Semantic Segmentation. 1805-1809 - Zhaoxin Fan, Hongyan Liu, Jun He, Min Zhang, Xiaoyong Du:
MPDNet: A 3D Missing Part Detection Network Based on Point Cloud Segmentation. 1810-1814 - Nan Jiang
, Xuehui Yu, Xiaoke Peng, Yuqi Gong, Zhenjun Han:
SM+: Refined Scale Match for Tiny Person Detection. 1815-1819 - Weilian Zhou, Sei-ichiro Kamata, Zhengbo Luo:
Sub-Band Grouping Spectral Feature-Attention Block for Hyperspectral Image Classification. 1820-1824 - Erting Pan
, Yong Ma, Xiaoguang Mei, Fan Fan, Jiayi Ma:
Unsupervised Stacked Capsule Autoencoder for Hyperspectral Image Classification. 1825-1829 - Ganghui Fan, Yong Ma, Jun Huang, Xiaoguang Mei, Jiayi Ma:
Robust Graph Autoencoder for Hyperspectral Anomaly Detection. 1830-1834 - Xiaomeng Wu, Yongqing Sun, Akisato Kimura, Kunio Kashino:
Reflectance-Oriented Probabilistic Equalization for Image Enhancement. 1835-1839 - Yijun Liu, Zhengning Wang
, Yi Zeng, Hao Zeng, Deming Zhao:
PD-GAN: Perceptual-Details GAN for Extremely Noisy Low Light Image Enhancement. 1840-1844 - Dong Wang, Yunpeng Bai
, Bendu Bai, Chanyue Wu, Ying Li:
Heterogeneous two-Stream Network with Hierarchical Feature Prefusion for Multispectral Pan-Sharpening. 1845-1849 - Chong Mou, Jian Zhang:
Synergic Feature Attention for Image Restoration. 1850-1854 - Jingwen Su, Hujun Yin:
Efficient Multi-Objective GANs for Image Restoration. 1855-1859 - Lanqing Guo, Zhiyuan Zha
, Saiprasad Ravishankar, Bihan Wen
:
Self-Convolution: A Highly-Efficient Operator for Non-Local Image Restoration. 1860-1864 - Fengchao Xiong, Jun Zhou, Minchao Ye, Jianfeng Lu, Yuntao Qian:
NMF-SAE: An Interpretable Sparse Autoencoder for Hyperspectral Unmixing. 1865-1869 - Chao Zhou
, Miguel R. D. Rodrigues:
An ADMM Based Network for Hyperspectral Unmixing Tasks. 1870-1874 - Shuaikai Shi, Min Zhao, Lijun Zhang, Jie Chen:
Variational Autoencoders for Hyperspectral Unmixing with Endmember Variability. 1875-1879 - Yaser Esmaeili Salehani, Ehsan Arabnejad, Saeed Gazor:
Augmented Gaussian Linear Mixture Model for Spectral Variability in Hyperspectral Unmixing. 1880-1884 - Qiwen Jin
, Yong Ma, Xiaoguang Mei, Hao Li
, Jiayi Ma:
UTDN: An Unsupervised Two-Stream Dirichlet-Net for Hyperspectral Unmixing. 1885-1889 - Yi Yang, Fei Jiang, Hongtao Lu:
Laplacian Regularized Tensor Low-Rank Minimization for Hyperspectral Snapshot Compressive Imaging. 1890-1894 - Roy Miles, Krystian Mikolajczyk:
Compressing Local Descriptor Models for Mobile Applications. 1895-1899 - Zhi Chen, Wei Yang, Zhenbo Xu, Zhenbo Shi
, Liusheng Huang:
VK-Net: Category-Level Point Cloud Registration with Unsupervised Rotation Invariant Keypoints. 1900-1904 - Bhavesh Deshpande, Sourabh Hanamsheth, Yawen Lu, Guoyu Lu
:
Matching as Color Images: Thermal Image Local Feature Detection and Description. 1905-1909 - Viktoria Heimann, Andreas Spruck, André Kaup:
Frame Rate Up-Conversion Using Key Point Agnostic Frequency-Selective Mesh-to-Grid Resampling. 1910-1914 - Jianwei Ke, Alex J. Watras, Jae-Jun Kim, Hewei Liu, Hongrui Jiang, Yu Hen Hu:
Efficient Real-Time Video Stabilization with a Novel Least Squares Formulation. 1915-1919 - Yuan Hou, Annie A. M. Cuyt, Wen-shin Lee, Deepayan Bhowmik:
Decomposing Textures using Exponential Analysis. 1920-1924 - Hoda Roodaki, Masoud Dehyadegari, Mahdi Nazm Bojnordi:
G-Arrays: Geometric Arrays for Efficient Point Cloud Processing. 1925-1929 - Lisha Wang, Chenglin Li, Wenrui Dai, Junni Zou, Hongkai Xiong:
QoE-Driven and Tile-Based Adaptive Streaming for Point Clouds. 1930-1934 - Ashek Ahmmed, Manoranjan Paul
, M. Manzur Murshed
, David Taubman
:
Dynamic Point Cloud Compression Using A Cuboid Oriented Discrete Cosine Based Motion Model. 1935-1939 - Yangang Cai, Ronggang Wang, Song Gu, Jian Zhang, Wen Gao:
An Adaptive Pyramid Single-View Depth Lookup Table Coding Method. 1940-1944 - Marta Milovanovic, Félix Henry, Marco Cagnazzo
, Joël Jung:
Patch Decoder-Side Depth Estimation In Mpeg Immersive Video. 1945-1949 - Hongyan Quan, Mingwei Yao, Xiaoxiao Qian:
Geometry Consistency Of Augmented Reality Based On Semantics. 1950-1954 - Tong Zhou, Kun Tian:
What And Where To Focus In Person Search. 1955-1959 - Ning Lv
, Xuezhi Xiang, Xinyao Wang, Jie Yang, Rokia Abdein, Abdulmotaleb El-Saddik:
Stable and Effective One-Step Method for Person Search. 1960-1964 - Xi-Peng Lin, Yu-Bin Yang:
An Adaptive Part-Based Model For Person Re-Identification. 1965-1969 - Yukang Gao, Hua Yang:
Crowd Counting Via Multi-Level Regression With Latent Gaussian Maps. 1970-1974 - Ye Tian, Chengzhen Duan, Ruilin Zhang
, Zhiwei Wei, Hongpeng Wang:
Lightweight Dual-Task Networks For Crowd Counting In Aerial Images. 1975-1979 - Siyang Pan, Yanyun Zhao, Fei Su, Zhicheng Zhao:
SANet++: Enhanced Scale Aggregation with Densely Connected Feature Fusion for Crowd Counting. 1980-1984 - Zehao Chen, Hua Yang:
Attentive Semantic Exploring for Manipulated Face Detection. 1985-1989 - Bin Cheng, Tao Dai, Bin Chen, Shutao Xia, Xiu Li:
Efficient Face Manipulation Via Deep Feature Disentanglement And Reintegration Net. 1990-1994 - Seogkyu Jeon, Pilhyeon Lee, Kibeom Hong, Hyeran Byun:
Continuous Face Aging Generative Adversarial Networks. 1995-1999 - Nicky Bayat, Vahid Reza Khazaie, Yalda Mohsenzadeh
:
Fast Inverse Mapping of Face GANs. 2000-2004 - Jingwei Yan, Boyuan Jiang, Jingjing Wang, Qiang Li, Chunmao Wang, Shiliang Pu:
Multi-Level Adaptive Region of Interest and Graph Learning for Facial Action Unit Recognition. 2005-2009 - Meimei Shang, Fei Gao, Xiang Li, Jingjie Zhu, Lingna Dai:
Bridging Unpaired Facial Photos and Sketches by Line-Drawings. 2010-2014 - Xinwei Xue, Ying Ding, Long Ma, Yi Wang, Risheng Liu, Xin Fan:
Temporal Rain Decomposition with Spatial Structure Guidance for Video Deraining. 2015-2019 - Xinwei Xue, Xiangyu Meng
, Long Ma, Risheng Liu, Xin Fan:
GTA-Net: Gradual Temporal Aggregation Network for Fast Video Deraining. 2020-2024 - Zhen Wang, Cong Wang
, Zhixun Su
, Junyang Chen:
Dense Feature Pyramid Grids Network for Single Image Deraining. 2025-2029 - Youzhao Yang, Hong Lu:
A Fast and Efficient Network for Single Image Deraining. 2030-2034 - Dongdong Ren, Jinbao Li, Meng Han, Minglei Shu:
DNANet: Dense Nested Attention Network for Single Image Dehazing. 2035-2039 - Cong Wang, Yan Huang, Yuexian Zou, Yong Xu:
FWB-Net: Front White Balance Network for Color Shift Correction in Single Image Dehazing Via Atmospheric Light Estimation. 2040-2044 - Tobias Alt, Joachim Weickert:
Learning Integrodifferential Models for Image Denoising. 2045-2049 - Huy Vu, Gene Cheung, Yonina C. Eldar:
Unrolling of Deep Graph Total Variation for Image Denoising. 2050-2054 - Yanghao Li, Bichuan Guo, Jiangtao Wen, Zhen Xia, Shan Liu, Yuxing Han:
Learning Model-Blind Temporal Denoisers without Ground Truths. 2055-2059 - Hangfan Liu
, Jian Zhang, Chong Mou:
Image Denoising Based on Correlation Adaptive Sparse Modeling. 2060-2064 - Xiaokun Liu, Long Ma, Risheng Liu, Wei Zhong, Xin Fan, Zhongxuan Luo:
NASA: A Noise-Adaptive and Structure-Aware Learning Framework for Image Deblurring. 2065-2069 - Chen Li
, Qi Wang, Shaoteng Liu
, Xuelong Li:
Multiple Auxiliary Networks for Single Blind Image Deblurring. 2070-2074 - Xiangfei Liu, Xiushan Nie, Zhen Shen, Yilong Yin:
Joint Learning of Image Aesthetic Quality Assessment and Semantic Recognition Based on Feature Enhancement. 2075-2079 - Junming Chen, Haiqiang Wang, Ge Li, Shan Liu:
Nested Error Map Generation Network for No-Reference Image Quality Assessment. 2080-2084 - Zhengzhong Tu, Chia-Ju Chen, Li-Heng Chen, Yilin Wang, Neil Birkbeck, Balu Adsumilli, Alan C. Bovik:
Regression or classification? New methods to evaluate no-reference picture and video quality models. 2085-2089 - Ci Wang, Mei Li:
Blind Image Quality Evaluator with Scale Robustness. 2090-2094 - Yingjie Feng, Sumei Li
, Yongli Chang:
Multi-Scale Feature-Guided Stereoscopic Video Quality Assessment Based on 3d Convolutional Neural Network. 2095-2099 - Fan Meng, Sumei Li
, Yongli Chang:
No-Reference Stereoscopic Image Quality Assessment Based on the Human Visual System. 2100-2104 - Yuxing Wang, Yawen Lu, Guoyu Lu
:
Stereo Rectification Based on Epipolar Constrained Neural Network. 2105-2109 - Xiaogang Jia, Wei Chen, Zhengfa Liang, Xin Luo, Mingfei Wu, Yusong Tan, Libo Huang:
Multi-Scale Cascade Disparity Refinement Stereo Network. 2110-2114 - Jun Peng, Wangduo Xie, Zijing Huang, Wei Chen, Yong Zhao:
Hierarchical Context Guided Aggregation Network for Stereo Matching. 2115-2119 - Shenglun Chen, Baopu Li, Wei Wang, Hong Zhang, Haojie Li, Zhihui Wang:
Cost Affinity Learning Network for Stereo Matching. 2120-2124 - Naga Sailaja Mahankali
, Sumohana S. Channappayya:
Video Quality Prediction Using Voxel-Wise fMRI Models of the Visual Cortex. 2125-2129 - Jianfu Zhang, Zerui Tao, Liqing Zhang, Qibin Zhao:
Tensor Decomposition Via Core Tensor Networks. 2130-2134 - Katrin Renz, Nicolaj C. Stache, Samuel Albanie, Gül Varol:
Sign Language Segmentation with Temporal Convolutional Networks. 2135-2139 - Shuyi Li
, Bob Zhang
:
An Adaptive Discriminant and Sparsity Feature Descriptor for Finger Vein Recognition. 2140-2144 - Zhizhong Huang, Junping Zhang, Hongming Shan:
Routinggan: Routing Age Progression and Regression with Disentangled Learning. 2145-2149 - Zongyao Li, Ren Togo, Takahiro Ogawa
, Miki Haseyama:
Semantic-Aware Unpaired Image-to-Image Translation for Urban Scene Images. 2150-2154 - Rakshith S, Rishabh Khurana
, Vibhav Agarwal, Jayesh Rajkumar Vachhani, Bhanodai Guggilla:
Fontnet: On-Device Font Understanding and Prediction Pipeline. 2155-2159 - Viet-Khoa Vo-Ho
, Ngan Le
, Kashu Yamazaki
, Akihiro Sugimoto, Minh-Triet Tran
:
Agent-Environment Network for Temporal Action Proposal Generation. 2160-2164 - Zhaoyang Gui, Shanshan Zhang, Kangkan Wang, Jian Yang, Pong Chi Yuen:
Adaptive Multi-Domain Learning for Outdoor 3d Human Pose and Shape Estimation. 2165-2169 - Zhe Zhang, Jie Tang, Gangshan Wu:
Lightweight Human Pose Estimation under Resource-Limited Scenes. 2170-2174 - Jie Mei, Jenq-Neng Hwang, Suzanne Romain, Craig S. Rose, Braden Moore, Kelsey Magrane:
Absolute 3d Pose Estimation and Length Measurement of Severely Deformed Fish from Monocular Videos in Longline Fishing. 2175-2179 - Yuzhuo Ren, Feng Hu:
Camera Calibration with Pose Guidance. 2180-2184 - Rishi Rajesh Shah, Vyas Anirudh Akundy, Zhou Wang:
Real Versus Fake 4k - Authentic Resolution Assessment. 2185-2189 - Wenhan Zhu, Guangtao Zhai
, Xiongkuo Min
, Xiaokang Yang, Xiao-Ping Zhang:
Perceptual Quality Assessment for Recognizing True and Pseudo 4k Content. 2190-2194 - Li Liu
, Da Chen, Minglei Shu, Huazhong Shu, Laurent D. Cohen:
A New Tubular Structure Tracking Algorithm Based On Curvature-Penalized Perceptual Grouping. 2195-2199 - Sibo Wang, Ruize Han, Wei Feng, Song Wang
:
Multiple Human Tracking in Non-Specific Coverage with Wearable Cameras. 2200-2204 - Chaoyi Wang, Yang Hua
, Tao Song
, Zhengui Xue, Ruhui Ma, Neil Robertson, Haibing Guan:
Fine-Grained Pose Temporal Memory Module for Video Pose Estimation and Tracking. 2205-2209 - Minghao Yang, Xukang Zhou, Yangchang Sun, Jinglong Chen, Baohua Qiang:
Drawing Order Recovery from Trajectory Components. 2210-2214 - Na Lv, Ying Wang, Zhiquan Feng, Jingliang Peng:
Deep Hashing for Motion Capture Data Retrieval. 2215-2219 - Liqi Yan
, Yiming Cui, Yingjie Victor Chen, Dongfang Liu:
Hierarchical Attention Fusion for Geo-Localization. 2220-2224 - Souvik Kundu, Sairam Sundaresan:
AttentionLite: Towards Efficient Self-Attention Models for Vision. 2225-2229 - Shannan Chen
, Qiule Sun, Cunhua Li, Jianxin Zhang, Qiang Zhang
:
Attention-Guided Second-Order Pooling Convolutional Networks. 2230-2234 - Qing-Long Zhang
, Yu-Bin Yang:
SA-Net: Shuffle Attention for Deep Convolutional Neural Networks. 2235-2239 - Reshmi S. Bhooshan
, Suresh K
:
An Attention Based Wavelet Convolutional Model for Visual Saliency Detection. 2240-2244 - Shuang Wang, Yun Meng, Yu Gu
, Lei Zhang, Xiutiao Ye, Jingxian Tian, Licheng Jiao
:
Cascade Attention Fusion for Fine-Grained Image Captioning Based on Multi-Layer LSTM. 2245-2249 - Jinpeng Wang, Bin Chen, Tao Dai, Shu-Tao Xia:
Webly Supervised Deep Attentive Quantization. 2250-2254 - Leena Mathur, Maja J. Mataric:
Unsupervised Audio-Visual Subspace Alignment for High-Stakes Deception Detection. 2255-2259 - Wen-Feng Pang, Qian-Hua He, Yongjian Hu, Yan-Xiong Li:
Violence Detection in Videos Based on Fusing Visual and Audio Information. 2260-2264 - Andreea-Maria Oncescu
, João F. Henriques, Yang Liu
, Andrew Zisserman, Samuel Albanie:
QUERYD: A Video Dataset with High-Quality Text and Audio Narrations. 2265-2269 - Alkesh Patel, Akanksha Bindal, Hadas Kotek, Christopher Klein, Jason D. Williams
:
Generating Natural Questions from Images for Multimodal Assistants. 2270-2274 - Jialang Xu
, Yang Luo, Xinyue Chen, Chunbo Luo:
An Adaptive Multi-Scale and Multi-Level Features Fusion Network with Perceptual Loss for Change Detection. 2275-2279 - Samuel Albanie, Gül Varol, Liliane Momeni, Triantafyllos Afouras, Andrew Brown, Chuhan Zhang, Ernesto Coto, Necati Cihan Camgöz, Ben Saunders, Abhishek Dutta
, Neil Fox
, Richard Bowden
, Bencie Woll, Andrew Zisserman:
SeeHear: Signer Diarisation and a New Dataset. 2280-2284 - Kai Katsumata, Hideki Nakayama:
Semantic Image Synthesis from Inaccurate and Coarse Masks. 2285-2289 - Yuan Chang, Yisong Chen, Guoping Wang:
Range Guided Depth Refinement and Uncertainty-Aware Aggregation for View Synthesis. 2290-2294 - Yuan Chang
, Tao Peng, Ruhan He, Xinrong Hu, Junping Liu, Zili Zhang, Minghua Jiang:
DP-VTON: Toward Detail-Preserving Image-Based Virtual Try-on Network. 2295-2299 - Dónal Egan, Martin Alain, Aljosa Smolic:
Light Field Style Transfer with Local Angular Consistency. 2300-2304 - Kai Deng, Kun Zhang, Ping Yao, Siyuan Cheng
, Peng He:
Skip Attention GAN for Remote Sensing Image Synthesis. 2305-2309 - Libao Zhang, Yanan Liu:
Image Generation Based on Texture Guided VAE-AGAN for Regions of Interest Detection in Remote Sensing Images. 2310-2314 - Qihang Yang, Tao Chen, Jiayuan Fan, Ye Lu, Chongyan Zuo, Qinghua Chi:
EADNet: Efficient Asymmetric Dilated Network For Semantic Segmentation. 2315-2319 - Binjie Mao
, Lingfeng Wang, Shiming Xiang, Chunhong Pan:
Ltaf-Net: Learning Task-Aware Adaptive Features and Refining Mask for Few-Shot Semantic Segmentation. 2320-2324 - Hanlin Chen, Qingyong Hu, Jungang Yang, Jing Wu, Yulan Guo:
Cgan-Net: Class-Guided Asymmetric Non-Local Network for Real-Time Semantic Segmentation. 2325-2329 - Kuntao Cao, Xi Huang, Jie Shao:
Aggregation Architecture and all-to-one Network for Real-Time Semantic Segmentation. 2330-2334 - Dong Liang
, Yun Du, Han Sun, Liyan Zhang, Ningzhong Liu, Mingqiang Wei:
Nlkd: Using Coarse Annotations For Semantic Segmentation Based on Knowledge Distillation. 2335-2339 - Shengjia Chen
, Zhixin Li, Xiwei Yang:
Knowledge Reasoning for Semantic Segmentation. 2340-2344 - Yaxi Yang, Hailin Wang, Haiquan Qiu, Jianjun Wang, Yao Wang:
Non-Convex Sparse Deviation Modeling Via Generative Models. 2345-2349 - Xin Yang, Chunling Yang:
Imrnet: An Iterative Motion Compensation and Residual Reconstruction Network for Video Compressed Sensing. 2350-2354 - Jeong-Won Ha, Jun-Sang Yoo, Jong-Ok Kim:
Deep Color Constancy Using Temporal Gradient Under Ac Light Sources. 2355-2359 - Ronan Fablet, Lucas Drumetz, François Rousseau
:
End-to-End Learning of Variational Models and Solvers for the Resolution of Interpolation Problems. 2360-2364 - Fengyin Cao
, Ping An, Xinpeng Huang, Chao Yang, Qiang Wu
:
Multi-Models Fusion for Light Field Angular Super-Resolution. 2365-2369 - Zhun Sun, Chao Li, Qibin Zhao:
Hide Chopin in the Music: Efficient Information Steganography Via Random Shuffling. 2370-2374 - Yi Zhang, Wei Yang, Zhenbo Xu, Yingjie Li, Zhi Chen, Liusheng Huang:
Pointer Networks for Arbitrary-Shaped Text Spotting. 2375-2379 - Longjiao Zhao, Yu Wang, Jien Kato:
Rotation Invariance Analysis of Local Convolutional Features in Image Retrieval. 2380-2384 - Atharva Kadethankar, Neelam Sinha
, Vinayaka Hegde, Abhishek Burman:
Signature Feature Marking Enhanced IRM Framework for Drone Image Analysis in Precision Agriculture. 2385-2389 - Yanting Zhang, Aotian Zheng, Ke Han, Yizhou Wang, Jenq-Neng Hwang:
Vehicle 3d Localization in Road Scenes VIA a Monocular Moving Camera. 2390-2394 - Teresa White, Jesse Wheeler
, Colton Lindstrom, Randall Christensen, Kevin R. Moon:
Gps-Denied Navigation Using Sar Images And Neural Networks. 2395-2399 - Binyu Zhao
, Qianqian Ren, Jinbao Li, Yafeng Zhao:
Attention-Embedded Decomposed Network with Unpaired CT Images Prior for Metal Artifact Reduction. 2400-2404 - Houshun Yu, Li Zhang:
Partial Feature Aggregation Network for Real-Time Object Counting. 2405-2409 - David A. Maluf
, Amr Elnakeeb, Matt Silverman:
A Bayesian Inference Approach for Location-Based Micro Motions using Radio Frequency Sensing. 2410-2414 - Yuheng Deng, Wenjun Zhou, Bo Peng, Dong Liang
, Shun'ichi Kaneko:
Robust Spatial-Temporal Correlation Model for Background Initialization in Severe Scene. 2415-2419 - Lei Gao, Lin Qi, Ling Guan:
2D-FRFT Based Frequency Shift-Invariant Digital Image Encryption. 2420-2424 - Akshay Kapoor, Jatin Sapra, Zhou Wang:
Capturing Banding in Images: Database Construction and Objective Assessment. 2425-2429 - Qier An, Yuan Shen
:
On The Camera Position Dithering In Visual 3d Reconstruction. 2430-2434 - Liyu Wu, Yuexian Zou, Can Zhang:
Long-Short Temporal Modeling for Efficient Action Recognition. 2435-2439 - Bohong Yang, Zijian Wang, Wu Ran, Hong Lu, Yi-Ping Phoebe Chen
:
Multi-Directional Convolution Networks with Spatial-Temporal Feature Pyramid Module for Action Recognition. 2440-2444 - Xiaohang Yang, Lingtong Kong, Jie Yang:
Unsupervised Motion Representation Enhanced Network for Action Recognition. 2445-2449 - Wei Wu, Jiale Yu:
An Improved Deep Relation Network for Action Recognition in Still Images. 2450-2454 - Zichen Yang, Di Huang, Jie Qin, Yunhong Wang:
Human-Aware Coarse-to-Fine Online Action Detection. 2455-2459 - Ranyu Ning, Can Zhang, Yuexian Zou:
SRF-Net: Selective Receptive Field Network for Anchor-Free Temporal Action Detection. 2460-2464 - Zhilin Huang, Chujun Qin, Ruixin Liu, Zhenyu Weng, Yuesheng Zhu:
Semantic-Aware Context Aggregation for Image Inpainting. 2465-2469 - Xue Zhou, Tao Dai, Yong Jiang, Shu-Tao Xia:
Bishift-Net for Image Inpainting. 2470-2474 - Lingtong Kong, Xiaohang Yang, Jie Yang:
OAS-Net: Occlusion Aware Sampling Network for Accurate Optical Flow. 2475-2479 - Yingjie Li, Wei Yang, Zhenbo Xu, Zhi Chen, Zhenbo Shi
, Yi Zhang, Liusheng Huang:
Mask4D: 4D Convolution Network for Light Field Occlusion Removal. 2480-2484 - Jianrong Wang, Ge Zhang, Zhenyu Wu, Xuewei Li, Li Liu
:
Self-Supervised Depth Estimation Via Implicit Cues from Videos. 2485-2489 - Cho-Ying Wu, Ulrich Neumann:
Scene Completeness-Aware Lidar Depth Completion for Driving Scenario. 2490-2494 - Bahram Lavi
, José Nascimento
, Anderson Rocha:
Semi-Supervised Feature Embedding for Data Sanitization in Real-World Events. 2495-2499 - Shu Hu
, Yuezun Li, Siwei Lyu:
Exposing GAN-Generated Faces Using Inconsistent Corneal Specular Highlights. 2500-2504 - Jiaxin Chen
, Xin Liao
, Wei Wang, Zheng Qin:
A Features Decoupling Method for Multiple Manipulations Identification in Image Operation Chains. 2505-2509 - Pavel Korshunov, Sébastien Marcel:
Subjective and Objective Evaluation of Deepfake Videos. 2510-2514 - Alexander Schlögl, Tobias Kupek, Rainer Böhme:
Forensicability of Deep Neural Network Inference Pipelines. 2515-2519 - Jianhui Xie, Song Liu, Ruixin Liu, Yinghong Zhang, Yuesheng Zhu:
SERN: Stance Extraction and Reasoning Network for Fake News Detection. 2520-2524 - Yuhao Sun, Xin Liao
, Jianfeng Liu:
An Efficient Paper Anti-Counterfeiting Method Based on Microstructure Orientation Estimation. 2525-2529 - Irene Amerini, Aris Anagnostopoulos
, Luca Maiano
, Lorenzo Ricciardi Celsi
:
Learning Double-Compression Video Fingerprints Left From Social-Media Platforms. 2530-2534 - Chiara Albisani, Massimo Iuliani, Alessandro Piva
:
Checking PRNU Usability on Modern Devices. 2535-2539 - Thomas Thebaud, Gaël Le Lan, Anthony Larcher:
Handwritten Digits Reconstruction from Unlabelled Embeddings. 2540-2544 - Samet Taspinar, Manoranjan Mohanty, Nasir D. Memon
:
Effect of Video Pixel-Binning on Source Attribution of Mixed Media. 2545-2549 - Lingling Lv, Youjun Xiang, Xianfeng Li
, Hanye Huang, Rongju Ruan, Xiaoyan Xu, Yuli Fu:
Combining Dynamic Image and Prediction Ensemble for Cross-Domain Face Anti-Spoofing. 2550-2554 - Mingzhu Ma, Gongping Yang
, Kuikui Wang, Yuwen Huang, Yilong Yin:
Label-Guided Dictionary Pair Learning for ECG Biometric Recognition. 2555-2559 - Tongqing Zhai, Yiming Li
, Ziqi Zhang, Baoyuan Wu, Yong Jiang, Shu-Tao Xia:
Backdoor Attack Against Speaker Verification. 2560-2564 - Mohammad Esmaeilpour, Patrick Cardinal, Alessandro Lameiras Koerich:
Class-Conditional Defense GAN Against End-To-End Speech Attacks. 2565-2569 - Yiqun Liu, Yi Zeng, Jian Pu, Hongming Shan, Peiyang He, Junping Zhang:
Selfgait: A Spatiotemporal Representation Learning Method for Self-Supervised Gait Recognition. 2570-2574 - Weiyi Zhang, Shuning Zhao
, Le Liu, Jianmin Li, Xingliang Cheng, Thomas Fang Zheng, Xiaolin Hu:
Attack on Practical Speaker Verification System Using Universal Adversarial Perturbations. 2575-2579 - Heinz Hofbauer, Yoanna Martínez-Díaz, Simon Kirchgasser, Heydi Méndez-Vázquez, Andreas Uhl:
Highly Efficient Protection of Biometric Face Samples with Selective JPEG2000 Encryption. 2580-2584 - Hatef Otroshi-Shahreza, Sébastien Marcel:
Deep Auto-Encoding and Biohashing for Secure Finger Vein Recognition. 2585-2589 - Jinzhu Yang, Wei Zhou, Wanhui Qian, Jizhong Han, Songlin Hu
:
Topic Sequence Embedding for User Identity Linkage from Heterogeneous Behavior Data. 2590-2594 - Daniele Mari
, Samuele Giuliano Piazzetta, Sara Bordin, Luca Pajola
, Sebastiano Verde
, Simone Milani, Mauro Conti
:
Looking Through Walls: Inferring Scenes from Video-Surveillance Encrypted Traffic. 2595-2599 - Zhanjiang Chen, H. Vicky Zhao:
Optimal Attacking Strategy Against Online Reputation Systems with Consideration of the Message-Based Persuasion Phenomenon. 2600-2604 - Mohammad Adiban, Arash Safari, Giampiero Salvi
:
STEP-GAN: A One-Class Anomaly Detection Model with Applications to Power System Security. 2605-2609 - Michele Cirillo, Mario Di Mauro
, Vincenzo Matta, Marco Tambasco:
Application-Layer DDOS Attacks with Multiple Emulation Dictionaries. 2610-2614 - Henri Hentilä, Yanina Y. Shkel, Visa Koivunen:
Secret Key Generation Over Wireless Channels using short Blocklength Multilevel Source Polar Coding. 2615-2619 - Zhifan Xu
, Melike Baykal-Gürsoy:
Efficient Network Protection Games Against Multiple Types Of Strategic Attackers. 2620-2624 - Jinyuan Jia, Zheng Dong, Jie Li, Jack W. Stokes:
Detection Of Malicious DNS and Web Servers using Graph-Based Approaches. 2625-2629 - Mengdi Wang, Di Xiao, Jia Liang:
Low Complexity Secure P-Tensor Product Compressed Sensing Reconstruction Outsourcing and Identity Authentication in Cloud. 2630-2634 - Behrooz Razeghi
, Sohrab Ferdowsi, Dimche Kostadinov, Flávio P. Calmon
, Slava Voloshynovskiy:
Privacy-Preserving near Neighbor Search via Sparse Coding with Ambiguation. 2635-2639 - Zuobin Ying, Shuanglong Cao, Shengmin Xu, Ximeng Liu, Lingjuan Lyu, Cen Chen, Li Wang:
Privacy-Preserving Optimal Insulin Dosing Decision. 2640-2644 - Yulu Jin
, Lifeng Lai:
Privacy-Accuracy Trade-Off of Inference as Service. 2645-2649 - Muah Kim, Onur Günlü
, Rafael F. Schaefer
:
Federated Learning with Local Differential Privacy: Trade-Offs Between Privacy, Utility, and Communication. 2650-2654 - Amin Aminifar
, Fazle Rabbi
, Yngve Lamo
:
Scalable Privacy-Preserving Distributed Extremely Randomized Trees for Structured Data With Multiple Colluding Parties. 2655-2659 - Ecenaz Erdemir, Pier Luigi Dragotti
, Deniz Gündüz:
Active Privacy-Utility Trade-Off Against A Hypothesis Testing Adversary. 2660-2664 - Bhanuka Gamage, Adnan Labib, Aisha Joomun, Chern Hong Lim, KokSheik Wong
:
Baitradar: A Multi-Model Clickbait Detection Algorithm Using Deep Learning. 2665-2669 - Xiangyu Wang
, Jianfeng Ma, Ximeng Liu:
Enabling Efficient and Expressive Spatial Keyword Queries On Encrypted Data. 2670-2674 - Shangyu Xie, Bingyu Liu, Yuan Hong:
Privacy-Preserving Cloud-Based DNN Inference. 2675-2679 - Avital Shafran, Gil Segev, Shmuel Peleg, Yedid Hoshen:
Crypto-Oriented Neural Architecture Design. 2680-2684 - Seok-Jun Bu
, Sung-Bae Cho:
Integrating Deep Learning with First-Order Logic Programmed Constraints for Zero-Day Phishing Attack Detection. 2685-2689 - Haibo Cheng, Wenting Li, Ping Wang, Kaitai Liang
:
Improved Probabilistic Context-Free Grammars for Passwords Using Word Extraction. 2690-2694 - Tingting Song, Minglin Liu
, Weiqi Luo, Peijia Zheng:
Enhancing Image Steganography Via Stego Generation And Selection. 2695-2699 - Shengbei Wang, Weitao Yuan, Zhen Zhang, Jianming Wang, Masashi Unoki:
Synchronous Multi-Bit Audio Watermarking Based on Phase Shifting. 2700-2704 - Xinghong Qin, Shunquan Tan, Weixuan Tang, Bin Li, Jiwu Huang:
Image Steganography Based on Iterative Adversarial Perturbations Onto a Synchronized-Directions Sub-Image. 2705-2709 - Jan Butora, Jessica J. Fridrich:
Extending the Reverse JPEG Compatibility Attack to Double Compressed Images. 2710-2714 - Yuxuan Huang, Xin Cao
, Hao-Tian Wu, Yiu-ming Cheung:
Reversible Data Hiding in Jpeg Images for Privacy Protection. 2715-2719 - Xiaoqing Jia, Jie Wang, Yongliang Liu, Xiangui Kang, Yun-Qing Shi:
A Layered Embedding-Based Scheme to Cope with Intra-Frame Distortion Drift In IPM-Based HEVC Steganography. 2720-2724 - Zejiang Hou, Anwar Walid, Sun-Yuan Kung:
Meta-Learning with Attention for Improved Few-Shot Learning. 2725-2729 - Anish Madan, Ranjitha Prasad:
B-Small: A Bayesian Neural Network Approach to Sparse Model-Agnostic Meta-Learning. 2730-2734 - Wen Tang, Emilie Chouzenoux, Jean-Christophe Pesquet, Hamid Krim
:
Deep Transform and Metric Learning Networks. 2735-2739 - Pengchao Han, Jihong Park
, Shiqiang Wang, Yejun Liu:
Robustness and Diversity Seeking Data-Free Knowledge Distillation. 2740-2744 - Yassir Fathullah, Mark J. F. Gales, Andrey Malinin:
Ensemble Distillation Approaches for Grammatical Error Correction. 2745-2749 - Shucong Zhang, Cong-Thanh Do, Rama Doddipatla
, Erfan Loweimi
, Peter Bell, Steve Renals:
Train Your Classifier First: Cascade Neural Networks Training from Upper Layers to Lower Layers. 2750-2754 - Antônio H. Ribeiro, Thomas B. Schön:
How Convolutional Neural Networks Deal with Aliasing. 2755-2759 - Tianyou Chen
, Xiaoguang Hu, Jin Xiao
, Guofeng Zhang, Hui Ruan:
Canet: Context-Aware Loss for Descriptor Learning. 2760-2764 - Yan Zhang, Binyu He, Li Sun, Qingli Li:
Progressive Multi-Stage Feature Mix for Person Re-Identification. 2765-2769 - Vivek Sivaraman Narayanaswamy, Jayaraman J. Thiagarajan, Andreas Spanias:
Using Deep Image Priors to Generate Counterfactual Explanations. 2770-2774 - Hojatollah Zamani, Peyman Rostami, Arash Amini, Farokh Marvasti:
Elliptical Shape Recovery from Blurred Pixels Using Deep Learning. 2775-2779 - Eran Goldman, Jacob Goldberger:
Factorized CRF with Batch Normalization Based on the Entire Training Data. 2780-2784 - Zhenhua Liu, Xinfeng Zhang, Shanshe Wang, Siwei Ma, Wen Gao:
Evolutionary Quantization of Neural Networks with Mixed-Precision. 2785-2789 - Yong Wang, Xiaojing Wang, Xiaoyu He:
Evolving Quantized Neural Networks for Image Classification Using A Multi-Objective Genetic Algorithm. 2790-2794 - Bochen Guan, Jinnian Zhang
, William A. Sethares, Richard Kijowski, Fang Liu:
Spectral Domain Convolutional Neural Network. 2795-2799 - Luke Wood, Eric C. Larson:
Parametric Spectral Filters for Fast Converging, Scalable Convolutional Neural Networks. 2800-2804 - Xinyue Liang, Mikael Skoglund, Saikat Chatterjee:
Feature Reuse for a Randomization Based Neural Network. 2805-2809 - Alireza M. Javid, Sandipan Das, Mikael Skoglund, Saikat Chatterjee:
A ReLU Dense Layer to Improve the Performance of Neural Networks. 2810-2814 - Raphaël Achddou, J. Matías Di Martino, Guillermo Sapiro:
Nested Learning for Multi-Level Classification. 2815-2819 - Yu Wang, Shenjie Zhao:
Cross-Modal Representation Reconstruction for Zero-Shot Classification. 2820-2824 - Jisheng Dang, Jun Yang:
HIGCNN: Hierarchical Interleaved Group Convolutional Neural Networks for Point Clouds Analysis. 2825-2829 - Bo Zhang, Wenfeng Li, Qingyuan Li, Weiji Zhuang, Xiangxiang Chu, Yujun Wang:
AutoKWS: Keyword Spotting with Differentiable Architecture Search. 2830-2834 - Yubin Ge, Site Li
, Xuyang Li
, Fangfang Fan, Wanqing Xie, Jane You, Xiaofeng Liu:
Embedding Semantic Hierarchy in Discrete Optimal Transport for Risk Minimization. 2835-2839 - Panagiotis A. Traganitis
, Georgios B. Giannakis
:
Identifying Spammers to Boost Crowdsourced Classification. 2840-2844 - Shenfei Pei, Feiping Nie, Rong Wang, Xuelong Li:
A Rank-Constrained Clustering Algorithm with Adaptive Embedding. 2845-2849 - Yulan Deng, Lunke Fei, Shaohua Teng, Wei Zhang, Dongning Liu
, Yan Hou:
Towards Efficient Age Estimation by Embedding Potential Gender Features. 2850-2854 - Ismail R. Alkhouri, George K. Atia:
Adversarial Attacks on Coarse-to-Fine Classifiers. 2855-2859 - Xiang Liu, Naiqi Li, Shu-Tao Xia:
GDTW: A Novel Differentiable DTW Loss for Time Series Tasks. 2860-2864 - Illya Degtyarenko, Ivan Deriuga, Andrii Grygoriev
, Serhii Polotskyi, Volodymyr Melnyk, Dmytro Zakharchuk, Olga Radyvonenko
:
Hierarchical Recurrent Neural Network for Handwritten Strokes Classification. 2865-2869 - Wenyu Zhang, Mohamed Ragab
, Ramón Sagarna:
Robust Domain-Free Domain Generalization with Class-Aware Alignment. 2870-2874 - Swatantra Kafle, Geethu Joseph, Pramod K. Varshney:
One-Bit Compressed Sensing Using Untrained Network Prior. 2875-2879 - Rong Fu, Vincent Monardo, Tianyao Huang, Yimin Liu:
Deep Unfolding Network for Block-Sparse Signal Recovery. 2880-2884 - Wei Pu, Chao Zhou
, Yonina C. Eldar, Miguel R. D. Rodrigues:
REST: Robust lEarned Shrinkage-Thresholding Network Taming Inverse Problems with Model Mismatch. 2885-2889 - Bahareh Tolooshams, Satish Mulleti, Demba E. Ba, Yonina C. Eldar:
Unfolding Neural Networks for Compressive Multichannel Blind Deconvolution. 2890-2894 - Vinayak Killedar, Praveen Kumar Pokala, Chandra Sekhar Seelamantula:
Sparsity Driven Latent Space Sampling for Generative Prior Based Compressive Sensing. 2895-2899 - Anurag Das, Seyedhooman Sajjadi, Bobak Mortazavi, Theodora Chaspari, Projna Paromita, Laura Ruebush, Nicolaas E. P. Deutz, Ricardo Gutierrez-Osuna:
A Sparse Coding Approach to Automatic Diet Monitoring with Continuous Glucose Monitors. 2900-2904 - Ouafae Karmouda, Jérémie Boulanger, Rémy Boyer:
Speeding Up of Kernel-Based Learning for High-Order Tensors. 2905-2909 - Le Trung Thanh
, Karim Abed-Meraim, Nguyen Linh-Trung
, Adel Hafiane
:
A Fast Randomized Adaptive CP Decomposition For Streaming Tensors. 2910-2914 - Athanasios A. Rontogiannis, Paris V. Giampouras, Eleftherios Kofidis:
Rank-Revealing Block-Term Decomposition for Tensor Completion. 2915-2919 - Kriton Konstantinidis, Shengxi Li, Danilo P. Mandic:
Kernel Learning with Tensor Networks. 2920-2924 - Wenqiang Pu, Shahana Ibrahim, Xiao Fu, Mingyi Hong:
Fiber-Sampled Stochastic Mirror Descent for Tensor Decomposition with β-Divergence. 2925-2929 - Ruyuan Qu, Jiaqi He, Hui Feng, Chongbin Xu, Bo Hu:
Regularized Recovery by Multi-Order Partial Hypergraph Total Variation. 2930-2934 - Zhe Feng, Jie Tang, Yishun Dou, Gangshan Wu:
Learning Discriminative Features for Semi-Supervised Anomaly Detection. 2935-2939 - Jiaxiang Tang, Xiang Gao, Wei Hu:
RGLN: Robust Residual Graph Learning Networks via Similarity-Preserving Mapping on Graphs. 2940-2944 - Eric Sun, Liang Lu, Zhong Meng, Yifan Gong:
Sequence-Level Self-Teaching Regularization. 2945-2949 - Sina Alemohammad, Hossein Babaei, Randall Balestriero, Matt Y. Cheung, Ahmed Imtiaz Humayun, Daniel LeJeune, Naiming Liu, Lorenzo Luzi, Jasper Tan, Zichao Wang, Richard G. Baraniuk:
Wearing A Mask: Compressed Representations of Variable-Length Sequences Using Recurrent Neural Tangent Kernels. 2950-2954 - Naiqi Li, Yinghua Gao, Wenjie Li
, Yong Jiang, Shu-Tao Xia:
H-GPR: A Hybrid Strategy for Large-Scale Gaussian Process Regression. 2955-2959 - Laia Amorós
, Mikko Pitkänen:
Learning Optimal Lattice Codes for MIMO Communications. 2960-2964 - Alexandre Bittar, Philip N. Garner
:
A Bayesian Interpretation of the Light Gated Recurrent Unit. 2965-2969 - Charles Séjourné, Romain Couillet, Pierre Comon:
A Large-Dimensional Analysis of Symmetric SNE. 2970-2974 - Alec Koppel, Amrit S. Bedi, Vikram Krishnamurthy:
A Dynamical Systems Perspective on Online Bayesian Nonparametric Estimators with Adaptive Hyperparameters. 2975-2979 - Zixiao Zong, Yanning Shen
:
Online Multi-Hop Information Based Kernel Learning Over Graphs. 2980-2984 - Nikos Tsilivis, Anastasios Tsiamis, Petros Maragos:
Sparsity in Max-Plus Algebra and Applications in Multivariate Convex Regression. 2985-2989 - Jose Agustin Barrachina
, Chenfang Ren, Christèle Morisseau, Gilles Vieillard, Jean Philippe Ovarlez:
Complex-Valued Vs. Real-Valued Neural Networks for Classification Perspectives: An Example on Non-Circular Data. 2990-2994 - Raphaël Olivier, Bhiksha Raj, Muhammad A. Shah:
High-Frequency Adversarial Defense for Speech and Audio. 2995-2999 - Jie Pu, Yannis Panagakis
, Maja Pantic:
Learning Separable Time-Frequency Filterbanks for Audio Classification. 3000-3004 - Jordi Pons, Santiago Pascual, Giulio Cengarle, Joan Serrà:
Upsampling Artifacts in Neural Audio Synthesis. 3005-3009 - Kleanthis Avramidis
, Agelos Kratimenos, Christos Garoufis, Athanasia Zlatintsi, Petros Maragos:
Deep Convolutional and Recurrent Networks for Polyphonic Instrument Classification from Monophonic Raw Audio Waveforms. 3010-3014 - Ke Chen, Beici Liang, Xiaoshuan Ma, Minwei Gu:
Learning Audio Embeddings with User Listening Data for Content-Based Music Recommendation. 3015-3019 - Zixuan Peng, Yu Lu, Shengfeng Pan, Yunfeng Liu:
Efficient Speech Emotion Recognition Using Multi-Scale CNN and Attention. 3020-3024 - Sungkyun Chang, Donmoon Lee, Jeongsoo Park, Hyungui Lim, Kyogu Lee, Karam Ko, Yoonchang Han:
Neural Audio Fingerprint for High-Specific Audio Retrieval Based on Contrastive Learning. 3025-3029 - Qiantong Xu, Alexei Baevski, Tatiana Likhomanenko, Paden Tomasello, Alexis Conneau, Ronan Collobert, Gabriel Synnaeve, Michael Auli:
Self-Training and Pre-Training are Complementary for Speech Recognition. 3030-3034