


default search action
ICASSP 2023: Rhodes Island, Greece
- IEEE International Conference on Acoustics, Speech and Signal Processing ICASSP 2023, Rhodes Island, Greece, June 4-10, 2023. IEEE 2023, ISBN 978-1-7281-6327-7
- Tongzhou Chen, Cyril Allauzen, Yinghui Huang, Daniel S. Park, David Rybach
, W. Ronny Huang, Rodrigo Cabrera, Kartik Audhkhasi, Bhuvana Ramabhadran, Pedro J. Moreno, Michael Riley:
Large-Scale Language Model Rescoring on Long-Form Data. 1-5 - Haibo Ye, Fangyu Zhou, Xinjie Li, Qingheng Zhang:
Balanced Mixup Loss for Long-Tailed Visual Recognition. 1-5 - Hanbing Liu, Yanru Wu, Yang Liu
, Ercan E. Kuruoglu, Xuan Zhang:
SDG-L: A Semiparametric Deep Gaussian Process based Framework for Battery Capacity Prediction. 1-5 - Harshat Kumar, Alejandro Parada-Mayorga
, Alejandro Ribeiro
:
Algebraic Convolutional Filters on Lie Group Algebras. 1-5 - Atsushi Miyashita, Tomoki Toda:
Representation of Vocal Tract Length Transformation Based on Group Theory. 1-5 - Aochuan Chen, Peter Lorenz, Yuguang Yao, Pin-Yu Chen, Sijia Liu:
Visual Prompting for Adversarial Robustness. 1-5 - Yuzhou Chen, Sotiris Batsakis, H. Vincent Poor
:
Higher-Order Spatio-Temporal Neural Networks for Covid-19 Forecasting. 1-5 - Domenico Mattia Cinque, Claudio Battiloro, Paolo Di Lorenzo:
Pooling Strategies for Simplicial Convolutional Networks. 1-5 - Jerry Gu, Liam Collins, Debashri Roy, Aryan Mokhtari, Sanjay Shakkottai, Kaushik R. Chowdhury
:
Meta-Learning for Image-Guided Millimeter-Wave Beam Selection in Unseen Environments. 1-5 - Amlu Anna Joshy, P. N. Parameswaran, Siddharth R. Nair, Rajeev Rajan:
Statistical Analysis of Speech Disorder Specific Features to Characterise Dysarthria Severity Level. 1-5 - Brian Yan, Matthew Wiesner, Ondrej Klejch, Preethi Jyothi, Shinji Watanabe
:
Towards Zero-Shot Code-Switched Speech Recognition. 1-5 - Jian Chen, Wei Wang, Junxin Chen, Ming Cai:
Dynamic Vehicle Graph Interaction for Trajectory Prediction Based on Video Signals. 1-5 - Thien-Phuc Doan
, Long Nguyen-Vu, Souhwan Jung, Kihun Hong:
BTS-E: Audio Deepfake Detection Using Breathing-Talking-Silence Encoder. 1-5 - Yahong Zhang, Sheng Shi
, Chenchen Fan, Yixin Wang, Wenli Ouyang, WeiFan, Jianping Fan:
Long-Tailed Recognition with Causal Invariant Transformation. 1-5 - Xiu Zheng, Yuan Huang, Jie Tang:
Reliable Cluster-Based Framework for Open Set Domain Adaptation. 1-5 - Jing-Xuan Zhang, Genshun Wan, Zhen-Hua Ling, Jia Pan, Jianqing Gao, Cong Liu:
Self-Supervised Audio-Visual Speech Representations Learning by Multimodal Self-Distillation. 1-5 - Weiquan Huang, Fu Zhang:
Semi-Supervised Semantic Segmentation with Structured Output Space Adaption. 1-5 - Gaopeng Xu, Xianliang Wang, Sang Wang, Junfeng Yuan, Wei Guo, Wei Li, Jie Gao:
The NIO System for Audio-Visual Diarization and Recognition in MISP Challenge 2022. 1-2 - Chenghu Du
, Shengwu Xiong:
CF-VTON: Multi-Pose Virtual Try-on with Cross-Domain Fusion. 1-5 - Subhashini Venugopalan, Jimmy Tobin, Samuel J. Yang, Katie Seaver, Richard J. N. Cave
, Pan-Pan Jiang, Neil Zeghidour, Rus Heywood, Jordan R. Green, Michael P. Brenner:
Speech Intelligibility Classifiers from 550k Disordered Speech Samples. 1-5 - Kassem Kallas
, Teddy Furon:
Mixer: DNN Watermarking using Image Mixup. 1-5 - Kaushani Majumder
, Sibi Raj B. Pillai, Satish Mulleti:
Clustered Greedy Algorithm For Large-Scale Sensor Selection. 1-5 - Ke Hu, Tara N. Sainath, Bo Li, Nan Du, Yanping Huang, Andrew M. Dai, Yu Zhang, Rodrigo Cabrera, Zhifeng Chen, Trevor Strohman:
Massively Multilingual Shallow Fusion with Large Language Models. 1-5 - Dazhao Du, Bing Su, Zhewei Wei:
Preformer: Predictive Transformer with Multi-Scale Segment-Wise Correlations for Long-Term Time Series Forecasting. 1-5 - Ziyue Wang
, Ya-Feng Liu, Zhaorui Wang, Wei Yu:
Scaling Law Analysis for Covariance Based Activity Detection in Cooperative Multi-Cell Massive Mimo. 1-5 - Michael Chan, Li Zhu, Korosh Vatanparvar, Hewon Jung, Jilong Kuang, Jun Alex Gao:
Improving Heart Rate and Heart Rate Variability Estimation from Video Through a HR-RR-Tuned Filter. 1-5 - Yosuke Higuchi, Tetsuji Ogawa
, Tetsunori Kobayashi, Shinji Watanabe
:
Intermpl: Momentum Pseudo-Labeling With Intermediate CTC Loss. 1-5 - Jiewen Zhu
, Shengjia Chen
, Lexiao Li, Luping Ji
:
Sanet: Spatial Attention Network with Global Average Contrast Learning for Infrared Small Target Detection. 1-5 - Manila Kodali
, Sudarsana Reddy Kadiri
, Laura Laaksonen, Paavo Alku
:
Automatic Classification of Vocal Intensity Category from Speech. 1-5 - Xingming Wang, Hao Wu, Chen Ding, Chuanzeng Huang, Ming Li:
Exploring Universal Singing Speech Language Identification Using Self-Supervised Learning Based Front-End Features. 1-5 - Jochen Fink, Renato L. G. Cavalcante, Zoran Utkovski
, Slawomir Stanczak:
Deep-Unfolded Adaptive Projected Subgradient Method For Mimo Detection. 1-5 - Sofia Suvorova, Ali Pezeshki, Ross Kyprianou, Bill Moran:
A Radar-Jammer Zero-Sum Repeated Bayesian Game. 1-5 - Shuo Feng, Piji Li:
Ancient Chinese Word Segmentation and Part-of-Speech Tagging Using Distant Supervision. 1-5 - Yao Lu, Zhiyi Chen, Zehui Chen, Jie Hu, Liujuan Cao, Shengchuan Zhang:
CANDY: Category-Kernelized Dynamic Convolution for Instance Segmentation. 1-5 - Liuyin Wang, Mingchao Li, Hai-Tao Zheng:
High-Level Feature Fusion Network for Session-Based Social Recommendation. 1-5 - Mingliang Dai, Zhizhong Huang, Jiaqi Gao, Hongming Shan, Junping Zhang:
Cross-Head Supervision for Crowd Counting with Noisy Annotations. 1-5 - Liana Khamidullina, André L. F. de Almeida
, Martin Haardt:
Rate Splitting and Precoding Strategies for Multi-User MIMO Broadcast Channels with Common and Private Streams. 1-5 - Lei Zhang, Jie Liu, Yanqi Bao, Jie Wang:
Region-Awared Transformer with Asymmetric Loss in Multi-Label Classification. 1-5 - Mehul Kumar, Jiyeon Kim, Dhananjaya Gowda, Abhinav Garg, Chanwoo Kim
:
Self-Supervised Accent Learning for Under-Resourced Accents Using Native Language Data. 1-5 - Jun Wang, Peng Yao, Feng Deng, Jianchao Tan, Chengru Song, Xiaorui Wang:
NAS-DYMC: NAS-Based Dynamic Multi-Scale Convolutional Neural Network for Sound Event Detection. 1-5 - Xianyu Wang, Yuhan Zhang
, Weihua He, Yaoyuan Wang, Minglei Li, Yuchen Wang, Jingyi Zhang, Shunbo Zhou, Ziyang Zhang:
Audio-Driven High Definetion and Lip-Synchronized Talking Face Generation Based on Face Reenactment. 1-5 - Han Ding, Wenjing Song, Cui Zhao, Fei Wang, Ge Wang, Wei Xi, Jizhong Zhao:
Knowledge-Graph Augmented Music Representation for Genre Classification. 1-5 - Da Li, Bo Tang
, Lei Xue:
Co-Design for Mimo Radar and Mimo Communication Aided by Reconfigurable Intelligent Surface. 1-5 - Daizong Liu, Pan Zhou:
Jointly Visual- and Semantic-Aware Graph Memory Networks for Temporal Sentence Localization in Videos. 1-5 - Yudong Zhang
, Wei Lu, Xu Wang
, Pengkun Wang, Yang Wang:
Pondering About Task Spatial Misalignment: Classification-Localization Equilibrated Object Detection. 1-5 - Andrea Marinoni, Marine Mercier, Qian Shi, Sivasakthy Selvakumaran, Mark Girolami:
Incorporating Reliability in Graph Information Propagation by Fluid Dynamics Diffusion: A case of Multimodal Semisupervised Deep Learning. 1-5 - Zhao Ren, Thanh Tam Nguyen, Yi Chang, Björn W. Schuller:
Fast Yet Effective Speech Emotion Recognition with Self-Distillation. 1-5 - Marco A. Oliveira
, Vitor Almeida, João Silva, Aníbal J. S. Ferreira
:
Analysis and Re-Synthesis of Natural Cricket Sounds Assessing the Perceptual Relevance of Idiosyncratic Parameters. 1-5 - Yikang Wei
, Yahong Han:
Exploring Instance Relation for Decentralized Multi-Source Domain Adaptation. 1-5 - Yihong Wu
, Yuwen Heng
, Mahesan Niranjan, Hansung Kim
:
Depth Estimation for a Single Omnidirectional Image with Reversed-Gradient Warming-up Thresholds Discriminator. 1-5 - Ysobel Sims, Alexandre Mendes, Stephan K. Chalup:
Enhanced Embeddings in Zero-Shot Learning for Environmental Audio. 1-5 - Youngki Kwon, Hee-Soo Heo, Bong-Jin Lee, You Jin Kim, Jee-Weon Jung:
Absolute Decision Corrupts Absolutely: Conservative Online Speaker Diarisation. 1-5 - Paul-Gauthier Noé, Xiaoxiao Miao, Xin Wang, Junichi Yamagishi, Jean-François Bonastre, Driss Matrouf:
Hiding Speaker's Sex in Speech Using Zero-Evidence Speaker Representation in an Analysis/Synthesis Pipeline. 1-5 - Seyed Saman Saboksayr, Gonzalo Mateos:
Dual-Based Online Learning of Dynamic Network Topologies. 1-5 - Benjamin Z. Reichman, Anirudh Sundar, Christopher Richardson, Tamara Zubatiy, Prithwijit Chowdhury, Aaryan Shah, Jack Truxal, Micah Grimes, Dristi Shah, Woo Ju Chee, Saif Punjwani, Atishay Jain, Larry Heck:
Outside Knowledge Visual Question Answering Version 2.0. 1-5 - Zihui Cai, Hongwei Ding, Xuemeng Wu, Mohan Xu, Xiaohui Cui:
Hierarchical Transformer for Multi-Label Trailer Genre Classification. 1-5 - Georgios Rizos, Rafael A. Calvo, Björn W. Schuller:
Positive-Pair Redundancy Reduction Regularisation for Speech-Based Asthma Diagnosis Prediction. 1-5 - Xunmeng Wu
, Zai Yang, Jian-Feng Cai, Zongben Xu:
Spectral Super-Resolution on the Unit Circle Via Gradient Descent. 1-5 - Seongyeon Park, Myungseo Song, Bohyung Kim, Tae-Hyun Oh:
Unsupervised Pre-Training for Data-Efficient Text-to-Speech on Low Resource Languages. 1-5 - Fengming Liang, Changlin Fan, Bo Xiao, Kongming Liang:
Semantic Centralized Contrastive Learning for Unsupervised Hashing. 1-5 - Chia-Sheng Liu, Jia-Fong Yeh, Hao Hsu, Hung-Ting Su, Ming-Sui Lee, Winston H. Hsu:
BIRD-PCC: Bi-Directional Range Image-Based Deep Lidar Point Cloud Compression. 1-5 - Zengrui Jin, Xurong Xie, Mengzhe Geng, Tianzi Wang, Shujie Hu, Jiajun Deng, Guinan Li, Xunying Liu:
Adversarial Data Augmentation Using VAE-GAN for Disordered Speech Recognition. 1-5 - Guanjun Li, Wei Xue
, Wenju Liu, Jiangyan Yi, Jianhua Tao:
GCC-Speaker: Target Speaker Localization with Optimal Speaker-Dependent Weighting in Multi-Speaker Scenarios. 1-5 - Yihe Wang, Yitong Li, Yasheng Wang, Fei Mi, Pingyi Zhou, Jin Liu, Xin Jiang, Qun Liu:
History, Present and Future: Enhancing Dialogue Generation with Few-Shot History-Future Prompt. 1-5 - Yang Zhang, Krishna C. Puvvada, Vitaly Lavrukhin, Boris Ginsburg:
Conformer-Based Target-Speaker Automatic Speech Recognition For Single-Channel Audio. 1-5 - Dan Berrebbi, Brian Yan, Shinji Watanabe
:
Avoid Overthinking in Self-Supervised Models for Speech Recognition. 1-5 - Sarah Miller, Christina Karam, Achour Idoughi, Kodai Kikuchi, Keigo Hirakawa:
A Bayesian Perspective on Noise2Noise: Theory and Extensions. 1-5 - Yuhongze Zhou, Liguang Zhou, Issam Hadj Laradji, Tin Lun Lam, Yangsheng Xu:
Affinity Learning With Blind-Spot Self-Supervision for Image Denoising. 1-5 - Tzeviya Sylvia Fuchs, Yedid Hoshen:
Unsupervised Word Segmentation Using Temporal Gradient Pseudo-Labels. 1-5 - Nauman Dawalatabad, Sameer Khurana, Antoine Laurent, James R. Glass:
On Unsupervised Uncertainty-Driven Speech Pseudo-Label Filtering and Model Calibration. 1-5 - Kisoo Kwon, Kuhwan Jeong, Junghyun Park, Hwidong Na, Jinwoo Shin:
String-Based Molecule Generation Via Multi-Decoder VAE. 1-5 - Xinzhou Xu, Jun Deng, Zixing Zhang, Zhen Yang, Björn W. Schuller:
Zero-Shot Speech Emotion Recognition Using Generative Learning with Reconstructed Prototypes. 1-5 - Roberto Pereira, Xavier Mestre, David Gregoratti:
Consistent Estimators of a New Class of Covariance Matrix Distances in the Large Dimensional Regime. 1-5 - Yu Bai, Ruian He, Weimin Tan, Bo Yan, Yangle Lin:
Fine-Grained Blind Face Inpainting with 3D Face Component Disentanglement. 1-5 - Yibin Tang, Ying Chen, Yuan Gao, Aimin Jiang, Lin Zhou:
ADHD Classification with Biomarker Identification Using a Triplet Loss Attention Auto-Encoding Network. 1-5 - Rakib Hyder, M. Salman Asif:
Compressive Sensing with Tensorized Autoencoder. 1-5 - Zhengzhuo Xu, Shuo Yang, Xingjun Wang, Chun Yuan:
Rethink Long-Tailed Recognition with Vision Transforms. 1-5 - Ruoyu Wang, Jun Du, Tian Gao:
Quantum Transfer Learning Using the Large-Scale Unsupervised Pre-Trained Model Wavlm-Large for Synthetic Speech Detection. 1-5 - Daeun Kyung
, Kyungmin Jo, Jaegul Choo, Joonseok Lee, Edward Choi:
Perspective Projection-Based 3d CT Reconstruction from Biplanar X-Rays. 1-5 - Yanan Lin, Keyu Chen, Shihao Zhou, Yunan Huang, Yunqi Lei:
CO-NET: Classification-Oriented Point Cloud Sampling via Informative Feature Learning and Non-Overlapped Local Adjustment. 1-5 - Rémi Delogne, Vincent Schellekens, Laurent Daudet, Laurent Jacques:
Signal Processing with Optical Quadratic Random Sketches. 1-5 - Ferdinand Jost, Vassillen Chizhov, Joachim Weickert:
Optimising Different Feature Types for Inpainting-Based Image Representations. 1-5 - Fuyan Ma, Bin Sun, Shutao Li:
Logo-Former: Local-Global Spatio-Temporal Transformer for Dynamic Facial Expression Recognition. 1-5 - Yun-Ning Hung, Chao-Han Huck Yang, Pin-Yu Chen, Alexander Lerch
:
Low-Resource Music Genre Classification with Cross-Modal Neural Model Reprogramming. 1-5 - Jiukai Sun, Ganchao Liu, Xuelong Li, Yuan Yuan:
Difference Guided VHR Remote Sensing Image Change Detection. 1-5 - Ryuichi Yamamoto, Reo Yoneyama, Tomoki Toda:
NNSVS: A Neural Network-Based Singing Voice Synthesis Toolkit. 1-5 - Yuya Nishi, Takumi Takahashi, Hiroki Iimori, Giuseppe Abreu, Shinsuke Ibi, Seiichi Sampei:
Wireless Location Tracking via Complex-Domain Super MDS with Time Series Self-Localization Information. 1-5 - Adarsh M. Subramaniam, Akshayaa Magesh, Venugopal V. Veeravalli:
Adaptive Step-Size Methods for Compressed SGD. 1-5 - Khoa Anh Ngo, Kyuhong Shim
, Byonghyo Shim:
Spatial Cross-Attention for Transformer-Based Image Captioning. 1-5 - Tong Lei
, Zhongshu Hou, Yuxiang Hu, Wanyu Yang, Tianchi Sun, Xiaobin Rong, Dahan Wang, Kai Chen, Jing Lu:
A Low-Latency Hybrid Multi-Channel Speech Enhancement System For Hearing Aids. 1-2 - Guangzhi Sun, Chao Zhang, Philip C. Woodland:
End-to-End Spoken Language Understanding with Tree-Constrained Pointer Generator. 1-5 - Anastasia Kuznetsova, Aswin Sivaraman, Minje Kim
:
The Potential of Neural Speech Synthesis-Based Data Augmentation for Personalized Speech Enhancement. 1-5 - Sarbani Ghose, Deepak Mishra, Santi P. Maity
, George C. Alexandropoulos:
RIS Reflection and Placement Optimisation for Underlay D2D Communications in Cognitive Cellular Networks. 1-5 - Tianyu Geng, Feng Ji, Pratibha, Wee Peng Tay:
Modulo EEG Signal Recovery Using Transformer. 1-5 - Bach-Tung Pham, Ting-Yu Wang, Phuong Le Thi, Khai-Thinh Nguyen, Yuan-Shan Lee, Tzu-Chiang Tai, Jia-Ching Wang:
Dense Adversarial Transfer Learning Based On Class-Invariance. 1-5 - Yuang Li, Xianrui Zheng, Philip C. Woodland:
Self-Supervised Learning-Based Source Separation for Meeting Data. 1-5 - Gerrit Maus
, Dieter Brückmann:
Joint Angle and Respiration Estimation for Passive and Device-Free Respiration Monitoring. 1-5 - Yingting Li, Ambuj Mehrish, Rishabh Bhardwaj, Navonil Majumder, Bo Cheng, Shuai Zhao, Amir Zadeh, Rada Mihalcea, Soujanya Poria:
Evaluating Parameter-Efficient Transfer Learning Approaches on SURE Benchmark for Speech Understanding. 1-5 - Kartik Audhkhasi, Brian Farris, Bhuvana Ramabhadran, Pedro J. Moreno:
Modular Conformer Training for Flexible End-to-End ASR. 1-5 - Zihan Zhang, Shimin Zhang, Mingshuai Liu, Yanhong Leng, Zhe Han, Li Chen, Lei Xie:
Two-Step Band-Split Neural Network Approach For Full-Band Residual Echo Suppression. 1-2 - Huaizhen Tang, Xulong Zhang, Jianzong Wang, Ning Cheng, Jing Xiao:
Learning Speech Representations with Flexible Hidden Feature Dimensions. 1-5 - Vikram Krishnamurthy:
Adaptive Filtering Algorithms For Set-Valued Observations-Symmetric Measurement Approach To Unlabeled And Anonymized Data. 1-5 - Dianlong You, Houlin Wang
, Bingxin Liu, Yang Yu, Zhiming Li:
DL-NET: Dilation Location Network for Temporal Action Detection. 1-5 - Vanya Bannihatti Kumar, Shanbo Cheng, Ningxin Peng, Yuchen Zhang:
Visual Information Matters for ASR Error Correction. 1-5 - Xiangping Zheng, Xun Liang, Bo Wu, Junlan Feng, Yuhui Guo, Sensen Zhang:
Intent Does Matter! Propagating High-Order Relations for Exploring Interest Preferences. 1-5 - Tom O'Malley, Shaojin Ding, Arun Narayanan, Quan Wang, Rajeev Rikhye, Qiao Liang, Yanzhang He, Ian McGraw:
Conditional Conformer: Improving Speaker Modulation For Single And Multi-User Speech Enhancement. 1-5 - Qin Lu, Konstantinos D. Polyzos:
Gaussian Process Dynamical Modeling for Adaptive Inference Over Graphs. 1-5 - Sakila S. Jayaweera
, Beibei Wang, Xiaolu Zeng, Wei-Hsiang Wang
, K. J. Ray Liu:
WIFI-Based Robust Child Presence Detection for Smart Cars. 1-5 - Hayato Futami, Jessica Huynh, Siddhant Arora, Shih-Lun Wu, Yosuke Kashiwagi, Yifan Peng, Brian Yan, Emiru Tsunoo, Shinji Watanabe
:
The Pipeline System of ASR and NLU with MLM-based data Augmentation Toward Stop Low-Resource Challenge. 1-2 - Steven Vander Eeckt
, Hugo Van hamme
:
Weight Averaging: A Simple Yet Effective Method to Overcome Catastrophic Forgetting in Automatic Speech Recognition. 1-5 - Yan Zhao
, Jincen Wang, Yuan Zong, Wenming Zheng, Hailun Lian, Li Zhao:
Deep Implicit Distribution Alignment Networks for cross-Corpus Speech Emotion Recognition. 1-5 - Byeonggeun Kim, Jun-Tae Lee, Seunghan Yang, Simyung Chang:
Scalable Weight Reparametrization for Efficient Transfer Learning. 1-5 - Mohammad Reza Hasanabadi, Majid Behdad
, Davood Gharavian:
MFCCGAN: A Novel MFCC-Based Speech Synthesizer Using Adversarial Learning. 1-5 - Mingming Zhang, Ye Du, Zhenghui Hu, Qingjie Liu, Yunhong Wang:
BISVP: Building Footprint Extraction Via Bidirectional Serialized Vertex Prediction. 1-5 - Naman Khetan, Tushar Arora, Samee Ur Rehman, Deepak K. Gupta:
Implicitly Rotation Equivariant Neural Networks. 1-5 - Nirmesh Shah, Mayank Kumar Singh, Naoya Takahashi, Naoyuki Onoe:
Nonparallel Emotional Voice Conversion for Unseen Speaker-Emotion Pairs Using Dual Domain Adversarial Network & Virtual Domain Pairing. 1-5 - Yukun Zhang, Chuan Wang, Sanyi Zhang, Xiaochun Cao:
A Database for Multi-Modal Short Video Quality Assessment. 1-5 - Chakka Sai Pradeep, Neelam Sinha, Banibrata Mukhopadhyay:
Measuring Deviation from Stochasticity in Time-Series Using Autoencoder Based Time-Invariant Representation: Application to Black Hole Data. 1-5 - Rahul Pandey, Roger Ren, Qi Luo, Jing Liu, Ariya Rastrow, Ankur Gandhe, Denis Filimonov, Grant P. Strimel, Andreas Stolcke, Ivan Bulyko:
Procter: Pronunciation-Aware Contextual Adapter For Personalized Speech Recognition In Neural Transducers. 1-5 - Jitendra K. Tugnait:
Estimation of High-Dimensional Differential Graphs from Multi-Attribute Data. 1-5 - Li Huang, Hongmei Wu, Qiang Gao, Guisong Liu:
Attention Localness in Shared Encoder-Decoder Model For Text Summarization. 1-5 - Puja Trivedi, Danai Koutra
, Jayaraman J. Thiagarajan:
A Closer Look At Scoring Functions And Generalization Prediction. 1-5 - Ke Liu, Jingzhao Hu, Jun Feng:
Speech Emotion Recognition Based on Low-Level Auto-Extracted Time-Frequency Features. 1-5 - M. Amin Manouchehrpour, Harvinder Lehal, Mahsa Salmani, Timothy N. Davidson:
TDMA-Based Multi-User Binary Computation Offloading in the Finite-Block-Length Regime. 1-5 - Zheng Tan, Longxiu Huang, HanQin Cai, Yifei Lou:
Non-Convex Approaches for Low-Rank Tensor Completion under Tubal Sampling. 1-5 - Costas A. Kokke, Mario Coutino, Laura Anitori, Richard Heusdens, Geert Leus:
Sensor Selection for Angle of Arrival Estimation Based on the Two-Target Cramér-Rao Bound. 1-5 - Yannan Chen
, Licheng Zhao, Yaowen Zhang, Kaiming Shen:
Inverse Quadratic Transform for Minimizing A Sum of Ratios. 1-5 - Chen Chen, Dong Wang, Thomas Fang Zheng:
CN-CVS: A Mandarin Audio-Visual Dataset for Large Vocabulary Continuous Visual to Speech Synthesis. 1-5 - W. Bastiaan Kleijn
, Michael Chinen, Felicia S. C. Lim, Jan Skoglund:
Multi-Channel Audio Signal Generation. 1-5 - Kyungsu Kim, Minju Park
, Haesun Joung, Yunkee Chae, Yeongbeom Hong, Seonghyeon Go, Kyogu Lee:
Show Me the Instruments: Musical Instrument Retrieval From Mixture Audio. 1-5 - Ben Hayes, Charalampos Saitis, György Fazekas:
Sinusoidal Frequency Estimation by Gradient Descent. 1-5 - David Ramírez, Ignacio Santamaría, Louis L. Scharf:
Passive Detection of Rank-One Gaussian Signals for Known Channel Subspaces and Arbitrary Noise. 1-5 - Yufeng Wu, Baowei Wang, Changyu Dai, Yi Yuan, Bin Li, Weiqian Zheng, Hao Wu
:
Enhancing Robustness and Imperceptibility of Blind Watermarking with Improved Message Processor. 1-5 - Chenyu Huang, Weimin Tan, Jiaxing Shi, Zhen Xing, Bo Yan:
Uncer2Natural: Uncertainty-Aware Unsupervised Image Denoising. 1-5 - Ruolin Su, Jingfeng Yang, Ting-Wei Wu, Biing-Hwang Juang:
Choice Fusion As Knowledge For Zero-Shot Dialogue State Tracking. 1-5 - Sixiang Chen, Tian Ye, Jun Shi, Yun Liu, Jingxia Jiang, Erkang Chen, Peng Chen:
DEHRFormer: Real-Time Transformer for Depth Estimation and Haze Removal from Varicolored Haze Scenes. 1-5 - Dongseong Hwang, Khe Chai Sim, Yu Zhang, Trevor Strohman:
Comparison of Soft and Hard Target RNN-T Distillation for Large-Scale ASR. 1-5 - Hongyi Pan, Xin Zhu, Zhilu Ye, Pai-Yen Chen, Ahmet Enis Çetin
:
Real-Time Wireless ECG-Derived Respiration Rate Estimation using an Autoencoder with a DCT Layer. 1-5 - Gary Wang, Kyle Kastner, Ankur Bapna, Zhehuai Chen, Andrew Rosenberg, Bhuvana Ramabhadran, Yu Zhang:
Understanding Shared Speech-Text Representations. 1-5 - Jinghan Jia, Yihua Zhang, Dogyoon Song, Sijia Liu, Alfred O. Hero III:
Robustness-Preserving Lifelong Learning Via Dataset Condensation. 1-5 - Yibo Zhang, Ping Gong, Zelin Wang, Zhe Li, Xuanyuan Yang:
DialogMI: A Dialogue Model Based on Enhancing Dialogue Mutual Information. 1-5 - Yi-Chiao Wu, Israel D. Gebru, Dejan Markovic, Alexander Richard:
Audiodec: An Open-Source Streaming High-Fidelity Neural Audio Codec. 1-5 - Yongxiang Feng, Weihua He, Kaichao You, Bing Liu, Ziyang Zhang, Yaoyuan Wang, Minglei Li, Yihang Lou, Jiawei Li, Guoqi Li, Jianxing Liao:
Test-Time Training-Free Domain Adaptation. 1-5 - Haoyu Lu, Nan Li, Tongtong Song, Longbiao Wang, Jianwu Dang, Xiaobao Wang, Shiliang Zhang:
Speech and Noise Dual-Stream Spectrogram Refine Network With Speech Distortion Loss For Robust Speech Recognition. 1-5 - Jinshuai Yang, Zhongliang Yang, Xinrui Ge, Jiajun Zou, Yue Gao, Yongfeng Huang:
LINK: Linguistic Steganalysis Framework with External Knowledge. 1-5 - Jinting Wu, Mei Tu:
A Person Identification System for the ICASSP 2023 e-Prevention Challenge. 1-2 - Xuandi Fu, Kanthashree Mysore Sathyendra, Ankur Gandhe, Jing Liu, Grant P. Strimel, Ross McGowan, Athanasios Mouchtaris:
Robust Acoustic And Semantic Contextual Biasing In Neural Transducers For Speech Recognition. 1-5 - Navneet Agrawal, Renato L. G. Cavalcante, Slawomir Stanczak:
Dynamic Distributed Convex Optimization "Over-The-Air" In Decentralized Wireless Networks. 1-5 - Weidong Chen, Xiaofen Xing, Xiangmin Xu, Jianxin Pang, Lan Du:
DST: Deformable Speech Transformer for Emotion Recognition. 1-5 - Matthias Blochberger
, Filip Elvander
, Randall Ali
, Jan Østergaard
, Jesper Jensen, Marc Moonen, Toon van Waterschoot:
Distributed Adaptive Norm Estimation for Blind System Identification in Wireless Sensor Networks. 1-5 - Hans Van Gorp
, Merel M. van Gilst
, Pedro Fonseca
, Sebastiaan Overeem
, Ruud J. G. van Sloun
:
Aleatoric Uncertainty Estimation of Overnight Sleep Statistics Through Posterior Sampling Using Conditional Normalizing Flows. 1-5 - Tongzi Wu, Yuhao Zhou, Wang Ling, Hojin Yang, Joana Veloso, Lin Sun, Ruixin Huang, Norberto Guimaraes, Scott Sanner:
Towards Dialogue Modeling Beyond Text. 1-5 - Xiangui Kang, Pengcheng Su, Zisheng Huang, Yifang Chen, Jie Wang:
Double Compression Detection Based on the De-Blocking Filtering of HEVC Videos. 1-5 - Valentin Debarnot, Sidharth Gupta, Konik Kothari, Ivan Dokmanic:
Joint Cryo-ET Alignment and Reconstruction with Neural Deformation Fields. 1-5 - Shinta Otake, Rei Kawakami, Nakamasa Inoue:
Parameter Efficient Transfer Learning for Various Speech Processing Tasks. 1-5 - Esaú Villatoro-Tello
, Srikanth R. Madikeri, Juan Zuluaga-Gomez
, Bidisha Sharma, Seyyed Saeed Sarfjoo, Iuliia Nigmatulina, Petr Motlícek, Alexei V. Ivanov, Aravind Ganapathiraju:
Effectiveness of Text, Acoustic, and Lattice-Based Representations in Spoken Language Understanding Tasks. 1-5 - Georgi Tinchev, Marta Czarnowska, Kamil Deja
, Kayoko Yanagisawa, Marius Cotescu
:
Modelling Low-Resource Accents Without Accent-Specific TTS Frontend. 1-5 - Arghya Pal, Sailaja Rajanala, Raphaël C.-W. Phan, KokSheik Wong
:
Self Supervised Bert for Legal Text Classification. 1-5 - Xiongbiao Luo:
A New Personalized Efficacy Atlas for Pallidal Deep Brain Stimulation. 1-5 - Harry Dong, Megna Shah, Sean Donegan, Yuejie Chi:
Deep Unfolded Tensor Robust PCA With Self-Supervised Learning. 1-5 - Ryota Komatsu
, Yusuke Kimura, Takuma Okamoto, Takahiro Shinozaki:
Continuous Action Space-Based Spoken Language Acquisition Agent Using Residual Sentence Embedding and Transformer Decoder. 1-5 - Junhao Wang
, Li Lu
, Zhongjie Ba, Feng Lin, Kui Ren:
Shift to Your Device: Data Augmentation for Device-Independent Speaker Verification Anti-Spoofing. 1-5 - Chun-Yi Li, Yen-Yu Lin
, Wei-Chen Chiu:
Decontamination Transformer For Blind Image Inpainting. 1-5 - Ruixia Zhang, Zhiqiong Wang, Zhongyang Wang, Junchang Xin:
A Dynamic Cross-Scale Transformer with Dual-Compound Representation for 3D Medical Image Segmentation. 1-5 - Yuzheng Wang, Zhaoyu Chen, Dingkang Yang, Yang Liu
, Siao Liu, Wenqiang Zhang, Lizhe Qi:
Adversarial Contrastive Distillation with Adaptive Denoising. 1-5 - Jiawei Chen, Peijie Huang, Guotai Huang, Qianer Li, Yuhong Xu:
SDTN: Speaker Dynamics Tracking Network for Emotion Recognition in Conversation. 1-5 - Ioannis Tsiamas, Gerard I. Gállego, José A. R. Fonollosa, Marta R. Costa-jussà:
Efficient Speech Translation with Dynamic Latent Perceivers. 1-5 - Chaoran Yang
, Qing Ling, Xueli Sheng, Mengfei Mu, Andreas Jakobsson
:
Sparse and Structured Modelling of Underwater Acoustic Channel Impulse Responses. 1-5 - Ran Ji, Jiarui Li
, Wentao He
, Jianfeng Ren, Xudong Jiang
:
Dual-Stream Siamese Vision Transformer With Mutual Attention For Radar Gait Verification. 1-5 - Camilo Aguilar, Mathias Ortner, Josiane Zerubia:
Enhanced GM-PHD Filter for Real Time Satellite Multi-Target Tracking. 1-5 - Yikemaiti Sataer, Chuanqi Shi, Miao Gao, Yunlong Fan, Bin Li
, Zhiqiang Gao:
Integrating Syntactic and Semantic Knowledge in AMR Parsing with Heterogeneous Graph Attention Network. 1-5 - E. Kobayashi, Hiroyasu Yasuda, Kiyoshi Hayasaka, Yu Otake, Shunsuke Ono, Shogo Muramatsu:
Multi-Resolution Convolutional Dictionary Learning for Riverbed Dynamics Modeling. 1-5 - Priyesh Shukla
, Sureshkumar S., Alex C. Stutts, Sathya Ravi, Theja Tulabandhula, Amit Ranjan Trivedi:
Robust Monocular Localization of Drones by Adapting Domain Maps to Depth Prediction Inaccuracies. 1-5 - Kaiwen Zhou, Zhilin Chen, Guochen Liu, Zhitang Chen:
A Novel Extrapolation Technique to Accelerate WMMSE. 1-5 - Takuya Fujihashi, Toshiaki Koike-Akino, Takashi Watanabe:
Soft 2D-to-3D Delivery Using Deep Graph Neural Networks for Holographic-Type Communication. 1-5 - Jie Chen, Xingchen Song, Zhendong Peng, Binbin Zhang, Fuping Pan, Zhiyong Wu:
LightGrad: Lightweight Diffusion Probabilistic Model for Text-to-Speech. 1-5 - Anselm Lohmann, Toon van Waterschoot, Jörg Bitzer
, Simon Doclo:
Dereverberation in Acoustic Sensor Networks Using weighted Prediction Error with Microphone-Dependent Prediction Delays. 1-5 - Ke Yang, Sixian Wang, Jincheng Dai, Kailin Tan, Kai Niu, Ping Zhang:
WITT: A Wireless Image Transmission Transformer for Semantic Communications. 1-5 - Iván López-Espejo
, Ram C. M. C. Shekar, Zheng-Hua Tan
, Jesper Jensen, John H. L. Hansen:
Filterbank Learning for Noise-Robust Small-Footprint Keyword Spotting. 1-5 - Vandad Davoodnia
, Ali Etemad:
Human Pose Estimation from Ambiguous Pressure Recordings with Spatio-Temporal Masked Transformers. 1-5 - Zekai Li, Wei Peng:
Self-Adaptive Reasoning on Sub-Questions for Multi-Hop Question Answering. 1-5 - Hugo Jaquard
, Michaël Fanuel, Pierre-Olivier Amblard, Rémi Bardenet, Simon Barthelmé, Nicolas Tremblay:
Smoothing Complex-Valued Signals on Graphs with Monte-Carlo. 1-5 - Xian Zhong, Shuaipeng Su, Wenxuan Liu, Xuemei Jia, Wenxin Huang, Mengdie Wang:
Neighborhood Information-Based Label Refinement for Person Re-Identification with Label Noise. 1-5 - Oliver Watts, Lovisa Wihlborg, Cassia Valentini-Botinhao:
PUFFIN: Pitch-Synchronous Neural Waveform Generation for Fullband Speech on Modest Devices. 1-5 - Wilmer Lobato, Felipe Farias, William Cruz, Marcellus Amadeus:
Performance Comparison of TTS Models for Brazilian Portuguese to Establish a Baseline. 1-5 - Aaron Geldert, Nils Meyer-Kahlen
, Sebastian J. Schlecht:
Interpolation of Spatial Room Impulse Responses Using Partial Optimal Transport. 1-5 - Marzieh Ajirak, Petar M. Djuric:
A Gaussian Latent Variable Model for Incomplete Mixed Type Data. 1-5 - Kang Li, Yan Song, Li-Rong Dai, Ian McLoughlin
, Xin Fang, Lin Liu:
AST-SED: An Effective Sound Event Detection Method Based on Audio Spectrogram Transformer. 1-5 - Mariam Saeed, Marwan Torki:
Lit the Darkness: Three-Stage Zero-Shot Learning for Low-Light Enhancement with Multi-Neighbor Enhancement Factors. 1-2 - Jie Liu, Yixuan Liu, Xue Han, Chao Deng, Junlan Feng:
ESCL: Equivariant Self-Contrastive Learning for Sentence Representations. 1-5 - Shangeth Rajaa, Kriti Anandan, Swaraj Dalmia, Tarun Gupta, Eng Siong Chng:
Improving Spoken Language Identification with Map-Mix. 1-5 - Huy Phan, Elisabeth R. M. Heremans
, Oliver Y. Chén
, Philipp Koch, Alfred Mertins, Maarten De Vos:
Improving Automatic Sleep Staging Via Temporal Smoothness Regularization. 1-5 - Ori Kenig, Koby Todros, Tülay Adali:
Robust GMM Parameter Estimation via the K-BM Algorithm. 1-5 - Nayeon Kim, Moonsub Byeon, Daehyun Ji, Dokwan Oh:
D-3DLD: Depth-Aware Voxel Space Mapping for Monocular 3D Lane Detection with Uncertainty. 1-5 - Wei Huang, Yixin Zhao, Xuechao Wu, Le Yin:
Improved Indoor Localization With NLOS Signal Propagations. 1-5 - Chan-Jan Hsu, Ho-Lam Chung, Hung-Yi Lee, Yu Tsao:
T5lephone: Bridging Speech and Text Self-Supervised Models for Spoken Language Understanding Via Phoneme Level T5. 1-5 - Federico Baldassarre, Alaaeldin El-Nouby, Hervé Jégou:
Variable Rate Allocation for Vector-Quantized Autoencoders. 1-5 - Chao Liao, Jinwen Huang, Huan Yuan, Peng Yao, Jianchao Tan, Dawei Zhang, Feng Deng, Xiaorui Wang, Chengru Song:
Dynamic TF-TDNN: Dynamic Time Delay Neural Network Based on Temporal-Frequency Attention for Dialect Recognition. 1-5 - Toshiki Orihara, Kazi Mahmudul Hassan
, Toshihisa Tanaka:
Active Selection of Source Patients in Transfer Learning for Epileptic Seizure Detection Using Riemannian Manifold. 1-5 - Alexandra Vioni, Georgia Maniati, Nikolaos Ellinas, June Sig Sung, Inchul Hwang, Aimilios Chalamandaris, Pirros Tsiakoulis:
Investigating Content-Aware Neural Text-to-Speech MOS Prediction Using Prosodic and Linguistic Features. 1-5 - Vincent P. Martin
, Aymeric Ferron
, Jean-Luc Rouas, Pierre Philip:
"Prediction of Sleepiness Ratings from Voice by Man and Machine": A Perceptual Experiment Replication Study. 1-5 - Dong Wu, Bin Liang, Xiangjun Liu, Xuan Zang, Mingmin Chi:
Bipartite Graph Convolutional Networks with Adversarial Domain Transfer. 1-5 - Dongmin Huang
, Lingwei Wang, Hongzhou Lu, Wenjin Wang:
A Contrastive Embedding-Based Domain Adaptation Method for Lung Sound Recognition in Children Community-Acquired Pneumonia. 1-5 - Jie Wei
, Guanyu Hu
, Luu Anh Tuan, Xinyu Yang, Wenjing Zhu:
Multi-Scale Receptive Field Graph Model for Emotion Recognition in Conversations. 1-5 - Pascal A. Schirmer, Iosif Mporas:
A Wavelet Scattering Approach for Load Identification with Limited Amount of Training Data. 1-5 - Xiaoliang Wu, Peter Bell, Ajitha Rajan:
Explanations for Automatic Speech Recognition. 1-5 - Jiaxin Ye, Xin-Cheng Wen, Yujie Wei, Yong Xu, Kunhong Liu, Hongming Shan:
Temporal Modeling Matters: A Novel Temporal Emotional Modeling Approach for Speech Emotion Recognition. 1-5 - Fangyuan Chi, Yixiao Wang
, Panos Nasiopoulos, Victor C. M. Leung, Mahsa T. Pourazad:
Federated Semi-Supervised Learning for Object Detection in Autonomous Driving. 1-5 - Shuai Tao, Himavanth Reddy, Jesper Rindom Jensen
, Mads Græsbøll Christensen
:
Frequency Bin-Wise Single Channel Speech Presence Probability Estimation Using Multiple DNNS. 1-5 - Yunzuo Zhang, Weili Kang, Yameng Liu, Pengfei Zhu:
Joint Multi-Level Feature Network for Lightweight Person Re-Identification. 1-5 - Yan Wang, Xin Luo, Zhen-Duo Chen, Peng-Fei Zhang, Meng Liu, Xin-Shun Xu:
FedVMR: A New Federated Learning Method for Video Moment Retrieval. 1-5 - Aleksej Chinaev, Niklas Knaepper, Gerald Enzner:
Long-Term Synchronization of Wireless Acoustic Sensor Networks with Nonpersistent Acoustic Activity Using Coherence State. 1-5 - Jiawei Liu, Hao Wang, Weining Wang, Xingjian He, Jing Liu:
WL-MSR: Watch and Listen for Multimodal Subtitle Recognition. 1-5 - Sándor Plósz, István Gyöngy, Jonathan Leach
, Steve McLaughlin
, Gerald S. Buller, Abderrahim Halimi
:
Fast Multiscale 3D Reconstruction Using Single-Photon Lidar Data. 1-5 - Michael Nigro, Sridhar Krishnan:
SARdBScene: Dataset and Resnet Baseline for Audio Scene Source Counting and Analysis. 1-5 - Enes Krijestorac, Hazem Sallouha, Shamik Sarkar, Danijela Cabric:
Agile Radio Map Prediction Using Deep Learning. 1-2 - Lang Wang, Juan Liu, Peng Jiang
, Dehua Cao, Baochuan Pang:
DDN: Dynamic Aggregation Enhanced Dual-Stream Network for Medical Image Classification. 1-5 - Han-Sol Lee
, Moonkyu Song, Junseo Lee, Yeol-Min Seong, Ducksoo Kim, Kwanghyuk Bae, Seongwook Song:
An Antispoofing Approach in Biometric Authentication System for a Smartcard. 1-5 - Pierre Houdouin, Esa Ollila, Frédéric Pascal:
Regularized EM Algorithm. 1-5 - Shreyas Jaiswal, Ruchi Pandey
, Santosh Nannuru:
Deep Architecture for DOA Trajectory Localization. 1-5 - Rumeysa Bodur, Binod Bhattarai
, Tae-Kyun Kim
:
Joint Training of Hierarchical GANs and Semantic Segmentation for Expression Translation. 1-5 - Ya Tang, Xiongjun Ye, Xuanya Li, Zhineng Chen:
Multi-Object Localization and Irrelevant-Semantic Separation for Nuclei Segmentation in Histopathology Images. 1-5 - Bangjian Zhou, Jieming Pan, Maheswari Sivan
, Aaron Voon-Yew Thean, J. Senthilnath:
Quantile Online Learning for Semiconductor Failure Analysis. 1-5 - Qi Zhang, Zhongchang Sun, Luis C. Herrera, Shaofeng Zou:
Data-Driven Quickest Change Detection in Markov Models. 1-5 - Florian Hilgemann
, Peter Jax:
Order Reduction of Multi-Channel FIR Filters by Balanced Truncation. 1-5 - Chengyou Jia, Minnan Luo, Zhuohang Dang, Xiaojun Chang
, Qinghua Zheng:
Towards Real-Time Person Search with Invariant Feature Learning. 1-5 - Jinchao Li, Xixin Wu, Kaitao Song, Dongsheng Li, Xunying Liu, Helen Meng:
A Hierarchical Regression Chain Framework for Affective Vocal Burst Recognition. 1-5 - Yicheng Xiao, Yue Ma, Shuyan Li, Hantao Zhou, Ran Liao, Xiu Li:
SemanticAC: Semantics-Assisted Framework for Audio Classification. 1-5 - Xingrong Dong
, Zhaoxian Wu, Qing Ling, Zhi Tian:
Distributed Online Learning With Adversarial Participants In An Adversarial Environment. 1-5 - Haitao Xu
, Liangfa Wei, Jie Zhang, Jianming Yang, Yannan Wang, Tian Gao, Xin Fang, Li-Rong Dai:
A Multi-Scale Feature Aggregation Based Lightweight Network for Audio-Visual Speech Enhancement. 1-5 - Mingliang Zhai, Kang Ni, Jiucheng Xie, Hao Gao:
Spike-Based Optical Flow Estimation Via Contrastive Learning. 1-5 - Georgios Chochlakis, Gireesh Mahajan, Sabyasachee Baruah, Keith Burghardt, Kristina Lerman, Shrikanth Narayanan:
Leveraging Label Correlations in a Multi-Label Setting: a Case Study in Emotion. 1-5 - Shammur Absar Chowdhury, Ahmed Ali:
Multilingual Word Error Rate Estimation: E-Wer3. 1-5 - Jiexin Wang, Jiahao Chen, Bing Su:
Toward Auto-Evaluation With Confidence-Based Category Relation-Aware Regression. 1-5 - Jizhou Li
, Bin Chen
, Guibin Zan, Guannan Qian
, Piero Pianetta, Yijin Liu:
Subspace Modeling Enabled High-Sensitivity X-Ray Chemical Imaging. 1-5 - Zhizheng Yang, Xun Wang, Dongyu Xia, Wei Wang, Haipeng Dai
:
Sequence-Based Device-Free Gesture Recognition Framework for Multi-Channel Acoustic Signals. 1-5 - Sergey Novoselov, Vladimir Volokhov, Galina Lavrentyeva:
Universal Speaker Recognition Encoders for Different Speech Segments Duration. 1-5 - Wei Liu, Kaiqi Fu, Xiaohai Tian, Shuju Shi, Wei Li, Zejun Ma, Tan Lee
:
Leveraging Phone-Level Linguistic-Acoustic Similarity For Utterance-Level Pronunciation Scoring. 1-5 - Wang Chen
, Peizhen Chen, Weijie Chen, Luojun Lin
:
Customized Automatic Face Beautification. 1-5 - Caitlin Richter, Jón Guðnason:
Relative Dynamic Time Warping Comparison for Pronunciation Errors. 1-5 - Tian Feng, Qiming Chen, Yao Shi, Xun Lang, Lei Xie, Hongye Su:
A Hybrid Deep Neural Network for Nonlinear Causality Analysis in Complex Industrial Control System. 1-5 - Haoran Zhao, Nan Li, Runqiang Han, Xiguang Zheng, Chen Zhang, Liang Guo, Bing Yu:
A Low-Latency Deep Hierarchical Fusion Network for Fullband Acoustic Echo Cancellation. 1-2 - Seong-Gyun Leem, Daniel Fulford, Jukka-Pekka Onnela, David Gard, Carlos Busso:
Adapting a Self-Supervised Speech Representation for Noisy Speech Emotion Recognition by Using Contrastive Teacher-Student Learning. 1-5 - Nicolas Horst, Priyanka Das, Mathias Wien:
A Template Matching Approach for Reference Picture Padding in Video Coding. 1-5 - Masanori Tsujikawa, Akihiko Sugiyama, Ken Hanazawa, Yoshinobu Kajikawa:
Linear Microphone Array Parallel to the Driving Direction for in-Car Speech Enhancement. 1-5 - Zhenxiao Cheng, Jie Zhou, Wen Wu, Qin Chen, Liang He:
Tell Model Where to Attend: Improving Interpretability of Aspect-Based Sentiment Classification via Small Explanation Annotations. 1-5 - Zhenzhen You, Yan Yan, Zhenghao Shi, Minghua Zhao, Jing Yan, Haiqin Liu, Xinhong Hei, Xiaoyong Ren:
Laryngeal Leukoplakia Classification Via Dense Multiscale Feature Extraction in White Light Endoscopy Images. 1-5 - Zihan Zhao, Yu Wang, Yanfeng Wang:
Knowledge-Aware Bayesian Co-Attention for Multimodal Emotion Recognition. 1-5 - Ting-Wei Lin, Chao-Lin Liu, Li Su:
Audio-Driven Facial Landmark Generation in Violin Performance using 3DCNN Network with Self Attention Model. 1-5 - Chin-Yun Yu
, Sung-Lin Yeh, György Fazekas, Hao Tang:
Conditioning and Sampling in Variational Diffusion Models for Speech Super-Resolution. 1-5 - Xingke Song, Xiaoying Yang
, Jianfeng Ren, Ruibin Bai
, Xudong Jiang
:
Solving Jigsaw Puzzle of Large Eroded Gaps Using Puzzlet Discriminant Network. 1-5 - Jianing Long, Qingmeng Zhu, Hao He, Zhipeng Yu, Qilin Zhang, Zhihong Zhang:
3D Point Cloud Completion Based on Multi-Scale Degradation. 1-5 - Bin Yang
, Jun Chen, Mang Ye:
Top-K Visual Tokens Transformer: Selecting Tokens for Visible-Infrared Person Re-Identification. 1-5 - Yoav Noah
, Nir Shlezinger:
Distributed Admm with Limited Communications Via Deep Unfolding. 1-5 - Stijn Kindt, Jenthe Thienpondt
, Nilesh Madhu
:
Exploiting Speaker Embeddings for Improved Microphone Clustering and Speech Separation in ad-hoc Microphone Arrays. 1-5 - Djallel Bouneffouf, Oznur Alkan, Raphaël Féraud, Baihan Lin:
Question Answering System with Sparse and Noisy Feedback. 1-5 - Zhezheng Hao
, Zhoumin Lu, Feiping Nie, Rong Wang, Xuelong Li:
Multi-View K-Means with Laplacian Embedding. 1-5 - Matthew J. Goupell, Marjan Davoodian, Sarah Weinstein, David Gadzinski, Dmitry N. Zotkin, Kaushik Sethunath, Ramani Duraiswami
:
Rapid Audiometric Evaluation for Personalized Headphone Listening. 1-5 - Prateek Verma, Chris Chafe:
A Content Adaptive Learnable "Time-Frequency" Representation for audio Signal Processing. 1-5 - Salamata Konate, Léo Lebrat
, Rodrigo Santa Cruz, Clinton Fookes, Andrew P. Bradley
, Olivier Salvado
:
Bias Identification with RankPix Saliency. 1-5 - Othman Istaiteh, Yasmeen Kussad, Yahya Daqour, Maria Habib, Mohammad Habash, Dhananjaya Gowda:
A Transformer-Based E2E SLU Model for Improved Semantic Parsing. 1-2 - Arian Bakhtiarnia, Nemanja Milosevic, Qi Zhang
, Dragana Bajovic, Alexandros Iosifidis
:
Dynamic Split Computing for Efficient Deep EDGE Intelligence. 1-5 - Ju-Hyung Lee, Joohan Lee, Seon-Ho Lee, Andreas F. Molisch:
PMNet: Large-Scale Channel Prediction System for ICASSP 2023 First Pathloss Radio Map Prediction Challenge. 1-2 - Omid Rezaei, Mohammad Mahdi Naghsh, Seyed Mohammad Karbasi, Mohammad Mahdi Nayebi:
Resource Allocation for UAV-Enabled Integrated Sensing and Communication (ISAC) via Multi-Objective Optimization. 1-5 - Sizhe Chen, Qinghua Tao, Zhixing Ye, Xiaolin Huang:
Measuring the Transferability of ℓ∞ Attacks by the ℓ2 Norm. 1-5 - Zhenyao He, Wei Xu, Hong Shen, Derrick Wing Kwan Ng, Yonina C. Eldar, Xiaohu You:
Integrated Sensing and Full-Duplex Communication: Joint Transceiver Beamforming and Power Allocation. 1-5 - Zeyu Wang, Haibin Shen, Changyou Men, Quan Sun, Kejie Huang:
Thermal Infrared Image Inpainting Via Edge-Aware Guidance. 1-5 - Haole Ke, Lin Li, Peipei Wang, Jingling Yuan, Xiaohui Tao:
Tree-Like Interaction Learning for Bundle Recommendation. 1-5 - Martin Gölz, Abdelhak M. Zoubir, Visa Koivunen:
Spatial Inference Using Censored Multiple Testing with Fdr Control. 1-5 - Alexandros Gkillas, Dimitris Ampeliotis, Kostas Berberidis:
A Highly Interpretable Deep Equilibrium Network for Hyperspectral Image Deconvolution. 1-5 - Jingzhou Hu, Kejun Huang:
Identifiable Bounded Component Analysis Via Minimum Volume Enclosing Parallelotope. 1-5 - Weiji Zhao, Kefeng Huang, Chongyang Zhang:
Modulation-Based Center Alignment and Motion Mining for Spatial Temporal Action Detection. 1-5 - Yao Wei, Haoxiang Wang, Mingze Sun, Jiawang Liu:
Attention Based Relation Network for Facial Action Units Recognition. 1-5 - Yunpeng Bai, Yayuan Xiao, Xuan Hou, Ying Li, Changjing Shang, Qiang Shen:
SAR Image Despeckling with Residual-in-Residual Dense Generative Adversarial Network. 1-5 - Youngjun Kwak, Minyoung Jung, Hunjae Yoo, Jinho Shin, Changick Kim:
Liveness Score-Based Regression Neural Networks for Face Anti-Spoofing. 1-5 - Gaosheng Zhang, Shilei Miao, Linghui Tang, Peijia Qian:
A Two-Stage System for Spoken Language Understanding. 1-2 - Emilie Chouzenoux, Víctor Elvira:
Graphit: Iterative Reweighted ℓ1 Algorithm for Sparse Graph Inference in State-Space Models. 1-5 - Cyprien Gille
, Frédéric Guyard, Michel Barlaud:
A New Semi-Supervised Classification Method Using a Supervised Autoencoder for Biomedical Applications. 1-5 - Elsa Rizk, Stefan Vlaski
, Ali H. Sayed:
Local Graph-Homomorphic Processing for Privatized Distributed Systems. 1-5 - Zhi Zhou, Xianjin Li, Jia He, Xiaoyan Bi, Yan Chen, Guangjian Wang, Peiying Zhu:
6G Integrated Sensing and Communication - Sensing Assisted Environmental Reconstruction and Communication. 1-5 - Payal Mohapatra, Bashima Islam, Md Tamzeed Islam, Ruochen Jiao, Qi Zhu:
Efficient Stuttering Event Detection Using Siamese Networks. 1-5 - Peiyu Zhang, Ayush Bhandari:
Unlimited Sampling in Phase Space. 1-5 - Yu Wu, Dongfang Shen, Jiabao Jin, Guanping Xu, Yinran Chen, Xiongbiao Luo:
Local-Global Progressive U-Transformers for Accurate Hepatic and Portal Veins Segmentation in Abdominal MR Images. 1-5 - Zijian Gao, Kele Xu
, Hongda Jia, Tianjiao Wan, Bo Ding, Dawei Feng, Xinjun Mao, Huaimin Wang:
Complementary Learning System Based Intrinsic Reward in Reinforcement Learning. 1-5 - Roy Sheffer, Yossi Adi:
I Hear Your True Colors: Image Guided Audio Generation. 1-5 - Bamelak Tadele, Volodymyr Shyianov, Faouzi Bellili, Amine Mezghani:
Channel Estimation with Tightly-Coupled Antenna Arrays. 1-5 - Fernando Pedraza, Giuseppe Caire:
Neurally Augmented State Space Model for Simultaneous Communication and Tracking with Low Complexity Receivers. 1-5 - Yujie Zheng, Chong Wang, Yi Chen, Jiangbo Qian, Jun Wang, Jiafei Wu:
Enlightening the Student in Knowledge Distillation. 1-5 - Guanghao Meng, Tao Dai, Bin Chen, Naiqi Li, Yong Jiang, Shu-Tao Xia:
Difficulty-Aware Data Augmentor for Scene Text Recognition. 1-5 - Anna Lopatnikova
, Minh-Ngoc Tran
:
Quantum Variational Bayes on Manifolds. 1-5 - Junghyun Koo
, Marco A. Martínez Ramírez, Wei-Hsiang Liao, Stefan Uhlich, Kyogu Lee, Yuki Mitsufuji:
Music Mixing Style Transfer: A Contrastive Learning Approach to Disentangle Audio Effects. 1-5 - Hyemi Kim, Jiyun Park, Taegyun Kwon, Dasaem Jeong
, Juhan Nam:
A Study of Audio Mixing Methods for Piano Transcription in Violin-Piano Ensembles. 1-5 - Han Han
, Tao Jiang, Wei Yu:
Active Beam Tracking with Reconfigurable Intelligent Surface. 1-5 - Kevin Scheck, Tanja Schultz:
Multi-Speaker Speech Synthesis from Electromyographic Signals by Soft Speech Unit Prediction. 1-5 - George Retsinas, Giorgos Sfikas, Panagiotis Paraskevas Filntisis, Petros Maragos:
Newton-Based Trainable Learning Rate. 1-5 - Junyu Liu
, Jianfeng Ren, Hongliang Sun, Xudong Jiang
:
Face Recognition on Point Cloud with Cgan-Top for Denoising. 1-5 - Amir Weiss, Andrew C. Singer
, Gregory W. Wornell
:
Towards Robust Data-Driven Underwater Acoustic Localization: A Deep CNN Solution with Performance Guarantees for Model Mismatch. 1-5 - Jiahao Xu, Xufeng Yan, Cui Peng, Xinquan Wu, Lipeng Gu, Yanbiao Niu:
UAV Local Path Planning Based on Improved Proximal Policy Optimization Algorithm. 1-5 - Ju-Seok Seong, Jeong-Hwan Choi
, Jehyun Kyung, Ye-Rin Jeoung, Joon-Hyuk Chang:
Noise-Aware Target Extension with Self-Distillation for Robust Speech Recognition. 1-5 - Hassan Taherian, DeLiang Wang:
Multi-Resolution Location-Based Training for Multi-Channel Continuous Speech Separation. 1-5 - Sanglee Park, Seung-won Hwang, Jungmin So:
SMCL: Saliency Masked Contrastive Learning for Long-Tailed Visual Recognition. 1-5 - Hessa Alfalahi, Ahsan Khandoker, Ghada Alhussein, Leontios J. Hadjileontiadis:
Cochlear Decomposition: A Novel Bio-Inspired Multiscale Analysis Framework. 1-5 - Xudong Pan, Mi Zhang, Duocai Wu:
RØROS: Building a Responsive Online Recommender System via Meta-Gradients Updating. 1-5 - Zhongweiyang Xu, Xulin Fan, Mark Hasegawa-Johnson:
Dual-Path Cross-Modal Attention for Better Audio-Visual Speech Extraction. 1-5 - Yifei Shen
, Yuqing Ren
, Andreas Toftegaard Kristensen, Xiaohu You, Chuan Zhang
, Andreas Burg
:
Improved Belief Propagation Decoding of Turbo Codes. 1-5 - Yadong Guan, Guibin Zheng, Jiqing Han, Huanliang Wang:
Subband Dependency Modeling for Sound Event Detection. 1-5 - Jiajiong Cao, Yufan Liu, Weiming Bai, Jingting Ding, Liang Li:
Nasty-SFDA: Source Free Domain Adaptation from a Nasty Model. 1-5 - Kehai Qiu, Stefanos Bakirtzis, Hui Song, Ian J. Wassell, Jie Zhang
:
Deep Learning-Based Path Loss Prediction for Outdoor Wireless Communication Systems. 1-2 - Mohamed Elminshawi, Srikanth Raj Chetupalli
, Emanuël A. P. Habets:
Beamformer-Guided Target Speaker Extraction. 1-5 - Mufan Sang
, Yong Zhao, Gang Liu, John H. L. Hansen, Jian Wu:
Improving Transformer-Based Networks with Locality for Automatic Speaker Verification. 1-5 - Paula Andrea Pérez-Toro
, Dalia Rodríguez-Salas, Tomás Arias-Vergara, Sebastian P. Bayerl, Philipp Klumpp, Korbinian Riedhammer
, Maria Schuster, Elmar Nöth, Andreas K. Maier, Juan Rafael Orozco-Arroyave
:
Transferring Quantified Emotion Knowledge for the Detection of Depression in Alzheimer's Disease Using Forestnets. 1-5 - Han-Mo Ou, Naresh R. Shanbhag:
Enhancing the Accuracy of Resistive In-Memory Architectures using Adaptive Signal Processing. 1-5 - Gaku Narita, Junichi Shimizu, Taketo Akama:
GANStrument: Adversarial Instrument Sound Synthesis with Pitch-Invariant Instance Conditioning. 1-5 - Valentin Bolz, Johannes Rueß, Andreas Zell:
Data-Driven Graph Convolutional Neural Networks for Power System Contingency Analysis. 1-5 - Randall Balestriero, Yann LeCun:
Fast and Exact Enumeration of Deep Networks Partitions Regions. 1-5 - Neel Bhandari, Pin-Yu Chen:
Lost In Translation: Generating Adversarial Examples Robust to Round-Trip Translation. 1-5 - Giovana Morais, Matthew E. P. Davies, Marcelo Queiroz, Magdalena Fuentes
:
Tempo vs. Pitch: Understanding Self-Supervised Tempo Estimation. 1-5 - Xian Zhong, Wei Li, Liang Liao, Jing Xiao, Wenxuan Liu, Wenxin Huang, Zheng Wang:
Bat: Bi-Alignment Based On Transformation in Multi-Target Domain Adaptation for Semantic Segmentation. 1-5 - Yonghao Liu, Di Liang, Fang Fang, Sirui Wang, Wei Wu, Rui Jiang:
Time-Aware Multiway Adaptive Fusion Network for Temporal Knowledge Graph Question Answering. 1-5 - Zexin Fan, Kejiang Chen
, Chuan Qin, Kai Zeng, Weiming Zhang, Nenghai Yu:
Image Adversarial Steganography Based on Joint Distortion. 1-5 - Abinay Reddy Naini, Mary A. Kohler, Carlos Busso:
Unsupervised Domain Adaptation for Preference Learning Based Speech Emotion Recognition. 1-5 - Frederik Bous
, Axel Roebel:
Analysis and Transformation of Voice Level in Singing Voice. 1-5 - Rehana Mahfuz, Yinyi Guo, Erik Visser:
Improving Audio Captioning Using Semantic Similarity Metrics. 1-5 - Layne Berry, Yi-Jen Shih, Hsuan-Fu Wang, Heng-Jui Chang, Hung-Yi Lee, David Harwath:
M-SpeechCLIP: Leveraging Large-Scale, Pre-Trained Models for Multilingual Speech to Image Retrieval. 1-5 - Ya Jiang, Hang Chen, Jun Du, Qing Wang, Chin-Hui Lee:
Incorporating Lip Features into Audio-Visual Multi-Speaker DOA Estimation by Gated Fusion. 1-5 - Dongyue Li, Yaping Yan, Dong Liang, Songlin Du:
MSFORMER: Multi-Scale Transformer with Neighborhood Consensus for Feature Matching. 1-5 - Yakun Ju, Kin-Man Lam, Jun Xiao, Cong Zhang, Cuixin Yang, Junyu Dong:
Efficient Feature Fusion for Learning-Based Photometric Stereo. 1-5 - Bo-Wen Zhang, Yan Yan, Jiapei Yu:
Contrastive Learning of Sentence Embeddings in Product Search. 1-5 - Leonardo Spampinato, Alessia Tarozzi, Chiara Buratti, Riccardo Marini
:
DRL Path Planning for UAV-Aided V2X Networks: Comparing Discrete to Continuous Action Spaces. 1-5 - Jianrong Wang, Yaxin Zhao, Hongkai Fan, Tianyi Xu, Qi Li, Sen Li, Li Liu
:
Memory-Augmented Contrastive Learning for Talking Head Generation. 1-5 - Zhiyuan Peng, Mingjie Shao, Xuanji He, Xu Li, Tan Lee
, Ke Ding, Guanglu Wan:
Covariance Regularization for Probabilistic Linear Discriminant Analysis. 1-5 - Koyo Sato, Shunsuke Ono:
Robust Hyperspectral Anomaly Detection with Simultaneous Mixed Noise Removal via Constrained Convex Optimization. 1-5 - Onkar Susladkar, Prajwal Gatti, Santosh Kumar Yadav:
SLBERT: A Novel Pre-Training Framework for Joint Speech and Language Modeling. 1-5 - Wuti Xiong:
CD-FSOD: A Benchmark For Cross-Domain Few-Shot Object Detection. 1-5 - Huaying Xue, Xiulian Peng, Yan Lu:
Contrast-PLC: Contrastive Learning for Packet Loss Concealment. 1-5 - Zhi Zhong, Masato Hirano, Kazuki Shimada
, Kazuya Tateishi, Shusuke Takahashi, Yuki Mitsufuji:
An Attention-Based Approach to Hierarchical Multi-Label Music Instrument Classification. 1-5 - Zhengding Luo, Dongyuan Shi, Xiaoyi Shen
, Junwei Ji, Woon-Seng Gan:
Deep Generative Fixed-Filter Active Noise Control. 1-5 - Pourya Shamsolmoali, Masoumeh Zareapoor
, Eric Granger:
Image Completion Via Dual-Path Cooperative Filtering. 1-5 - Kyusung Seo, Joonhyung Park, Jaeyun Song, Eunho Yang:
Weavspeech: Data Augmentation Strategy For Automatic Speech Recognition Via Semantic-Aware Weaving. 1-5 - Yifan Peng
, Jaesong Lee, Shinji Watanabe
:
I3D: Transformer Architectures with Input-Dependent Dynamic Depth for Speech Recognition. 1-5 - Heejin Do
, Yunsu Kim, Gary Geunbae Lee:
Hierarchical Pronunciation Assessment with Multi-Aspect Attention. 1-5 - Zilong Li, Qianqian Ren, Long Chen, Jianguo Sun:
Dual-Stage Graph Convolution Network With Graph Learning For Traffic Prediction. 1-5 - Ziyang Luo, Zhipeng Hu, Yadong Xi, Rongsheng Zhang, Jing Ma:
I-Tuning: Tuning Frozen Language Models with Image for Lightweight Image Captioning. 1-5 - Shun Lei, Yixuan Zhou, Liyang Chen, Zhiyong Wu, Shiyin Kang, Helen Meng:
Context-Aware Coherent Speaking Style Prediction with Hierarchical Transformers for Audiobook Speech Synthesis. 1-5 - Shuaitao Zhang, Yuan Zhang, Zheng Zhao, Di Xie, Shiliang Pu:
HPFTN: Hierarchical Progressive Fusion Transformer Network for Video Denoising. 1-5 - Xiangyu Yang
, Boris Joukovsky
, Nikos Deligiannis
:
Relevance Propagation through Deep Conditional Random Fields. 1-5 - François Grondin, Marc-Antoine Maheux, Jean-Samuel Lauzon, Jonathan Vincent, François Michaud:
Fast Cross-Correlation for TDoA Estimation on Small Aperture Microphone Arrays. 1-5 - Xueqi Gao, Chao Xu, Yihang Song, Jing Hu, Jian Xiao, Zhaopeng Meng:
Node-Wise Domain Adaptation Based on Transferable Attention for Recognizing Road Rage via EEG. 1-5 - Haoxu Wang, Ming Cheng, Qiang Fu, Ming Li:
The DKU Post-Challenge Audio-Visual Wake Word Spotting System for the 2021 MISP Challenge: Deep Analysis. 1-5 - Linlin Yang, Hongying Liu, Fanhua Shang, Yuanyuan Liu:
Adaptive Non-Local Generative Adversarial Networks for Low-Dose CT Image Denoising. 1-5 - Yulu Jin
, Lifeng Lai:
Adversarially Robust Fairness-Aware Regression. 1-5 - Chen Wang, Jiang Zhong, Qizhu Dai, Yafei Qi, Rongzhen Li, Qin Lei, Bin Fang, Xue Li
:
PRRD: Pixel-Region Relation Distillation For Efficient Semantic Segmentation. 1-5 - Shuting Dong, Feng Lu, Chun Yuan:
Frequency Reciprocal Action and Fusion for Single Image Super-Resolution. 1-5 - Eun Som Jeon, Suhas Lohit, Rushil Anirudh, Pavan K. Turaga:
Robust Time Series Recovery and Classification Using Test-Time Noise Simulator Networks. 1-5 - Jiayi Tian, Chao Fang
, Haonan Wang, Zhongfeng Wang:
Bebert: Efficient And Robust Binary Ensemble Bert. 1-5 - Junlin Liu
, Xinchen Lyu:
Distance-Based Online Label Inference Attacks Against Split Learning. 1-5 - Steven M. Hernandez, Ding Zhao, Shaojin Ding, Antoine Bruguier, Rohit Prabhavalkar, Tara N. Sainath, Yanzhang He, Ian McGraw:
Sharing Low Rank Conformer Weights for Tiny Always-On Ambient Speech Recognition Models. 1-5 - Yuan Cao, Danchen Zhang, Xin Zheng, Hongming Shan, Junping Zhang:
Mutual Information Based Reweighting for Precipitation Nowcasting. 1-5 - Weijun Huang, Jia Huang, Guowei Wang, Hongzhou Lu, Min He, Wenjin Wang:
Exploiting CCTV Cameras for Hand Hygiene Recognition in ICU. 1-5 - Zhong Meng, Weiran Wang, Rohit Prabhavalkar, Tara N. Sainath, Tongzhou Chen, Ehsan Variani, Yu Zhang, Bo Li, Andrew Rosenberg, Bhuvana Ramabhadran:
JEIT: Joint End-to-End Model and Internal Language Model Training for Speech Recognition. 1-5 - Ti Wang, Hong Liu, Runwei Ding, Wenhao Li, Yingxuan You, Xia Li
:
Interweaved Graph and Attention Network for 3D Human Pose Estimation. 1-5 - Nilaksh Das, Monica Sunkara, Sravan Bodapati, Jinglun Cai, Devang Kulshreshtha, Jeff Farris, Katrin Kirchhoff:
Mask the Bias: Improving Domain-Adaptive Generalization of CTC-Based ASR with Internal Language Model Estimation. 1-5 - Nafiul Rashid, Md Mahbubur Rahman, Tousif Ahmed
, Jilong Kuang, Jun Alex Gao:
BreathIE: Estimating Breathing Inhale Exhale Ratio Using Motion Sensor Data from Consumer Earbuds. 1-5 - Linfeng Feng, Yijun Gong, Xiao-Lei Zhang:
Soft Label Coding for end-to-end Sound Source Localization with ad-hoc Microphone Arrays. 1-5 - Huixiang Wen
, Shan Chang, Luo Zhou
:
Light Projection-Based Physical-World Vanishing Attack Against Car Detection. 1-5 - Hui Chen, Hanyi Zhang, Longbiao Wang, Kong Aik Lee, Meng Liu, Jianwu Dang:
Self-Supervised Audio-Visual Speaker Representation with Co-Meta Learning. 1-5 - Arshdeep Singh
, Mark D. Plumbley:
Efficient Similarity-Based Passive Filter Pruning for Compressing CNNS. 1-5 - Gongping Huang, Jacob Benesty, Israel Cohen, Emil Winebrand, Jingdong Chen, Walter Kellermann:
Switching Kronecker Product Linear Filtering for Multispeaker Adaptive Speech Dereverberation. 1-5 - Disheng Li
, Wei Liu
, Yuriy V. Zakharov, Paul D. Mitchell:
Graph Signal Processing for Narrowband Direction of Arrival Estimation. 1-5 - Shuo Zhang
, Jing Liu:
Anomalous Signal Detection for Cyber-Physical Systems Using Interpretable Causal Neural Network. 1-5 - George Dimas, Anastasios Koulaouzidis
, Dimitris K. Iakovidis:
Co-Operative CNN for Visual Saliency Prediction on WCE Images. 1-5 - Shengdi Qin, Shunli Zhang, Yu Zhang, Haoyu Gao:
CAENet: Using Collaborative Attention Transformer and Add-Boost Strategy for Single Image Deraining. 1-5 - Yuhe Ding, Jian Liang, Jie Cao, Aihua Zheng, Ran He:
Modify: Model-Driven Face Stylization Without Style Images. 1-5 - Taihui Li
, Zhong Zhuang, Hengkang Wang, Ju Sun
:
Random Projector: Efficient Deep Image Prior. 1-5 - Chunyu Qiang
, Peng Yang, Hao Che, Ying Zhang, Xiaorui Wang, Zhongyuan Wang:
Improving Prosody for Cross-Speaker Style Transfer by Semi-Supervised Style Extractor and Hierarchical Modeling in Speech Synthesis. 1-5 - Xian Zhong, Aoyu Yi, Wenxuan Liu, Wenxin Huang, Chengming Zou, Zheng Wang:
Background-Weakening Consistency Regularization for Semi-Supervised Video Action Detection. 1-5 - Yuan-Pei Lin, Ting-Ming Yang:
Robust Angle Estimation for Hybrid mmWave Systems. 1-5 - Atsunori Ogawa, Takafumi Moriya, Naoyuki Kamo, Naohiro Tawara, Marc Delcroix:
Iterative Shallow Fusion of Backward Language Model for End-To-End Speech Recognition. 1-5 - Carlos Alejandro López
, Jaume Riba:
Data Driven Joint Sensor Fusion and Regression Based on Geometric Mean Squared Error. 1-5 - Songpei Xu, Chaitanya Kaul, Xuri Ge, Roderick Murray-Smith:
Continuous Interaction with A Smart Speaker via Low-Dimensional Embeddings of Dynamic Hand Pose. 1-5 - Yanxing Wang, Shengqi Zhu, Guisheng Liao, Lan Lan, Zhuochen Chen, Feilong Liu:
Resolving Doppler Ambiguity Via Spread Phase Alignment in FDA-MIMO Radar. 1-5 - Verena Lachner, Katharina Schaar, Ralf Zimmermann:
CSM In Motion Vector Steganalysis: The Effect of Coders on Motion Vectors in H.264 Video Encoding. 1-5 - Yu Zheng, David C. Zhu, Jian Ren, Taosheng Liu, Karl J. Friston, Tongtong Li:
A Mathematical Model for Neuronal Activity and Brain Information Processing Capacity. 1-5 - Hsuan-Jui Chen, Yen Meng, Hung-yi Lee:
Once-for-All Sequence Compression for Self-Supervised Speech Models. 1-5 - Taylan Kargin
, Fariborz Salehi, Babak Hassibi:
Asymptotic Distribution of Stochastic Mirror Descent Iterates in Average Ensemble Models. 1-5 - Aymane Abdali, Vincent Gripon, Lucas Drumetz, Bartosz Boguslawski:
Active Learning for Efficient Few-Shot Classification. 1-5 - Zhenyu Piao, Miseul Kim, Hyungchan Yoon, Hong-Goo Kang:
HappyQuokka System for ICASSP 2023 Auditory EEG Challenge. 1-2 - Plácido L. Vidal, Joaquim de Moura
, Jorge Novo, Marcos Ortega
, Jaime S. Cardoso:
Transformer-Based Multi-Prototype Approach for Diabetic Macular Edema Analysis in OCT Images. 1-5 - Can Han
, Suncheng Xiang, Dahong Qian:
MTDL-NET: Morphological and Temporal Discriminative Learning for Heartbeat Classification. 1-5 - Kuan-Lin Chen
, Daniel D. E. Wong, Ke Tan, Buye Xu, Anurag Kumar, Vamsi Krishna Ithapu:
Leveraging Heteroscedastic Uncertainty in Learning Complex Spectral Mapping for Single-Channel Speech Enhancement. 1-5 - Petteri Pulkkinen, Visa Koivunen:
Model-Free Online Learning for Waveform Optimization In Integrated Sensing And Communications. 1-5 - Ye-Rin Jeoung, Joon-Young Yang, Jeong-Hwan Choi
, Joon-Hyuk Chang:
Improving Transformer-Based End-to-End Speaker Diarization by Assigning Auxiliary Losses to Attention Heads. 1-5 - Jie Tan, Hengyi Cai, Hongshen Chen, Hong Cheng, Helen Meng, Zhuoye Ding:
Contrastive Learning with Dialogue Attributes for Neural Dialogue Generation. 1-5 - Boyu Hou
, Chengyu Wang, Xiaoqing Chen, Minghui Qiu, Liang Feng, Jun Huang:
Prompt-Distiller: Few-Shot Knowledge Distillation for Prompt-Based Language Learners with Dual Contrastive Learning. 1-5 - Dat Thanh Nguyen, Kamal Gopikrishnan Nambiar, André Kaup:
Deep Probabilistic Model for Lossless Scalable Point Cloud Attribute Compression. 1-5 - Qiquan Xiao, Yuan Zhang, Xuanya Li, Kai Hu:
Boundary Cue Guidance and Contextual Feature Mining for Glass Segmentation. 1-5 - William Chettleburgh
, Zhishen Huang, Ming Yang:
Fast Robust Principle Component Analysis Using Gauss-Newton Iterations. 1-5 - Yuntao Li, Zhenpeng Su, Yutian Li, Hanchu Zhang, Sirui Wang, Wei Wu, Yan Zhang:
T5-SR: A Unified Seq-to-Seq Decoding Strategy for Semantic Parsing. 1-5 - Luca Barbieri, Bernardo Camajori Tedeschini
, Mattia Brambilla
, Monica Nicoli:
Implicit Vehicle Positioning with Cooperative Lidar Sensing. 1-5 - Zehua Zhang, Shiyun Xu, Xuyi Zhuang, Lianyu Zhou, Heng Li, Mingjiang Wang:
Two-Stage UNet with Multi-Axis Gated Multilayer Perceptron for Monaural Noisy-Reverberant Speech Enhancement. 1-5 - Syed A. Hamza
, Kyle Juretus, Moeness G. Amin
, Fauzia Ahmad:
Deep Learning Sparse Array Design Using Binary Switching Configurations. 1-5 - Sung-Feng Huang, Chia-Ping Chen, Zhi-Sheng Chen, Yu-Pao Tsai, Hung-Yi Lee:
Personalized Lightweight Text-to-Speech: Voice Cloning with Adaptive Structured Pruning. 1-5 - Yuma Shirahata, Ryuichi Yamamoto, Eunwoo Song
, Ryo Terashima, Jae-Min Kim, Kentaro Tachibana:
Period VITS: Variational Inference with Explicit Pitch Modeling for End-To-End Emotional Speech Synthesis. 1-5 - Grant A. Davidson, Mark Vinton, Per Ekstrand, Cong Zhou, Lars F. Villemoes, Lie Lu
:
High Quality Audio Coding with Mdctnet. 1-5 - Zhenghao Guo
, Verity M. McClelland
, Wei Dai, Zoran Cvetkovic:
Structured Errors-in-Variables Modelling for Cortico-Muscular Coherence Enhancement. 1-5 - Torsten Schlett, Sebastian Schachner, Christian Rathgeb, Juan E. Tapia, Christoph Busch:
Effect of Lossy Compression Algorithms on Face Image Quality and Recognition. 1-5 - Tal Peer, Simon Welker
, Timo Gerkmann:
DiffPhase: Generative Diffusion-Based STFT Phase Retrieval. 1-5 - Dawei Dai, Yutang Li
, Liang Wang, Shiyu Fu, Shuyin Xia, Guoyin Wang:
Sketch Less Face Image Retrieval: A New Challenge. 1-5 - Wen Cheng
, Shichen Dong, Wei Wang:
W2KPE: Keyphrase Extraction with Word-Word Relation. 1-2 - Muhammad Saad Saeed, Shah Nawaz
, Muhammad Haris Khan, Muhammad Zaigham Zaheer, Karthik Nandakumar, Muhammad Haroon Yousaf, Arif Mahmood:
Single-branch Network for Multimodal Training. 1-5 - Changzeng Fu
, Zhenghan Chen
, Jiaqi Shi, Bowen Wu, Chaoran Liu, Carlos Toshinori Ishi, Hiroshi Ishiguro:
HAG: Hierarchical Attention with Graph Network for Dialogue Act Classification in Conversation. 1-5 - Dror Jacoby, Jonatan Ostrometzky, Hagit Messer:
Model-based vs. Data-driven Approaches for Predicting Rain-induced Attenuation in Commercial Microwave Links: A Comparative Empirical Study. 1-5 - Muzhou Yu, Sia Huat Tan, Kailu Wu, Runpei Dong
, Linfeng Zhang
, Karsheng Ma:
CORSD: Class-Oriented Relational Self Distillation. 1-5 - Jun Xue
, Cunhang Fan, Jiangyan Yi, Chenglong Wang, Zhengqi Wen, Dan Zhang, Zhao Lv:
Learning From Yourself: A Self-Distillation Method For Fake Speech Detection. 1-5 - Zhangying Weng, Peng Li, Xin Zhuang, Xuefeng Yan, Lina Gong, Haoran Xie, Mingqiang Wei:
ifUNet++: Iterative Feedback UNet++ for Infrared Small Target Detection. 1-5 - Weihang Ding, Mohammad Shikh-Bahaei:
HARQ Delay Minimization of 5G Wireless Network with Imperfect Feedback. 1-5 - Yuchen Wong
, Qingni Shen, Cong Li, Cunzhan Liu, Tianxiang Ai:
Detecting Malicious Migration on Edge to Prevent Running Data Leakage. 1-5 - Chunyang Fu
, Xiang Zhang, Thuong Nguyen-Canh, Xiaozhong Xu, Ge Li, Shan Liu:
Surface-Sampling Based Objective Quality Assessment Metrics for Meshes. 1-5 - Annika Briegleb
, Mhd Modar Halimeh, Walter Kellermann:
Exploiting Spatial Information with the Informed Complex-Valued Spatial Autoencoder for Target Speaker Extraction. 1-5 - Ryouichi Nishimura, Kenichi Takizawa:
Simultaneous Estimation of Direction of Arrival and Sound Speed Using a Non-Uniform Sensor Array. 1-5 - Rossen Nenov, Dang-Khoa Nguyen
, Peter Balazs:
Faster Than Fast: Accelerating the Griffin-Lim Algorithm. 1-5 - Shengfang Zhai, Qingni Shen, Xiaoyi Chen
, Weilong Wang, Cong Li, Yuejian Fang, Zhonghai Wu:
NCL: Textual Backdoor Defense Using Noise-Augmented Contrastive Learning. 1-5 - Yongzi Yu, Wanyong Qiu
, Chen Quan, Kun Qian, Zhihua Wang, Yu Ma, Bin Hu, Björn W. Schuller, Yoshiharu Yamamoto:
Federated Intelligent Terminals Facilitate Stuttering Monitoring. 1-5 - Farwa Abbas, Verity M. McClelland, Zoran Cvetkovic, Wei Dai:
SS-ADMM: Stationary and Sparse Granger Causal Discovery for Cortico-Muscular Coupling. 1-5 - Feihu Jin, Jinliang Lu, Jiajun Zhang:
Unified Prompt Learning Makes Pre-Trained Language Models Better Few-Shot Learners. 1-5 - Charalampos Symeonidis, Ioannis Mademlis, Ioannis Pitas, Nikos Nikolaidis:
Efficient Feature Extraction for Non-Maximum Suppression in Visual Person Detection. 1-5 - Akshayaa Magesh, Zhongchang Sun, Venugopal V. Veeravalli, Shaofeng Zou:
Robust Hypothesis Testing With Moment Constrained Uncertainty Sets. 1-5 - Rohan R. Pote, Bhaskar D. Rao:
Light-Weight Sequential SBL Algorithm: An Alternative to OMP. 1-5 - Yunyang Zeng, Joseph Konan, Shuo Han, David Bick, Muqiao Yang, Anurag Kumar, Shinji Watanabe
, Bhiksha Raj:
TAPLoss: A Temporal Acoustic Parameter Loss for Speech Enhancement. 1-5 - Fang-Qi Li, Shi-Lin Wang, Yun Zhu:
Measure and Countermeasure of the Capsulation Attack Against Backdoor-Based Deep Neural Network Watermarks. 1-5 - Zijian Yang, Wei Zhou
, Ralf Schlüter, Hermann Ney:
Lattice-Free Sequence Discriminative Training for Phoneme-Based Neural Transducers. 1-5 - Yajie Liu, Xinmeng Xu, Weiping Tu, Yuhong Yang, Li Xiao:
Improving Acoustic Echo Cancellation by Mixing Speech Local and Global Features with Transformer. 1-5 - Tanoj Langore, Te-Cheng Hsu, Yi-Hsien Hsieh, Che Lin:
LE-DTA: Local Extrema Convolution for Drug Target Affinity Prediction. 1-5 - Antonio Agudo:
Detail-Aware Uncalibrated Photometric Stereo. 1-5 - Timur Locher, Guy Revach
, Nir Shlezinger, Ruud J. G. van Sloun
, Rik Vullings
:
Hierarchical Filtering With Online Learned Priors for ECG Denoising. 1-5 - Ioannis C. Tsaknakis, Prashant Khanduri, Mingyi Hong:
An Implicit Gradient Method for Constrained Bilevel Problems Using Barrier Approximation. 1-5 - Tanvir Mahmud, Feng Liang, Yaling Qing, Diana Marculescu
:
CLIP4VideoCap: Rethinking Clip for Video Captioning with Multiscale Temporal Fusion and Commonsense Knowledge. 1-5 - Rowel Atienza:
EfficientSpeech: An On-Device Text to Speech Model. 1-5 - Chang-Sung Sung, Jun-Cheng Chen, Chu-Song Chen:
Hearing and Seeing Abnormality: Self-Supervised Audio-Visual Mutual Learning for Deepfake Detection. 1-5 - Zhifang Guo, Yichong Leng, Yihan Wu, Sheng Zhao, Xu Tan
:
Prompttts: Controllable Text-To-Speech With Text Descriptions. 1-5 - Sébastien Journé, Nicolas Le Bihan, Florent Chatelain, Julien Flamant:
Polarized Signal Singular Spectrum Analysis with Complex SSA. 1-5 - Lequan Lin, Junbin Gao:
A Magnetic Framelet-Based Convolutional Neural Network for Directed Graphs. 1-5 - Shivani Gowda, Yifan Hu, Mandy Korpusik:
Multi-Modal Food Classification in a Diet Tracking System with Spoken and Visual Inputs. 1-5 - Yuhang Yang, Haihua Xu, Hao Huang, Eng Siong Chng, Sheng Li
:
Speech-Text Based Multi-Modal Training with Bidirectional Attention for Improved Speech Recognition. 1-5 - Bin Xie, Hao Tang, Bin Duan, Dawen Cai, Yan Yan:
MLP-GAN for Brain Vessel Image Segmentation. 1-5 - Ruixiang Chen, Sheng Liu, Junhao Chen, Bingnan Guo, Feng Zhang:
VLKP:Video Instance Segmentation with Visual-Linguistic Knowledge Prompts. 1-5 - Guanlong Zhao, Quan Wang, Han Lu, Yiling Huang, Ignacio López-Moreno:
Augmenting Transformer-Transducer Based Speaker Change Detection with Token-Level Training Loss. 1-5 - Francisco Teixeira
, Alberto Abad
, Bhiksha Raj, Isabel Trancoso:
Privacy-Preserving Automatic Speaker Diarization. 1-5 - Haniyeh Ehsani Oskouie, Farzan Farnia
:
Interpretation of Neural Networks is Susceptible to Universal Adversarial Perturbations. 1-5 - Xovee Xu
, Yutao Wei, Pengyu Wang, Xucheng Luo, Fan Zhou, Goce Trajcevski:
Diffusion Probabilistic Modeling for Fine-Grained Urban Traffic Flow Inference with Relaxed Structural Constraint. 1-5 - Weiquan Fan, Xiaofen Xing, Bolun Cai, Xiangmin Xu:
MGAT: Multi-Granularity Attention Based Transformers for Multi-Modal Emotion Recognition. 1-5 - Jacob J. Webber, Cassia Valentini-Botinhao, Evelyn Williams
, Gustav Eje Henter, Simon King:
Autovocoder: Fast Waveform Generation from a Learned Speech Representation Using Differentiable Digital Signal Processing. 1-5 - Alan Yang, Tara Yasmin Mina, Grace Xingxin Gao:
Binary Sequence Set Optimization for CDMA Applications via Mixed-Integer Quadratic Programming. 1-5 - Yang Zhou
, Hongxia Wang, Qiang Zeng, Rui Zhang, Sijiang Meng:
A Discriminative Multi-Channel Noise Feature Representation Model for Image Manipulation Localization. 1-5 - Xueyan Zhou, Jiacen Guo, Hao Liu, Chao Wang:
A Fusion-Based and Multi-Layer Method for Low Light Image Enhancement. 1-5 - Puneesh Deora, Christos Thrampoulidis:
On Weighted Cross-Entropy for Label-Imbalanced Separable Data: An Algorithmic-Stability Study. 1-5 - Ansel MacLaughlin, Anna Rumshisky, Rinat Khaziev
, Anil Ramakrishna, Yuval Merhav, Rahul Gupta:
Self-Healing Through Error Detection, Attribution, and Retraining. 1-5 - Yuxin Yang, Xia Sun, Qiang Lu, Richard F. E. Sutcliffe, Jun Feng:
A Sentiment and Syntactic-Aware Graph Convolutional Network for Aspect-Level Sentiment Classification. 1-5 - Magda Amiridi, Cheng Qian, Nicholas D. Sidiropoulos
, Lucas M. Glass:
Enrollment Rate Prediction in Clinical Trials based on CDF Sketching and Tensor Factorization tools. 1-5 - Zhongling Liu, Rujie Liu, Ziqiang Shi, Liu Liu, Xiaoyu Mi, Kentaro Murase:
Semi-Supervised Contrastive Learning with Soft Mask Attention for Facial Action Unit Detection. 1-5 - Sankha Subhra Bhattacharjee
, Liming Shi
, Guoli Ping, Xiaoxiang Shen, Mads Græsbøll Christensen
:
Study And Design Of Robust Personal Sound Zones With Vast Using Low Rank Rirs. 1-5 - Junyi He, Meimei Wu, Meng Li, Xiaobo Zhu, Feng Ye:
Multilevel Transformer for Multimodal Emotion Recognition. 1-5 - Jie Qin, Peng Zheng
, Yichao Yan, Rong Quan, Xiaogang Cheng, Bingbing Ni:
Movienet-PS: A Large-Scale Person Search Dataset in the Wild. 1-5 - Mohammad Amin Omidi
, Babak Seyfe, Shahrokh Valaee:
Reducing the Computational Complexity of Learning with Random Convolutional Features. 1-5 - Steven Vander Eeckt
, Hugo Van hamme
:
Using Adapters to Overcome Catastrophic Forgetting in End-to-End Automatic Speech Recognition. 1-5 - Linhan Zhang, Qian Chen, Wen Wang, Chong Deng, Xin Cao, Kongzhang Hao, Yuxin Jiang
, Wei Wang:
Weighted Sampling for Masked Language Modeling. 1-5 - Chenyang Gao, Yue Gu, Francesco Calivá, Yuzong Liu:
Self-Supervised Speech Representation Learning for Keyword-Spotting With Light-Weight Transformers. 1-5 - Robert Kuku Fotock, Alessio Zappone, Marco Di Renzo:
Energy Efficiency Maximization in RIS-aided Networks with Global Reflection Constraints. 1-5 - Hongjia Zhai, Hai Li, Hanzhi Zhang, Hujun Bao, Guofeng Zhang:
Self-Distillation Hashing for Efficient Hamming Space Retrieval. 1-5 - Rokia Abdein
, Xuezhi Xiang, Ning Lv, Abdulmotaleb El-Saddik:
Deformable Cross Attention for Learning Optical Flow. 1-5 - Lihua Zhang, Quan Liu, Zhigang Huang, Lan Wu:
Learning Unbiased Rewards with Mutual Information in Adversarial Imitation Learning. 1-5 - Shiyu Chen, Wenxin Yu, Qi Wang
, Jun Gong, Peng Chen:
Image Inpainting with Semantic-Aware Transformer. 1-5 - Woan-Shiuan Chien
, Chi-Chun Lee
:
Achieving Fair Speech Emotion Recognition via Perceptual Fairness. 1-5 - Fan Cui, Liyong Guo, Lang He, Jiyao Liu, Ercheng Pei, Yujun Wang, Dongmei Jiang:
Relate Auditory Speech To Eeg By Shallow-Deep Attention-Based Network. 1-2 - Yiwei Guo, Chenpeng Du
, Xie Chen, Kai Yu:
Emodiff: Intensity Controllable Emotional Text-to-Speech with Soft-Label Guidance. 1-5 - Qin Shi, Liang Liu, Shuowen Zhang:
Joint Data Association, NLOS Mitigation, and Clutter Suppression for Networked Device-Free Sensing in 6G Cellular Network. 1-5 - Kazuyuki Arikawa, Shoichi Koyama, Hiroshi Saruwatari:
Spatial Active Noise Control Method Based on Sound Field Interpolation from Reference Microphone Signals. 1-5 - Cal Peyser, Michael Picheny, Kyunghyun Cho, Rohit Prabhavalkar, W. Ronny Huang, Tara N. Sainath:
A Comparison of Semi-Supervised Learning Techniques for Streaming ASR at Scale. 1-5 - Zehua Zhang, Shiyun Xu, Xuyi Zhuang, Yukun Qian, Lianyu Zhou, Mingjiang Wang:
Half-Temporal and Half-Frequency Attention U2Net for Speech Signal Improvement. 1-2 - Salvatore Calcagno
, Raffaele Mineo
, Daniela Giordano, Concetto Spampinato:
Ensemble and Personalized Transformer Models for Subject Identification and Relapse Detection in E-Prevention Challenge. 1-2 - Daniel Fejgin, Simon Doclo:
Assisted RTF-Vector-Based Binaural Direction of Arrival Estimation Exploiting A Calibrated External Microphone Array. 1-5 - Akshay S. Bondre, Christ D. Richmond, Ahmed Alkhateeb, Nicolò Michelusi:
Sparse Delay-Doppler Channel Estimation for OTFS Modulation Using 2D-Music. 1-5 - Jiancai Zhu, Jiabao Zhao, Jiayi Zhou, Liang He, Jing Yang, Zhi Zhang:
Uncertainty-Aware Few-Shot Class-Incremental Learning. 1-5 - Jiayan Guo, Meiqi Chen, Yan Zhang, Jianqiang Huang, Zhiwei Liu:
Hierarchical Hypergraph Recurrent Attention Network for Temporal Knowledge Graph Reasoning. 1-5 - Shana Moothedath, Namrata Vaswani
:
Comparing Decentralized Gradient Descent Approaches and Guarantees. 1-5 - Hao-Wen Dong, Ke Chen, Shlomo Dubnov, Julian J. McAuley
, Taylor Berg-Kirkpatrick:
Multitrack Music Transformer. 1-5 - Mingrui He, Tianyu Chen, Haoyi Zhou
, Shanghang Zhang, Jianxin Li:
BadRes: Reveal the Backdoors Through Residual Connection. 1-5 - Mohamed Gueye, Yazid Attabi, Maxime Dumas:
Row Conditional-TGAN for Generating Synthetic Relational Databases. 1-5 - Ziya Gülgün, Erik G. Larsson:
Channel Estimation in Massive MIMO with Heavy-Tailed Noise: Gaussian-Mixture Versus Cauchy Models. 1-4 - Serge Kas Hanna, Zhiyuan Tan, Wen Xu, Antonia Wachter-Zeh:
Codes Correcting Burst and Arbitrary Erasures for Reliable and Low-Latency Communication. 1-5 - Daniel Tompkins, Dimitra Emmanouilidou, Soham Deshmukh, Benjamin Elizalde:
Multi-View Learning for Speech Emotion Recognition with Categorical Emotion, Categorical Sentiment, and Dimensional Scores. 1-5 - Teerapat Jenrungrot, Michael Chinen, W. Bastiaan Kleijn
, Jan Skoglund, Zalán Borsos, Neil Zeghidour, Marco Tagliasacchi:
LMCodec: A Low Bitrate Speech Codec with Causal Transformer Models. 1-5 - Prasenjit Mondal, Ayush Pant, Sachin Soni:
Dewarping Documents Using C2 Continuous Boundary Estimation. 1-5 - Michele Cirillo, Vincenzo Matta, Ali H. Sayed:
Learning Dynamic Graphs under Partial Observability. 1-5 - Jie Wang, Zhicong Chen, Haodong Zhou, Lin Li, Qingyang Hong:
Community Detection Graph Convolutional Network for Overlap-Aware Speaker Diarization. 1-5 - Simon Vary, Hazan Daglayan, Laurent Jacques, Pierre-Antoine Absil:
Low-Rank Plus Sparse Trajectory Decomposition for Direct Exoplanet Imaging. 1-5 - Zhicong Chen, Jie Wang, Wenxuan Hu, Lin Li, Qingyang Hong:
Unsupervised Speaker Verification Using Pre-Trained Model and Label Correction. 1-5 - Tengtao Song, Nuo Chen, Ji Jiang, Zhihong Zhu, Yuexian Zou:
Improving Retrieval-Based Dialogue System Via Syntax-Informed Attention. 1-5 - Qianshuo Hu, Hong Liu, Huaqiu Wang, Mengyuan Liu:
Body Prior Guided Graph Convolutional Neural Network for Skeleton-Based Action Recognition. 1-5 - Haibin Yu, Yuxuan Hu, Yao Qian, Ma Jin, Linquan Liu, Shujie Liu, Yu Shi, Yanmin Qian, Edward Lin, Michael Zeng:
Code-Switching Text Generation and Injection in Mandarin-English ASR. 1-5 - Binglin Li, Jie Liang, Haisheng Fu, Jingning Han:
ROI-Based Deep Image Compression with Swin Transformers. 1-5 - Baichuan Huang, Azra Abtahi, Amir Aminifar:
Lightweight Machine Learning for Seizure Detection on Wearable Devices. 1-2 - Jiayu Li, Tianyun Zhang, Shengmin Jin, Reza Zafarani:
Semi-Supervised Graph Ultra-Sparsifier Using Reweighted ℓ1 Optimization. 1-5 - Junxiang Ruan, Xiangtao Kong, Wenqi Huang, Wenming Yang:
Retiformer: Retinex-Based Enhancement In Transformer For Low-Light Image. 1-5 - Daniel Nicholls, Jack Wells
, Alex W. Robinson, Amirafshar Moshtaghpour, Maryna Kobylynska, Roland A. Fleck, Angus I. Kirkland, Nigel D. Browning:
A Targeted Sampling Strategy for Compressive Cryo Focused Ion Beam Scanning Electron Microscopy. 1-5 - Bin Ren, Hao Tang, Yiming Wang
, Xia Li
, Wei Wang, Nicu Sebe
:
PI-Trans: Parallel-Convmlp and Implicit-Transformation Based Gan for Cross-View Image Translation. 1-5 - Ahmed M. A. Shaalan, Jun Du:
Super Dilated Nested Arrays with Ideal Critical Weights and Increased Degrees of Freedom. 1-5 - Jan Dorazil
, Bernard H. Fleury, Franz Hlawatsch:
Bayesian Methods for Optical Flow Estimation Using a Variational Approximation, with Applications to Ultrasound. 1-5 - Shuo-Yiin Chang, Chao Zhang, Tara N. Sainath, Bo Li, Trevor Strohman:
Context-Aware end-to-end ASR Using Self-Attentive Embedding and Tensor Fusion. 1-5 - Ying Zhou, Xuefeng Liang, Shiquan Zheng, Huijun Xuan, Takatsune Kumada:
Adaptive Mask Co-Optimization for Modal Dependence in Multimodal Learning. 1-5 - Mocho Go, Hideyuki Tachibana
:
GSWIN: Gated MLP Vision Model with Hierarchical Structure of Shifted Window. 1-5 - Xiang Gao, Honghui Lin, Yu Li, Ruiyan Fang, Xin Zhang:
Look and Think: Intrinsic Unification of Self-Attention and Convolution for Spatial-Channel Specificity. 1-5 - Leheng Sheng, Wenhan Wang, Zhiyi Shi, Jichao Zhan, Youyong Kong:
Brainnetformer: Decoding Brain Cognitive States with Spatial-Temporal Cross Attention. 1-5 - Binh P. Nguyen
, Michael Nigro, Alice Rueda
, Venkat Bhat, Sridhar Krishnan:
Digital Phenotype Representation by Statistical, Information Theory, Data-Driven Approach with Digital Health Data. 1-5 - Ankita Pasad, Bowen Shi, Karen Livescu:
Comparative Layer-Wise Analysis of Self-Supervised Speech Models. 1-5 - Yong-Yeon Jo, Young Sang Choi, Jong-Hwan Jang, Joon-Myoung Kwon:
ECGT2T: Towards Synthesizing Twelve-Lead Electrocardiograms from Two Asynchronous Leads. 1-5 - Aleksandr Laptev, Vladimir Bataev
, Igor Gitman, Boris Ginsburg:
Powerful and Extensible WFST Framework for Rnn-Transducer Losses. 1-5 - Georgios Vasileios Karanikolas, Alba Pagès-Zamora, Georgios B. Giannakis
:
Higher-Order Link Prediction Via Learnable Maximum Mean Discrepancy. 1-5 - Zhixuan Li, Ruohua Shi, Tiejun Huang, Tingting Jiang:
OAFormer: Learning Occlusion Distinguishable Feature for Amodal Instance Segmentation. 1-5 - Supritha M. Shetty, Shraddha Revankar, Nalini C. Iyer, K. T. Deepak:
F0 Estimation From Telephone Speech Using Deep Feature Loss. 1-5 - Xujiang Zhao, Xuchao Zhang, Chen Zhao, Jin-Hee Cho, Lance M. Kaplan, Dong Hyun Jeong, Audun Jøsang, Haifeng Chen, Feng Chen:
Multi-Label Temporal Evidential Neural Networks for Early Event Detection. 1-5 - Md. Ershadul Haque
, Manoranjan Paul
, Anwar Ulhaq
, Tanmoy Debnath
:
A Novel State Connection Strategy for Quantum Computing to Represent and Compress Digital Images. 1-5 - Ben Gabrielson, Mingyu Sun, Mohammad A. B. S. Akhonda, Vince D. Calhoun
, Tülay Adali:
Independent Vector Analysis with Multivariate Gaussian Model: a Scalable Method by Multilinear Regression. 1-5 - Mohsen Abdoli, Gordon Clare, Félix Henry:
GOP-Based Latent Refinement for Learned Video Coding. 1-5 - Yongqiang Wang, Zhehuai Chen, Chengjian Zheng, Yu Zhang, Wei Han, Parisa Haghani:
Accelerating RNN-T Training and Inference Using CTC Guidance. 1-5 - Haoxiang Zhang, He Jiang, Ziqiang Wang, Deqiang Cheng:
Ontology-Aware Network for Zero-Shot Sketch-Based Image Retrieval. 1-5 - Rajat Hebbar, Digbalay Bose, Krishna Somandepalli, Veena Vijai, Shrikanth Narayanan:
A Dataset for Audio-Visual Sound Event Detection in Movies. 1-5 - William Chen, Brian Yan, Jiatong Shi, Yifan Peng, Soumi Maiti, Shinji Watanabe
:
Improving Massively Multilingual ASR with Auxiliary CTC Objectives. 1-5 - Qinglin Zhang, Chong Deng, Jiaqing Liu, Hai Yu, Qian Chen, Wen Wang, Zhijie Yan, Jinglin Liu, Yi Ren, Zhou Zhao:
MUG: A General Meeting Understanding and Generation Benchmark. 1-5 - Rémi Piau
, Thomas Maugey, Aline Roumy:
Learning on Entropy Coded Images with CNN. 1-5 - Bo Dekker, Alfred C. Schouten, Odette Scharenborg
:
DAIS: The Delft Database of EEG Recordings of Dutch Articulated and Imagined Speech. 1-5 - Chengyu Zheng, Yuan Zhou, Xiulian Peng, Yuan Zhang, Yan Lu:
Real-Time Speech Enhancement with Dynamic Attention Span. 1-5 - Bohan Tang, Siheng Chen, Xiaowen Dong:
Learning Hypergraphs From Signals With Dual Smoothness Prior. 1-5 - Kazuhiro Kobayashi, Tomoki Hayashi, Tomoki Toda:
Low-Latency Electrolaryngeal Speech Enhancement Based on Fastspeech2-Based Voice Conversion and Self-Supervised Speech Representation. 1-5 - Yuan Huang, Yuting Tang, Xiu Zheng, Jie Tang:
CPD-GAN: Cascaded Pyramid Deformation GAN for Pose Transfer. 1-5 - Jenthe Thienpondt
, Nilesh Madhu
, Kris Demuynck:
Margin-Mixup: A Method for Robust Speaker Verification In Multi-Speaker Audio. 1-5 - Peiying Wang, Chaoqun Duan, Meng Chen, Xiaodong He:
Improving Disfluency Detection with Multi-Scale Self Attention and Contrastive Learning. 1-5 - Zhongyu Yang, Chen Shen, Wei Shao, Tengfei Xing, Runbo Hu, Pengfei Xu, Hua Chai, Ruini Xue:
CANet: Curved Guide Line Network with Adaptive Decoder for Lane Detection. 1-5 - Jae-Heung Cho, Joon-Hyuk Chang:
CAN2V: Can-Bus Data-Based Seq2seq Model for Vehicle Velocity Prediction. 1-5 - Mahdi Namazifar, Devamanyu Hazarika, Dilek Hakkani-Tür
:
Role of Bias Terms in Dot-Product Attention. 1-5 - Angélica S. Z. Suárez, Clément Laroche, Line H. Clemmensen, Sneha Das
:
On Crowdsourcing-Design with Comparison Category Rating for Evaluating Speech Enhancement Algorithms. 1-5 - Yuning Wu, Jiatong Shi, Tao Qian, Dongji Gao, Qin Jin:
Phoneix: Acoustic Feature Processing Strategy for Enhanced Singing Pronunciation With Phoneme Distribution Predictor. 1-5 - Saidur R. Pavel, Yimin D. Zhang, Maria S. Greco, Fulvio Gini:
Deep Learning-Based Compressive Sampling Optimization in Massive MIMO Systems. 1-5 - Tao Li, Huayu Shou, Yuchen Deng, Yu Zhou, Chenqi Shi, Pengpeng Chen:
A Novel Heart Rate Estimation Method Exploiting Heartbeat Second Harmonic Reconstruction Via Millimeter Wave Radar. 1-5 - Daniel Mas Montserrat, Alexander G. Ioannidis:
Adversarial Attacks on Genotype Sequences. 1-5 - Francesco Binucci
, Paolo Banelli:
BER-Aware Dynamic Resource Management for Edge-Assisted Goal-Oriented Communications. 1-5 - Chenxu Niu, Yue Hu, Wei Peng, Yuqiang Xie:
Learning to Balance the Global Coherence and Informativeness in Knowledge-Grounded Dialogue Generation. 1-5 - Wenbo Shi, Wenming Yang, Qingmin Liao:
Robust Content-Variant Reference Image Quality Assessment Via Similar Patch Matching. 1-5 - Yanjia Li, Lahiru Samarakoon, Ivan Fung:
Improving Non-Autoregressive Speech Recognition with Autoregressive Pretraining. 1-5 - Niladri Halder, K. P. Arunkumar, Chandra R. Murthy:
Variational Bayesian Channel Estimation in Wideband Multi-Scale Multi-Lag Channels. 1-5 - Jian Xiong, Sifan Wu, Wang Luo, Jinli Suo, Hao Gao:
ψ-Net: Point Structural Information Network for No-Reference Point Cloud Quality Assessment. 1-5 - Shuxin Qin, Yongcan Luo, Gaofeng Tao:
Memory-Augmented U-Transformer For Multivariate Time Series Anomaly Detection. 1-5 - Jonah Anton, Harry Coppock, Pancham Shukla, Björn W. Schuller:
Audio Barlow Twins: Self-Supervised Audio Representation Learning. 1-5 - Boyang Zhang, Suping Wu, Meining Jia:
Time-Frequency Awareness Network For Human Mesh Recovery From Videos. 1-5 - Jiseob Kim, Kyuhong Shim
, Junhan Kim, Byonghyo Shim:
Vision Transformer-Based Feature Extraction for Generalized Zero-Shot Learning. 1-5 - Na Jiang, Wei Quan, Qichuan Geng, Zhi-Ping Shi, Peng Xu:
Exploiting 3D Human Recovery for Action Recognition with Spatio-Temporal Bifurcation Fusion. 1-5 - Chao Liu, Ruipeng Ma, Zheng Si, Mingmin Chi:
A Method of Constructing and Automatically Labeling Radio Frequency Signal Training Dataset for UAV. 1-5 - Zixuan Xiao, Shengshi Yao, Jincheng Dai, Sixian Wang, Kai Niu, Ping Zhang:
Wireless Deep Speech Semantic Transmission. 1-5 - Zihao Guo, Shilin Wang:
Content-Insensitive Dynamic Lip Feature Extraction for Visual Speaker Authentication Against Deepfake Attacks. 1-5 - Felix Wu, Kwangyoun Kim, Shinji Watanabe
, Kyu Jeong Han, Ryan McDonald, Kilian Q. Weinberger, Yoav Artzi:
Wav2Seq: Pre-Training Speech-to-Text Encoder-Decoder Models Using Pseudo Languages. 1-5 - Xiaoyu Lin, Xiaoyu Bie, Simon Leglaive, Laurent Girin, Xavier Alameda-Pineda:
Speech Modeling with a Hierarchical Transformer Dynamical VAE. 1-5 - Ibrahim Alkanhal, Abdullah Almansour, Lamia Alsalloom, Raied Aljadaany, Marios Savvides:
Cov Loss: Covariance-Based Loss for Deep Face Recognition. 1-5 - Xue Yao, Guolong Cui, Xianxiang Yu:
Dual-Use Signal Design for MIMO Radcom with Inter-Pulse Index Modulation. 1-5 - Victor Solo:
Asymptotic Bias and Variance of Kernel Ridge Regression. 1-5 - Atli Þór Sigurgeirsson, Simon King:
Do Prosody Transfer Models Transfer Prosodyƒ. 1-5 - Shanshan Wang
, Soumya Tripathy, Annamaria Mesaros
:
Self-Supervised Learning of Audio Representations using Angular Contrastive Loss. 1-5 - Mohan Zhou, Yalong Bai, Wei Zhang, Ting Yao, Tiejun Zhao, Tao Mei:
Visual-Aware Text-to-Speech*. 1-5 - Liwen Peng, Songlei Jian, Dongsheng Li, Siqi Shen:
MRML: Multimodal Rumor Detection by Deep Metric Learning. 1-5 - Zhimin He, Jiangbo Qian, Diqun Yan, Chong Wang, Yu Xin:
Animal Re-Identification Algorithm for Posture Diversity. 1-5 - Daichi Guo, Guanting Dong, Dayuan Fu, Yuxiang Wu, Chen Zeng, Tingfeng Hui, Liwen Wang, Xuefeng Li, Zechen Wang, Keqing He, Xinyue Cui, Weiran Xu:
Revisit Out-Of-Vocabulary Problem For Slot Filling: A Unified Contrastive Framework With Multi-Level Data Augmentations. 1-5 - Farshad G. Veshki, Sergiy A. Vorobyov:
Efficient Online Convolutional Dictionary Learning Using Approximate Sparse Components. 1-5 - Yidi Zhang, Wenqi Huang, Wenming Yang:
Global Matching-Optimization Network for Stereo Depth Estimation. 1-5 - Xuechao He, Jiaojiao Zhang, Qing Ling:
Byzantine-Robust and Communication-Efficient Personalized Federated Learning. 1-5 - Jin Zeng, Yang Liu, Gene Cheung, Wei Hu:
Sparse Graph Learning with Spectrum Prior for Deep Graph Convolutional Networks. 1-5 - Li Fu, Siqi Li, Qingtao Li, Liping Deng, Fangzhu Li, Fan Lu, Meng Chen, Xiaodong He:
UFO2: A Unified Pre-Training Framework for Online and Offline Speech Recognition. 1-5 - Haohan Luo, Feng Wang:
A Simulation-Based Framework for Urban Traffic Accident Detection. 1-5 - Chris Henry, Rijun Liao, Ruiyuan Lin, Zhebin Zhang, Hongyu Sun, Zhu Li:
Lightweight Fisher Vector Transfer Learning for Video Deduplication. 1-5 - Ofer Schwartz, Ayal Schwartz:
RNN-Based Step-Size Estimation for the RLS Algorithm with Application to Acoustic Echo Cancellation. 1-5 - Wei Huang, Haiyang Zhang, Nir Shlezinger, Yonina C. Eldar:
Joint Microstrip Selection and Beamforming Design for MmWave Systems with Dynamic Metasurface Antennas. 1-5 - Hexiang Zhang
, Zhenghua Xu, Dan Yao, Shuo Zhang, Junyang Chen, Thomas Lukasiewicz:
Multi-Head Feature Pyramid Networks for Breast Mass Detection. 1-5 - Zishuo Zhao, Yuexiang Xie, Jingyou Xie, Zhenzhou Lin, Yaliang Li, Ying Shen:
Source-Free Unsupervised Domain Adaptation for Question Answering. 1-5 - Qingqing Zhao
, Yanting Ma, Petros Boufounos, Saleh Nabi, Hassan Mansour:
Deep Born Operator Learning for Reflection Tomographic Imaging. 1-5 - Kangdi Mei, Xinyun Ding, Yinlong Liu, Zhiqiang Guo, Feiyang Xu, Xin Li, Tuya Naren, Jiahong Yuan, Zhenhua Ling:
The Ustc System for Adress-m Challenge. 1-2 - Hai Victor Habi, Hagit Messer, Yoram Bresler:
Learned Generative Misspecified Lower Bound. 1-5 - Tzu-Ting Chuang, Ting-Yun Wei, Yu-Hsing Hsieh, Chu-Song Chen, Huei-Fang Yang:
Continual Cell Instance Segmentation of Microscopy Images. 1-5 - Pengcheng Guo, He Wang, Bingshen Mu, Ao Zhang, Peikun Chen:
The NPU-ASLP System for Audio-Visual Speech Recognition in MISP 2022 Challenge. 1-2 - Dianwen Ng, Ruixi Zhang, Jia Qi Yip, Chong Zhang
, Yukun Ma, Trung Hieu Nguyen, Chongjia Ni, Eng Siong Chng, Bin Ma:
Contrastive Speech Mixup for Low-Resource Keyword Spotting. 1-5 - Zechao Hu, Adrian G. Bors:
Few but Informative Local Hash Code Matching for Image Retrieval. 1-5 - Samik Sadhu, Hynek Hermansky
:
Importance of Different Temporal Modulations of Speech: a Tale of two Perspectives. 1-5 - Shehzeen Hussain
, Paarth Neekhara
, Jocelyn Huang, Jason Li, Boris Ginsburg:
ACE-VC: Adaptive and Controllable Voice Conversion Using Explicitly Disentangled Self-Supervised Speech Representations. 1-5 - Charles Hovine, Alexander Bertrand
:
A Distributed Adaptive Algorithm for Non-Smooth Spatial Filtering Problems. 1-5 - Xiaotong Zhang, Peng He, Han Liu, Zhengxi Yin, Xinyue Liu, Xianchao Zhang:
Knowledge-Aware Graph Convolutional Network with Utterance-Specific Window Search for Emotion Recognition In Conversations. 1-5 - Leonardo Fierro, Alec Wright, Vesa Välimäki, Matti S. Hämäläinen:
Extreme Audio Time Stretching Using Neural Synthesis. 1-5 - Arian Eamaz, Farhang Yeganegi, Mojtaba Soltanalian:
CyPMLI: WISL-Minimized Unimodular Sequence Design via Power Method-Like Iterations. 1-5 - Anup Singh
, Kris Demuynck, Vipul Arora:
Simultaneously Learning Robust Audio Embeddings and Balanced Hash Codes for Query-by-Example. 1-5 - Fan Hu, Aozhu Chen, Xirong Li
:
Towards Making a Trojan-Horse Attack on Text-to-Image Retrieval. 1-5 - Solomon Goldgraber Casspi, Oliver Hüsser, Guy Revach
, Nir Shlezinger:
LQGNET: Hybrid Model-Based and Data-Driven Linear Quadratic Stochastic Control. 1-5 - Efthymios Tzinis, Gordon Wichern, Paris Smaragdis, Jonathan Le Roux:
Optimal Condition Training for Target Source Separation. 1-5 - Tianxiao Han, Jiancheng Tang, Qianqian Yang, Yiping Duan, Zhaoyang Zhang, Zhiguo Shi:
Generative Model based Highly Efficient Semantic Communication Approach for Image Transmission. 1-5 - Jian Ni, Yong Liao:
Semantics-Disentangled Contrastive Embedding for Generalized Zero-Shot Learning. 1-5 - Guillaume Le Guludec, Christine Guillemot:
Joint Neural Representation for Multiple Light Fields. 1-5 - Bin Liu, Fengfu Li, Xiaoxing Wang, Bo Zhang, Junchi Yan:
Ternary Weight Networks. 1-5