


default search action
ICASSP 2025: Hyderabad, India - Workshops
- IEEE International Conference on Acoustics, Speech, and Signal Processing, ICASSP 2025 - Workshops, Hyderabad, India, April 6-11, 2025. IEEE 2025, ISBN 979-8-3315-1931-5

- Zihan Deng, Zhisheng Wang, Shanyuan Lin, Guohang He, Demin Jiang, Shunli Wang:

Computation Overhead Optimization Dual-Domain Network with Multi-Level Cross-Domain Connections for Sparse-view CT Reconstruction. 1-5 - Yihong Gao, Luteng Zhu, Zhuoyang An, Mingjie Shao:

Robust Phase Retrieval from Quantized and Noisy Measurements. 1-5 - Joanna Reszka, Parvaneh Janbakhshi, Tilak Purohit, Sadegh Mohammadi:

Investigating the Effects of Diffusion-based Conditional Generative Speech Models Used for Speech Enhancement on Dysarthric Speech. 1-5 - Meenakshi Krishnan, Liam Fowl, Ramani Duraiswami:

3D Gaussian Splatting with Normal Information for Mesh Extraction and Improved Rendering. 1-5 - Yangbin Chen, Chenyang Xu, Chunfeng Liang, Yanbao Tao, Chuan Shi:

Speech-based Clinical Depression Detection: An Empirical Study. 1-5 - Arathi K, Hotha Durga Swetha, V. G. Vaishali, Kalyan Munukutla, Abhijith V, Gurram Shalini, Joe Cheri Ross:

On Investigating a Better Audio Representation for Mood Classification in Indian Popular Music. 1-5 - Samuel Yen-Chi Chen, Huan-Hsin Tseng, Hsin-Yi Lin, Shinjae Yoo:

Learning to Measure Quantum Neural Networks. 1-5 - Jackie Lin, Georg Götz, Hermes Sampedro Llopis, Haukur Hafsteinsson, Steinar Guðjónsson, Daniel Gert Nielsen, Finnur Pind, Paris Smaragdis, Dinesh Manocha, John R. Hershey, Trausti T. Kristjansson, Minje Kim:

Generative Data Augmentation Challenge: Synthesis of Room Acoustics for Speaker Distance Estimation. 1-5 - Shivam Saini

, Jürgen Peissig:
HARP: A Large-Scale Higher-Order Ambisonic Room Impulse Response Dataset. 1-5 - Barbara Ruvolo, Tilak Purohit, Bogdan Vlasenko, Juan Rafael Orozco-Arroyave, Mathew Magimai-Doss:

Exploring the Complexity of Parkinson's Patient Speech for Depression Detection task: A Qualitative Analysis. 1-5 - Francesca Ronchini, Ho-Hsiang Wu, Wei-Cheng Lin, Fabio Antonacci:

Mind the Prompt: Prompting Strategies in Audio Generations for Improving Sound Classification. 1-5 - Tianming Yin, Yiyang Zhou, Xuzhou Ye, Qiuqiang Kong:

PAWS: A Physical Acoustic Wave Simulation Dataset for Sound Modeling and Rendering. 1-5 - Soham Korade, Suswara Pochampally:

A Notation Dataset for Indian Raga Music. 1-5 - Hamed Farhadi, Prayag Gowgi, David Sandberg:

AI/ML-driven beamforming with imperfect channel state information for multi-user MIMO transmission. 1-5 - Han Yin, Yang Xiao

, Jisheng Bai, Rohan Kumar Das:
Leveraging LLM and Text-Queried Separation for Noise-Robust Sound Event Detection. 1-5 - Nishit Anand, Ashish Seth, Ramani Duraiswami, Dinesh Manocha:

TSPE: Task-Specific Prompt Ensemble for Improved Zero-Shot Audio Classification. 1-5 - Eleonora Mancini, Francesco Paissan, Paolo Torroni

, Mirco Ravanelli, Cem Subakan:
Investigating the Effectiveness of Explainability Methods in Parkinson's Detection from Speech. 1-5 - Bagus Tris Atmaja, Akira Sasou:

Pathological Voice Detection From Sustained Vowels: Handcrafted vs. Self-supervised Learning. 1-5 - Leonardo Spampinato, Enrico Testi, Chiara Buratti, Riccardo Marini:

Deep Meta Advisor-aided Exploration for UAV Trajectory Design in Vehicular Networks. 1-5 - Ayush Kumar Dwivedi, Taneli Riihonen:

Near-Field WPT Using Multisine Phase Alignment and Massive Antenna Array at Extreme Frequencies. 1-4 - Shashi Kumar, Iuliia Thorbecke, Sergio Burdisso, Esaú Villatoro-Tello, Manjunath K. E, Kadri Hacioglu, Pradeep Rangappa, Petr Motlícek, Aravind Ganapathiraju, Andreas Stolcke:

Performance Evaluation of SLAM-ASR: The Good, the Bad, the Ugly, and the Way Forward. 1-5 - Kuan-Cheng Chen, Yi-Tien Li

, Tai-Yu Li, Chen-Yu Liu, Po-Heng Henry Lee, Cheng-Yu Chen:
CompressedMediQ: Hybrid Quantum Machine Learning Pipeline for High-Dimensional Neuroimaging Data. 1-5 - Bubai Maji, Rajlakshmi Guha, Aurobinda Routray, Shazia Nasreen, Debabrata Majumdar:

Prosody Disentanglement with Self-Supervised Speech Representation for Detecting Depression. 1-5 - Gopika Krishnan, Akshay Anantapadmanabhan, Kaustuv Kanti Ganguli, Carlos Guedes:

Investigating Temporal Convolutional Networks for Automated Stroke Transcription in the Mridangam. 1-5 - Pandit Vivek Kumar Pandey, Sitanshu Sekhar Sahu, Biswajit Karan, Juan Rafael Orozco-Arroyave:

Parkinson's Disease Detection Using Wavelet Packet Absolute Amplitude Deviation (WPAAD) from voice signals. 1-5 - Satvik Dixit, Soham Deshmukh, Bhiksha Raj:

MACE: Leveraging Audio for Evaluating Audio Captioning Systems. 1-5 - D. Patel, Joseph Tabrikian, Igal Bilik:

Adaptive Waveform Design for Cognitive MIMO ISAC. 1-5 - Sanket Shah, Kavya Ranjan Saxena, Kancharana Manideep Bharadwaj, Sharath Adavanne, Nagaraj Adiga:

IndicST: Indian Multilingual Translation Corpus For Evaluating Speech Large Language Models. 1-5 - V. T. Balamurugan, J. Hrithick Sundar, V. V. Meenakshi:

Computational Analysis and Classification of the Pannisai system of Indian Classical Music. 1-4 - Miguel Perez, Holger Kirchhoff, Peter Grosche, Xavier Serra:

Singing Voice Accompaniment Data Augmentation with Generative Models. 1-5 - Sascha Grollmisch, Thomas Köllmer

, Artem Yaroshchuk, Hanna M. Lukashevich:
Federated Semi-supervised Learning for Industrial Sound Analysis and Keyword Spotting. 1-5 - Karl El Hajal, Enno Hermann, Ajinkya Kulkarni, Mathew Magimai-Doss:

Unsupervised Rhythm and Voice Conversion of Dysarthric to Healthy Speech for ASR. 1-5 - Lokes S, Suresh Reddy, Samuel Yen-Chi Chen:

Hybrid Encoding-based Quantum Self Attention for Financial Time Series Analysis. 1-5 - Raghavasimhan Sankaranarayanan

, Larry Heck, Gil Weinberg:
Gamaka Synthesis for Kalpitha Swaras in Carnatic Music. 1-5 - Yu-Chao Hsu, Tai-Yu Li, Kuan-Cheng Chen:

Quantum Kernel-Based Long Short-term Memory. 1-5 - Tomohiko Nakamura

, Kwanghee Choi, Keigo Hojo, Yoshiaki Bando, Satoru Fukayama, Shinji Watanabe:
Discrete Speech Unit Extraction via Independent Component Analysis. 1-5 - Adithi Shankar, Serafin Schweinitz, Genís Plaja-Roglans, Xavier Serra, Martín Rocamora

:
Disentangling Overlapping Sources: Improving Vocal and Violin Source Separation in Carnatic Music. 1-5 - Muhammad Ahmed Mohsin, Syed Muhammad Jameel, Hassan Rizwan, Muhammad Iqbal

, Tabinda Ashraf
, Jen-Yi Pan:
Transformer-based Distributed Machine Learning for Downlink Channel Estimation in RIS-Aided Networks. 1-5 - Raghavasimhan Sankaranarayanan

, Gil Weinberg:
Agnostic Automatic Melodic Accompaniment for Alapana in Carnatic Music. 1-5 - Srikar Chaganti, Francesco Fioranelli, Raj Thilak Rajan

:
Distributed ADMM for Target Localization using Radar Networks. 1-5 - Gopika Krishnan, Julia Drabek, Akshay Anantapadmanabhan, Kaustuv Kanti Ganguli, Carlos Guedes:

Closing the Loop on Speech to Music Translation: Automatically Generating Synthetic Percussive Sequences on the Mridangam from Konnakol. 1-5 - Damir Cavar, Koushik Reddy Parukola:

Word and Text Similarity Using Classical Word Embeddings in Quantum NLP Systems. 1-5 - Ninad Puranik, Travis J. West, Marcelo M. Wanderley, Gary P. Scavone:

Thoughts on Mapping and Interface Design of a Keyboard to Perform Continuous Pitch Ornamentations in Hindustani Music. 1-5 - Zhenxiao Fu, Fan Chen:

Quantum Neural Network Extraction Attack via Split Co-Teaching. 1-5 - Djallel Bouneffouf, Raphaël Féraud, Baihan Lin:

Multi-Armed Bandit with Sparse and Noisy Feedback. 1-5 - Xunmeng Wu, Zai Yang, Zongben Xu:

Low-Complexity Algorithms for Multichannel Spectral Super-Resolution. 1-5 - Kuan-Cheng Chen, Wenxuan Ma, Xiaotian Xu:

Consensus-based Distributed Quantum Kernel Learning for Speech Recognition. 1-5 - Thomas Nuttall, Xavier Serra, Lara Pearson:

Svara-Forms in Carnatic Music: Contextual Influences on the Performance of Svara. 1-5 - Huan-Hsin Tseng, Hsin-Yi Lin, Samuel Yen-Chi Chen, Shinjae Yoo:

Transfer Learning Analysis of Variational Quantum Circuits. 1-5 - Gowtham Premananth, Carol Y. Espy-Wilson:

Speech-Based Estimation of Schizophrenia Severity Using Feature Fusion. 1-5 - Chu-Hsuan Abraham Lin, Chen-Yu Liu, Samuel Yen-Chi Chen, Kuan-Cheng Chen:

Quantum-Trained Convolutional Neural Network for Deepfake Audio Detection. 1-5 - Anna Guerra, Francesco Guidi, Davide Dardari, Petar M. Djuric:

Assessing Model Proficiency in Autonomous Agents: A Signal Processing Perspective. 1-5 - Pranav Kulkarni, P. P. Vaidyanathan:

Generalized Constructions of Weight-Constrained Sparse Arrays. 1-5 - Xiaokun Zhao, Marija Iloska, Mónica F. Bugallo:

Fusion Strategies in Multiple Particle Filtering in the Presence of Shared Unknown Static Parameters. 1-5 - Zhiyong Chen, Xinnuo Li, Shuhang Wu, Zhi Yang, Zhiqi Ai

, Shugong Xu:
StableTTS: Towards Efficient Denoising Acoustic Decoder for Text to Speech Synthesis with Consistency Flow Matching. 1-5 - Ruoxi Cheng, Yizhong Ding, Shuirong Cao, Shitong Shao, Zhiqiang Wang:

USMID: A Unimodal Speaker-Level Membership Inference Detector for Contrastive Pretraining. 1-5 - Marco Piavanini, Simone Specchia, Mattia Brambilla

, Sergio Matteo Savaresi, Monica Nicoli:
Non-linear Variational Bayes Multiple Model for Positioning in Highway Tunnel. 1-5 - Yash Bhake, Preeti Rao:

Expressive Timing in Hindustani Vocal Music. 1-5 - Sumit Kumar, Parampreet Singh, Vipul Arora:

Confidence-Enhanced Models for Indian Art Music Analysis. 1-5 - Yueguan Wang, Tatsunari Matsushima, Soichiro Matsushima, Toshimitsu Sakai:

Enhancing and Exploring Mild Cognitive Impairment Detection with W2V-BERT-2.0. 1-5 - Zheqi Dai, Haolin He, Qiuqiang Kong:

Musimple: A Simplified Music Generation System with Diffusion Transformer. 1-5 - Jae-Sung Bae, Anastasia Kuznetsova, Dinesh Manocha, John R. Hershey, Trausti T. Kristjansson, Minje Kim:

Generative Data Augmentation Challenge: Zero-Shot Speech Synthesis for Personalized Speech Enhancement. 1-5 - Ruchi Pandey, Manjunath Mulimani, Archontis Politis

, Annamaria Mesaros
:
Class-Incremental Learning for Sound Event Localization and Detection. 1-5 - Sparsh Mittal, Yash Chand, Mintu Kumar, Neel Kanth Kundu

:
Hybrid Quantum Machine Learning based Human Speech Emotion Recognition. 1-5

manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.


Google
Google Scholar
Semantic Scholar
Internet Archive Scholar
CiteSeerX
ORCID














