


Остановите войну!
for scientists:


default search action
Tuomas Virtanen
Person information

- affiliation: Tampere University of Technology, Finland
Refine list

refinements active!
zoomed in on ?? of ?? records
view refined list in
export refined list as
showing all ?? records
2020 – today
- 2023
- [i73]Paul Magron, Tuomas Virtanen:
Spectrogram Inversion for Audio Source Separation via Consistency, Mixing, and Magnitude Constraints. CoRR abs/2303.01864 (2023) - [i72]Wang Dai, Archontis Politis, Tuomas Virtanen:
Multi-Channel Masking with Learnable Filterbank for Sound Source Separation. CoRR abs/2303.07816 (2023) - [i71]Shayan Gharib, Minh Tran, Diep Luong, Konstantinos Drossos, Tuomas Virtanen:
Adversarial Representation Learning for Robust Privacy Preservation in Audio. CoRR abs/2305.00011 (2023) - [i70]Wei Xie, Yanxiong Li, Qianhua He, Wenchang Cao, Tuomas Virtanen:
Few-shot Class-incremental Audio Classification Using Adaptively-refined Prototypes. CoRR abs/2305.18045 (2023) - [i69]Parthasaarathy Sudarsanam, Tuomas Virtanen:
Attention-Based Methods For Audio Question Answering. CoRR abs/2305.19769 (2023) - [i68]Khazar Khorrami, María Andrea Cruz Blandón, Tuomas Virtanen, Okko Räsänen:
Simultaneous or Sequential Training? How Speech Representations Cooperate in a Multi-Task Self-Supervised Learning System. CoRR abs/2306.02972 (2023) - [i67]David Diaz-Guerra, Archontis Politis, Antonio Miguel, José Ramón Beltrán, Tuomas Virtanen:
Permutation Invariant Recurrent Neural Networks for Sound Source Tracking Applications. CoRR abs/2306.08510 (2023) - [i66]Kazuki Shimada, Archontis Politis, Parthasaarathy Sudarsanam, Daniel Krause, Kengo Uchida, Sharath Adavanne, Aapo Hakala, Yuichiro Koyama, Naoya Takahashi, Shusuke Takahashi, Tuomas Virtanen, Yuki Mitsufuji:
STARSS23: An Audio-Visual Dataset of Spatial Recordings of Real Scenes with Spatiotemporal Annotations of Sound Events. CoRR abs/2306.09126 (2023) - [i65]Huang Xie, Khazar Khorrami, Okko Räsänen, Tuomas Virtanen:
Crowdsourcing and Evaluating Text-Based Audio Retrieval Relevances. CoRR abs/2306.09820 (2023) - [i64]Diep Luong, Minh Tran, Shayan Gharib, Konstantinos Drossos, Tuomas Virtanen:
Representation Learning for Audio Privacy Preservation using Source Separation and Robust Adversarial Learning. CoRR abs/2308.04960 (2023) - 2022
- [j43]Björn W. Schuller, Yonina C. Eldar, Maja Pantic, Shrikanth Narayanan, Tuomas Virtanen, Jianhua Tao:
Editorial: Intelligent Signal Analysis for Contagious Virus Diseases. IEEE J. Sel. Top. Signal Process. 16(2): 159-163 (2022) - [j42]Shanshan Wang
, Archontis Politis
, Annamaria Mesaros
, Tuomas Virtanen
:
Self-Supervised Learning of Audio Representations From Audio-Visual Data Using Spatial Alignment. IEEE J. Sel. Top. Signal Process. 16(6): 1467-1479 (2022) - [c166]Irene Martín-Morató, Francesco Paissan, Alberto Ancilotto, Toni Heittola, Annamaria Mesaros, Elisabetta Farella, Alessio Brutti, Tuomas Virtanen:
Low-Complexity Acoustic Scene Classification in DCASE 2022 Challenge. DCASE 2022 - [c165]Archontis Politis, Kazuki Shimada, Parthasaarathy Sudarsanam, Sharath Adavanne, Daniel Krause, Yuichiro Koyama, Naoya Takahashi, Shusuke Takahashi, Yuki Mitsufuji, Tuomas Virtanen:
STARSS22: A Dataset of Spatial Recordings of Real Scenes with Spatiotemporal Annotations of Sound Events. DCASE 2022 - [c164]Huang Xie, Samuel Lipping, Tuomas Virtanen:
Language-Based Audio Retrieval Task in DCASE 2022 Challenge. DCASE 2022 - [c163]Duygu Dogan, Huang Xie, Toni Heittola, Tuomas Virtanen:
Zero-Shot Audio Classification using Image Embeddings. EUSIPCO 2022: 1-5 - [c162]Ville-Veikko Eklund, Aleksandr Diment, Tuomas Virtanen:
Noise, Device and Room Robustness Methods for Pronunciation Error Detection. EUSIPCO 2022: 140-144 - [c161]Samuel Lipping, Parthasaarathy Sudarsanam, Konstantinos Drossos, Tuomas Virtanen:
Clotho-AQA: A Crowdsourced Dataset for Audio Question Answering. EUSIPCO 2022: 1140-1144 - [c160]Huang Xie
, Okko Räsänen
, Konstantinos Drossos
, Tuomas Virtanen:
Unsupervised Audio-Caption Aligning Learns Correspondences Between Individual Sound Events and Textual Phrases. ICASSP 2022: 8867-8871 - [c159]Yanxiong Li, Wenchang Cao, Konstantinos Drossos
, Tuomas Virtanen:
Domestic Activity Clustering from Audio via Depthwise Separable Convolutional Autoencoder Network. MMSP 2022: 1-6 - [c158]Gaurav Naithani, Kirsi Pietilä, Riitta Niemistö, Erkki Paajanen, Tero Takala, Tuomas Virtanen:
Subjective Evaluation of Deep Neural Network Based Speech Enhancement Systems in Real-World Conditions. MMSP 2022: 1-6 - [i63]Samuel Lipping, Parthasaarathy Sudarsanam, Konstantinos Drossos, Tuomas Virtanen:
Clotho-AQA: A Crowdsourced Dataset for Audio Question Answering. CoRR abs/2204.09634 (2022) - [i62]Shanshan Wang, Archontis Politis, Annamaria Mesaros, Tuomas Virtanen:
Self-supervised Learning of Audio Representations from Audio-Visual Data using Spatial Alignment. CoRR abs/2206.00970 (2022) - [i61]Archontis Politis, Kazuki Shimada
, Parthasaarathy Sudarsanam, Sharath Adavanne, Daniel Krause, Yuichiro Koyama, Naoya Takahashi, Shusuke Takahashi, Yuki Mitsufuji
, Tuomas Virtanen:
STARSS22: A dataset of spatial recordings of real scenes with spatiotemporal annotations of sound events. CoRR abs/2206.01948 (2022) - [i60]Duygu Dogan, Huang Xie, Toni Heittola, Tuomas Virtanen:
Zero-Shot Audio Classification using Image Embeddings. CoRR abs/2206.04984 (2022) - [i59]Yanxiong Li, Wenchang Cao, Konstantinos Drossos, Tuomas Virtanen:
Domestic Activity Clustering from Audio via Depthwise Separable Convolutional Autoencoder Network. CoRR abs/2208.02406 (2022) - [i58]Gaurav Naithani, Kirsi Pietilä, Riitta Niemistö, Erkki Paajanen, Tero Takala, Tuomas Virtanen:
Subjective Evaluation of Deep Neural Network Based Speech Enhancement Systems in Real-World Conditions. CoRR abs/2208.05057 (2022) - [i57]David Diaz-Guerra, Archontis Politis, Tuomas Virtanen:
Position tracking of a varying number of sound sources with sliding permutation invariant training. CoRR abs/2210.14536 (2022) - [i56]Huang Xie, Okko Räsänen, Tuomas Virtanen:
On Negative Sampling for Contrastive Audio-Text Retrieval. CoRR abs/2211.04070 (2022) - 2021
- [j41]Szymon Drgas, Tuomas Virtanen:
Joint speaker separation and recognition using non-negative matrix deconvolution with adaptive dictionary. Comput. Speech Lang. 70: 101223 (2021) - [j40]Annamaria Mesaros
, Toni Heittola
, Tuomas Virtanen
, Mark D. Plumbley
:
Sound Event Detection: A tutorial. IEEE Signal Process. Mag. 38(5): 67-83 (2021) - [j39]Archontis Politis
, Annamaria Mesaros
, Sharath Adavanne
, Toni Heittola
, Tuomas Virtanen
:
Overview and Evaluation of Sound Event Localization and Detection in DCASE 2019. IEEE ACM Trans. Audio Speech Lang. Process. 29: 684-698 (2021) - [j38]Huang Xie
, Tuomas Virtanen
:
Zero-Shot Audio Classification Via Semantic Embeddings. IEEE ACM Trans. Audio Speech Lang. Process. 29: 1233-1242 (2021) - [c157]Shanshan Wang, Annamaria Mesaros, Toni Heittola, Tuomas Virtanen:
Audio-Visual Scene Classification: Analysis of DCASE 2021 Challenge Submissions. DCASE 2021: 45-49 - [c156]Irene Martín-Morató, Toni Heittola, Annamaria Mesaros, Tuomas Virtanen:
Low-Complexity Acoustic Scene Classification for Multi-Device Audio: Analysis of DCASE 2021 Challenge Systems. DCASE 2021: 85-89 - [c155]Archontis Politis, Sharath Adavanne, Daniel Krause, Antoine Deleforge, Prerak Srivastava, Tuomas Virtanen:
A Dataset of Dynamic Reverberant Sound Scenes with Directional Interferers for Sound Event Localization and Detection. DCASE 2021: 125-129 - [c154]Shanshan Wang
, Gaurav Naithani, Archontis Politis
, Tuomas Virtanen:
Deep Neural Network Based Low-Latency Speech Separation with Asymmetric Analysis-Synthesis Window Pair. EUSIPCO 2021: 301-305 - [c153]Pasi Pertilä, Emre Cakir
, Aapo Hakala, Eemi Fagerlund, Tuomas Virtanen, Archontis Politis
, Antti J. Eronen:
Mobile Microphone Array Speech Detection and Localization in Diverse Everyday Environments. EUSIPCO 2021: 406-410 - [c152]Slobodan Djukanovic, Yash Patel, Jirí Matas
, Tuomas Virtanen:
Neural network-based acoustic vehicle counting. EUSIPCO 2021: 561-565 - [c151]An Tran, Konstantinos Drossos
, Tuomas Virtanen:
WaveTransformer: An Architecture for Audio Captioning Based on Learning Temporal and Time-Frequency Information. EUSIPCO 2021: 576-580 - [c150]Huang Xie
, Okko Räsänen
, Tuomas Virtanen:
Zero-Shot Audio Classification with Factored Linear and Nonlinear Acoustic-Semantic Projections. ICASSP 2021: 326-330 - [c149]Xavier Favory, Konstantinos Drossos
, Tuomas Virtanen, Xavier Serra:
Learning Contextual Tag Embeddings for Cross-Modal Alignment of Audio and Tags. ICASSP 2021: 596-600 - [c148]Shanshan Wang
, Annamaria Mesaros
, Toni Heittola, Tuomas Virtanen:
A Curated Dataset of Urban Scenes for Audio-Visual Scene Analysis. ICASSP 2021: 626-630 - [c147]Björn W. Schuller, Tuomas Virtanen, Maria Riveiro, Georgios Rizos, Jing Han, Annamaria Mesaros
, Konstantinos Drossos
:
Towards Sonification in Multimodal and User-friendlyExplainable Artificial Intelligence. ICMI 2021: 788-792 - [c146]Sharath Adavanne
, Archontis Politis
, Tuomas Virtanen:
Differentiable Tracking-Based Training of Deep Learning Sound Source Localizers. WASPAA 2021: 211-215 - [i55]Shanshan Wang, Toni Heittola, Annamaria Mesaros, Tuomas Virtanen:
Audio-visual scene classification: analysis of DCASE 2021 Challenge submissions. CoRR abs/2105.13675 (2021) - [i54]Archontis Politis, Sharath Adavanne, Daniel Krause, Antoine Deleforge, Prerak Srivastava, Tuomas Virtanen:
A Dataset of Dynamic Reverberant Sound Scenes with Directional Interferers for Sound Event Localization and Detection. CoRR abs/2106.06999 (2021) - [i53]Shanshan Wang, Gaurav Naithani, Archontis Politis, Tuomas Virtanen:
Deep neural network Based Low-latency Speech Separation with Asymmetric analysis-Synthesis Window Pair. CoRR abs/2106.11794 (2021) - [i52]Sharath Adavanne, Archontis Politis, Tuomas Virtanen:
Differentiable Tracking-Based Training of Deep Learning Sound Source Localizers. CoRR abs/2111.00030 (2021) - 2020
- [j37]Paul Magron
, Tuomas Virtanen
:
Online Spectrogram Inversion for Low-Latency Audio Source Separation. IEEE Signal Process. Lett. 27: 306-310 (2020) - [j36]Shuyang Zhao
, Toni Heittola
, Tuomas Virtanen
:
Active Learning for Sound Event Detection. IEEE ACM Trans. Audio Speech Lang. Process. 28: 2895-2905 (2020) - [c145]Emre Çakir, Konstantinos Drossos, Tuomas Virtanen:
Multi-Task Regularization Based on Infrequent Classes for Audio Captioning. DCASE 2020: 6-10 - [c144]Toni Heittola, Annamaria Mesaros, Tuomas Virtanen:
Acoustic Scene Classification in DCASE 2020 Challenge: Generalization Across Devices and Low Complexity Solutions. DCASE 2020: 56-60 - [c143]Khoa Nguyen, Konstantinos Drossos, Tuomas Virtanen:
Temporal Sub-Sampling of Audio Feature Sequences for Automated Audio Captioning. DCASE 2020: 110-114 - [c142]Archontis Politis, Sharath Adavanne, Tuomas Virtanen:
A Dataset of Reverberant Spatial Sound Scenes with Moving Sources for Sound Event Localization and Detection. DCASE 2020: 165-169 - [c141]Niccolò Nicodemo, Gaurav Naithani, Konstantinos Drossos
, Tuomas Virtanen, Roberto Saletti
:
Memory Requirement Reduction of Deep Neural Networks for Field Programmable Gate Arrays Using Low-Bit Quantization of Parameters. EUSIPCO 2020: 466-470 - [c140]Yanxiong Li, Mingle Liu, Konstantinos Drossos
, Tuomas Virtanen:
Sound Event Detection Via Dilated Convolutional Recurrent Neural Networks. ICASSP 2020: 286-290 - [c139]Konstantinos Drossos
, Samuel Lipping, Tuomas Virtanen:
Clotho: an Audio Captioning Dataset. ICASSP 2020: 736-740 - [c138]Konstantinos Drossos
, Stylianos I. Mimilakis, Shayan Gharib, Yanxiong Li, Tuomas Virtanen:
Sound Event Detection with Depthwise Separable and Dilated Convolutions. IJCNN 2020: 1-7 - [c137]Slobodan Djukanovic, Jiri Matas
, Tuomas Virtanen:
Robust Audio-Based Vehicle Counting in Low-to-Moderate Traffic Flow. IV 2020: 1608-1614 - [c136]Pyry Pyykkönen, Stylianos I. Mimilakis, Konstantinos Drossos
, Tuomas Virtanen:
Depthwise Separable Convolutions Versus Recurrent Neural Networks for Monaural Singing Voice Separation. MMSP 2020: 1-6 - [i51]Konstantinos Drossos, Stylianos Ioannis Mimilakis, Shayan Gharib, Yanxiong Li, Tuomas Virtanen:
Sound Event Detection with Depthwise Separable and Dilated Convolutions. CoRR abs/2002.00476 (2020) - [i50]Shuyang Zhao, Toni Heittola, Tuomas Virtanen:
Active Learning for Sound Event Detection. CoRR abs/2002.05033 (2020) - [i49]Archontis Politis, Sharath Adavanne, Tuomas Virtanen:
A Dataset of Reverberant Spatial Sound Scenes with Moving Sources for Sound Event Localization and Detection. CoRR abs/2006.01919 (2020) - [i48]Xavier Favory, Konstantinos Drossos, Tuomas Virtanen, Xavier Serra:
COALA: Co-Aligned Autoencoders for Learning Semantically Enriched Audio Representations. CoRR abs/2006.08386 (2020) - [i47]Khoa Nguyen, Konstantinos Drossos, Tuomas Virtanen:
Temporal Sub-sampling of Audio Feature Sequences for Automated Audio Captioning. CoRR abs/2007.02676 (2020) - [i46]Pyry Pyykkönen, Stylianos Ioannis Mimilakis, Konstantinos Drossos, Tuomas Virtanen:
Depthwise Separable Convolutions Versus Recurrent Neural Networks for Monaural Singing Voice Separation. CoRR abs/2007.02683 (2020) - [i45]Emre Çakir, Konstantinos Drossos, Tuomas Virtanen:
Multi-task Regularization Based on Infrequent Classes for Audio Captioning. CoRR abs/2007.04660 (2020) - [i44]Konstantinos Drossos, Stylianos Ioannis Mimilakis, Tuomas Virtanen:
Conditioned Time-Dilated Convolutions for Sound Event Detection. CoRR abs/2007.05183 (2020) - [i43]Archontis Politis, Annamaria Mesaros, Sharath Adavanne, Toni Heittola, Tuomas Virtanen:
Overview and Evaluation of Sound Event Localization and Detection in DCASE 2019. CoRR abs/2009.02792 (2020) - [i42]An Tran, Konstantinos Drossos, Tuomas Virtanen:
WaveTransformer: A Novel Architecture for Audio Captioning Based on Learning Temporal and Time-Frequency Information. CoRR abs/2010.11098 (2020) - [i41]Slobodan Djukanovic, Yash Patel, Jiri Matas, Tuomas Virtanen:
Neural Network-based Acoustic Vehicle Counting. CoRR abs/2010.11659 (2020) - [i40]Slobodan Djukanovic, Jiri Matas, Tuomas Virtanen:
Robust Audio-Based Vehicle Counting in Low-to-Moderate Traffic Flow. CoRR abs/2010.11716 (2020) - [i39]Xavier Favory, Konstantinos Drossos, Tuomas Virtanen, Xavier Serra:
Learning Contextual Tag Embeddings for Cross-Modal Alignment of Audio and Tags. CoRR abs/2010.14171 (2020)
2010 – 2019
- 2019
- [j35]Víctor M. García-Molla, Pablo San Juan Sebastián, Tuomas Virtanen, Antonio M. Vidal, Pedro Alonso:
Generalization of the K-SVD algorithm for minimization of β-divergence. Digit. Signal Process. 92: 47-53 (2019) - [j34]Sharath Adavanne
, Archontis Politis
, Joonas Nikunen
, Tuomas Virtanen
:
Sound Event Localization and Detection of Overlapping Sources Using Convolutional Recurrent Neural Networks. IEEE J. Sel. Top. Signal Process. 13(1): 34-48 (2019) - [j33]Hendrik Purwins
, Bo Li
, Tuomas Virtanen
, Jan Schlüter
, Shuo-Yiin Chang, Tara N. Sainath
:
Deep Learning for Audio Signal Processing. IEEE J. Sel. Top. Signal Process. 13(2): 206-219 (2019) - [j32]Paul Magron
, Tuomas Virtanen
:
Complex ISNMF: A Phase-Aware Model for Monaural Audio Source Separation. IEEE ACM Trans. Audio Speech Lang. Process. 27(1): 20-31 (2019) - [j31]Annamaria Mesaros
, Aleksandr Diment, Benjamin Elizalde
, Toni Heittola
, Emmanuel Vincent
, Bhiksha Raj, Tuomas Virtanen:
Sound Event Detection in the DCASE 2017 Challenge. IEEE ACM Trans. Audio Speech Lang. Process. 27(6): 992-1006 (2019) - [j30]Pablo San Juan Sebastián
, Tuomas Virtanen, Víctor M. García-Molla, Antonio M. Vidal:
Analysis of an efficient parallel implementation of active-set Newton algorithm. J. Supercomput. 75(3): 1298-1309 (2019) - [c135]Sharath Adavanne, Archontis Politis, Tuomas Virtanen:
A Multi-room Reverberant Dataset for Sound Event Localization and Detection. DCASE 2019: 10-14 - [c134]Sharath Adavanne, Archontis Politis, Tuomas Virtanen:
Localization, Detection and Tracking of Multiple Moving Sound Sources with a Convolutional Recurrent Neural Network. DCASE 2019: 20-24 - [c133]Konstantinos Drossos, Shayan Gharib, Paul Magron, Tuomas Virtanen:
Language Modelling for Sound Event Detection with Teacher Forcing and Scheduled Sampling. DCASE 2019: 59-63 - [c132]Samuel Lipping, Konstantinos Drossos, Tuomas Virtanen:
Crowdsourcing a Dataset of Audio Captions. DCASE 2019: 139-143 - [c131]Annamaria Mesaros, Toni Heittola, Tuomas Virtanen:
Acoustic Scene Classification in DCASE 2019 Challenge: Closed and Open Set Classification and Data Mismatch Setups. DCASE 2019: 164-168 - [c130]M. N. Istiaq Ahsan, Csaba Kertész, Annamaria Mesaros
, Toni Heittola, Andrew Knight, Tuomas Virtanen:
Audio-Based Epileptic Seizure Detection. EUSIPCO 2019: 1-5 - [c129]Shanshan Wang
, Gaurav Naithani, Tuomas Virtanen:
Low-latency Deep Clustering for Speech Separation. ICASSP 2019: 76-80 - [c128]Irene Martín-Morató
, Annamaria Mesaros
, Toni Heittola, Tuomas Virtanen, Maximo Cobos
, Francesc J. Ferri:
Sound Event Envelope Estimation in Polyphonic Mixtures. ICASSP 2019: 935-939 - [c127]Aleksandr Diment, Eemi Fagerlund, Adrian Benfield, Tuomas Virtanen:
Detection of Typical Pronunciation Errors in Non-native English Speech Using Convolutional Recurrent Neural Networks. IJCNN 2019: 1-8 - [c126]Helen L. Bear, Toni Heittola, Annamaria Mesaros
, Emmanouil Benetos
, Tuomas Virtanen:
City Classification from Multiple Real-World Sound Scenes. WASPAA 2019: 11-15 - [c125]Konstantinos Drossos
, Paul Magron, Tuomas Virtanen:
Unsupervised Adversarial Domain Adaptation Based on The Wasserstein Distance For Acoustic Scene Classification. WASPAA 2019: 259-263 - [c124]Huang Xie
, Tuomas Virtanen:
Zero-Shot Audio Classification Based On Class Label Embeddings. WASPAA 2019: 264-267 - [c123]Marc C. Green, Sharath Adavanne
, Damian T. Murphy, Tuomas Virtanen:
Acoustic Scene Classification Using Higher-Order Ambisonic Features. WASPAA 2019: 328-332 - [c122]Annamaria Mesaros
, Sharath Adavanne
, Archontis Politis
, Toni Heittola, Tuomas Virtanen:
Joint Measurement of Localization and Detection of Sound Events. WASPAA 2019: 333-337 - [i38]Shanshan Wang, Gaurav Naithani, Tuomas Virtanen:
Low-Latency Deep Clustering For Speech Separation. CoRR abs/1902.07033 (2019) - [i37]Konstantinos Drossos, Paul Magron, Tuomas Virtanen:
Unsupervised Adversarial Domain Adaptation Based On The Wasserstein Distance For Acoustic Scene Classification. CoRR abs/1904.10678 (2019) - [i36]Sharath Adavanne, Archontis Politis, Tuomas Virtanen:
Localization, Detection and Tracking of Multiple Moving Sound Sources with a Convolutional Recurrent Neural Network. CoRR abs/1904.12769 (2019) - [i35]Hendrik Purwins, Bo Li, Tuomas Virtanen, Jan Schlüter, Shuo-Yiin Chang, Tara N. Sainath:
Deep Learning for Audio Signal Processing. CoRR abs/1905.00078 (2019) - [i34]Helen L. Bear, Toni Heittola, Annamaria Mesaros, Emmanouil Benetos, Tuomas Virtanen:
City classification from multiple real-world sound scenes. CoRR abs/1905.00979 (2019) - [i33]Huang Xie, Tuomas Virtanen:
Zero-Shot Audio Classification Based on Class Label Embeddings. CoRR abs/1905.01926 (2019) - [i32]Sharath Adavanne, Archontis Politis, Tuomas Virtanen:
A multi-room reverberant dataset for sound event localization and detection. CoRR abs/1905.08546 (2019) - [i31]Konstantinos Drossos, Shayan Gharib, Paul Magron, Tuomas Virtanen:
Language Modelling for Sound Event Detection with Teacher Forcing and Scheduled Sampling. CoRR abs/1907.08506 (2019) - [i30]Samuel Lipping, Konstantinos Drossos, Tuomas Virtanen:
Crowdsourcing a Dataset of Audio Captions. CoRR abs/1907.09238 (2019) - [i29]Konstantinos Drossos, Samuel Lipping, Tuomas Virtanen:
Clotho: An Audio Captioning Dataset. CoRR abs/1910.09387 (2019) - [i28]Niccolò Nicodemo, Gaurav Naithani, Konstantinos Drossos, Tuomas Virtanen, Roberto Saletti:
Memory Requirement Reduction of Deep Neural Networks Using Low-bit Quantization of Parameters. CoRR abs/1911.00527 (2019) - [i27]Paul Magron, Tuomas Virtanen:
Online Spectrogram Inversion for Low-Latency Audio Source Separation. CoRR abs/1911.03128 (2019) - [i26]Shayan Gharib, Konstantinos Drossos, Eemi Fagerlund, Tuomas Virtanen:
VOICe: A Sound Event Detection Dataset For Generalizable Domain Adaptation. CoRR abs/1911.07098 (2019) - 2018
- [j29]Gaurav Naithani, Jaana Kivinummi, Tuomas Virtanen, Outi Tammela, Mikko J. Peltola
, Jukka M. Leppänen:
Automatic segmentation of infant cry signals using hidden Markov models. EURASIP J. Audio Speech Music. Process. 2018: 1 (2018) - [j28]Katariina Mahkonen
, Tuomas Virtanen, Joni-Kristian Kämäräinen:
Cascade of Boolean detector combinations. EURASIP J. Image Video Process. 2018: 61 (2018) - [j27]Joonas Nikunen
, Aleksandr Diment, Tuomas Virtanen
:
Separation of Moving Sound Sources Using Multichannel NMF and Acoustic Tracking. IEEE ACM Trans. Audio Speech Lang. Process. 26(2): 281-295 (2018) - [j26]Annamaria Mesaros
, Toni Heittola, Emmanouil Benetos
, Peter Foster, Mathieu Lagrange, Tuomas Virtanen, Mark D. Plumbley
:
Detection and Classification of Acoustic Scenes and Events: Outcome of the DCASE 2016 Challenge. IEEE ACM Trans. Audio Speech Lang. Process. 26(2): 379-393 (2018) - [j25]Julio J. Carabias-Orti
, Joonas Nikunen
, Tuomas Virtanen
, Pedro Vera-Candeas
:
Multichannel Blind Sound Source Separation Using Spatial Covariance Model With Level and Time Differences and Nonnegative Matrix Factorization. IEEE ACM Trans. Audio Speech Lang. Process. 26(9): 1512-1527 (2018) - [c121]Annamaria Mesaros, Toni Heittola, Tuomas Virtanen:
A multi-device dataset for urban acoustic scene classification. DCASE 2018: 9-13 - [c120]Shayan Gharib, Konstantinos Drossos, Emre Cakir, Dmitriy Serdyuk, Tuomas Virtanen:
Unsupervised adversarial domain adaptation for acoustic scene classification. DCASE 2018: 138-142 - [c119]Sharath Adavanne
, Archontis Politis
, Tuomas Virtanen:
Direction of Arrival Estimation for Multiple Sound Sources Using Convolutional Recurrent Neural Network. EUSIPCO 2018: 1462-1466 - [c118]