default search action
Zalan Borsos
Person information
Refine list
refinements active!
zoomed in on ?? of ?? records
view refined list in
export refined list as
2020 – today
- 2024
- [j4]Zalán Borsos, Mojmír Mutný, Marco Tagliasacchi, Andreas Krause:
Data Summarization via Bilevel Optimization. J. Mach. Learn. Res. 25: 73:1-73:53 (2024) - [c11]Geoffrey Cideron, Sertan Girgin, Mauro Verzetti, Damien Vincent, Matej Kastelic, Zalán Borsos, Brian McWilliams, Victor Ungureanu, Olivier Bachem, Olivier Pietquin, Matthieu Geist, Léonard Hussenot, Neil Zeghidour, Andrea Agostinelli:
MusicRL: Aligning Music Generation to Human Preferences. ICML 2024 - [i18]Geoffrey Cideron, Sertan Girgin, Mauro Verzetti, Damien Vincent, Matej Kastelic, Zalán Borsos, Brian McWilliams, Victor Ungureanu, Olivier Bachem, Olivier Pietquin, Matthieu Geist, Léonard Hussenot, Neil Zeghidour, Andrea Agostinelli:
MusicRL: Aligning Music Generation to Human Preferences. CoRR abs/2402.04229 (2024) - 2023
- [j3]Eugene Kharitonov, Damien Vincent, Zalán Borsos, Raphaël Marinier, Sertan Girgin, Olivier Pietquin, Matt Sharifi, Marco Tagliasacchi, Neil Zeghidour:
Speak, Read and Prompt: High-Fidelity Text-to-Speech with Minimal Supervision. Trans. Assoc. Comput. Linguistics 11: 1703-1718 (2023) - [j2]Zalán Borsos, Raphaël Marinier, Damien Vincent, Eugene Kharitonov, Olivier Pietquin, Matthew Sharifi, Dominik Roblek, Olivier Teboul, David Grangier, Marco Tagliasacchi, Neil Zeghidour:
AudioLM: A Language Modeling Approach to Audio Generation. IEEE ACM Trans. Audio Speech Lang. Process. 31: 2523-2533 (2023) - [c10]Teerapat Jenrungrot, Michael Chinen, W. Bastiaan Kleijn, Jan Skoglund, Zalán Borsos, Neil Zeghidour, Marco Tagliasacchi:
LMCodec: A Low Bitrate Speech Codec with Causal Transformer Models. ICASSP 2023: 1-5 - [c9]Ahmed Omran, Neil Zeghidour, Zalán Borsos, Félix de Chaumont Quitry, Malcolm Slaney, Marco Tagliasacchi:
Disentangling Speech from Surroundings with Neural Embeddings. ICASSP 2023: 1-5 - [c8]Hakan Erdogan, Scott Wisdom, Xuankai Chang, Zalán Borsos, Marco Tagliasacchi, Neil Zeghidour, John R. Hershey:
TokenSplit: Using Discrete Speech Representations for Direct, Refined, and Transcript-Conditioned Speech Separation and Recognition. INTERSPEECH 2023: 3462-3466 - [i17]Andrea Agostinelli, Timo I. Denk, Zalán Borsos, Jesse H. Engel, Mauro Verzetti, Antoine Caillon, Qingqing Huang, Aren Jansen, Adam Roberts, Marco Tagliasacchi, Matthew Sharifi, Neil Zeghidour, Christian Havnø Frank:
MusicLM: Generating Music From Text. CoRR abs/2301.11325 (2023) - [i16]Eugene Kharitonov, Damien Vincent, Zalán Borsos, Raphaël Marinier, Sertan Girgin, Olivier Pietquin, Matthew Sharifi, Marco Tagliasacchi, Neil Zeghidour:
Speak, Read and Prompt: High-Fidelity Text-to-Speech with Minimal Supervision. CoRR abs/2302.03540 (2023) - [i15]Teerapat Jenrungrot, Michael Chinen, W. Bastiaan Kleijn, Jan Skoglund, Zalán Borsos, Neil Zeghidour, Marco Tagliasacchi:
LMCodec: A Low Bitrate Speech Codec With Causal Transformer Models. CoRR abs/2303.12984 (2023) - [i14]Zalán Borsos, Matthew Sharifi, Damien Vincent, Eugene Kharitonov, Neil Zeghidour, Marco Tagliasacchi:
SoundStorm: Efficient Parallel Audio Generation. CoRR abs/2305.09636 (2023) - [i13]Paul K. Rubenstein, Chulayuth Asawaroengchai, Duc Dung Nguyen, Ankur Bapna, Zalán Borsos, Félix de Chaumont Quitry, Peter Chen, Dalia El Badawy, Wei Han, Eugene Kharitonov, Hannah Muckenhirn, Dirk Padfield, James Qin, Danny Rozenberg, Tara N. Sainath, Johan Schalkwyk, Matthew Sharifi, Michelle Tadmor Ramanovich, Marco Tagliasacchi, Alexandru Tudor, Mihajlo Velimirovic, Damien Vincent, Jiahui Yu, Yongqiang Wang, Vicky Zayats, Neil Zeghidour, Yu Zhang, Zhishuai Zhang, Lukas Zilka, Christian Havnø Frank:
AudioPaLM: A Large Language Model That Can Speak and Listen. CoRR abs/2306.12925 (2023) - [i12]Hakan Erdogan, Scott Wisdom, Xuankai Chang, Zalán Borsos, Marco Tagliasacchi, Neil Zeghidour, John R. Hershey:
TokenSplit: Using Discrete Speech Representations for Direct, Refined, and Transcript-Conditioned Speech Separation and Recognition. CoRR abs/2308.10415 (2023) - 2022
- [c7]Zalan Borsos, Matthew Sharifi, Marco Tagliasacchi:
SpeechPainter: Text-conditioned Speech Inpainting. INTERSPEECH 2022: 431-435 - [i11]Zalán Borsos, Matthew Sharifi, Marco Tagliasacchi:
SpeechPainter: Text-conditioned Speech Inpainting. CoRR abs/2202.07273 (2022) - [i10]Ahmed Omran, Neil Zeghidour, Zalán Borsos, Félix de Chaumont Quitry, Malcolm Slaney, Marco Tagliasacchi:
Disentangling speech from surroundings in a neural audio codec. CoRR abs/2203.15578 (2022) - [i9]Zalán Borsos, Raphaël Marinier, Damien Vincent, Eugene Kharitonov, Olivier Pietquin, Matthew Sharifi, Olivier Teboul, David Grangier, Marco Tagliasacchi, Neil Zeghidour:
AudioLM: a Language Modeling Approach to Audio Generation. CoRR abs/2209.03143 (2022) - 2021
- [b1]Zalán Borsos:
Data Summarization in Modern Machine Learning. ETH Zurich, Zürich, Switzerland, 2021 - [c6]Zalán Borsos, Yunpeng Li, Beat Gfeller, Marco Tagliasacchi:
Micaugment: One-Shot Microphone Style Transfer. ICASSP 2021: 3400-3404 - [c5]Zalán Borsos, Marco Tagliasacchi, Andreas Krause:
Semi-Supervised Batch Active Learning Via Bilevel Optimization. ICASSP 2021: 3495-3499 - [i8]Zalán Borsos, Mojmír Mutný, Marco Tagliasacchi, Andreas Krause:
Data Summarization via Bilevel Optimization. CoRR abs/2109.12534 (2021) - 2020
- [c4]Zalán Borsos, Mojmir Mutny, Andreas Krause:
Coresets via Bilevel Optimization for Continual Learning and Streaming. NeurIPS 2020 - [i7]Zalán Borsos, Mojmír Mutný, Andreas Krause:
Coresets via Bilevel Optimization for Continual Learning and Streaming. CoRR abs/2006.03875 (2020) - [i6]Zalán Borsos, Marco Tagliasacchi, Andreas Krause:
Semi-supervised Batch Active Learning via Bilevel Optimization. CoRR abs/2010.09654 (2020) - [i5]Zalán Borsos, Yunpeng Li, Beat Gfeller, Marco Tagliasacchi:
MicAugment: One-shot Microphone Style Transfer. CoRR abs/2010.09658 (2020)
2010 – 2019
- 2019
- [c3]Zalán Borsos, Sebastian Curi, Kfir Yehuda Levy, Andreas Krause:
Online Variance Reduction with Mixtures. ICML 2019: 705-714 - [i4]Zalán Borsos, Sebastian Curi, Kfir Y. Levy, Andreas Krause:
Online Variance Reduction with Mixtures. CoRR abs/1903.12416 (2019) - [i3]Zalán Borsos, Andrey Khorlin, Andrea Gesmundo:
Transfer NAS: Knowledge Transfer between Search Spaces with Transformer Agents. CoRR abs/1906.08102 (2019) - 2018
- [j1]Zalán Borsos, Camelia Lemnaru, Rodica Potolea:
Dealing with overlap and imbalance: a new metric and approach. Pattern Anal. Appl. 21(2): 381-395 (2018) - [c2]Zalan Borsos, Andreas Krause, Kfir Y. Levy:
Online Variance Reduction for Stochastic Optimization. COLT 2018: 324-357 - [i2]Zalán Borsos, Andreas Krause, Kfir Y. Levy:
Online Variance Reduction for Stochastic Optimization. CoRR abs/1802.04715 (2018) - [i1]Bianca-Cristina Cristescu, Zalán Borsos, John Lygeros, María Rodríguez Martínez, Maria Anna Rapsomaniki:
Inference of the three-dimensional chromatin structure and its temporal behavior. CoRR abs/1811.09619 (2018) - 2013
- [c1]Tamas Györfi, Octavian Cret, Zalan Borsos:
Implementing Modular FFTs in FPGAs - A Basic Block for Lattice-Based Cryptography. DSD 2013: 305-308
Coauthor Index
manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.
Unpaywalled article links
Add open access links from to the list of external document links (if available).
Privacy notice: By enabling the option above, your browser will contact the API of unpaywall.org to load hyperlinks to open access articles. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Unpaywall privacy policy.
Archived links via Wayback Machine
For web page which are no longer available, try to retrieve content from the of the Internet Archive (if available).
Privacy notice: By enabling the option above, your browser will contact the API of archive.org to check for archived content of web pages that are no longer available. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Internet Archive privacy policy.
Reference lists
Add a list of references from , , and to record detail pages.
load references from crossref.org and opencitations.net
Privacy notice: By enabling the option above, your browser will contact the APIs of crossref.org, opencitations.net, and semanticscholar.org to load article reference information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Crossref privacy policy and the OpenCitations privacy policy, as well as the AI2 Privacy Policy covering Semantic Scholar.
Citation data
Add a list of citing articles from and to record detail pages.
load citations from opencitations.net
Privacy notice: By enabling the option above, your browser will contact the API of opencitations.net and semanticscholar.org to load citation information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the OpenCitations privacy policy as well as the AI2 Privacy Policy covering Semantic Scholar.
OpenAlex data
Load additional information about publications from .
Privacy notice: By enabling the option above, your browser will contact the API of openalex.org to load additional information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the information given by OpenAlex.
last updated on 2024-09-18 01:13 CEST by the dblp team
all metadata released as open data under CC0 1.0 license
see also: Terms of Use | Privacy Policy | Imprint