default search action

combined dblp search
author search
venue search
publication search

ask others

Volodymyr Mnih

Vlad Mnih

> Home > Persons

Person information

Refine list

refinements active!

zoomed in on ?? of ?? records

view refined list in

export refined list as

showing all ?? records

Books and Theses

see FAQ

What is the meaning of the colors in the publication lists?

2013
[b1]
- view
  - electronic edition via handle.net
  - details & citations
- export record
  dblp key:
  - phd/ca/Mnih13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/phd/ca/Mnih13
Volodymyr Mnih:
Machine Learning for Aerial Image Labeling. University of Toronto, Canada, 2013

Journal Articles

see FAQ

What is the meaning of the colors in the publication lists?

2015
[j3]
- view
  authority control:
- export record
  dblp key:
  - journals/nature/MnihKSRVBGRFOPB15
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/nature/MnihKSRVBGRFOPB15
Volodymyr Mnih, Koray Kavukcuoglu, David Silver, Andrei A. Rusu, Joel Veness, Marc G. Bellemare, Alex Graves, Martin A. Riedmiller, Andreas Fidjeland, Georg Ostrovski, Stig Petersen, Charles Beattie, Amir Sadik, Ioannis Antonoglou, Helen King, Dharshan Kumaran, Daan Wierstra, Shane Legg, Demis Hassabis:
Human-level control through deep reinforcement learning. Nat. 518(7540): 529-533 (2015)
2013
[j2]
- view
  authority control:
- export record
  dblp key:
  - journals/pami/RanzatoMSH13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/pami/RanzatoMSH13
Marc'Aurelio Ranzato, Volodymyr Mnih, Joshua M. Susskind, Geoffrey E. Hinton:
Modeling Natural Images Using Gated MRFs. IEEE Trans. Pattern Anal. Mach. Intell. 35(9): 2206-2222 (2013)
2006
[j1]
- view
  authority control:
- export record
  dblp key:
  - journals/jfr/HeZM06
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/jfr/HeZM06
Xuming He, Richard S. Zemel, Volodymyr Mnih:
Topological map learning from outdoor image sequences. J. Field Robotics 23(11-12): 1091-1104 (2006)

Conference and Workshop Papers

see FAQ

What is the meaning of the colors in the publication lists?

2023
[c28]
- view
  - electronic edition @ openreview.net (open access)
  - details & citations
- export record
  dblp key:
  - conf/iclr/LaskinWOPSSSHFB23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/iclr/LaskinWOPSSSHFB23
Michael Laskin, Luyu Wang, Junhyuk Oh, Emilio Parisotto, Stephen Spencer, Richie Steigerwald, DJ Strouse, Steven Stenberg Hansen, Angelos Filos, Ethan A. Brooks, Maxime Gazeau, Himanshu Sahni, Satinder Singh, Volodymyr Mnih:
In-context Reinforcement Learning with Algorithm Distillation. ICLR 2023
2022
[c27]
- view
  - electronic edition @ openreview.net (open access)
  - details & citations
- export record
  dblp key:
  - conf/iclr/StrouseBWMH22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/iclr/StrouseBWMH22
DJ Strouse, Kate Baumli, David Warde-Farley, Volodymyr Mnih, Steven Stenberg Hansen:
Learning more skills through optimistic exploration. ICLR 2022
[c26]
- view
  - electronic edition @ nips.cc (open access)
  - details & citations
- export record
  dblp key:
  - conf/nips/LiuZM022
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/nips/LiuZM022
Hao Liu, Tom Zahavy, Volodymyr Mnih, Satinder Singh:
Palm up: Playing in the Latent Manifold for Unsupervised Pretraining. NeurIPS 2022
2021
[c25]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/aaai/BaumliWHM21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/aaai/BaumliWHM21
Kate Baumli, David Warde-Farley, Steven Hansen, Volodymyr Mnih:
Relative Variational Intrinsic Control. AAAI 2021: 6732-6740
[c24]
- view
  - electronic edition @ neurips.cc (open access)
  - details & citations
- export record
  dblp key:
  - conf/nips/HansenDBWHOM21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/nips/HansenDBWHOM21
Steven Hansen, Guillaume Desjardins, Kate Baumli, David Warde-Farley, Nicolas Heess, Simon Osindero, Volodymyr Mnih:
Entropic Desired Dynamics for Intrinsic Control. NeurIPS 2021: 11436-11448
2020
[c23]
- view
  - electronic edition @ openreview.net (open access)
  - details & citations
- export record
  dblp key:
  - conf/iclr/HansenDBWWM20
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/iclr/HansenDBWWM20
Steven Hansen, Will Dabney, André Barreto, David Warde-Farley, Tom Van de Wiele, Volodymyr Mnih:
Fast Task Inference with Variational Intrinsic Successor Features. ICLR 2020
2019
[c22]
- view
  - electronic edition @ openreview.net (open access)
  - details & citations
- export record
  dblp key:
  - conf/iclr/Warde-FarleyWKI19
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/iclr/Warde-FarleyWKI19
David Warde-Farley, Tom Van de Wiele, Tejas D. Kulkarni, Catalin Ionescu, Steven Hansen, Volodymyr Mnih:
Unsupervised Control Through Non-Parametric Discriminative Rewards. ICLR (Poster) 2019
[c21]
- view
- export record
  dblp key:
  - conf/nips/KulkarniGIBRZM19
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/nips/KulkarniGIBRZM19
Tejas D. Kulkarni, Ankush Gupta, Catalin Ionescu, Sebastian Borgeaud, Malcolm Reynolds, Andrew Zisserman, Volodymyr Mnih:
Unsupervised Learning of Object Keypoints for Perception and Control. NeurIPS 2019: 10723-10733
2018
[c20]
- view
  - electronic edition @ openreview.net (open access)
  - details & citations
- export record
  dblp key:
  - conf/iclr/FortunatoAPMHOG18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/iclr/FortunatoAPMHOG18
Meire Fortunato, Mohammad Gheshlaghi Azar, Bilal Piot, Jacob Menick, Matteo Hessel, Ian Osband, Alex Graves, Volodymyr Mnih, Rémi Munos, Demis Hassabis, Olivier Pietquin, Charles Blundell, Shane Legg:
Noisy Networks For Exploration. ICLR (Poster) 2018
[c19]
- view
  - electronic edition @ mlr.press (open access)
  - details & citations
- export record
  dblp key:
  - conf/icml/EspeholtSMSMWDF18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icml/EspeholtSMSMWDF18
Lasse Espeholt, Hubert Soyer, Rémi Munos, Karen Simonyan, Volodymyr Mnih, Tom Ward, Yotam Doron, Vlad Firoiu, Tim Harley, Iain Dunning, Shane Legg, Koray Kavukcuoglu:
IMPALA: Scalable Distributed Deep-RL with Importance Weighted Actor-Learner Architectures. ICML 2018: 1406-1415
[c18]
- view
  - electronic edition @ mlr.press (open access)
  - details & citations
- export record
  dblp key:
  - conf/icml/ODonoghueOMM18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icml/ODonoghueOMM18
Brendan O'Donoghue, Ian Osband, Rémi Munos, Volodymyr Mnih:
The Uncertainty Bellman Equation and Exploration. ICML 2018: 3836-3845
[c17]
- view
  - electronic edition @ mlr.press (open access)
  - details & citations
- export record
  dblp key:
  - conf/icml/RiedmillerHLNDW18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icml/RiedmillerHLNDW18
Martin A. Riedmiller, Roland Hafner, Thomas Lampe, Michael Neunert, Jonas Degrave, Tom Van de Wiele, Vlad Mnih, Nicolas Heess, Jost Tobias Springenberg:
Learning by Playing Solving Sparse Reward Tasks from Scratch. ICML 2018: 4341-4350
2017
[c16]
- view
  - electronic edition @ openreview.net (open access)
  - details & citations
- export record
  dblp key:
  - conf/iclr/0001BHMMKF17
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/iclr/0001BHMMKF17
Ziyu Wang, Victor Bapst, Nicolas Heess, Volodymyr Mnih, Rémi Munos, Koray Kavukcuoglu, Nando de Freitas:
Sample Efficient Actor-Critic with Experience Replay. ICLR (Poster) 2017
[c15]
- view
  - electronic edition @ openreview.net (open access)
  - details & citations
- export record
  dblp key:
  - conf/iclr/JaderbergMCSLSK17
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/iclr/JaderbergMCSLSK17
Max Jaderberg, Volodymyr Mnih, Wojciech Marian Czarnecki, Tom Schaul, Joel Z. Leibo, David Silver, Koray Kavukcuoglu:
Reinforcement Learning with Unsupervised Auxiliary Tasks. ICLR 2017
[c14]
- view
  - electronic edition @ openreview.net (open access)
  - details & citations
- export record
  dblp key:
  - conf/iclr/ODonoghueMKM17
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/iclr/ODonoghueMKM17
Brendan O'Donoghue, Rémi Munos, Koray Kavukcuoglu, Volodymyr Mnih:
Combining policy gradient and Q-learning. ICLR (Poster) 2017
2016
[c13]
- view
  - electronic edition @ mlr.press (open access)
  - details & citations
- export record
  dblp key:
  - conf/icml/MnihBMGLHSK16
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icml/MnihBMGLHSK16
Volodymyr Mnih, Adrià Puigdomènech Badia, Mehdi Mirza, Alex Graves, Timothy P. Lillicrap, Tim Harley, David Silver, Koray Kavukcuoglu:
Asynchronous Methods for Deep Reinforcement Learning. ICML 2016: 1928-1937
[c12]
- view
- export record
  dblp key:
  - conf/nips/VezhnevetsMOGVA16
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/nips/VezhnevetsMOGVA16
Alexander Vezhnevets, Volodymyr Mnih, Simon Osindero, Alex Graves, Oriol Vinyals, John P. Agapiou, Koray Kavukcuoglu:
Strategic Attentive Writer for Learning Macro-Actions. NIPS 2016: 3486-3494
[c11]
- view
- export record
  dblp key:
  - conf/nips/HasseltGHMS16
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/nips/HasseltGHMS16
Hado van Hasselt, Arthur Guez, Matteo Hessel, Volodymyr Mnih, David Silver:
Learning values across many orders of magnitude. NIPS 2016: 4287-4295
[c10]
- view
- export record
  dblp key:
  - conf/nips/BaHMLI16
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/nips/BaHMLI16
Jimmy Ba, Geoffrey E. Hinton, Volodymyr Mnih, Joel Z. Leibo, Catalin Ionescu:
Using Fast Weights to Attend to the Recent Past. NIPS 2016: 4331-4339
[c9]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/RusuCGDKPMKH15
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/RusuCGDKPMKH15
Andrei A. Rusu, Sergio Gomez Colmenarejo, Çaglar Gülçehre, Guillaume Desjardins, James Kirkpatrick, Razvan Pascanu, Volodymyr Mnih, Koray Kavukcuoglu, Raia Hadsell:
Policy Distillation. ICLR (Poster) 2016
2015
[c8]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/BaMK14
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/BaMK14
Jimmy Ba, Volodymyr Mnih, Koray Kavukcuoglu:
Multiple Object Recognition with Visual Attention. ICLR (Poster) 2015
2014
[c7]
- view
- export record
  dblp key:
  - conf/nips/MnihHGK14
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/nips/MnihHGK14
Volodymyr Mnih, Nicolas Heess, Alex Graves, Koray Kavukcuoglu:
Recurrent Models of Visual Attention. NIPS 2014: 2204-2212
2012
[c6]
- view
  - electronic edition @ icml.cc (open access)
  - details & citations
- export record
  dblp key:
  - conf/icml/MnihH12
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icml/MnihH12
Volodymyr Mnih, Geoffrey E. Hinton:
Learning to Label Aerial Images from Noisy Data. ICML 2012
2011
[c5]
- view
  authority control:
- export record
  dblp key:
  - conf/cvpr/RanzatoSMH11
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/cvpr/RanzatoSMH11
Marc'Aurelio Ranzato, Joshua M. Susskind, Volodymyr Mnih, Geoffrey E. Hinton:
On deep generative models with applications to recognition. CVPR 2011: 2857-2864
[c4]
- view
  - electronic edition @ dslpitt.org (archived)
  - details & citations
- export record
  dblp key:
  - conf/uai/MnihLH11
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/uai/MnihLH11
Volodymyr Mnih, Hugo Larochelle, Geoffrey E. Hinton:
Conditional Restricted Boltzmann Machines for Structured Output Prediction. UAI 2011: 514-522
2010
[c3]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/eccv/MnihH10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/eccv/MnihH10
Volodymyr Mnih, Geoffrey E. Hinton:
Learning to Detect Roads in High-Resolution Aerial Images. ECCV (6) 2010: 210-223
[c2]
- view
- export record
  dblp key:
  - conf/nips/RanzatoMH10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/nips/RanzatoMH10
Marc'Aurelio Ranzato, Volodymyr Mnih, Geoffrey E. Hinton:
Generating more realistic images using gated MRF's. NIPS 2010: 2002-2010
2008
[c1]
- view
  authority control:
- export record
  dblp key:
  - conf/icml/MnihSA08
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icml/MnihSA08
Volodymyr Mnih, Csaba Szepesvári, Jean-Yves Audibert:
Empirical Bernstein stopping. ICML 2008: 672-679

Informal and Other Publications

see FAQ

What is the meaning of the colors in the publication lists?

2023
[i25]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2312-09187
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2312-09187
Kate Baumli, Satinder Baveja, Feryal M. P. Behbahani, Harris Chan, Gheorghe Comanici, Sebastian Flennerhag, Maxime Gazeau, Kristian Holsheimer, Dan Horgan, Michael Laskin, Clare Lyle, Hussain Masoom, Kay McKinney, Volodymyr Mnih, Alexander Neitz, Fabio Pardo, Jack Parker-Holder, John Quan, Tim Rocktäschel, Himanshu Sahni, Tom Schaul, Yannick Schroecker, Stephen Spencer, Richie Steigerwald, Luyu Wang, Lei Zhang:
Vision-Language Models as a Source of Rewards. CoRR abs/2312.09187 (2023)
2022
[i24]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2210-10913
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2210-10913
Hao Liu, Tom Zahavy, Volodymyr Mnih, Satinder Singh:
Palm up: Playing in the Latent Manifold for Unsupervised Pretraining. CoRR abs/2210.10913 (2022)
[i23]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2210-14215
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2210-14215
Michael Laskin, Luyu Wang, Junhyuk Oh, Emilio Parisotto, Stephen Spencer, Richie Steigerwald, DJ Strouse, Steven Hansen, Angelos Filos, Ethan A. Brooks, Maxime Gazeau, Himanshu Sahni, Satinder Singh, Volodymyr Mnih:
In-context Reinforcement Learning with Algorithm Distillation. CoRR abs/2210.14215 (2022)
2021
[i22]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2106-00669
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2106-00669
Tom Zahavy, Brendan O'Donoghue, André Barreto, Volodymyr Mnih, Sebastian Flennerhag, Satinder Singh:
Discovering Diverse Nearly Optimal Policies withSuccessor Features. CoRR abs/2106.00669 (2021)
[i21]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2107-14226
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2107-14226
DJ Strouse, Kate Baumli, David Warde-Farley, Vlad Mnih, Steven Hansen:
Learning more skills through optimistic exploration. CoRR abs/2107.14226 (2021)
[i20]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2110-15331
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2110-15331
Ishan Durugkar, Steven Hansen, Stephen Spencer, Volodymyr Mnih:
Wasserstein Distance Maximizing Intrinsic Control. CoRR abs/2110.15331 (2021)
2020
[i19]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2001-08116
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2001-08116
Tom Van de Wiele, David Warde-Farley, Andriy Mnih, Volodymyr Mnih:
Q-Learning in enormous action spaces via amortized approximate maximization. CoRR abs/2001.08116 (2020)
[i18]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2012-07827
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2012-07827
Kate Baumli, David Warde-Farley, Steven Hansen, Volodymyr Mnih:
Relative Variational Intrinsic Control. CoRR abs/2012.07827 (2020)
2019
[i17]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-1906-05030
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1906-05030
Steven Hansen, Will Dabney, André Barreto, Tom Van de Wiele, David Warde-Farley, Volodymyr Mnih:
Fast Task Inference with Variational Intrinsic Successor Features. CoRR abs/1906.05030 (2019)
[i16]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-1906-11883
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1906-11883
Tejas D. Kulkarni, Ankush Gupta, Catalin Ionescu, Sebastian Borgeaud, Malcolm Reynolds, Andrew Zisserman, Volodymyr Mnih:
Unsupervised Learning of Object Keypoints for Perception and Control. CoRR abs/1906.11883 (2019)
2018
[i15]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-1802-01561
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1802-01561
Lasse Espeholt, Hubert Soyer, Rémi Munos, Karen Simonyan, Volodymyr Mnih, Tom Ward, Yotam Doron, Vlad Firoiu, Tim Harley, Iain Dunning, Shane Legg, Koray Kavukcuoglu:
IMPALA: Scalable Distributed Deep-RL with Importance Weighted Actor-Learner Architectures. CoRR abs/1802.01561 (2018)
[i14]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-1802-10567
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1802-10567
Martin A. Riedmiller, Roland Hafner, Thomas Lampe, Michael Neunert, Jonas Degrave, Tom Van de Wiele, Volodymyr Mnih, Nicolas Heess, Jost Tobias Springenberg:
Learning by Playing - Solving Sparse Reward Tasks from Scratch. CoRR abs/1802.10567 (2018)
[i13]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-1811-11359
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1811-11359
David Warde-Farley, Tom Van de Wiele, Tejas D. Kulkarni, Catalin Ionescu, Steven Hansen, Volodymyr Mnih:
Unsupervised Control Through Non-Parametric Discriminative Rewards. CoRR abs/1811.11359 (2018)
2017
[i12]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/FortunatoAPMOGM17
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/FortunatoAPMOGM17
Meire Fortunato, Mohammad Gheshlaghi Azar, Bilal Piot, Jacob Menick, Ian Osband, Alex Graves, Vlad Mnih, Rémi Munos, Demis Hassabis, Olivier Pietquin, Charles Blundell, Shane Legg:
Noisy Networks for Exploration. CoRR abs/1706.10295 (2017)
[i11]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-1709-05380
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1709-05380
Brendan O'Donoghue, Ian Osband, Rémi Munos, Volodymyr Mnih:
The Uncertainty Bellman Equation and Exploration. CoRR abs/1709.05380 (2017)
2016
[i10]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/MnihBMGLHSK16
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/MnihBMGLHSK16
Volodymyr Mnih, Adrià Puigdomènech Badia, Mehdi Mirza, Alex Graves, Timothy P. Lillicrap, Tim Harley, David Silver, Koray Kavukcuoglu:
Asynchronous Methods for Deep Reinforcement Learning. CoRR abs/1602.01783 (2016)
[i9]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/VezhnevetsMAOGV16
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/VezhnevetsMAOGV16
Alexander Vezhnevets, Volodymyr Mnih, John P. Agapiou, Simon Osindero, Alex Graves, Oriol Vinyals, Koray Kavukcuoglu:
Strategic Attentive Writer for Learning Macro-Actions. CoRR abs/1606.04695 (2016)
[i8]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/BaHMLI16
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/BaHMLI16
Jimmy Ba, Geoffrey E. Hinton, Volodymyr Mnih, Joel Z. Leibo, Catalin Ionescu:
Using Fast Weights to Attend to the Recent Past. CoRR abs/1610.06258 (2016)
[i7]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/WangBHMMKF16
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/WangBHMMKF16
Ziyu Wang, Victor Bapst, Nicolas Heess, Volodymyr Mnih, Rémi Munos, Koray Kavukcuoglu, Nando de Freitas:
Sample Efficient Actor-Critic with Experience Replay. CoRR abs/1611.01224 (2016)
[i6]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/ODonoghueMKM16
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/ODonoghueMKM16
Brendan O'Donoghue, Rémi Munos, Koray Kavukcuoglu, Volodymyr Mnih:
PGQ: Combining policy gradient and Q-learning. CoRR abs/1611.01626 (2016)
[i5]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/JaderbergMCSLSK16
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/JaderbergMCSLSK16
Max Jaderberg, Volodymyr Mnih, Wojciech Marian Czarnecki, Tom Schaul, Joel Z. Leibo, David Silver, Koray Kavukcuoglu:
Reinforcement Learning with Unsupervised Auxiliary Tasks. CoRR abs/1611.05397 (2016)
2015
[i4]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/NairSBAFMPSBPLM15
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/NairSBAFMPSBPLM15
Arun Nair, Praveen Srinivasan, Sam Blackwell, Cagdas Alcicek, Rory Fearon, Alessandro De Maria, Vedavyas Panneershelvam, Mustafa Suleyman, Charles Beattie, Stig Petersen, Shane Legg, Volodymyr Mnih, Koray Kavukcuoglu, David Silver:
Massively Parallel Methods for Deep Reinforcement Learning. CoRR abs/1507.04296 (2015)
2014
[i3]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/MnihHGK14
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/MnihHGK14
Volodymyr Mnih, Nicolas Heess, Alex Graves, Koray Kavukcuoglu:
Recurrent Models of Visual Attention. CoRR abs/1406.6247 (2014)
2013
[i2]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/MnihKSGAWR13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/MnihKSGAWR13
Volodymyr Mnih, Koray Kavukcuoglu, David Silver, Alex Graves, Ioannis Antonoglou, Daan Wierstra, Martin A. Riedmiller:
Playing Atari with Deep Reinforcement Learning. CoRR abs/1312.5602 (2013)
2012
[i1]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-1202-3748
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1202-3748
Volodymyr Mnih, Hugo Larochelle, Geoffrey E. Hinton:
Conditional Restricted Boltzmann Machines for Structured Output Prediction. CoRR abs/1202.3748 (2012)

Coauthor Index

see FAQ

manage site settings

To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.