default search action
Victoria Krakovna
Person information
Refine list
refinements active!
zoomed in on ?? of ?? records
view refined list in
export refined list as
2020 – today
- 2024
- [i13]Evan Ryan Gunter, Yevgeny Liokumovich, Victoria Krakovna:
Quantifying stability of non-power-seeking in artificial agents. CoRR abs/2401.03529 (2024) - [i12]Raymond Douglas, Jacek Karwowski, Chan Bae, Andis Draguns, Victoria Krakovna:
Limitations of Agents Simulated by Predictive Models. CoRR abs/2402.05829 (2024) - [i11]Mary Phuong, Matthew Aitchison, Elliot Catt, Sarah Cogan, Alexandre Kaskasoli, Victoria Krakovna, David Lindner, Matthew Rahtz, Yannis Assael, Sarah Hodkinson, Heidi Howard, Tom Lieberum, Ramana Kumar, Maria Abi Raad, Albert Webson, Lewis Ho, Sharon Lin, Sebastian Farquhar, Marcus Hutter, Grégoire Delétang, Anian Ruoss, Seliem El-Sayed, Sasha Brown, Anca D. Dragan, Rohin Shah, Allan Dafoe, Toby Shevlane:
Evaluating Frontier Models for Dangerous Capabilities. CoRR abs/2403.13793 (2024) - [i10]Iason Gabriel, Arianna Manzini, Geoff Keeling, Lisa Anne Hendricks, Verena Rieser, Hasan Iqbal, Nenad Tomasev, Ira Ktena, Zachary Kenton, Mikel Rodriguez, Seliem El-Sayed, Sasha Brown, Canfer Akbulut, Andrew Trask, Edward Hughes, A. Stevie Bergman, Renee Shelby, Nahema Marchal, Conor Griffin, Juan Mateos-Garcia, Laura Weidinger, Winnie Street, Benjamin Lange, Alex Ingerman, Alison Lentz, Reed Enger, Andrew Barakat, Victoria Krakovna, John Oliver Siy, Zeb Kurth-Nelson, Amanda McCroskery, Vijay Bolina, Harry Law, Murray Shanahan, Lize Alberts, Borja Balle, Sarah de Haas, Yetunde Ibitoye, Allan Dafoe, Beth Goldberg, Sébastien Krier, Alexander Reese, Sims Witherspoon, Will Hawkins, Maribeth Rauh, Don Wallace, Matija Franklin, Josh A. Goldstein, Joel Lehman, Michael Klenk, Shannon Vallor, Courtney Biles, Meredith Ringel Morris, Helen King, Blaise Agüera y Arcas, William Isaac, James Manyika:
The Ethics of Advanced AI Assistants. CoRR abs/2404.16244 (2024) - 2023
- [i9]Victoria Krakovna, János Kramár:
Power-seeking can be probable and predictive for trained agents. CoRR abs/2304.06528 (2023) - 2022
- [i8]Rohin Shah, Vikrant Varma, Ramana Kumar, Mary Phuong, Victoria Krakovna, Jonathan Uesato, Zac Kenton:
Goal Misgeneralization: Why Correct Specifications Aren't Enough For Correct Goals. CoRR abs/2210.01790 (2022) - 2021
- [j1]Tom Everitt, Marcus Hutter, Ramana Kumar, Victoria Krakovna:
Reward tampering problems and solutions in reinforcement learning: a causal influence diagram perspective. Synth. 198(27): 6435-6467 (2021) - 2020
- [c6]Victoria Krakovna, Laurent Orseau, Richard Ngo, Miljan Martic, Shane Legg:
Avoiding Side Effects By Considering Future Tasks. NeurIPS 2020 - [i7]Victoria Krakovna, Laurent Orseau, Richard Ngo, Miljan Martic, Shane Legg:
Avoiding Side Effects By Considering Future Tasks. CoRR abs/2010.07877 (2020) - [i6]Ramana Kumar, Jonathan Uesato, Richard Ngo, Tom Everitt, Victoria Krakovna, Shane Legg:
REALab: An Embedded Perspective on Tampering. CoRR abs/2011.08820 (2020) - [i5]Jonathan Uesato, Ramana Kumar, Victoria Krakovna, Tom Everitt, Richard Ngo, Shane Legg:
Avoiding Tampering Incentives in Deep RL via Decoupled Approval. CoRR abs/2011.08827 (2020)
2010 – 2019
- 2019
- [c5]Tom Everitt, Ramana Kumar, Victoria Krakovna, Shane Legg:
Modeling AGI Safety Frameworks with Causal Influence Diagrams. AISafety@IJCAI 2019 - [c4]Victoria Krakovna, Laurent Orseau, Miljan Martic, Shane Legg:
Penalizing Side Effects using Stepwise Relative Reachability. AISafety@IJCAI 2019 - [i4]Tom Everitt, Ramana Kumar, Victoria Krakovna, Shane Legg:
Modeling AGI Safety Frameworks with Causal Influence Diagrams. CoRR abs/1906.08663 (2019) - 2018
- [i3]Victoria Krakovna, Laurent Orseau, Miljan Martic, Shane Legg:
Measuring and avoiding side effects using relative reachability. CoRR abs/1806.01186 (2018) - 2017
- [c3]Tom Everitt, Victoria Krakovna, Laurent Orseau, Shane Legg:
Reinforcement Learning with a Corrupted Reward Channel. IJCAI 2017: 4705-4713 - [i2]Tom Everitt, Victoria Krakovna, Laurent Orseau, Marcus Hutter, Shane Legg:
Reinforcement Learning with a Corrupted Reward Channel. CoRR abs/1705.08417 (2017) - [i1]Jan Leike, Miljan Martic, Victoria Krakovna, Pedro A. Ortega, Tom Everitt, Andrew Lefrancq, Laurent Orseau, Shane Legg:
AI Safety Gridworlds. CoRR abs/1711.09883 (2017) - 2016
- [c2]Cory Shain, William Bryce, Lifeng Jin, Victoria Krakovna, Finale Doshi-Velez, Timothy A. Miller, William Schuler, Lane Schwartz:
Memory-Bounded Left-Corner Unsupervised Grammar Induction on Child-Directed Input. COLING 2016: 964-975 - 2010
- [c1]Matthew Skala, Victoria Krakovna, János Kramár, Gerald Penn:
A Generalized-Zero-Preserving Method for Compact Encoding of Concept Lattices. ACL 2010: 1512-1521
Coauthor Index
manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.
Unpaywalled article links
Add open access links from to the list of external document links (if available).
Privacy notice: By enabling the option above, your browser will contact the API of unpaywall.org to load hyperlinks to open access articles. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Unpaywall privacy policy.
Archived links via Wayback Machine
For web page which are no longer available, try to retrieve content from the of the Internet Archive (if available).
Privacy notice: By enabling the option above, your browser will contact the API of archive.org to check for archived content of web pages that are no longer available. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Internet Archive privacy policy.
Reference lists
Add a list of references from , , and to record detail pages.
load references from crossref.org and opencitations.net
Privacy notice: By enabling the option above, your browser will contact the APIs of crossref.org, opencitations.net, and semanticscholar.org to load article reference information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Crossref privacy policy and the OpenCitations privacy policy, as well as the AI2 Privacy Policy covering Semantic Scholar.
Citation data
Add a list of citing articles from and to record detail pages.
load citations from opencitations.net
Privacy notice: By enabling the option above, your browser will contact the API of opencitations.net and semanticscholar.org to load citation information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the OpenCitations privacy policy as well as the AI2 Privacy Policy covering Semantic Scholar.
OpenAlex data
Load additional information about publications from .
Privacy notice: By enabling the option above, your browser will contact the API of openalex.org to load additional information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the information given by OpenAlex.
last updated on 2024-06-06 23:03 CEST by the dblp team
all metadata released as open data under CC0 1.0 license
see also: Terms of Use | Privacy Policy | Imprint