default search action
Olivier Delalleau
Person information
Refine list
refinements active!
zoomed in on ?? of ?? records
view refined list in
export refined list as
2020 – today
- 2024
- [c11]Rohan Chitnis, Yingchen Xu, Bobak Hashemi, Lucas Lehnert, Ürün Dogan, Zheqing Zhu, Olivier Delalleau:
IQL-TD-MPC: Implicit Q-Learning for Hierarchical Model Predictive Control. ICRA 2024: 9154-9160 - [c10]Zhilin Wang, Yi Dong, Jiaqi Zeng, Virginia Adams, Makesh Narsimhan Sreedhar, Daniel Egert, Olivier Delalleau, Jane Polak Scowcroft, Neel Kant, Aidan Swope, Oleksii Kuchaiev:
HelpSteer: Multi-attribute Helpfulness Dataset for SteerLM. NAACL-HLT 2024: 3371-3384 - [i11]Gerald Shen, Zhilin Wang, Olivier Delalleau, Jiaqi Zeng, Yi Dong, Daniel Egert, Shengyang Sun, Jimmy J. Zhang, Sahil Jain, Ali Taghibakhshi, Markel Sanz Ausin, Ashwath Aithal, Oleksii Kuchaiev:
NeMo-Aligner: Scalable Toolkit for Efficient Model Alignment. CoRR abs/2405.01481 (2024) - [i10]Zhilin Wang, Yi Dong, Olivier Delalleau, Jiaqi Zeng, Gerald Shen, Daniel Egert, Jimmy J. Zhang, Makesh Narsimhan Sreedhar, Oleksii Kuchaiev:
HelpSteer2: Open-source dataset for training top-performing reward models. CoRR abs/2406.08673 (2024) - [i9]Bo Adler, Niket Agarwal, Ashwath Aithal, Dong H. Anh, Pallab Bhattacharya, Annika Brundyn, Jared Casper, Bryan Catanzaro, Sharon Clay, Jonathan M. Cohen, Sirshak Das, Ayush Dattagupta, Olivier Delalleau, Leon Derczynski, Yi Dong, Daniel Egert, Ellie Evans, Aleksander Ficek, Denys Fridman, Shaona Ghosh, Boris Ginsburg, Igor Gitman, Tomasz Grzegorzek, Robert Hero, Jining Huang, Vibhu Jawa, Joseph Jennings, Aastha Jhunjhunwala, John Kamalu, Sadaf Khan, Oleksii Kuchaiev, Patrick LeGresley, Hui Li, Jiwei Liu, Zihan Liu, Eileen Long, Ameya Sunil Mahabaleshwarkar, Somshubra Majumdar, James Maki, Miguel Martinez, Maer Rodrigues de Melo, Ivan Moshkov, Deepak Narayanan, Sean Narenthiran, Jesus Navarro, Phong Nguyen, Osvald Nitski, Vahid Noroozi, Guruprasad Nutheti, Christopher Parisien, Jupinder Parmar, Mostofa Patwary, Krzysztof Pawelec, Wei Ping, Shrimai Prabhumoye, Rajarshi Roy, Trisha Saar, Vasanth Rao Naik Sabavat, Sanjeev Satheesh, Jane Polak Scowcroft, Jason Sewall, Pavel Shamis, Gerald Shen, Mohammad Shoeybi, Dave Sizer, Misha Smelyanskiy, Felipe Soares, Makesh Narsimhan Sreedhar, Dan Su, Sandeep Subramanian, Shengyang Sun, Shubham Toshniwal, Hao Wang, Zhilin Wang, Jiaxuan You, Jiaqi Zeng, Jimmy Zhang, Jing Zhang, Vivienne Zhang, Yian Zhang, Chen Zhu:
Nemotron-4 340B Technical Report. CoRR abs/2406.11704 (2024) - [i8]Zhilin Wang, Alexander Bukharin, Olivier Delalleau, Daniel Egert, Gerald Shen, Jiaqi Zeng, Oleksii Kuchaiev, Yi Dong:
HelpSteer2-Preference: Complementing Ratings with Preferences. CoRR abs/2410.01257 (2024) - [i7]Michael J. Q. Zhang, Zhilin Wang, Jena D. Hwang, Yi Dong, Olivier Delalleau, Yejin Choi, Eunsol Choi, Xiang Ren, Valentina Pyatkin:
Diverging Preferences: When do Annotators Disagree and do Models Know? CoRR abs/2410.14632 (2024) - 2023
- [i6]Rohan Chitnis, Yingchen Xu, Bobak Hashemi, Lucas Lehnert, Ürün Dogan, Zheqing Zhu, Olivier Delalleau:
IQL-TD-MPC: Implicit Q-Learning for Hierarchical Model Predictive Control. CoRR abs/2306.00867 (2023) - [i5]Zhilin Wang, Yi Dong, Jiaqi Zeng, Virginia Adams, Makesh Narsimhan Sreedhar, Daniel Egert, Olivier Delalleau, Jane Polak Scowcroft, Neel Kant, Aidan Swope, Oleksii Kuchaiev:
HelpSteer: Multi-attribute Helpfulness Dataset for SteerLM. CoRR abs/2311.09528 (2023) - 2020
- [i4]Shagun Sodhani, Olivier Delalleau, Mahmoud Assran, Koustuv Sinha, Nicolas Ballas, Michael G. Rabbat:
A Closer Look at Codistillation for Distributed Training. CoRR abs/2010.02838 (2020)
2010 – 2019
- 2019
- [i3]Olivier Delalleau, Maxim Peter, Eloi Alonso, Adrien Logut:
Discrete and Continuous Action Representation for Practical RL in Video Games. CoRR abs/1912.11077 (2019) - 2016
- [i2]Rami Al-Rfou, Guillaume Alain, Amjad Almahairi, Christof Angermüller, Dzmitry Bahdanau, Nicolas Ballas, Frédéric Bastien, Justin Bayer, Anatoly Belikov, Alexander Belopolsky, Yoshua Bengio, Arnaud Bergeron, James Bergstra, Valentin Bisson, Josh Bleecher Snyder, Nicolas Bouchard, Nicolas Boulanger-Lewandowski, Xavier Bouthillier, Alexandre de Brébisson, Olivier Breuleux, Pierre Luc Carrier, Kyunghyun Cho, Jan Chorowski, Paul F. Christiano, Tim Cooijmans, Marc-Alexandre Côté, Myriam Côté, Aaron C. Courville, Yann N. Dauphin, Olivier Delalleau, Julien Demouth, Guillaume Desjardins, Sander Dieleman, Laurent Dinh, Melanie Ducoffe, Vincent Dumoulin, Samira Ebrahimi Kahou, Dumitru Erhan, Ziye Fan, Orhan Firat, Mathieu Germain, Xavier Glorot, Ian J. Goodfellow, Matthew Graham, Çaglar Gülçehre, Philippe Hamel, Iban Harlouchet, Jean-Philippe Heng, Balázs Hidasi, Sina Honari, Arjun Jain, Sébastien Jean, Kai Jia, Mikhail Korobov, Vivek Kulkarni, Alex Lamb, Pascal Lamblin, Eric Larsen, César Laurent, Sean Lee, Simon Lefrançois, Simon Lemieux, Nicholas Léonard, Zhouhan Lin, Jesse A. Livezey, Cory Lorenz, Jeremiah Lowin, Qianli Ma, Pierre-Antoine Manzagol, Olivier Mastropietro, Robert McGibbon, Roland Memisevic, Bart van Merriënboer, Vincent Michalski, Mehdi Mirza, Alberto Orlandi, Christopher Joseph Pal, Razvan Pascanu, Mohammad Pezeshki, Colin Raffel, Daniel Renshaw, Matthew Rocklin, Adriana Romero, Markus Roth, Peter Sadowski, John Salvatier, François Savard, Jan Schlüter, John Schulman, Gabriel Schwartz, Iulian Vlad Serban, Dmitriy Serdyuk, Samira Shabanian, Étienne Simon, Sigurd Spieckermann, S. Ramana Subramanyam, Jakub Sygnowski, Jérémie Tanguay, Gijs van Tulder, Joseph P. Turian, Sebastian Urban, Pascal Vincent, Francesco Visin, Harm de Vries, David Warde-Farley, Dustin J. Webb, Matthew Willson, Kelvin Xu, Lijun Xue, Li Yao, Saizheng Zhang, Ying Zhang:
Theano: A Python framework for fast computation of mathematical expressions. CoRR abs/1605.02688 (2016) - 2013
- [c9]Eric Thibodeau-Laufer, Raul Chandias Ferrari, Li Yao, Olivier Delalleau, Yoshua Bengio:
Stacked calibration of off-policy policy evaluation for video game matchmaking. CIG 2013: 1-8 - 2012
- [j6]Yoshua Bengio, Nicolas Chapados, Olivier Delalleau, Hugo Larochelle, Xavier Saint-Mleux, Christian Hudon, Jérôme Louradour:
Detonation Classification from acoustic Signature with the Restricted Boltzmann Machine. Comput. Intell. 28(2): 261-288 (2012) - [j5]Olivier Delalleau, Emile Contal, Eric Thibodeau-Laufer, Raul Chandias Ferrari, Yoshua Bengio, Frank Zhang:
Beyond Skill Rating: Advanced Matchmaking in Ghost Recon Online. IEEE Trans. Comput. Intell. AI Games 4(3): 167-177 (2012) - [i1]Olivier Delalleau, Aaron C. Courville, Yoshua Bengio:
Efficient EM Training of Gaussian Mixtures with Missing Data. CoRR abs/1209.0521 (2012) - 2011
- [c8]Yoshua Bengio, Olivier Delalleau:
On the Expressive Power of Deep Architectures. ALT 2011: 18-36 - [c7]Yoshua Bengio, Olivier Delalleau:
On the Expressive Power of Deep Architectures. Discovery Science 2011: 1 - [c6]Olivier Delalleau, Yoshua Bengio:
Shallow vs. Deep Sum-Product Networks. NIPS 2011: 666-674 - 2010
- [j4]Yoshua Bengio, Olivier Delalleau, Clarence Simard:
Decision trees do not generalize to new variations. Comput. Intell. 26(4): 449-467 (2010) - [c5]Guillaume Desjardins, Aaron C. Courville, Yoshua Bengio, Pascal Vincent, Olivier Delalleau:
Tempered Markov Chain Monte Carlo for training of Restricted Boltzmann Machines. AISTATS 2010: 145-152
2000 – 2009
- 2009
- [j3]Yoshua Bengio, Olivier Delalleau:
Justifying and Generalizing Contrastive Divergence. Neural Comput. 21(6): 1601-1621 (2009) - 2006
- [p3]Yoshua Bengio, Olivier Delalleau, Nicolas Le Roux:
Label Propagation and Quadratic Criterion. Semi-Supervised Learning 2006: 192-216 - [p2]Olivier Delalleau, Yoshua Bengio, Nicolas Le Roux:
Large-Scale Algorithms. Semi-Supervised Learning 2006: 332-341 - [p1]Yoshua Bengio, Olivier Delalleau, Nicolas Le Roux, Jean-François Paiement, Pascal Vincent, Marie Ouimet:
Spectral Dimensionality Reduction. Feature Extraction 2006: 519-550 - 2005
- [c4]Olivier Delalleau, Yoshua Bengio, Nicolas Le Roux:
Efficient Non-Parametric Function Induction in Semi-Supervised Learning. AISTATS 2005: 96-103 - [c3]Yoshua Bengio, Olivier Delalleau, Nicolas Le Roux:
The Curse of Highly Variable Functions for Local Kernel Machines. NIPS 2005: 107-114 - [c2]Yoshua Bengio, Nicolas Le Roux, Pascal Vincent, Olivier Delalleau, Patrice Marcotte:
Convex Neural Networks. NIPS 2005: 123-130 - 2004
- [j2]Pierre-Jean L'Heureux, Julie Carreau, Yoshua Bengio, Olivier Delalleau, Shi Yi Yue:
Locally Linear Embedding for dimensionality reduction in QSAR. J. Comput. Aided Mol. Des. 18(7): 475-482 (2004) - [j1]Yoshua Bengio, Olivier Delalleau, Nicolas Le Roux, Jean-François Paiement, Pascal Vincent, Marie Ouimet:
Learning Eigenfunctions Links Spectral Embedding and Kernel PCA. Neural Comput. 16(10): 2197-2219 (2004) - 2003
- [c1]Yoshua Bengio, Jean-François Paiement, Pascal Vincent, Olivier Delalleau, Nicolas Le Roux, Marie Ouimet:
Out-of-Sample Extensions for LLE, Isomap, MDS, Eigenmaps, and Spectral Clustering. NIPS 2003: 177-184
Coauthor Index
manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.
Unpaywalled article links
Add open access links from to the list of external document links (if available).
Privacy notice: By enabling the option above, your browser will contact the API of unpaywall.org to load hyperlinks to open access articles. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Unpaywall privacy policy.
Archived links via Wayback Machine
For web page which are no longer available, try to retrieve content from the of the Internet Archive (if available).
Privacy notice: By enabling the option above, your browser will contact the API of archive.org to check for archived content of web pages that are no longer available. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Internet Archive privacy policy.
Reference lists
Add a list of references from , , and to record detail pages.
load references from crossref.org and opencitations.net
Privacy notice: By enabling the option above, your browser will contact the APIs of crossref.org, opencitations.net, and semanticscholar.org to load article reference information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Crossref privacy policy and the OpenCitations privacy policy, as well as the AI2 Privacy Policy covering Semantic Scholar.
Citation data
Add a list of citing articles from and to record detail pages.
load citations from opencitations.net
Privacy notice: By enabling the option above, your browser will contact the API of opencitations.net and semanticscholar.org to load citation information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the OpenCitations privacy policy as well as the AI2 Privacy Policy covering Semantic Scholar.
OpenAlex data
Load additional information about publications from .
Privacy notice: By enabling the option above, your browser will contact the API of openalex.org to load additional information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the information given by OpenAlex.
last updated on 2024-11-28 21:28 CET by the dblp team
all metadata released as open data under CC0 1.0 license
see also: Terms of Use | Privacy Policy | Imprint