default search action
EBeM@IJCAI 2022: Vienna, Austria
- José Hernández-Orallo, Lucy Cheke, Joshua B. Tenebaum, Tomer D. Ullman, Fernando Martínez-Plumed, Danaja Rutar, John Burden, Ryan Burnell, Wout Schellaert:
Proceedings of the Workshop on AI Evaluation Beyond Metrics co-located with the 31st International Joint Conference on Artificial Intelligence (IJCAI-ECAI 2022), Vienna, Austria, July 25th, 2022. CEUR Workshop Proceedings 3169, CEUR-WS.org 2022 - Jesse Davis, Lotte Bransen, Laurens Devos, Wannes Meert, Pieter Robberechts, Jan Van Haaren, Maaike Van Roy:
Evaluating Sports Analytics Models: Challenges, Approaches, and Lessons Learned. - Konstantinos Voudouris, Niall Donnelly, Danaja Rutar, Ryan Burnell, John Burden, José Hernández-Orallo, Lucy Cheke:
Evaluating Object Permanence in Embodied Agents using the Animal-AI Environment. - Anthony G. Cohn, José Hernández-Orallo, Julius Sechang Mboli, Yael Moros-Daval, Zhiliang Xiang, Lexin Zhou:
A Framework for Categorising AI Evaluation Instruments. - Lexin Zhou, Fernando Martínez-Plumed, José Hernández-Orallo, Cèsar Ferri, Wout Schellaert:
Reject Before You Run: Small Assessors Anticipate Big Language Models. - Ricardo Baeza-Yates, Marina Estévez-Almenzar:
The Relevance of Non-Human Errors in Machine Learning. - Raül Fabra-Boluda, Cèsar Ferri, Fernando Martínez-Plumed, María José Ramírez-Quintana:
Robustness Testing of Machine Learning Families using Instance-Level IRT-Difficulty. - Chaina Oliveira, Ricardo B. C. Prudêncio:
Item Response Theory to Evaluate Speech Synthesis: Beyond Synthetic Speech Difficulty. - Victor Vikram Odouard, Melanie Mitchell:
Evaluating Understanding on Conceptual Abstraction Benchmarks. - Vicky Charisi, Natalia Díaz Rodríguez, Barbara Mawhin, Luis Merino:
On Young Children's Exploration, Aha! Moments and Explanations in Model Building for Self-Regulated Problem-Solving. - Yeu-Shin Fu, Wenbo Ge, Jo Plested:
FERM: A FEature-space Representation Measure for Improved Model Evaluation.
manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.