default search action
"Functional Benchmarks for Robust Evaluation of Reasoning Performance, and ..."
Saurabh Srivastava et al. (2024)
- Saurabh Srivastava, Annarose M. B, Anto P. V, Shashank Menon, Ajay Sukumar, Adwaith Samod T, Alan Philipose, Stevin Prince, Sooraj Thomas:
Functional Benchmarks for Robust Evaluation of Reasoning Performance, and the Reasoning Gap. CoRR abs/2402.19450 (2024)
manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.