Tag: benchmark (10 references)
Can We Trust AI Benchmarks? An Interdisciplinary Review of Current Issues in AI Evaluation
The Leaderboard Illusion
DeepCore: A Comprehensive Library for Coreset Selection in Deep Learning
Comprehensive library and empirical study of coreset selection methods for deep learning, finding that random selection remains a strong baseline across many settings.