MLE-bench: Evaluating Machine Learning Agents on Machine Learning Engineering (2024-10-09T00:00:00.000000Z)