3260 papers • 126 benchmarks • 313 datasets
This task has no description! Would you like to contribute one?
(Image credit: Papersgraph)
These leaderboards are used to track progress in auto-debugging-9
Use these libraries to find auto-debugging-9 models and implementations
No subtasks available.
A 540-billion parameter, densely activated, Transformer language model, which is called PaLM achieves breakthrough performance, outperforming the finetuned state-of-the-art on a suite of multi-step reasoning tasks, and outperforming average human performance on the recently released BIG-bench benchmark.
Multi-Granularity Debugger is introduced, a hierarchical code debugger by isolating, identifying, and resolving bugs at various levels of granularity by isolating, identifying, and resolving bugs at various levels of granularity.
Adding a benchmark result helps the community track progress.