Home Research Papers Datasets State of the Art Pricing

Discover, visualize, and connect AI research papers. Explore the latest trends and insights in artificial intelligence research.

Product

Home
Research Papers
About

Support

Contact
Terms of Service
Privacy Policy

miscellaneous-9

Auto Debugging

3260 papers • 126 benchmarks • 313 datasets

This task has no description! Would you like to contribute one?

(Image credit: Papersgraph)

Benchmarks

These leaderboards are used to track progress in auto-debugging-9

Trend

Dataset

Best Model

Actions

Big-bench Lite

Libraries

Use these libraries to find auto-debugging-9 models and implementations

Datasets

BIG-bench

Subtasks

No subtasks available.

Most implemented papers

PaLM: Scaling Language Modeling with Pathways

Michele Catasta, J. Dean, Xavier García, Orhan Firat, Noam Shazeer, James Bradbury, Andrew M. Dai, Sharan Narang, Anselm Levskaya, S. Ghemawat, M. Isard, Barret Zoph, Daphne Ippolito, A. Chowdhery, Emily Reif, Adam Roberts, D. Eck, Jacob Devlin, Slav Petrov, Zongwei Zhou, Katherine Lee, Kensen Shi, Pengcheng Yin, Oleksandr Polozov, Ryan Sepassi, H. Michalewski, Jacob Austin, Maarten Bosma, David Dohan, Charles Sutton, Sebastian Gehrmann, Yi Tay, Hyung Won Chung, Denny Zhou, Jason Wei, Ben Hutchinson, Vedant Misra, Xuezhi Wang, R. Child, Gaurav Mishra, L. Fedus, Nan Du, P. Barham, Parker Schuh, Sasha Tsvyashchenko, Joshua Maynez, Abhishek Rao, Parker Barnes, Vinodkumar Prabhakaran, Reiner Pope, Guy Gur-Ari, Toju Duke, Sunipa Dev, Kevin Robinson, D. Luan, Hyeontaek Lim, A. Spiridonov, Shivani Agrawal, Mark Omernick, Thanumalayan Sankaranarayana Pillai, Marie Pellat, Aitor Lewkowycz, Erica Moreira, Brennan Saeta, Mark Díaz, K. Meier-Hellstern, Noah Fiedel

Content

Introduction Benchmarks Datasets Subtasks Libraries Papers

•

Mon Apr 04 2022

A 540-billion parameter, densely activated, Transformer language model, which is called PaLM achieves breakthrough performance, outperforming the finetuned state-of-the-art on a suite of multi-step reasoning tasks, and outperforming average human performance on the recently released BIG-bench benchmark.

7593 0

Paper Graph

From Code to Correctness: Closing the Last Mile of Code Generation with Hierarchical Debugging

Yuling Shi, Songsong Wang, Chengcheng Wan, Xiaodong Gu•Tue Oct 01 2024

Multi-Granularity Debugger is introduced, a hierarchical code debugger by isolating, identifying, and resolving bugs at various levels of granularity by isolating, identifying, and resolving bugs at various levels of granularity.

37 0

Paper Graph

Adding a benchmark result helps the community track progress.

Auto Debugging | State-of-the-Art