Learning and Evaluating Contextual Embedding of Source Code

Published in

International Conference on Machine Learning(2019)

External Links:

TL;DR

This paper curates a massive, deduplicated corpus of 7.4M Python files from GitHub, and creates an open-sourced benchmark that comprises five classification tasks and one program-repair task, akin to code-understanding tasks proposed in the literature before, showing that CuBERT outperforms them all, even with shorter training, and with fewer labeled examples.

Authors

Aditya Kanade

4 papers

Petros Maniatis

2 papers

Gogul Balakrishnan

1 papers

Kensen Shi

2 papers

Learning and Evaluating Contextual Embedding of Source Code

Published in

International Conference on Machine Learning(2019)

External Links:

Generate Graph

TL;DR

Authors

Aditya Kanade

4 papers

Petros Maniatis

2 papers

Gogul Balakrishnan

1 papers

Kensen Shi

2 papers

Learning and Evaluating Contextual Embedding of Source Code

TL;DR

Authors

Learning and Evaluating Contextual Embedding of Source Code

TL;DR

Authors

Field of Study

Venue Information

Name

Type

URL

Alternate Names