Introduced in CodeSearchNet Challenge: Evaluating the State of Semantic Code Search
The CodeSearchNet Corpus is a large dataset of functions with associated documentation written in Go, Java, JavaScript, PHP, Python, and Ruby from open source projects on GitHub. The CodeSearchNet Corpus includes:
Source: https://github.blog/2019-09-26-introducing-the-codesearchnet-challenge/