From 97612ae589cc877fd7a44376115623dd82d039d8 Mon Sep 17 00:00:00 2001 From: Andrew Head
This dataset includes about 14'000 Java files from GitHub, split into training and test set. +
This dataset includes about 14'000 Java projects from GitHub, split into training and test set.
The files are from open source projects that have been forked at least once.
[download dataset]