Edinburgh Research Explorer

ManySStuBs4J Dataset


  • Rafael Karampatsis (Creator)

Related Edinburgh Organisations

PublisherEdinburgh DataShare
Date made available4 Apr 2019


The ManySStuBs4J corpus contains simple statement bugs mined from open-source Java projects hosted in GitHub. There are two variations of the dataset. One mined from the 100 Java Maven Projects and one mined from the top 1000 Java Projects.
A project's popularity is determined by computing the sum of z-scores of its forks and watchers.
See "README.txt" for further details.

Data Citation

Karampatsis, Rafael-Michael. (2019). ManySStuBs4J Dataset, [dataset]. University of Edinburgh. College of Science & Engineering. School of Informatics. Institute for Language, Cognition and Computation (ILCC). https://doi.org/10.7488/ds/2528.

ID: 86963026