Data sets

The following data sets are available as Prolog files and can be directly used in kLog using attach/1.

WebKB   Download
Originally developed by M. Craven et al. (1998). The version available here is a direct conversion to Prolog of the data available at the Alchemy website.
Internet Movie Database   Download
Data extracted from this database has been used in a number of relational learning papers. The version available here was downloaded from the IMDb website, converted into SQL using the prodecure described in http://imdbpy.sourceforge.net/docs/README.sqldb.txt and finally a subset of the tuples was converted into a Prolog file.
UW-CSE   Download
The data set originally developed at University of Washington for demonstrating the capabilities of Markov logic networks. The version available here is a direct conversion to Prolog of the data available at theAlchemy website.
Bursi   Download
This data set contains 4,337 molecules labeled according to mutagenicity (2,401 mutagens and 1,936 nonmutagens). Originally developed by Kazius et al (2005) it has been used in a number of machine learning papers, especially those studying graph kernels.
Biodegradability   Download
This is an older data set of chemical structures containing 328 compounds labeled by their half-life for aerobic aqueous biodegradation (a regression task).