In Spring 2010, the Association for Computing Machinery (ACM) Special Interest Group on Knowledge Discovery and Data-mining (KDD) selected an educational technology dataset for its annual competition. The competition, titled “Educational Data Mining Challenge,” tasked participants with predicting the correctness of student answers to questions within an Intelligent Tutoring System (ITS) from The Cognitive Tutors suite. PSLC DataShop hosted this challenge and included data provided by Carnegie Learning Inc., producers of The Cognitive Tutors. Consisting of over 9GB of student data, this was the largest KDD Cup dataset up to that time. The competition brought in 655 competitors submitting 3,400 solutions. Five years later, the competition dataset has been the most often cited from an educational technology platform.
Additional information can be found in the paper
Stamper, J., & Pardos, Z. A. (2016). The 2010 KDD Cup competition dataset: Engaging the machine learning community in predictive learning analytics. Journal of Learning Analytics, 3(2), 295–299. DOI: https://doi.org/10.18608/jla.2016.32.16