This page keeps a list of the most requested datasets and code.

Datasets

  • N24News: A News Dataset for Multimodal Classification.
  • LearningQ: A Large-scale Dataset for Educational Question Generation.
  • Foursquare: A Global-scale Check-in Dataset with User Social Networks.

Code

  • MARTA: Explainable Text Classification Integrating Human Rationales.
  • OpenCrowd: A Human-AI Collaborative Approach for Finding Open-Ended Answers.
  • daisyRec and daisyRec-v2.0: A Python toolkit for benchmarking top-N recommendation.