Copyright 2009 - Olena Medelyan

Directory documents contains 20 computer science articles in text format.
The same documents were indexed by 15 teams of graduate and undergraduate computer science students in competitive environment.

Each team's terms are stored in text format, one term per line, in files with
 the extension *.key, in directory teams. Note that the team numbers do not correspond to teams' performance.

When using this data set please cite:

O. Medelyan. 2009. Human-competitive automatic topic indexing. PhD thesis. Department of Computer Science, University of Waikato, New Zealand. 

O. Medelyan, I. H. Witten, D. Milne. 2008. Topic indexing with Wikipedia. In Proc. of Wikipedia and AI workshop at the AAAI-2008 Conference. Chicago, US. 
