Please cite this paper when using any of the material on this page.
- Tokenized |   (.tok extension) |
- Tokenized & Stemmed |   (.stm extension) |
- Tokenized & Stemmed + Bigrams |   (.bgm extension) |
- spec_seeds / nspec_seeds |    Seed data for spec and nspec classes |
- spec_test / nspec_test |    Test data for spec and nspec classes |
- pool |    Unlabeled pool |
- spec_train / nspec_train |    Training data sets automatically induced using the probabilistic acquisition model of Medlock and Briscoe (2007) |
- Tokenized: tok.tar.gz |
- Tokenized & Stemmed: stm.tar.gz |
- Tokenized & Stemmed + Bigrams: bgm.tar.gz |