Home >

Resources

  1. Saffron ACL data: includes top 15 topics for each publication (based on the Saffron score).
    If you want to use this dataset, please cite the following publication:

    Domain-independent term extraction through domain modelling, Georgeta Bordea, Paul Buitelaar and Tamara Polajnar (2013), 10th International Conference on Terminology and Artificial Intelligence (TIA 2013), Paris, France
  2. Sample domain models: contains 3 domain models for Computer Science, Food and Agriculture, and the Biomedical domain.
    If you want to use this dataset, please cite the following publication:

    Domain-independent term extraction through domain modelling, Georgeta Bordea, Paul Buitelaar and Tamara Polajnar (2013), 10th International Conference on Terminology and Artificial Intelligence (TIA 2013), Paris, France
  3. Sample topical hierarchies: includes 3 topical hierarchies automatically constructed for Computational Linguistics, Finance, and Semantic Web.
    If you want to use this dataset, please cite the following publication:

    Domain adaptive extraction of topical hierarchies for Expertise Mining, Georgeta Bordea (2013), PhD Thesis, National University of Ireland, Galway
  4. Expert search evaluation: evaluation dataset for domain-specific expert search based on workshop program committees.
    If you want to use this dataset, please cite the following publication:

    Benchmarking domain-specific expert search using workshop program committees, Georgeta Bordea, Toine Bogers and Paul Buitelaar (2013), CIKM 2013 Workshop on Computational Scientometrics: Theory & Applications, San Francisco, CA, USA