Disambiguation corpus ===================== This corpus was taken by Francis Tyers from the repository for the Apertium-projekt: https://svn.code.sf.net/p/apertium/svn/ The original purpose of the corpus was to develop a new module for lexical disambiguation. More information about apertium: www.apertium.org Apertium is licensed under GPL v.2 (can be relicensed under later licences). The corpus is part of the Europarl corpus, made available by EU for free use and assambled by Philipp Koehn (Europarl: A Parallel Corpus for Statistical Machine Translation, Philipp Koehn, MT Summit 2005): http://www.statmt.org/europarl/ The corpus consists of two files for each language: one for development (e.g. europarl.dev.en) and one for test (e.g. europarl.tst.en). More languages can be found at: https://svn.code.sf.net/p/apertium/svn/trunk/apertium-lex-tools/corpora Per Tunedal