This CD-ROM package is a direct product of the EU-funded TELRI (Trans-European Language Resources Infrastructure) Concerted Action. TELRI, currently in its third successful year, brings together researchers from over twenty sites in Central and Eastern European and the Newly Independent States and provides opportunities for the production and exchange of standardized corpora, tools and resources.
This double CD-ROM contains extensive corpora, both spoken and written, in more than 21 languages of Western, Central and Eastern Europe, for instance Lithuanian, Polish, Hungarian, and Slovene. The corpora are available in plain text and SGML encoding, and have been successfully aligned. Also available are various tools including a corpus query language, concordancer, alignment tools, software, POS taggers, lexica in 6 languages and samples of research work involving the data.
The CD-ROM package is usable on UNIX, PC and Apple MacIntosh platforms.
The CD-ROM is available in a limited edition for academic purposes from mid-December 1997.
PLEASE ORDER YOUR CD-ROM (ECU 25,- incl. p&p) BY SENDING US A MAIL. THANK YOU!
Contents - CD 1
Electronic resources for 17 languages:
Ancient Greek, Bulgarian, Czech, English, French, German, Hungarian, Latvian, Lithuanian, Polish, Romanian, Russian, Serbian, Slovene, Slovak, Swedish, ...plus Chinese
Contents - CD 2
Electronic resources for 6 languages:
Bulgarian, Czech, Estonian, Hungarian, Romanian, Slovene
TELRI Main Page