Last modified 30.04.1997
Mannheimer Korpus I
The corpus contains 293 texts of various sources: fiction (all genres),
poetry, scientific literature (academic and popular), newspapers and magazins
in the period from 1950-1967.
Conditions on demand
Provider: Institut für deutsche Sprache
Mannheimer Korpus II
The corpus contains 48 texts of various sources: non-fiction (operating
instructions, resolutions, textbooks, leaflets), light fiction, scientific
literature (academic and popular), newspapers, magazins of the period from
1949-1973.
Conditions on demand
Provider: Institut für deutsche Sprache
Mannheimer Morgen Today (MMT)
This corpus contains articles from the daily Mannheim newspaper "Mannheimer
Morgen". Every night the newspaper articels which are to be published
in the next issues are transfered via modem line to the IDS pool and are
automatically processed (conserving bibliographic information and the like)
and added to the corpus. The date given in the reference of an article
is the anticipated publication date. Therefore, it is either today's date
or a future date; however, a few articles remain unpublished.
Conditions on demand
Provider: Institut für deutsche Sprache
Mannheimer Morgen Monitor Corpus (MMM)
It is a cumulative corpus, i.e., it contains all Mannheimer Morgen articles
for the current year and is expanding daily through the addition of the
MMT.
Conditions on demand.
Provider: Institut für deutsche Sprache
Bonner Zeitungskorpus
The corpus is composed of 10 840 newspaper texts from the time in which
Eastern Germany was the GDR and Western Germany was the FRG. The corpus
contains a sample of texts from a big newspaper in Western Germany ("Die
Welt") and a big newspaper from Eastern Germany ("Neues Deutschland")
covering the years 1949, 1954, 1959, 1964, 1969 and 1974.
Conditions on demand
Provider: Institut für deutsche Sprache
Grammatik-Korpus
The corpus contains fiction and trivial literature from the years 1971,
1977 and 1990.
Conditions on demand
Provider: Institut für deutsche Sprache
Handbuchkorpus
The corpus contains 17330 texts from German magazins and newspapers, covering
the years 1985-1988.
Conditions on demand
Provider: Institut für deutsche Sprache
Wendekorpus
The corpus contains different sorts of text from the former GDR and the
former FRG. It is composed of 3387 texts from the years 1989-1990 such
as leaflets, pamphlets, protocols of the East German "Volkskammer"
and the West German "Bundestag", speeches, proclamations and
programmes which document the changes that led to the political turning-point
in East Germany and in 1990 to the re-unification of Germany.
Conditions on demand
Provider: Institut für deutsche Sprache
LIMAS Korpus
The corpus contains 500 excerpts from various kinds of texts, each with
a length of 2000 words. Based on the "Nationalbiographie" of
1971/72 this corpus is composed of textual excerpts from 33 different topics.
Conditions on demand
Provider: Institut für deutsche Sprache
Goethekorpus
This corpus contains works of Johann Wolfgang von Goethe. It is composed
of 1411 texts, reaching from Goethe`s novels like "Die Wahlverwandtschaften"
to theoretical and autobiographical texts.
Conditions on demand
Provider: Institut für deutsche Sprache
Marx-Engels-Schriften
The corpus contains at present 535 texts of Karl Marx and Friedrich Engels.
The coprus is not completed at present.
Conditions on demand
Provider: Institut für deutsche Sprache
Freiburger Korpus
The corpus contains 224 spoken texts from 1968 to 1974. It is composed
of discussions, interviews, speeches, reports and narratives.
Conditions on demand
Provider: Institut für deutsche Sprache
Dialogstrukturenkorpus
The corpus contains 72 texts from the years 1966-1977. It is composed of
interviews and consultations.
Conditions on demand
Provider: Institut für deutsche Sprache
PFEFFER-Korpus
The corpus contains supra-regional, colloquial language from West Germany
(FRG), East Germany (GDR), Austria and Switzerland since the beginning
of the 60s.
The corpus is based on the texts of the Stanford University and published
in book form.
Conditions on demand
Provider: Institut für deutsche Sprache
For the retrieval of corpus information, the IDS provides its own online
system:
COSMAS [http://www.ids-mannheim.de/kt/service-eng.html]
(Corpus Storage, Maintenance and Access System), developed jointly with
Makrolog GmbH. In addition to its standard retrieval facilities, COSMAS
allows its users to design their own research corpus by selecting any number
of texts from the corpora available within the system, according to individual
specifications and demands. Click here to test the COSMAS WWW demo
version.
Conditions on demand
Provider: Institut für deutsche Sprache
COSMAS-III is an enhanced version of the COSMAS system for the storage and maintenance of as well as the access to (annotated) language corpora. It has been developed within the MECOLB project and will be further improved and expanded, including statistical tools,speech recognition for computer supported transliteration and transcription of speech.
For further information on COSMAS and the IDS Corpora, please contact Dr. Irmtraud Jüttner, <juettner@ids-mannheim.de>
Reciprocal Parallel Corpus for German and French
Together with the Laboratoire de Linguistique Informatique of the University
of Paris XIII the IDS is compiling a corpus of appr. 20 million words in
each of the two languages with their relevant translations.
Available: End of 1997 (envisaged).
Conditions on demand
Provider: Institut für deutsche Sprache / LLI