• Skip to Content
  • Skip to Main Navigation
  • Skip to Search

Indiana University Bloomington Indiana University Bloomington IU Bloomington

Open Search
  • Corpus
    • Structure of corpus
    • Version numbers
    • Source corpora and licenses
  • Annotation guidlines
    • Sentence tokenization
    • Splitting & joining words; lemmatization
    • Tagset
    • Phrase types
    • Annotation of IP and other clause types
    • Annotation of CP types
    • Coordination
    • Comparison
    • Treatment of certain words and phrases
  • Query

Indiana Parsed Corpus of Historical High German

  • Home
  • Corpus
    • Structure of corpus
    • Version numbers
    • Source corpora and licenses
  • Annotation guidlines
    • Sentence tokenization
    • Splitting & joining words; lemmatization
    • Tagset
    • Phrase types
    • Annotation of IP and other clause types
    • Annotation of CP types
    • Coordination
    • Comparison
    • Treatment of certain words and phrases
  • Query
  • Search

Indiana Parsed Corpus of Historical (High) German

Project Description

The IPCHG is currently under development. Ultimately, it will be a syntactically parsed corpus of approx. 200 High German texts from the 11th through 20th centuries. As we complete annotations of the texts, you will find them on this site.

Note that the site is currently under construction. If the bullet points below are not hyperlinked, the content has not been created yet...

Corpus information

  • Corpus overview
  • Structure (time periods, dialects, and genres)
  • Titles of texts

Annotation Guidelines

  • General principles and contents
  • Division into sentence tokens
  • Splitting and joining words, and lemmatization
  • Tagset: parts of speech (heads), phrases, extended tags, and empty categories
  • Structure of phrases
  • Structure of clauses: IPs and CPs
  • Difficult structures: coordination and comparisons
  • Treatment of individual words and idiomatic phrases
  • Issues to be resolved

Research Team

Christopher D. Sapp, Ph.D., primary investigator

Rex A. Sprouse, Ph.D., primary investigator

Elliott Evans, Ph.D., postdoctoral researcher

Danny Dakota, Ph.D., computational consultant

David Bolter, Ph.D., graduate assistant

Mary Gilbert, M.A., graduate assistant

Tyler Kniess, M.A. graduate assistant

Links

Related parsed historical corpora
  • The Penn Parsed Historical Corpora of English (PPHCE)
  • The Corpus of Historical Low German (CHLG)
  • The Heliand Parsed Database (HeliPaD)
  • The Icelandic Parsed Historical Corpus (IcePaHC)
German language resources
  • Searchable historical dictionaries of German: Wörterbuchnetz
  • A family of historical corpora of German: Deutsch Diachron Digital

Publications

to appear ...

Acknowledgments

This project is possible thanks to a Faculty Research Support Funding Seed grant from the IU OVPR, supported by the Department of Germanic Studies and the Department of Second Language Studies.

Indiana University

Accessibility | Privacy Notice | Copyright © 2023 The Trustees of Indiana University