The IPCH(H)G, like other corpora in the Penn family, is queried using CorpusSearch 2.
You may use the corpus in two ways. You can download the parsed files from our GitHub repo and run CorpusSearch 2 on your own machine. This will allow the full range of functions that CorpusSearch 2 offers, such as creating your own definitions, revising the corpus, etc.
For simple searches, you may use our search form. (Note that this queries older versions of some texts, which may contain errors or outdated annotations.) Instructions for the search form:
- Select files: Select files to be queried, either en masse under the drop-down menu, or by ctrl-clicking individual text namess.
- Query: Type your query here. An example CorpusSearch query is
(IP-SUB* idoms BEDS*) AND (IP-SUB* idoms VBN*)
, which will search for subordinate clauses that contain the past subj. of the verb sein and a participle. Note that CorpusSearch 2 assumes that the repetition of an argument refers to the same argument; in plain language, the query above means 'subordinate IP immediately dominates be.pres.subj and the same subordinate IP also immediately dominates the past participle of a lexical verb'. - Node: You must specify the node; $ROOT will search the entire tree, or you can search within smaller constituents (see the CorpusSearch documentation for the consequences of searching within different nodes).
- Options: This field is optional. Options are described in the CorpusSearch documentation.
- Click submit, wait up to 60 seconds for the 'show results' and 'download results' buttons to become available, and then click to either display your results in the browser or download them as text.
- Note: search results can be easily color coded by adding indices like {1} before arguments, and pre- and appending color tags: