Sampler of the Christine Corpus

(200 corpus graphs)

1. General information

Name: Sampler of the Christine Corpus
ID: SWITCHBOARDSampler
Format: extended bracketing format
Author: University of Pennsylvania
Date: 1999
Description: A sampler of 200 sentences from the Switchboard Corpus (Penn Treebank, Version 3). With kind permission of LDC.

2. Corpus details

Features (T): word, pos
Features (NT): cat
Labelled edges: yes
Crossing edges: no
Secondary edges: yes

3. Statistical information

Number of corpus graphs: 200
Number of tokens: 2772
Average number of tokens: 13.9
Number of inner nodes: 2350
Number of edges: 4922

4. Feature documentation

Feature values: pos

Feature values: cat

Edge labels

Secondary edge labels