
A Corpus of Formal British English Speech
The Lancaster/IBM Spoken English Corpus
Preview
Book Description
This work provides 50,000 words of prosodically-transcribed text from a variety of sources. The introduction explains fully the transcription conventions, the structure of the corpus and its relationship to other computer corpora, and provides examples of different versions of texts.
Table of Contents
Introduction
Prosodic characters
The composition of the corpus.
Breakdown into categories. Speakers.
Dates of composition and recording.
The duration of text extracts.
SEC text details.
Versions of SEC material.
Spoken recording.
Unpunctuated transcriptions.
Orthographic transcriptions.
Samples of different versions.
Unpunctuated transcription.
Orthographic transcription.
Grammatically tagged versions.
Texts.
Appendix 1: The CLAWS1 tagset.
Appendix 2: Complete version of Through the Tunnel.
References and bibliography.