SLABank ESF Corpus

The ESF (European Science Foundation Second Language) Database is a computerized archive of data collected by research groups of the ESF project in five European countries: France, Germany, Great Britain, The Netherlands and Sweden. The project concentrates on the spontaneous second language acquisition of forty adult immigrant workers living in Western Europe, and their communication with native speakers in the respective host countries. The target languages are Dutch, English, German and Swedish. The French data are not included, because they could not be brought into CHAT format. For each target language, two source languages were selected. The corpora are:

The Dutch and English transcripts have accompanying audio. The German and Swedish L2 transcripts do not. A filename like lsfbe24a.1.cha indicates:

Publications that use this corpus should cite:
Perdue, C. (ed.) (1993). Adult Language Acquisition. Vol 1: Field Methods. Cambridge University Press

In the Corpus Column there are three links.

You can also browse the entire SLABank database online from this link.

Corpus Description Contributor
Dutch L2 / Arabic L1 Wolfgang Klein and Clive Perdue
Dutch L2 / Turkish L1 Wolfgang Klein and Clive Perdue
English L2 / Italian L1 Wolfgang Klein and Clive Perdue
English L2 / Punjabi L1 Wolfgang Klein and Clive Perdue
German L2 / Italian L1 Wolfgang Klein and Clive Perdue
German L2 / Turkish L1 Wolfgang Klein and Clive Perdue
Swedish L2 / Finnish L1 Wolfgang Klein and Clive Perdue
Swedish L2 / Spanish L1 Wolfgang Klein and Clive Perdue