Re: Child language corpora

E S Atwell (eric@scs.leeds.ac.uk)
Wed, 14 Feb 1996 17:47:55 GMT

Actually the Polytechnic of Wales Corpus is available direct from ICAME,
see http://www.hd.uib.no/corpora.html

Clive Souter wrote a manual to accompany the corpus, his email has changed
from the one cited by Marie Helt to: cs@scs.leeds.ac.uk
(note that UK internet users no longer drive on the left, ie back-to-front!)

The ICAME distributed version is briefly described on WWW:
Orthographic transcriptions of some 61,000 words of child language data.
The corpus is parsed according to Hallidayan systemic-functional grammar.
There is no prosodic information.

Clive Souter also has the original sound recordings (acoustic and digital
cassettes) and I understand would be interested to hear if anyone
has a practical use for these!

I hope this clears up this misunderstanding...

____________________________________________________________

Eric Steven Atwell,
Centre for Computer Analysis of Language And Speech (CCALAS)
Artificial Intelligence Division, School of Computer Studies
The University of Leeds, LEEDS LS2 9JT, Yorkshire, England
TEL:0113-2335761 FAX:0113-2335468 EMAIL:eric@scs.leeds.ac.uk
WWW: http://agora.leeds.ac.uk/nti-kbs/ccalas.html

FROM SEPTEMBER 1996:
Research Professor, School of Computing and Information Systems,
University of SUNDERLAND, Sunderland, Tyne and Wear, England
WWW: http://osiris.sunderland.ac.uk/
____________________________________________________________