Hi Vladimir,
You can find a good introduction to lexical acquisition methods based on
co-occurrence statistics in Manning and Schuetze's "Foundations of
Statistical Natural Language Processing". You can find an overview of work
on semantic clustering of words based on their spelling in M.Oakes'
"Statistics for Corpus Linguistics".
Best wishes,
Viktor.
----- Original Message -----
From: "P bI K O B___ B.B. (MOCKBA)" <rykov@narod.ru>
To: <corpora@uib.no>
Sent: Tuesday, November 09, 2004 7:50 AM
Subject: [Corpora-List] corpus ------>>>>> thesaurus
>
> I would be very grateful to anyone for any info concerning compiling
thesaurus from corpus (esp. from corpus of specific domain documents).
>
> As example - thesaurus of financial terms compiled from financial
documents corpus.
>
> Best wishes to all our corpus society !
>
> --
> Regards Vladimir Rykov
>
> PhD in Computational Linguistics
> Personal web-site: rykov.narod.ru
> mailto: rykov2000@mail.ru
> Si etiam omnes - ego non
> English version: www.blkbox.com/~gigawatt/rykov.html
>
> --
> Яндекс.Игрушки - яркий перерыв в серых трудовых буднях.
http://play.yandex.ru/
>
>
This archive was generated by hypermail 2b29 : Tue Nov 09 2004 - 12:25:19 MET