I should just publicly apologize to Lou since he only mentions European languages and not _major_ European languages in his posting.
And should also mention the invaluable help for the Brazilian data, of NILC (Núcleo Interinstitucional de Lingüística Computacional), who compiled the original corpus (NILC/São Carlos corpus) of which a part was then rebuilt by Linguateca as CETENFolha.
Diana
This archive was generated by hypermail 2b29 : Sun Jan 23 2005 - 19:21:39 MET