[Corpora-List] Re: Looking for super large Russian corpus

From: Sergey Protasov (svp@zuzino.net.ru)
Date: Thu Nov 04 2004 - 07:03:22 MET

  • Next message: Francesca Bianchi: "[Corpora-List] on-line bibliographical resources"

    Dear, All!

    Thank you very much for your links!

    I found exactly that I looking for...

    Special thanks to
    Philip Resnik (USA) for 2Gbyte url list of Russian sites
    http://umiacs.umd.edu/~resnik/strand/,
    Jonathan Young (USA) for good collection at http://www.wordtheque.com,
    Roman Yangarber (USA) for link to more than 3Gbyte Moshkov library at
    http://lib.ru,
    Viktor Zaharov (Russia) for link to the special search engine in Moshkov
    library http://www.aot.ru/search1.html,
    Stefan Bordag for the idea to buy some special CDs in Russia (This
    method works!),

    Vladimir Rykov (Russia, I am postgraduate of) for posting my question to
    the Corpora List.

    Now I am trying to extract sentences from html/plain texts.

    I need a big text splited by sentences to train my Russian Link Grammar
    http://sz.ru/parser/,

    I will tell you about my results.. Later.

    --
    Sergey Protasov
    PhD student in Computational Linguistics,
    Moscow Institute of Physics and Technology
    



    This archive was generated by hypermail 2b29 : Thu Nov 04 2004 - 08:19:59 MET