RE: [Corpora-List] Chinese language corpus

From: Serge Sharoff (S.Sharoff@leeds.ac.uk)
Date: Mon Oct 11 2004 - 14:08:23 MET DST

  • Next message: hajmohammadi: "[Corpora-List] [corpora-list] Persian language corpus"

    I don't think that there is a Chinese corpus with explicitly marked emotions.

    As far I know there is even no BNC-like Chinese corpus, i.e. a large corpus of modern texts in variety of genres. Several news corpora are available for Chinese, including Guo Jin's Chinese PH corpus, which is based on XINHUA news from 1990, downloadable from ftp://ftp.cogsci.ed.ac.uk/pub/chinese/
    and Chinese Gigaword corpus available from LDC, http://www.ldc.upenn.edu/

    We have an interface for searching through their subset:
    http://corpus.leeds.ac.uk/query-zh.html

    The problem for your research is that emotions are not frequently expressed or discussed in news texts, so you'll have relatively few examples of emotion words in those corpora. I had similar problems in studying emotion-related German words in the IDS corpus, which has a very large proportion of newspaper texts. They exhibit very few patterns that are quite different from uses of emotion words in other genres, in particular in spoken language.

    Best wishes,
    Serge

    > -----Original Message-----
    > From: owner-corpora@lists.uib.no [mailto:owner-corpora@lists.uib.no] On
    > Behalf Of Katarzyna Horszowska
    > Sent: Sunday, October 10, 2004 10:46 PM
    > To: corpora@uib.no
    > Subject: [Corpora-List] Chinese language corpus
    >
    > Dear colleagues,
    >
    > I'm looking for a Chinese language corpus. I'm particulary interested
    > in words expressing emotions. nevertheless, if you could recommend me
    > any Chinese langugage reliable corpus, I would appreciate.
    >
    > best wishes,
    >
    > Katarzyna Horszowska.
    > --
    > Katarzyna Horszowska
    > www.chinski.ti.pl



    This archive was generated by hypermail 2b29 : Mon Oct 11 2004 - 14:25:29 MET DST