Corpus Tools

1. http://lwc.daanvanesch.nl/    (Leiden Weibo Corpus, free to search, based on “5,103,566 messages posted on Sina Weibo in January 2012″)

see its frequency list http://lwc.daanvanesch.nl/frequentwords.php

2. http://lingua.mtsu.edu/chinese-computing/statistics/bigram/form.php   (Jun Da, Chinese Text Computing)

3. http://www.jukuu.com/  (句酷

4. Taiwan Mandarin Spoken Wordlist http://mmc.sinica.edu.tw/resources_e_01.htm

Advertisements

Leave a Reply

Fill in your details below or click an icon to log in:

WordPress.com Logo

You are commenting using your WordPress.com account. Log Out / Change )

Twitter picture

You are commenting using your Twitter account. Log Out / Change )

Facebook photo

You are commenting using your Facebook account. Log Out / Change )

Google+ photo

You are commenting using your Google+ account. Log Out / Change )

Connecting to %s

%d bloggers like this: