Can you recommend tool where I can import text and get result list with words according their frequency in text. Aim to import Chinese subtitles learn most frequent words and then watch movie.
Asked
Active
Viewed 1,731 times
3
2 Answers
3
Boris, try this one: http://www.zhongguosou.com/zonghe/cipintongji.aspx
The UI is in Chinese, you may have to figure out how to use it. I tried it myself. It is not user friendly. But this is the only one I found so far.
EQG
- 199
- 1
- 4
-
-
Its hangs with error and not ordering (even with ordering option ON though) – Boris Ivanov Dec 09 '14 at 13:22
0
Python package "jieba" (结巴) should be a good alternative if you want to process a lot of text. This package can split sentences into words and then what you need to do is only counting.
Aria Ax
- 555
- 2
- 6
xunsearchprovides a good engine for Chinese analyses, but it seems they don't provide a formal website to do that, you need to call their api by yourself. Anyway, they have a demo that may meet your requirement -- just paste your text in the large text box, tick the check box before 只看统计, modify the number 10 after it to list more results (it even provides a text box for you to input what word class you want to exclude,~vmeans "exclude verbs"), and then click "submit". You may have a try. – Stan Dec 08 '14 at 19:01