Background: I'm gonna attempt to make a list of the most used words/kanji on different message boards on 2ch.net so that japanese learners quckly can participate in online discussion and thus become motivated to continue.
I'm looking for a way to separate words, but it's not as simple as in english. Words can either be one kanji or consist of multiple, like "巨人" (giant) or "人" (human), and there are no spaces either.
So I probably need to have some japanese word processing library, and I only know python, javascript and java. (I prefer python)