2

I'm developing a set of tools meant to aid in learning Chinese characters and I realized that I don't know a good dataset containing character decomposition. Ideally it should be comprehensive (or have as good coverage as possible), machine-readable and available for free (creative commons?). So far I know about Wiktionary and Make Me a Hanzi. What are the other ones?

d33tah
  • 555
  • 3
  • 14

3 Answers3

3

you may try the 拆字字典:

http://www.kaifangcidian.com/han/chaizi

eg:

【彎】 wan1 〖絲 言 弓 彎〗〖糹 言 糹 弓 彎〗

http://www.kaifangcidian.com/han/chaizi/彎

have fun :)

水巷孑蠻
  • 15,695
  • 2
  • 16
  • 35
2

CHISE (http://www.kanji.zinbun.kyoto-u.ac.jp/projects/chise/ids/index.html) is one of the most comprehensive ones available, containing decompositons for almost all characters in the CJK unicode block.

dROOOze
  • 22,662
  • 2
  • 42
  • 65
0

https://github.com/amake/cjk-decomp or https://github.com/cjkvi/cjkvi-ids is what you're looking for. Licenses are included in the repos. The IDS decomposition is newer but licensing seems a bit unclear. Then again, Chinese characters haven't changed since the first repo, cjk-decomp, was created, use that perhaps.

Vitaly Osipov
  • 628
  • 3
  • 10