I'm developing a set of tools meant to aid in learning Chinese characters and I realized that I don't know a good dataset containing character decomposition. Ideally it should be comprehensive (or have as good coverage as possible), machine-readable and available for free (creative commons?). So far I know about Wiktionary and Make Me a Hanzi. What are the other ones?
Asked
Active
Viewed 396 times
2
-
1Please note that this kind of decomposition is merely graphical. A lot of the time they don’t reveal how the character functions at all. See this related question. – dROOOze Aug 22 '18 at 18:01
3 Answers
3
you may try the 拆字字典:
http://www.kaifangcidian.com/han/chaizi
eg:
【彎】 wan1 〖絲 言 弓 彎〗〖糹 言 糹 弓 彎〗
http://www.kaifangcidian.com/han/chaizi/彎
have fun :)
水巷孑蠻
- 15,695
- 2
- 16
- 35
2
CHISE (http://www.kanji.zinbun.kyoto-u.ac.jp/projects/chise/ids/index.html) is one of the most comprehensive ones available, containing decompositons for almost all characters in the CJK unicode block.
dROOOze
- 22,662
- 2
- 42
- 65
-
-
@d33tah the only thing I can find is that it's open source and copyrighted...maybe I didn't look in the right places. – dROOOze Aug 21 '18 at 16:17
0
https://github.com/amake/cjk-decomp or https://github.com/cjkvi/cjkvi-ids is what you're looking for. Licenses are included in the repos. The IDS decomposition is newer but licensing seems a bit unclear. Then again, Chinese characters haven't changed since the first repo, cjk-decomp, was created, use that perhaps.
Vitaly Osipov
- 628
- 3
- 10