Fresh Hacker News | An Introduction to Writing Systems and Unicode

▲An Introduction to Writing Systems and Unicode(r12a.github.io)

38 points by mariuz 3 days ago | 3 comments

▲ks2048 2 hours ago

This site has been a gem for a long time for Unicode and language-related topics. Just as good to link to the top-level,

https://r12a.github.io/

▲mostafah 2 hours ago

Richard is amazing. I briefly worked with him while volunteering on a W3C text layout requirements document. He cares deeply about writing systems, and he has been doing so much valuable work in this space.

▲vishnuharidas 32 minutes ago

Also: UTF-8 Playground: https://utf8-playground.netlify.app

▲ovciokko 2 hours ago

The texts in the images claimed to be Simplified Chinese are not really conforming the standard glyph shapes of hanzi as defined by the government of China; they look more like the Japanese standard shapes of kanji.

▲mbrubeck 1 hour ago

Can you clarify which characters you're talking about? I don't see any examples of Japanese-specific kanji in the simplified Chinese examples.

For example, the first image uses 沟 and 时 forms that are found only in simplified Chinese. In both Japanese and traditional Chinese, these are written 溝 and 時.

The images also correctly use the Chinese forms of 統/统. The Japanese form [0] differs from both and does not appear in these images.

请 as shown in the image is similarly used only in simplified Chinese, not Japanese. In Japanese, the traditional Chinese form is normally used in handwriting, and an alternate form of the 訁 radical (different from either of the Chinese forms) is often used in printed text.

[0]: https://en.wiktionary.org/wiki/%E7%B5%B1#Japanese

▲dhosek 1 hour ago

One of the big complaints about Han-unification in Unicode is that simplified and traditional forms share the same code points so display of simplified vs traditional is up to the font to manage.

▲renhanxue 19 minutes ago

That's not really accurate. An overwhelming majority of the simplified characters have had their own code points in Unicode ever since 1.0. Some more details here: https://r12a.github.io/scripts/chinese/