Oct
31
(via Silver Oliver who works at the BBC): Do tag clouds work in Chinese? (Chinese translation of this article). Rex Wong talks about the alphabetical ordering problems of non-alphabetical languages, problems sizing up Chinese fonts, and the mixing of Chinese characters and English words.
At the end of the day, though, tag clouds do seem to work fine in Chinese, as they do in Japanese. The alphabetical ordering algorythm is kind of irrelevant because tag clouds guide the eye mostly by size (a tag cloud is not used for known-item searching, where alphabetical ordering matters, but for serendipitous discovery).
Also, collation algorythms that order these languages in a “good enough” manner already exist and are built into the infrastructure we use (programming languages and databases), so you can probably safely forget about this.
I’ve been told that in Japan, tag clouds are pretty popular and work well.
Chinese tag cloud:
Japanese tag cloud:
How about tag clouds in other languages? Please point me to examples of tag clouds in as many languages as you can find, I want to try to make a collection.
Another issue of course is if you aggregate lots of content and tags, how do you deal with mixed-language tag clouds? Here’s an (older) example of mixed languages in a tagcloud:
The best approach seems to be to separate out tagclouds in different languages. There are automatic ways to do that (I have to research some more what the best ways are), and then present a tagcloud to the user in their language. (Some more thoughts on this.)
A note about alphabetical ordering of none alphabetical languages (like Chinese or Japanese): it often is possible to order these languages, using an alternative collation system called radical and stroke ordering. Below is a screenshot of an “alphabetical” index in Japanese.
And a question to finish this off: for our Japanese speakers: what is Taggy?

October 31st, 2007 at 4:23 am
[…] Related: I wrote a post on 290s.com about tagclouds in Chinese and Japanese. […]
October 31st, 2007 at 4:47 am
Also, I’d love to hear about more examples of using tags in languages different than English, especially non-alphabetical and non-Latin alphabet languages. Leave them in the comments!
October 31st, 2007 at 5:07 am
Here’s a Portuguese tagcloud:
http://www.overmundo.com.br/home/nuvem_tags.php (via Gustavo from http://gawry.com)
October 31st, 2007 at 6:35 am
Hi, Peter
Just wanted to mention that the Chinese tag cloud above does not seem to be in any predictable order. There are ways to organize words/phrases in Chinese based on pronunciation or number of strokes. It would be more useful if they were in a particular order.
October 31st, 2007 at 7:19 am
“It would be more useful if they were in a particular order.” Why? The utility of a tag cloud is not to show order but to show frequency, is it not?
October 31st, 2007 at 7:31 am
My written Japanese is frankly awful, but Taggy.jp appears to be a “search engine that you don’t have to search” http://taggy.jp/help/whatis.html - primarily for multi-media, it seems. It lets you set up search profiles that act as “agents” to pull in content against tags you (and others) set: wanna see lots of funny videos? Just set the “funny” tag up and let the videos come to you!
November 1st, 2007 at 12:34 pm
Here’s a tagcloud in lolcatz: http://icanhascheezburger.com/all-tags/
November 8th, 2007 at 5:58 am
Jonathan, The utility of a tag cloud is to show frequency, displayed in alphabetical order ..