---

Mastering Characters Sets in Linux (Weird Characters, part 2)

“gucharmap

“First, when you’re testing anything involving character
encoding, gucharmap is invaluable (Figure 1).

“Every Unicode character is in some category, shown in the list
on the left — in addition to Basic Latin, Latin-1 Supplement
(accented characters), Greek, Cyrillic, Katakana etc. there are
categories for Braille, Cuneiform, punctuation, mathematics, music
and so forth.

“The Character Details tab tells you the Unicode, UTF-8, UTF-16
and XML/HTML codes for the character.

“If you have a character from a web page or email and don’t know
what it is, just paste it into gucharmap’s Search->Find field
(Figure 2).”

Complete
Story

Get the Free Newsletter!

Subscribe to Developer Insider for top news, trends, & analysis