This is the mail archive of the
libc-alpha@sourceware.org
mailing list for the glibc project.
UCS data encoding in localedata
- From: Petr Baudis <pasky at ucw dot cz>
- To: libc-alpha at sourceware dot org, libc-announce at sourceware dot org
- Date: Fri, 6 Apr 2012 10:18:27 +0200
- Subject: UCS data encoding in localedata
Hi!
Does anyone know the technical reason for using the explicit <U0000>
UCS encoding in localedata instead of some sane approach like UTF8
encoded data? I can think of only historical reasons due to the lack
of support in tools (OS, editors, VCS, ...) in the past, however I
believe that by now, using UTF8 should be fairly safe.
For me, even with the show-ucs-data tool, deadling with localedata
files is quite onerous. Can anyone share any other tricks they use
when dealing with localedata? Would there be any resistance to moving
to UTF8?
P.S.: It is not like I would start working on this tomorrow. However,
I have wondered about this long enough to ask. :-)
--
Petr "Pasky" Baudis
Smart data structures and dumb code works a lot better
than the other way around. -- Eric S. Raymond