This is the mail archive of the
libc-alpha@sourceware.org
mailing list for the glibc project.
Re: UCS data encoding in localedata
- From: "Carlos O'Donell" <carlos at systemhalted dot org>
- To: Petr Baudis <pasky at ucw dot cz>
- Cc: libc-alpha at sourceware dot org, libc-announce at sourceware dot org
- Date: Fri, 6 Apr 2012 05:37:22 -0400
- Subject: Re: UCS data encoding in localedata
- References: <20120406081827.GH31914@machine.or.cz>
On Fri, Apr 6, 2012 at 4:18 AM, Petr Baudis <pasky@ucw.cz> wrote:
> ?Does anyone know the technical reason for using the explicit <U0000>
> UCS encoding in localedata instead of some sane approach like UTF8
> encoded data? I can think of only historical reasons due to the lack
> of support in tools (OS, editors, VCS, ...) in the past, however I
> believe that by now, using UTF8 should be fairly safe.
Yes, you are probably correct.
The only other problem I can think of is that you'd be adding a
circular dependency between a tool that uses this data and at the same
time edits the data.
> ?For me, even with the show-ucs-data tool, deadling with localedata
> files is quite onerous. ?Can anyone share any other tricks they use
> when dealing with localedata? ?Would there be any resistance to moving
> to UTF8?
I'm not opposed to it. I want to make editing these files as easy as
possible for you :-)
> ?P.S.: It is not like I would start working on this tomorrow. However,
> I have wondered about this long enough to ask. :-)