This is the mail archive of the
mailing list for the Cygwin project.
Re: Locales with wrong umlauts
- From: Igor Peshansky <pechtcha at cs dot nyu dot edu>
- To: Lapo Luchini <lapo at lapo dot it>
- Cc: cygwin at cygwin dot com
- Date: Mon, 27 Mar 2006 23:54:15 -0500 (EST)
- Subject: Re: Locales with wrong umlauts
- References: <loom.20060326T135539firstname.lastname@example.org>
- Reply-to: cygwin at cygwin dot com
On Sun, 26 Mar 2006, Lapo Luchini wrote:
> Igor Pechtchanski <pechtcha <at> cs.nyu.edu> writes:
> > On Sun, 27 Jun 2004, A. Alper Atici wrote:
> > > try the following:
> > > set OUTPUT_CHARSET=iso-8859-1
> > Wow. Thanks, this was *extremely* useful. Interestingly enough, the
> > OUTPUT_CHARSET option was not mentioned anywhere in the gettext/libintl
> > documentation, but a search for it unearthed another couple of messages on
> > this list from earlier this year with the same info[*] (one was from you).
> Extremely useful to me too, I was quite fed up to see "`a" instead of "à" =)
> I also noticed that OUTPUT_CHARSET=CP1252 *may* be preferred, compare the
> following outputs:
> % mtn up
> monotone: gi`a aggiornato a '1848d7dfabfbed09fe53856da038e31eed0f42dc'
> % OUTPUT_CHARSET=CP1252 monotone up
> monotone: già aggiornato a ÿÿ1848d7dfabfbed09fe53856da038e31eed0f42dcÿÿ
> % OUTPUT_CHARSET=ISO8859-1 monotone up
> monotone: già aggiornato a `1848d7dfabfbed09fe53856da038e31eed0f42dc´
> % OUTPUT_CHARSET=ISO8859-15 mtn up
> monotone: già aggiornato a '1848d7dfabfbed09fe53856da038e31eed0f42dc'
> In order to really "check" it some gettext with an euro symbol should be
> used, but I'm not aware of any that does and I don't have the time to
> create one right now 0=)
> Instead of putting it simply in some FAQ couldn't Cygwin define that env
> var correctly "by default"? (after all the system *knows* which charset
> it is using, I guess?)
The system has no idea what charset it's using, because it depends on the
font you set for your terminal, which is outside of the terminal's
control. Even if you use a Unicode font with charset conversion, the
charset is specified outside of the console.
Incidentally, since this subject came up: ls has a "--show-control-chars"
option, but rm, mv, cp, and a bunch of other tools don't. So, if you run
rm in interactive mode, it doesn't display filenames properly. For
$ touch é a
$ ls é
$ mv -i a é
mv: overwrite `\351'? n
$ rm -i é
rm: remove regular empty file `\351'? y
Is there any way to tell mv, rm &co to display non-ASCII characters in
filenames? I know this isn't Cygwin-specific, but I'm not even sure what
to Google for. Eric?
|\ _,,,---,,_ email@example.com | firstname.lastname@example.org
ZZZzz /,`.-'`' -. ;-;;,_ Igor Peshansky, Ph.D. (name changed!)
|,4- ) )-,_. ,\ ( `'-' old name: Igor Pechtchanski
'---''(_/--' `-'\_) fL a.k.a JaguaR-R-R-r-r-r-.-.-. Meow!
"Las! je suis sot... -Mais non, tu ne l'es pas, puisque tu t'en rends compte."
"But no -- you are no fool; you call yourself a fool, there's proof enough in
that!" -- Rostand, "Cyrano de Bergerac"
Unsubscribe info: http://cygwin.com/ml/#unsubscribe-simple
Problem reports: http://cygwin.com/problems.html