Non-trivial Cyrillic question
Roman Belenov
rbelenov@yandex.ru
Tue Jan 20 15:41:00 GMT 2004
Alexander Shopov <ash@contact.bg> writes:
> However when I pipe it to less
>
> cat myfile | less
>
> it gets broken
> I get things like: <CF><D0><C5><CF><C8><D1><CA><C0>
You should tell less what characters are supposed to be printable
using LESSCHARSET or LESSCHARDEF environment variable (man less for
details).
> ls *
>
> it is displayed like
> ????.txt
ls uses current locale to determine which characters are
printable. AFAIK cygwin isn't locale-aware (I can be mistaken); the
possible workaround is to use --show-control-chars switch to print all
characters literally.
BTW Is cp1251 the only 8-bit encoding used by Bulgarian locale in MS
Windows ? In Russia two encoding are used - cp866 as OEM (used, e.g.,
in console), cp1251 as ANSI.
--
With regards, Roman.
--
Unsubscribe info: http://cygwin.com/ml/#unsubscribe-simple
Problem reports: http://cygwin.com/problems.html
Documentation: http://cygwin.com/docs.html
FAQ: http://cygwin.com/faq/
More information about the Cygwin
mailing list