Non-trivial Cyrillic question

Roman Belenov rbelenov@yandex.ru
Tue Jan 20 15:41:00 GMT 2004


Alexander Shopov <ash@contact.bg> writes:

> However when I pipe it to less
>
> cat myfile | less
>
> it gets broken
> I get things like: <CF><D0><C5><CF><C8><D1><CA><C0>

You should tell less what characters are supposed to be printable
using LESSCHARSET or LESSCHARDEF environment variable (man less for
details).

> ls *
>
> it is displayed like
> ????.txt

ls uses current locale to determine which characters are
printable. AFAIK cygwin isn't locale-aware (I can be mistaken); the
possible workaround is to use --show-control-chars switch to print all
characters literally.

BTW Is cp1251 the only 8-bit encoding used by Bulgarian locale in MS
Windows ? In Russia two encoding are used - cp866 as OEM (used, e.g.,
in console), cp1251 as ANSI.

-- 
 							With regards, Roman.


--
Unsubscribe info:      http://cygwin.com/ml/#unsubscribe-simple
Problem reports:       http://cygwin.com/problems.html
Documentation:         http://cygwin.com/docs.html
FAQ:                   http://cygwin.com/faq/



More information about the Cygwin mailing list