Need help with multibyte UTF-8 characters
Thomas Wolff
towo@towo.net
Sat Dec 16 01:50:00 GMT 2017
Am 15.12.2017 um 01:32 schrieb Brian Inglis:
> On 2017-12-12 12:42, Thomas Taylor wrote:
>> I believe that Cygwin displays certain UTF-8 characters incorrectly. To see the
>> problem, first save the attached "utf-8_test.sed" text file to your desktop.
>> Then run "mintty," and set its options by right clicking in its title bar,
>> selecting "Options" and then "Text."Â On the Text page set "Locale" to "en_US"
>> and "Character set" to "UTF-8," and then "Save."Â Now exit and restart mintty.
>> Change directory to your desktop and run the editor "vim" on the utf-8_test.sed
>> file. Once inside vim do a ":set fileencoding=utf-8". You should now see that
>> vim displays correctly a sample of one-, two-, and three-byte UTF-8 character
>> encodings in the test file. Vim fails, however, on the three-byte encodings for
>> the "en" dash, the "em" dash, and the ellipsis, each of which displays
>> incorrectly as a filled-in rectangle. Now exit vim and do a "less" or "cat" on
>> the utf-8_test.sed file. You should see most of the sample UTF-8 encoded
>> characters displayed correctly, except once again for the en dash, em dash, and
>> ellipsis. So it looks like a problem in the underlying Cygwin run-time
>> libraries rather than in vim, less, or cat. I haven't tested this on four-byte
>> UTF-8 character encodings, but assume Cygwin will have similar problems.
> Like many others -- no problems visible -- all UTF-8 characters displayed
> correctly in gvim/X, vim, less, cat from mintty.
It seems nobody asked you so far which font you use. So please report that.
--
Problem reports: http://cygwin.com/problems.html
FAQ: http://cygwin.com/faq/
Documentation: http://cygwin.com/docs.html
Unsubscribe info: http://cygwin.com/ml/#unsubscribe-simple
More information about the Cygwin
mailing list