This is the mail archive of the
glibc-bugs@sourceware.org
mailing list for the glibc project.
[Bug libc/12830] New: ISO-2022-JP-2 maps C1 control characters incorrectly
- From: "glibcbugz at ghalkes dot nl" <sourceware-bugzilla at sourceware dot org>
- To: glibc-bugs at sources dot redhat dot com
- Date: Wed, 1 Jun 2011 07:41:46 +0000
- Subject: [Bug libc/12830] New: ISO-2022-JP-2 maps C1 control characters incorrectly
- Auto-submitted: auto-generated
http://sourceware.org/bugzilla/show_bug.cgi?id=12830
Summary: ISO-2022-JP-2 maps C1 control characters incorrectly
Product: glibc
Version: 2.13
Status: NEW
Severity: normal
Priority: P2
Component: libc
AssignedTo: drepper.fsp@gmail.com
ReportedBy: glibcbugz@ghalkes.nl
In the ISO-2022-JP-2 converter, the C1 control codes (U0080-U009F) are encoded
as 1B 2E 41 1B 4E [00 - 1F] (i.e., load ISO-8859-1 in the G2 graphics set, use
single shift to set G2 and encode the byte [00 - 1F]). However, if I understand
the standard correctly, switching to the G2 set _only_ changes the mapping of
the 96 characters in the range 20-7F (or the 94 charaacters in the range 21-7E
if a smaller set is used). The control characters are unaffected. To access the
C1 control set, one should use 1B [40 - 5F]. This is actually done for the
encoding of the "single shift 2" control (U+008E) in the sequence above, which
is encoded as 1B 4E.
--
Configure bugmail: http://sourceware.org/bugzilla/userprefs.cgi?tab=email
------- You are receiving this mail because: -------
You are on the CC list for the bug.