This is the mail archive of the cygwin mailing list for the Cygwin project.


Index Nav: [Date Index] [Subject Index] [Author Index] [Thread Index]
Message Nav: [Date Prev] [Date Next] [Thread Prev] [Thread Next]
Other format: [Raw text]

Re: Date field of ls -l command is garbled in latest cygwin1.dll snapshot.


Am 24.06.2018 um 00:32 schrieb Thomas Wolff:
Am 23.06.2018 um 20:46 schrieb Brian Inglis:
On 2018-06-22 17:06, Takashi Yano wrote:
On Sat, 23 Jun 2018 05:39:27 +0900
Takashi Yano wrote:
I looked into this problem, and found this is caused by incorrect
return value of iswprint().
I have found the cause. That is, file categories.t is not correct.

For example, http://ftp.unicode.org/Public/UNIDATA/UnicodeData.txt says:

3400;<CJK Ideograph Extension A, First>;Lo;0;L;;;;;N;;;;;
4DB5;<CJK Ideograph Extension A, Last>;Lo;0;L;;;;;N;;;;;
...
4E00;<CJK Ideograph, First>;Lo;0;L;;;;;N;;;;;
9FEF;<CJK Ideograph, Last>;Lo;0;L;;;;;N;;;;;

However, categories.t is:
     {CAT_Lo, 0x3400, 0},
     {CAT_Lo, 0x4DB5, 0},
...
     {CAT_Lo, 0x4E00, 0},
     {CAT_Lo, 0x9FEA, 0},

Therefore, the script mkcategories which generates categories.t should be fixed.
Obviously. I will check why the script was failing here and thanks for the patch already.
Yeah, I totally failed to consider the First...Last ranges in this script, how embarassing. I would have chosen other markers than 0/1 for the two cases (maybe "firstlast" for the new one), and not put it in the middle, but you fixed it anyway; except for the Private Use ranges E000..F8FF, F0000..FFFFD, 100000..10FFFD, but their category Co isn't used anyway. I suggest you submit your patch directly to cygwin-patches@cygwin.com. One issue, though: Apparently, you used a Unicode version other than 10.0 as a reference, probably 11.0. Maybe the fixing patch should be based on the same version, for better clarity and to reduce the diff (fewer changes in categories.t and none in caseconv.t).
We can update to 11.0 in addition, but:
Brian wrote:
These entries change with every Unicode release, and a new one came out a few
weeks ago, updated here yesterday.
The library updated to 11.0 is probably icu, which isn't used here. The libc generation scripts use the original Unicode files which are packaged in unicode-ucd which is still at 10.0.

--
Problem reports:       http://cygwin.com/problems.html
FAQ:                   http://cygwin.com/faq/
Documentation:         http://cygwin.com/docs.html
Unsubscribe info:      http://cygwin.com/ml/#unsubscribe-simple


Index Nav: [Date Index] [Subject Index] [Author Index] [Thread Index]
Message Nav: [Date Prev] [Date Next] [Thread Prev] [Thread Next]