This is the mail archive of the
libc-alpha@sourceware.org
mailing list for the glibc project.
Re: [PATCH] Fix Charset comment in fi_FI, fi_FI at euro
- From: Siddhesh Poyarekar <siddhesh at redhat dot com>
- To: Marko Myllynen <myllynen at redhat dot com>
- Cc: GNU C Library <libc-alpha at sourceware dot org>, libc-locales at sourceware dot org
- Date: Thu, 12 Dec 2013 09:26:17 +0530
- Subject: Re: [PATCH] Fix Charset comment in fi_FI, fi_FI at euro
- Authentication-results: sourceware.org; auth=none
- References: <5294D4F3 dot 40006 at redhat dot com> <20131126172545 dot GA11818 at www5 dot open-std dot org> <5294DC9E dot 201 at redhat dot com> <20131126181126 dot GA13640 at www5 dot open-std dot org> <5294E723 dot 3000907 at redhat dot com> <20131126193931 dot GA17538 at www5 dot open-std dot org> <52A70B3A dot 3080406 at redhat dot com>
Thanks, I've pushed this now.
Siddhesh
On Tue, Dec 10, 2013 at 02:38:18PM +0200, Marko Myllynen wrote:
> Ping / could someone commit this one, please?
>
> https://sourceware.org/ml/libc-alpha/2013-11/msg00787.html
>
> Thanks,
>
> On 2013-11-26 21:39, keld@keldix.com wrote:
> > Hi Marko
> >
> > The patch is OK with me
> >
> > Kiitos,
> > Keld
> >
> > On Tue, Nov 26, 2013 at 08:23:31PM +0200, Marko Myllynen wrote:
> >> Hi,
> >>
> >> correct, no non-ISO-8859-15 characters are used in the locale itself
> >> elsewhere.
> >>
> >> Updated patch below, ok now? Thanks.
> >>
> >> 2013-11-26 Marko Myllynen <myllynen@redhat.com>
> >>
> >> * locales/fi_FI: Fix Charset comment.
> >> * locales/fi_FI@euro: Likewise.
> >>
> >> diff --git a/localedata/locales/fi_FI b/localedata/locales/fi_FI
> >> index f779964..6ea9301 100644
> >> --- a/localedata/locales/fi_FI
> >> +++ b/localedata/locales/fi_FI
> >> @@ -16,7 +16,7 @@ comment_char %
> >> % Date: 1996-10-15
> >> % Application: general
> >> % Users: general
> >> -% Charset: ISO-8859-1
> >> +% Charset: UTF-8, ISO-8859-15
> >> % Distribution and use is free, also
> >> % for commercial purposes.
> >> %
> >> diff --git a/localedata/locales/fi_FI@euro b/localedata/locales/fi_FI@euro
> >> index 76f78ee..ef7f8cd 100644
> >> --- a/localedata/locales/fi_FI@euro
> >> +++ b/localedata/locales/fi_FI@euro
> >> @@ -8,7 +8,7 @@ comment_char %
> >> % Date: 2000-08-20
> >> % Application: general
> >> % Users: general
> >> -% Charset: ISO-8859-15
> >> +% Charset: UTF-8, ISO-8859-15
> >> % Distribution and use is free, also
> >> % for commercial purposes.
> >>
> >>
> >>
> >> On 2013-11-26 20:11, keld@keldix.com wrote:
> >>> Hi again
> >>>
> >>> Well, libc locales are able to sort the full ucs charset, so that is not an argument to
> >>> exclude iso-8859-15. Finland uses Euro, so that excludes iso-8859-1.
> >>> I do not think we have specs that specifies use of citation marks etc, which could be
> >>> characters outside 8859-15 for Finnish.
> >>>
> >>> The Charset comment is meant to give charsets for which a locale would compile
> >>> with localedef without fatal errors. It can be used to generate
> >>> all the locale/charset combinations that a locale is valid in.
> >>>
> >>> Best regards
> >>> keld
> >>>
> >>> On Tue, Nov 26, 2013 at 07:38:38PM +0200, Marko Myllynen wrote:
> >>>> Hi,
> >>>>
> >>>> ANGSTROM SIGN is specified in the standard to sort it alongside with å/Å.
> >>>>
> >>>> There is also a recent proposal to extend the auxiliary character set
> >>>> (that's the closest translation I came up with, it's
> >>>> "lisämerkkivalikoima" in Finnish in the proposal) with characters that
> >>>> might be needed when processing texts in the Finnish context and the
> >>>> proposal contains several non-ISO-8859-15 characters, see
> >>>> http://www.kotoistus.fi/kannanotot_html/avoimet/lisamerkisto-2013.
> >>>>
> >>>> So I think it's better not to suggest using ISO-8859-15 any more.
> >>>>
> >>>> Thanks,
> >>>>
> >>>> On 2013-11-26 19:25, keld@keldix.com wrote:
> >>>>> It is not necessary to specify UTF-8, all locales support UTF-8.
> >>>>> The question is whether the locale can be used with iso-8859-15.
> >>>>> More than one charset may be specified with the charset comment.
> >>>>> If you like you could specify both UTF-8 and iso-8859-15,
> >>>>> with utf-8 first, if that is the preference.
> >>>>>
> >>>>> What is ANGSTROM used for? As an alternative to Å in the sorting?
> >>>>> And other places?
> >>>>>
> >>>>> Best regards
> >>>>> keld
> >>>>>
> >>>>> On Tue, Nov 26, 2013 at 07:05:55PM +0200, Marko Myllynen wrote:
> >>>>>> Hi,
> >>>>>>
> >>>>>> please see the patch below to fix the Charset comment in fi_FI and
> >>>>>> fi_FI@euro. ISO-8859-1 is certainly incorrect with Euro and since SFS-EN
> >>>>>> 13710 (which we will hopefully support later) specifies a
> >>>>>> non-ISO-8859-15 character (ANGSTROM SIGN) it seems best to use UTF-8.
> >>>>>>
> >>>>>> 2013-11-26 Marko Myllynen <myllynen@redhat.com>
> >>>>>>
> >>>>>> * locales/fi_FI: Fix Charset comment.
> >>>>>> * locales/fi_FI@euro: Likewise.
> >>>>>>
> >>>>>> diff --git a/localedata/locales/fi_FI b/localedata/locales/fi_FI
> >>>>>> index f779964..6ea9301 100644
> >>>>>> --- a/localedata/locales/fi_FI
> >>>>>> +++ b/localedata/locales/fi_FI
> >>>>>> @@ -16,7 +16,7 @@ comment_char %
> >>>>>> % Date: 1996-10-15
> >>>>>> % Application: general
> >>>>>> % Users: general
> >>>>>> -% Charset: ISO-8859-1
> >>>>>> +% Charset: UTF-8
> >>>>>> % Distribution and use is free, also
> >>>>>> % for commercial purposes.
> >>>>>> %
> >>>>>> diff --git a/localedata/locales/fi_FI@euro b/localedata/locales/fi_FI@euro
> >>>>>> index 76f78ee..ef7f8cd 100644
> >>>>>> --- a/localedata/locales/fi_FI@euro
> >>>>>> +++ b/localedata/locales/fi_FI@euro
> >>>>>> @@ -8,7 +8,7 @@ comment_char %
> >>>>>> % Date: 2000-08-20
> >>>>>> % Application: general
> >>>>>> % Users: general
> >>>>>> -% Charset: ISO-8859-15
> >>>>>> +% Charset: UTF-8
> >>>>>> % Distribution and use is free, also
> >>>>>> % for commercial purposes.
> >>>>>>
> >>>>>>
> >>>>>> --
> >>>>>> Marko Myllynen
> >>>>
> >>>>
> >>>> --
> >>>> Marko Myllynen
> >>
> >>
> >> --
> >> Marko Myllynen
>
>
> --
> Marko Myllynen