Possible issue with gawk 5.0.1-1: Getting new warnings

Brian Inglis Brian.Inglis@SystematicSw.ab.ca
Fri Jul 12 14:43:00 GMT 2019


On 2019-07-12 04:47, Houder wrote:
> On Thu, 11 Jul 2019 11:51:09, Vipul P  wrote:
>> Here is a sample script to invoke awk:
>> $ cat ./gawk_error.sh
>> #!/bin/sh
>> echo "This:is:a:colon:separated:line:%%%:" | awk '{
>>   gsub("\\%", "%25", $0);
>>   gsub("\\:", "%3A", $0);
>>   print
>>   }'
> gsub(regexp, replacement[, target])
> echo "This:is:a:colon:separated:line:%%%:" | awk '{
>   gsub(/%/, "%25");
>   gsub(/:/, "%3A");
>   print
>   }'
>> If I invoke it with gawk 5.0.1-1, I see warnings as below:
>> $ uname -a
>> CYGWIN_NT-10.0 MyPC 3.0.7(0.338/5/3) 2019-04-30 18:08 x86_64 Cygwin
>> $ awk -V
>> GNU Awk 5.0.1, API: 2.0 (GNU MPFR 4.0.2, GNU MP 6.1.2)
>> Copyright (C) 1989, 1991-2019 Free Software Foundation.
>> $ ./gawk_error.sh
>> awk: cmd. line:2: (FILENAME=- FNR=1) warning: regexp escape sequence `\%'
>> is not a known regexp operator
>> awk: cmd. line:3: (FILENAME=- FNR=1) warning: regexp escape sequence `\:'
>> is not a known regexp operator
>> This%3Ais%3Aa%3Acolon%3Aseparated%3Aline%3A%25%25%25%3A
> I am not surprised to see these specific warnings ...
> See 3.2 (escape sequences) of GAWK: Effective AWK programming.
>> This script runs without any warnings with older versions of gawk:
>     https://cygwin.com/ml/cygwin/2019-04/msg00095.html
>     ( [ANNOUNCEMENT] gawk 5.0.0-1 )
>> The script runs without any warnings on Linux and other Unix variants.
> Using version 5 of gawk?
>> Is this a regression in gawk 5.0.1-1 on Cygwin ?
> Not very likely, as Corinna V. uses the tarball provided by Arnold R.

The first gsub argument is a regexp, so using a quoted string it must follow the
rules for a computed regexp: either "%"/":" to match characters alone or
"\\\\%"/"\\\\:" for characters preceded by backslash "\", as you wish.
Specifying only a single backslash regexp /\x/ or double backslash string "\\x",
the character must be a regexp special or it generates a warning message.

$ awk '{gsub("%", "%25", $0);gsub(":", "%3A", $0);print}' \
	<<< "This:is:a:colon:separated:line:%%%:"
This%3Ais%3Aa%3Acolon%3Aseparated%3Aline%3A%25%25%25%3A
$ awk '{gsub("\\\\%", "%25", $0);gsub("\\\\:", "%3A", $0);print}' \
	<<< "This:is:a:colon:separated:line:%%%:"
This:is:a:colon:separated:line:%%%:
$ awk '{gsub(/%/, "%25", $0);gsub(/:/, "%3A", $0);print}' \
	<<< "This:is:a:colon:separated:line:%%%:"
This%3Ais%3Aa%3Acolon%3Aseparated%3Aline%3A%25%25%25%3A
$ awk '{gsub(/\\%/, "%25", $0);gsub(/\\:/, "%3A", $0);print}' \
	<<< "This:is:a:colon:separated:line:%%%:"
This:is:a:colon:separated:line:%%%:

-- 
Take care. Thanks, Brian Inglis, Calgary, Alberta, Canada

This email may be disturbing to some readers as it contains
too much technical detail. Reader discretion is advised.

--
Problem reports:       http://cygwin.com/problems.html
FAQ:                   http://cygwin.com/faq/
Documentation:         http://cygwin.com/docs.html
Unsubscribe info:      http://cygwin.com/ml/#unsubscribe-simple



More information about the Cygwin mailing list