[Improvement] Cache user auth and converted paths for child-processes

Brian Inglis Brian.Inglis@SystematicSW.ab.ca
Sun Jan 19 17:04:20 GMT 2025


On 2025-01-19 05:56, Devste Devste via Cygwin wrote:
> strace -o strace.log dirname -- /some/path/here
> There are 2 points that could make it significantly faster and
> shouldn't be too hard to implement?

Criticisms as expected to be accompanied by code and all Patches are 
Thoughtfully Considered (PTC - see https://cygwin.com/acronyms/) - git 
format-patch/send-email against repo main branch to cygwin-patches@... ML.

All contributors are volunteers working when they have spare time available.

> 1) A significant amount of time is spent on user auth (as seen in
> various github issues the infamous /etc/passwd nsswitch.conf fix)
> Wouldn't it be possible to just reuse the auth data from the current
> shell for the subshell, e.g. for
> basename -- $(dirname -- /some/path/here)

Some of this is done for each child in a Cygwin process tree, and for all Cygwin 
processes if you run cygserver.
You can also decide to limit what sources should be used in nsswitch.conf.

> 2) why are unix/dos path conversions of environment variables not
> cached? A significant amount of time (15-30%, depending on the number
> of environment variables) is spent on the conversion for every
> invocation.
> However, this would be extremely simple to cache and reusable even on
> completely unrelated subshells.
> cache key = original path
> cache value = converted value
> e.g.
> 
> ```
>     44   12839 [main] dirname 16929 mount_info::conv_to_posix_path:
> conv_to_posix_path (C:\Users\User123\bin, 0x10000100, no-add-slash)
>     44   12883 [main] dirname 16929 normalize_win32_path:
> C:\Users\User123\bin = normalize_win32_path (C:\Users\User123\bin)
>     44   12927 [main] dirname 16929 mount_info::conv_to_posix_path:
> mount[0] .. checking / -> C:\Program Files\Git
>     44   12971 [main] dirname 16929 mount_info::conv_to_posix_path:
> mount[1] .. checking /bin -> C:\Program Files\Git\usr\bin
>     44   13015 [main] dirname 16929 mount_info::conv_to_posix_path:
> mount[2] .. checking /tmp -> C:\Users\User123\AppData\Local\Temp
>     44   13059 [main] dirname 16929 mount_info::conv_to_posix_path:
> /c/Users/User123/bin = conv_to_posix_path (C:\Users\User123\bin)
> ```
> could be cached with key
> C:\Users\User123\bin
> and value
> /c/Users/User123/bin
> at least for the current process (e.g. a bash script and it's
> children) without risking any noticeable outdated cache issues
> (probably even longer, however we want to keep it simple and don't
> want to worry about cache invalidation too much)

Any process can change the environment, so contents can not be assumed, and 
everything has to be rechecked each time.
It is probably quicker (44µs) to check for paths and convert.

Drop all the Windows paths and variables from your Cygwin environment, unless 
you intend to run Windows programs from there, and you will not have so much 
overhead.

For a cache, you would want to key from the env var name, have a path flag, and 
keep Windows and Cygwin alternatives, but that only helps if you have child 
processes where the environment stays the same, so the penalty should be paid in 
any children.

-- 
Take care. Thanks, Brian Inglis              Calgary, Alberta, Canada

La perfection est atteinte                   Perfection is achieved
non pas lorsqu'il n'y a plus rien à ajouter  not when there is no more to add
mais lorsqu'il n'y a plus rien à retrancher  but when there is no more to cut
                                 -- Antoine de Saint-Exupéry


More information about the Cygwin mailing list