This is the mail archive of the glibc-bugs-regex@sourceware.org mailing list for the glibc project.


Index Nav: [Date Index] [Subject Index] [Author Index] [Thread Index]
Message Nav: [Date Prev] [Date Next] [Thread Prev] [Thread Next]
Other format: [Raw text]

[Bug regex/5028] New: segfault in regex engine when using C collation and multibyte chars


glibc's regular expression engine crashes with segmentation fault whenever 
trying to compile a regular expression with non-ASCII range expressions if
LC_CTYPE is a UTF-8 locale but LC_COLLATE is set to C or POSIX. This
is a common setup among users who want traditional codepoint-order
sorting but need UTF-8 encoding.

$ LANG=en_US.UTF-8 LC_COLLATE=C grep '[Á-ä]'
Segmentation fault

This is also reproducible with a simple C program.

-- 
           Summary: segfault in regex engine when using C collation and
                    multibyte chars
           Product: glibc
           Version: unspecified
            Status: NEW
          Severity: normal
          Priority: P2
         Component: regex
        AssignedTo: drepper at redhat dot com
        ReportedBy: aurelien at aurel32 dot net
                CC: glibc-bugs-regex at sources dot redhat dot com,glibc-
                    bugs at sources dot redhat dot com
 GCC build triplet: x86_64-unknown-linux-gnu
  GCC host triplet: x86_64-unknown-linux-gnu
GCC target triplet: x86_64-unknown-linux-gnu


http://sourceware.org/bugzilla/show_bug.cgi?id=5028

------- You are receiving this mail because: -------
You are on the CC list for the bug, or are watching someone who is.


Index Nav: [Date Index] [Subject Index] [Author Index] [Thread Index]
Message Nav: [Date Prev] [Date Next] [Thread Prev] [Thread Next]