sed converts 8-bit input text to 16-bit (Unicode-16?) characters - how to suppress that?
Corinna Vinschen
corinna-cygwin@cygwin.com
Mon Mar 30 12:36:00 GMT 2009
On Mar 30 13:48, Michael Moser wrote:
> I need to mangle a file containing "8-bit ASCII" characters (i.e. the
> file contains also characters in the upper 8-bit range, namely a few
> umlauts as well as some french accented characters).
>
> Strange enough, the SED version that came as part of cygwin emits the
> result of the mangling using 16-bit characters (I believe those are
> Unicode-16 characters, but not sure. The Hexeditor shows each second
> byte as always 00, execpt for the first two bytes which read FF FE).
This is very likely not Cygwin's sed. Do you have another sed in $PATH
by any chance? I tried with input files containing german umlauts and
sed does not convert to wide char and it does not produce a BOM marker
at the start of the file.
Corinna
--
Corinna Vinschen Please, send mails regarding Cygwin to
Cygwin Project Co-Leader cygwin AT cygwin DOT com
Red Hat
--
Unsubscribe info: http://cygwin.com/ml/#unsubscribe-simple
Problem reports: http://cygwin.com/problems.html
Documentation: http://cygwin.com/docs.html
FAQ: http://cygwin.com/faq/
More information about the Cygwin
mailing list