sed converts 8-bit input text to 16-bit (Unicode-16?) characters - how to suppress that?

Michael Moser michael.moser@sunrise.ch
Mon Mar 30 17:41:00 GMT 2009


> -----Original Message-----
> From: cygwin-owner@cygwin.com 
> [mailto:cygwin-owner@cygwin.com] On Behalf Of Dave Korn
> Sent: Montag, 30. März 2009 14:46
> To: cygwin@cygwin.com
> Subject: Re: sed converts 8-bit input text to 16-bit 
> (Unicode-16?) characters - how to suppress that?
> > ...
> Another possibility is that wordpad or notepad has tried to 
> be clever and gone and unexpectedly saved the original source 
> file in UTF16.  Did you verify the original source file in a 
> hexeditor too, Michael?

Yes - I did. The input is stricly one byte/octet per character and
starts with 
"4e 61 6d 65 09 ..."  (= "Name<TAB>...")

The output starts with:
"ff fe 4e 00 61 00 6d 00 65 00 09 00 ... "


Michael


--
Unsubscribe info:      http://cygwin.com/ml/#unsubscribe-simple
Problem reports:       http://cygwin.com/problems.html
Documentation:         http://cygwin.com/docs.html
FAQ:                   http://cygwin.com/faq/



More information about the Cygwin mailing list