This is the mail archive of the xsl-list@mulberrytech.com mailing list .


Index Nav: [Date Index] [Subject Index] [Author Index] [Thread Index]
Message Nav: [Date Prev] [Date Next] [Thread Prev] [Thread Next]

Re: How to read the encoding of an XML document


At 08:30 PM 10/25/2001 +0100, David Carlisle wrote:
 >> If I no longer know what my original XML document was encoded as, how do I
> > know the appropriate encoding set to specify for the output?
>
>every xml application is mandated to support at least the utf8 and utf16
>encodings, so either of those is always appropriate  (or at least
>acceptable) whatever the original encoding of the file.

Ok.  If you recall, I started this discussion by mentioning that I am 
receiving XML documents from several European countries.  So the pertinent 
question for me is "if UTF-8 and/or UTF-16 will be the output encoding set 
I must use, will they handle charcters from the languages I care about?"

I found this statement on unicode.org:

"What Characters Does the Unicode Standard Include?  The Unicode Standard 
defines codes for characters used in the major languages written today. 
Scripts include the European alphabetic scripts, Middle Eastern 
right-to-left scripts, and scripts of Asia."

So it seems to me that I should be safe outputing my data to UTF-16.  That 
make sense?

Thanks much, David and Mike.

--James Garriss


 XSL-List info and archive:  http://www.mulberrytech.com/xsl/xsl-list


Index Nav: [Date Index] [Subject Index] [Author Index] [Thread Index]
Message Nav: [Date Prev] [Date Next] [Thread Prev] [Thread Next]