This is the mail archive of the docbook-apps@lists.oasis-open.org mailing list .


Index Nav: [Date Index] [Subject Index] [Author Index] [Thread Index]
Message Nav: [Date Prev] [Date Next] [Thread Prev] [Thread Next]
Other format: [Raw text]

Re: Choosing a characterset for DocBook


On Fri, 15 Mar 2002, Jirka Kosek wrote:

> Jens Stavnstrup wrote:
> 
> > Now I am going to release my colleague on the document. They are going to
> > use a myriad of windows editors (Word, Notepad, etc in different language
> > versions), and I predict this is going to cause a lot of problems.
> > 
> > Does anybody have any experience with this issue ?
> 
> In Czech we have much more problems with encoding than you, believe me.
> ;)
>  

I have no doubt, you have my deepest sympaty (But we are writing in 
English, so normally we shouldn't have any problems. !


> > Should  I chose another encoding UTF-8 or what ? And then why ?
> 
> If your documents will contain a lot of character outside of ISO Latin 1
> or ASCII using UTF-8 is best choice, assuming that all editors used can
> deal with UTF-8.
> 

Not really, the problem is basically, that Word, which might be used to 
to edit the XML sources, kindly add invinsible characters to my document. 
And this might cause problems for my colleagues.



> If your documents contain only ISO Latin 1 characters, you can encode
> your documents in iso-8859-1 and add following line as a first line of
> your XML document:
> 
> <?xml version="1.0" encoding="iso-8859-1"?>
> 
> After that change all common processing tools (including Saxon) will be
> able to cope with your documents. Support for ISO Latin 1 is still
> better than support for UTF-8 in many editors.
> 


Yes, that was my impression as well. However lately,  I am beginning to 
doubt it. 

E.g. see


   http://www.cl.cam.ac.uk/~mgk25/uicode.html


Regards


Jens


Index Nav: [Date Index] [Subject Index] [Author Index] [Thread Index]
Message Nav: [Date Prev] [Date Next] [Thread Prev] [Thread Next]