This is the mail archive of the docbook@lists.oasis-open.org mailing list for the DocBook project.


Index Nav: [Date Index] [Subject Index] [Author Index] [Thread Index]
Message Nav: [Date Prev] [Date Next] [Thread Prev] [Thread Next]
Other format: [Raw text]

Re: re: converting to DocBook


* jonathon; <jblake@eskimo.com> on 14 Aug, 2002 wrote:

If I had that problem, I would convert as many of them as I could to HTML, run 'tidy' to clean up the HTML,
and then run the DocParse tool from www.commmandprompt.com to convert them to DocBook.
	That sounds like the easiest option, since the majority of them are
	in HTML format.   Tidy can get everything to the same version of HTML.

	Somewhere I have a perl script that converts plain ASCII to HTML
	2.0.  Tidy can clean up and upgrade the results to 4.01.

	Most of the formatted non-HTML, non-plain ASCII documents can be
	converted to HTML using whatever created them in that format.

	There is the irony of converting to HTML, then to DocBook, then back
	to HTML so that it can be seen on the web.

http://docbook.org/wiki/moin.cgi/Html2DocBook explains that HTML should
be converted to XHTML (which is xml) and then applying a stylesheet
given on the webpage xonvert XHTML to Docbook.

HTH
--

Togan Muftuoglu


Index Nav: [Date Index] [Subject Index] [Author Index] [Thread Index]
Message Nav: [Date Prev] [Date Next] [Thread Prev] [Thread Next]