This is the mail archive of the docbook-apps@lists.oasis-open.org mailing list .


Index Nav: [Date Index] [Subject Index] [Author Index] [Thread Index]
Message Nav: [Date Prev] [Date Next] [Thread Prev] [Thread Next]
Other format: [Raw text]

Re: [docbook-apps] Re: xml/groff: comments in text=>whitespacestripped?!


> * In message <20030606142741.GA24261@redhat.com>
> * On the subject of "Re: [docbook-apps] Re: xml/groff: comments in text=>whitespace stripped?!"
> * Sent on Fri, 6 Jun 2003 15:27:41 +0100
> * Honorable Tim Waugh <twaugh@redhat.com> writes:
>
> On Fri, Jun 06, 2003 at 09:56:17AM -0400, Sam Steingold wrote:
> 
> > I would argue that this is incorrect.
> > Since newline is syntactically significant in groff,
> > arbitrarily removing it is a bug.
> 
> But then the following would have all spaces preserved:
> 
>      <para>Lorem ipsum dolor sit amet.  Lorem ipsum dolor sit amet.
>        Lorem ipsum dolor sit amet.  Lorem ipsum dolor sit amet.  Lorem
>        ipsum dolor sit amet.  Lorem ipsum dolor sit amet.  Lorem ipsum
>        dolor sit amet.  Lorem ipsum dolor sit amet.  Lorem ipsum dolor
>        sit amet.</para>
> 
> And that would create:
> 
> Lorem ipsum dolor sit amet.  Lorem ipsum dolor sit amet.
>        Lorem ipsum dolor sit amet.  Lorem ipsum dolor sit amet.  Lorem
>        ipsum dolor sit amet.  Lorem ipsum dolor sit amet.  Lorem ipsum
>        dolor sit amet.  Lorem ipsum dolor sit amet.  Lorem ipsum dolor
>        sit amet.
> 
> rather than:
> 
> Lorem ipsum dolor sit amet.  Lorem ipsum dolor sit amet.
> Lorem ipsum dolor sit amet.  Lorem ipsum dolor sit amet.  Lorem
> ipsum dolor sit amet.  Lorem ipsum dolor sit amet.  Lorem ipsum
> dolor sit amet.  Lorem ipsum dolor sit amet.  Lorem ipsum dolor
> sit amet.
> 
> which is actually what you want.

good point.

1. you can either remove just the leading spaces (but not NLs), or

2. do the space normalization __before__ the lower-level processing,
   i.e., _first_ normalize-space on the raw XML text, and only _then_
   call apply-templates.

PS. "Lorem ipsum dolor sit amet" sounds Latin (and is Greek to me).
    What does it mean?

-- 
Sam Steingold (http://www.podval.org/~sds) running RedHat9 GNU/Linux
<http://www.camera.org> <http://www.iris.org.il> <http://www.memri.org/>
<http://www.mideasttruth.com/> <http://www.palestine-central.com/links.html>
Whom computers would destroy, they must first drive mad.

---------------------------------------------------------------------
To unsubscribe, e-mail: docbook-apps-unsubscribe@lists.oasis-open.org
For additional commands, e-mail: docbook-apps-help@lists.oasis-open.org


Index Nav: [Date Index] [Subject Index] [Author Index] [Thread Index]
Message Nav: [Date Prev] [Date Next] [Thread Prev] [Thread Next]