[XSL-LIST Mailing List Archive Home] [By Thread] [By Date]

Re: Formatting Objects considered harmful

Subject: Re: Formatting Objects considered harmful
From: Håkon Wium Lie <howcome@xxxxxxxxxxxxxxxxx>
Date: Sun, 25 Apr 1999 14:49:42 +0200 (MET DST)

Stephen Deach wrote:

 > This whole debate seems to miss several key points:
 >  1.) Stripping the original semantics is NOT ALWAYS harmful:
 >      a.) Storing any derived form of original content
 >          that strips the semantic meaning MAY be harmful. If one
 >          actually needs the original semantics, it probably is harmful.


 >      b.) Conversely, almost all books contain significant derived information.
 >          The editing and filtering (authoring) process often ADDS value,
 >          yet it clearly strips all the original semantics.

The loss of semantics during a book's editing/proof-reading process is
a bug, not a feature. I have recently completed such a process and
insisted that the publisher retained semantics in all steps. As a
result, the book will probably be available in enriched HTML before
it's on paper. (An Adobe product -- FrameMaker -- formed the backbone
of this process.)

 >  I can't unconditionally side with a) or with b), it depends on what I am
 > looking for in producing or in using the document.

So, you are saying that publishing formatting objects is the right
solution in some cases?

 >  2.) You can't determine the semantics from a DTD alone, you can only
 > derive the syntax and the allowed organization (of both elements (records,
 > fields, objects) and attributes (properties, qualifiers, modifiers,
 > constraints, values).

Correct. That's why my document says:

 "Publishing semantically rich XML should be encouraged when the
  semantics is globally known, e.g. MathML. Publishing arbitrary XML
  should be discouraged."

[1] http://www.operasoftware.com/people/howcome/1999/foch.html

 >  3.) A tagset (such as XSL's FOs) which is public, stable, and well
 > understood (with published semantics) is FAR better than a proprietary
 > encoding or a proprietary tagset (without published semantics).

Perhaps, but they're both bad when used on the Web. The right solution
is to publish documents in tagsets that are:

 - globally understood
 - semantic, rather than presentational

 >  4.) XML allows the definition of application specific tagsets. XSL's FO
 > tagset is designed solely for describing paginated and non-paginated
 > presentations, it is no more dangerous than any other tagset.

For reasons outlined in the paper, I believe XFO is much more
dangerous than other, more abstract tagsets developed within W3C.

I also think it's a significant departure from SGML's tradition of
capturing semantics, not presentation.


Håkon Wium Lie             http://www.operasoftware.com/people/howcome
howcome@xxxxxxxxxxxxxxxxx                      simply a better browser

 XSL-List info and archive:  http://www.mulberrytech.com/xsl/xsl-list

Current Thread