xml-xindice-users mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Murray Altheim <m.alth...@open.ac.uk>
Subject Re: DTD declarations in XML docs
Date Thu, 04 Sep 2003 17:01:00 GMT
Perry Molendijk wrote:
> Hello
>  
> Recently there has been a thread in regards to expanding DTD comments. 
> There were quite a few suggestions as to what could be done to resolve 
> the problem and I think the original "poster" did get it to work. 
> However, I think I tried every combination of suggested solutions and 
> Xindice versions and I still can't get it to work.
>  
> What I would like to achieve is to add a document to Xindice with for 
> example the Simple DocBook DTD declaration in the header:
>  
> <!DOCTYPE article PUBLIC "-//OASIS//DTD Simplified DocBook XML V1.0//EN"
> "http://www.oasis-open.org/docbook/xml/simple/1.0/sdocbook.dtd">
>  
> And then to to retrieve it just like that.
>  
> Because the DTD I want to use is not 100% stable I don't want to go the 
> way of normalising the DTD yet.
>  
> I am not too fussed about which version of Xindice it will work in 1.0 
> or CVS.

Someone should correct me if I'm wrong, but my impression of having used
and developed using Xindice 1.0 is that it stores XML document instances,
and only XML document instances. While that may seem a rather curious
statement, it's not in the context of XML, which has both the concept of
a prolog and and a document instance (something sadly lost on much of the
current output from the W3C). The prolog is the "pre-processor" information
given to the processor/parser prior to encountering the document instance,
such as entity declarations, and of course the document element for the
given document and the schema information, i.e., the external subset or
DTD.

Xindice is only storing the document instance, not the prolog, nor the
information about the external subset, which is what you want. IOW, you
want to store the public and system identifiers as well as the specified
document element, these are the three things found in a typical DOCTYPE
declaration.

What I've done in my own application is use the XNode API to store this
as metadata with the document instance. XNode acts as a SOAP-like wrapper
around the instance, allowing a location for such metadata without
polluting the document instance.

On the subject of XNode, Brian Behlendorf has suggested I take the XNode
API to the Incubator group to see if we can bring it into Apache properly.
I've made some substantial modifications to XNode to allow for embedded
XML metadata (e.g., RDF) as well as a listener support, which I've found
quite valuable in inter-app communication.

Anyway, that's the short and long of it.

Murray

...........................................................................
Murray Altheim                         http://kmi.open.ac.uk/people/murray/
Knowledge Media Institute
The Open University, Milton Keynes, Bucks, MK7 6AA, UK                    .

   "There was no [sign of] shooting, no bullet inside her body, no
   stab wound - only road traffic accident. [...] "It was like a
   Hollywood film. They cried 'go, go, go', with guns and blanks
   without bullets, blanks and the sound of explosions. They made
   a show for the American attack on the hospital - action movies
   like Sylvester Stallone or Jackie Chan."
   http://news.bbc.co.uk/1/hi/programmes/correspondent/3028585.stm

   The American soldier Jessica Lynch, who was rescued by US special
   forces after being taken prisoner in Iraq, has signed a $1m book
   deal with publisher Alfred A Knopf.
   http://news.bbc.co.uk/1/hi/world/americas/3076624.stm

   "This story is Mission: Impossible, but it's real," an official told
   the paper. "It's uplifting, heroic, compelling and dramatic. "You see
   this sort of thing in spy movies and wonder if it's really true.
   Now we know it is true."
   http://news.bbc.co.uk/1/hi/entertainment/film/2938589.stm


Mime
View raw message