[XSL-LIST Mailing List Archive Home] [By Thread] [By Date]

Re: [xsl] use XSLT or XQuery in Saxon?

Subject: Re: [xsl] use XSLT or XQuery in Saxon?
From: Raffaele Sena <raff@xxxxxxxxxxxx>
Date: Wed, 5 Jan 2005 20:51:13 -0800

Are you trying to do all this using the standalone XSLT or XQuery processor or are you thinking of writing a Java application that queries the directory content to get the small files ? (otherwise if the big file has no reference to the small files you can't "load" them).

In any case if memory is not a problem (both the XSLT and the XQuery processor operate to the in-memory representation of the full file) I would say use the language you are more familiar with. You should be able to do what you want in both.

If memory is a problem you may want to look at the streaming APIs in "nux" (http://dsd.lbl.gov/nux/). You could use XPath to scan the big file and XQuery to match the single node with the small files. It uses Saxon as the XPath/XQuery processor. It may not be powerful enough (in order to "stream" it limits the XPath functionalities that you can use) but it's worth a look.

-- Raffaele

On Jan 5, 2005, at 2:53 PM, alan m wrote:

I did a post of this in another forum but did not yet
get a reply.

I have extremely large (over 300 MB) XML file and tens
of thousands of small xml files generated after
applying various XSLT on the one big XML file.

I am using Saxon for XSLT and will be using it also
for XQuery.

Is Xquery or XSLT is better solution for this problem?
Query each text node in the big xml file and verify
that this content is present in one of the results xml
files. Based on this information generate a report
that shows which content is present and in which file
and in a separate section which content was not found
in result xml files and also show this content parent
element or other markup to indicate its position in
the big xml file.

All the small xml files are stored as flat files in
various directories on Windows File system although
most files are in one directory. The big XML file is
fairly complex with multiple levels of nesting

Any comments or suggestions?
Thank you

__________________________________ Do you Yahoo!? Yahoo! Mail - now with 250MB free storage. Learn more. http://info.mail.yahoo.com/mail_250


Current Thread