[XSL-LIST Mailing List Archive Home] [By Thread] [By Date]

Re: [xsl] Recursive grouping - simple, XSLT 1.0, fast non-Muenchian grouping method

Subject: Re: [xsl] Recursive grouping - simple, XSLT 1.0, fast non-Muenchian grouping method
From: "Sergiu Ignat" <sergiu.ignat@xxxxxxxx>
Date: Tue, 21 Dec 2004 19:54:26 +0200

Hello everyone.
Many thanks to all the people that reviewed this algorithm. Special thanks
to Dimitre Novachev for performed tests.

From: "Michael Kay" <mike@xxxxxxxxxxxx>
> The use of xsl:key is essential to Muenchian grouping, so there seems to
be
> some misunderstanding here.
Muenchian grouping without key is presented in
http://www.jenitennison.com/xslt/grouping/muenchian.html and the example is
contact[not(surname = preceding-sibling::contact/surname)]
It is of course a very slow method.

The recursion grouping algorithm was born from ignorance. I didn't use
xsl:key because I was not aware of its implementation. I thought it is
evaluated every time key() function is called. I did not know that it
creates an index, so I did not expect any performance improvment. I just
tried to find a faster algorithm, and it works faster than Muenchian without
xsl:key .

Howether, later M. David Peterson found it usefull to group data from
external data sources. This is indeed a problem hard to solve using
Muenchian method (or maybe I am wrong). Wendell Piez found it easy to
understand for the newbie and fast enough for many real life situations.
There may be also cases when grouping rules are too difficult to described
using xsl:key, and recursion grouping will be an easyer solution.

So I hope it will find its niche and will be useful for many people.

Best regards,
Sergiu Ignat

>
> >
> > The advantage of recursive grouping is that it is possible to define
> > grouping rules that are much more complex than one accepted by the use
> > attribute of xsl:key or by generate-id() function. For
> > example if in a list
> > of items that have attributes price and quantity the grouping
> > should be done
> > by price*quantity.
>
> Muenchian grouping makes it easy to group on the result of any path
> expression, e.g.
>
> <xsl:key name="m" match="order-line" group="@price * @qty"/>
>
> If there is a limitation, it is that Muenchian grouping is inconvenient
when
> the node-set to be grouped is anything other than "all nodes in a single
> document that match pattern P": that is, it's inconvenient when the
> population spans multiple documents, or when it is scoped to a particular
> subtree within a document.
>
> >
> > A recoursive grouping with xsl:key is presented below.
> >
>
> The downside here is:
>
> >     <xsl:with-param name="list"
> > select="$list[not(@author=$group-identifier)]"/>
>
> which performs a serial search of the grouping population (actually, on
> average, half the population) once for each distinct value of the grouping
> key. The algorithm therefore has order (at least) O(P*G) where P is the
size
> of the population and G the number of groups. In applications where the
> number of groups increases with the population (e.g. grouping employees by
> surname) this is effectively O(N^2).
>
> I agree that the algorithm is viable in cases where the number of groups
is
> small and almost fixed, e.g. grouping sales by continent.
>
> But of course if the number of groups is completely fixed, the simplest
> approach is:
>
> for-each continent
>   for-each P[continent=current()]
>
>
> Michael Kay
> http://www.saxonica.com/

Current Thread
Re: [xsl] Is there equivalence of max(node-set) of XForm in XSLT 1.0, (continued) Mukul Gandhi - Mon, 27 Dec 2004 00:39:56 -0800 (PST) Michael Kay - Mon, 20 Dec 2004 21:03:25 -0000 M. David Peterson - Mon, 20 Dec 2004 14:27:35 -0700 Wendell Piez - Mon, 20 Dec 2004 16:41:38 -0500 Sergiu Ignat - Tue, 21 Dec 2004 19:54:26 +0200 <= M. David Peterson - Tue, 21 Dec 2004 13:30:18 -0700 Dimtre Novatchev - Tue, 21 Dec 2004 20:09:22 +1100 Dimtre Novatchev - Tue, 21 Dec 2004 22:42:06 +1100

Current Thread

Re: [xsl] Is there equivalence of max(node-set) of XForm in XSLT 1.0, (continued)

<- Previous	Index	Next ->
RE: [xsl] Recursive grouping - simp, Wendell Piez	Thread	Re: [xsl] Recursive grouping - simp, M. David Peterson
Re: [xsl] nested output, David Carlisle	Date	Re: [xsl] nested output, Bruce D'Arcus
	Month

Re: [xsl] Recursive grouping - simple, XSLT 1.0, fast non-Muenchian grouping method

Products

Features

Shop

Resources

Support

Company