[XSL-LIST Mailing List Archive Home] [By Thread] [By Date]

=?windows-1252?Q?Re=3A_=5Bxsl=5D_Does_=27Lec=9Cur=27_occur_in_=24text=3F_Do_you_have?= =?windows-1252?Q?_a_multi=2Dfactor_XPath_solution=3F?=


Subject: Re: [xsl] Does 'Lecœur' occur in $text? Do you have a multi-factor XPath solution?
From: Wolfgang Laun <wolfgang.laun@xxxxxxxxx>
Date: Sat, 19 Jan 2013 09:06:40 +0100

If the "slighly misspelled" is due to writing after hearing,
something like the Soundex algorithm is one possible solution.
The algorithm depends on the speaker's language. There
are a few variants for English. Other languages might be
found on the internet.
-W

On 18/01/2013, Costello, Roger L. <costello@xxxxxxxxx> wrote:
> Hi Folks,
>
> I want to determine if 'Lecur' occurs in $text.
>
> A naove solution is this XPath expression:
>
>       contains($text, 'Lecur')
>
> However, that does not take into account many important factors:
>
> 1. Perhaps 'Lecur' occurs, but in $text it is in uppercase
>
> 2. Perhaps instead of the '' ligature, $text uses 'oe'
>
> 3. Perhaps in $text 'Lecur' is split over two lines and thus is hyphenated
>
> 4. Perhaps 'Lecur' is slightly misspelled in $text and therefore requires
> fuzzy matching
>
> And there are many other important factors.
>
> Do you have an XPath solution to this problem that takes into account the
> many important factors?
>
> /Roger


Current Thread
Keywords