Page 1 of 1

Extend stop-words-list (for Webhelp search)

Posted: Thu Mar 31, 2016 2:58 pm
by Silke
Hi,

ist there a possibility to extend the stop-word-list for Webhelp output -> index-1.js?

I searched for a example from the stoplist: "anderr".
- my content does not contain that string.
- the oxygen folder contains "anderr" in the de-DE.dic in words like "Wanderrucksack".

In the webhelp-plugin folder ist a file: de_words.properties which looks as if the stop words do come from, but "anderr" is not in there?

Thank you in advance
Silke

Re: Extend stop-words-list (for Webhelp search)

Posted: Fri Apr 01, 2016 4:34 pm
by bogdan_cercelaru
Hello,

Unfortunately you cannot change the stop-words-list using the current implementation of the WebHelp.
I have registered your request in our issue tracking system to be analyzed.

Regards,
Bogdan

Re: Extend stop-words-list (for Webhelp search)

Posted: Tue May 16, 2017 3:13 pm
by ckabstein
Hi,

Given the fact that the new webhelp search now does the following:

"Always search for words containing three or more characters (shorter words, such as to or of are ignored). This rule does not apply to CJK (Chinese, Japanese, Korean) languages."

and

"To improve performance, the Search feature excludes certain stop words. For example, the English version of such stop words include: a, an, and, are, as, at, be, but, by, for, if, in, into, is, it, no, not, of, on, or, such, that, the, their, then, there, these, they, this, to, was, will, with."

I would be interested to know if it's now possible to
a) extend or reduce the list of stop words and where we could do this.
b) add more lists for other languages.

And finally, which rules apply to CJK languages exactly? And where are these set?

Thanks,
Christina

Re: Extend stop-words-list (for Webhelp search)

Posted: Wed May 17, 2017 12:56 pm
by ionela
Hi Christina,

Unfortunately, the stop-words-list is not configurable.
The stop words are computed dynamically depending on the language you have chosen when you publish your documentation. They are computed by the search indexer and written in the out/webhelp-responsive/oxygen-webhelp/search/index-1.js file:

We have also discussed about this on the following topic from our forum:
post42687.html#p42687

Regards,
Ionela

Re: Extend stop-words-list (for Webhelp search)

Posted: Wed May 17, 2017 1:01 pm
by ckabstein
Hi Ionela,

Sorry, I didn't see that post. Thanks for pointing me towards that one.

Best,
Christina