Searching algorithms in WebHelp
Post here questions and problems related to editing and publishing DITA content.
			- 
				ann.jensen
 - Posts: 316
 - Joined: Wed Jun 17, 2015 10:19 am
 
Searching algorithms in WebHelp
Post by ann.jensen »
Hello,
I am trying to get a better understanding of how to use the Search in Oxygen WebHelp and what kind of results I can expect to get.
If I type in the following words " This task was prevented from being added" (without quotes obviously) I see groupings of results where the first grouping is titled
Results for: from, prevented, task
Does the algorithm remove the words 'this', 'was', 'being' and 'added' and if so what is the logic for doing this?
Any advice appreciated,
Regards,
Ann
			
			
									
									
						I am trying to get a better understanding of how to use the Search in Oxygen WebHelp and what kind of results I can expect to get.
If I type in the following words " This task was prevented from being added" (without quotes obviously) I see groupings of results where the first grouping is titled
Results for: from, prevented, task
Does the algorithm remove the words 'this', 'was', 'being' and 'added' and if so what is the logic for doing this?
Any advice appreciated,
Regards,
Ann
- 
				bogdan_cercelaru
 - Posts: 222
 - Joined: Tue Jul 01, 2014 11:48 am
 
Re: Searching algorithms in WebHelp
Post by bogdan_cercelaru »
Hello,
First of all, the WebHelp search does not support exact match searching.
Secondly, results depends on the content being searched. The words listed after the "Results for:" label are the words that have been found in the index of the content, not the words searched for.
However, there is a set of words, named "stop words" that are filtered by the content indexer. You can find the list in the OUTPUT/oxygen-webhelp/search/index-1.js file.
Regards,
Bogdan
			
			
									
									First of all, the WebHelp search does not support exact match searching.
Secondly, results depends on the content being searched. The words listed after the "Results for:" label are the words that have been found in the index of the content, not the words searched for.
However, there is a set of words, named "stop words" that are filtered by the content indexer. You can find the list in the OUTPUT/oxygen-webhelp/search/index-1.js file.
Regards,
Bogdan
Bogdan Cercelaru
<oXygen/> XML Editor, Schema Editor and XSLT Editor/Debugger
http://www.oxygenxml.com
						<oXygen/> XML Editor, Schema Editor and XSLT Editor/Debugger
http://www.oxygenxml.com
Return to “DITA (Editing and Publishing DITA Content)”
			
				Jump to
				
			
		
			
			
	
	- Oxygen XML Editor/Author/Developer
 - ↳ Feature Request
 - ↳ Common Problems
 - ↳ DITA (Editing and Publishing DITA Content)
 - ↳ Artificial Intelligence (AI Positron Assistant add-on)
 - ↳ SDK-API, Frameworks - Document Types
 - ↳ DocBook
 - ↳ TEI
 - ↳ XHTML
 - ↳ Other Issues
 - Oxygen XML Web Author
 - ↳ Feature Request
 - ↳ Common Problems
 - Oxygen Content Fusion
 - ↳ Feature Request
 - ↳ Common Problems
 - Oxygen JSON Editor
 - ↳ Feature Request
 - ↳ Common Problems
 - Oxygen PDF Chemistry
 - ↳ Feature Request
 - ↳ Common Problems
 - Oxygen Feedback
 - ↳ Feature Request
 - ↳ Common Problems
 - Oxygen XML WebHelp
 - ↳ Feature Request
 - ↳ Common Problems
 - XML
 - ↳ General XML Questions
 - ↳ XSLT and FOP
 - ↳ XML Schemas
 - ↳ XQuery
 - NVDL
 - ↳ General NVDL Issues
 - ↳ oNVDL Related Issues
 - XML Services Market
 - ↳ Offer a Service