Endeca Stop words
Stop words are words that are ignored by the MDEX Engine when the words are part of a keyword search.
Where I can find OOTB Stop words?
Endeca 3.1.X
/opt/apps/endeca/apps/CRS/config/CRS.stop_words.xml
Endeca 11.X
/opt/app/endeca/apps/CRS/config/mdex/CRS.stop_words.xml
<?xml version="1.0" encoding="UTF-8" standalone="no" ?>
<!DOCTYPE STOP_WORDS SYSTEM "stop_words.dtd">
<STOP_WORDS>
<STOP_WORD>a</STOP_WORD>
<STOP_WORD>do</STOP_WORD>
<STOP_WORD>me</STOP_WORD>
<STOP_WORD>when</STOP_WORD>
<STOP_WORD>about</STOP_WORD>
<STOP_WORD>find</STOP_WORD>
<STOP_WORD>not</STOP_WORD>
<STOP_WORD>where</STOP_WORD>
<STOP_WORD>above</STOP_WORD>
<STOP_WORD>for</STOP_WORD>
<STOP_WORD>or</STOP_WORD>
<STOP_WORD>why</STOP_WORD>
<STOP_WORD>an</STOP_WORD>
<STOP_WORD>from</STOP_WORD>
<STOP_WORD>over</STOP_WORD>
<STOP_WORD>with</STOP_WORD>
<STOP_WORD>and</STOP_WORD>
<STOP_WORD>have</STOP_WORD>
<STOP_WORD>show</STOP_WORD>
<STOP_WORD>you</STOP_WORD>
<STOP_WORD>any</STOP_WORD>
<STOP_WORD>how</STOP_WORD>
<STOP_WORD>the</STOP_WORD>
<STOP_WORD>your</STOP_WORD>
<STOP_WORD>are</STOP_WORD>
<STOP_WORD>I</STOP_WORD>
<STOP_WORD>under</STOP_WORD>
<STOP_WORD>can</STOP_WORD>
<STOP_WORD>is</STOP_WORD>
<STOP_WORD>what</STOP_WORD>
</STOP_WORDS>
You can add application specific stop words in above mentioned file and run Endeca baseline update.
Stop words are words that are ignored by the MDEX Engine when the words are part of a keyword search.
Where I can find OOTB Stop words?
Endeca 3.1.X
/opt/apps/endeca/apps/CRS/config/CRS.stop_words.xml
Endeca 11.X
/opt/app/endeca/apps/CRS/config/mdex/CRS.stop_words.xml
<?xml version="1.0" encoding="UTF-8" standalone="no" ?>
<!DOCTYPE STOP_WORDS SYSTEM "stop_words.dtd">
<STOP_WORDS>
<STOP_WORD>a</STOP_WORD>
<STOP_WORD>do</STOP_WORD>
<STOP_WORD>me</STOP_WORD>
<STOP_WORD>when</STOP_WORD>
<STOP_WORD>about</STOP_WORD>
<STOP_WORD>find</STOP_WORD>
<STOP_WORD>not</STOP_WORD>
<STOP_WORD>where</STOP_WORD>
<STOP_WORD>above</STOP_WORD>
<STOP_WORD>for</STOP_WORD>
<STOP_WORD>or</STOP_WORD>
<STOP_WORD>why</STOP_WORD>
<STOP_WORD>an</STOP_WORD>
<STOP_WORD>from</STOP_WORD>
<STOP_WORD>over</STOP_WORD>
<STOP_WORD>with</STOP_WORD>
<STOP_WORD>and</STOP_WORD>
<STOP_WORD>have</STOP_WORD>
<STOP_WORD>show</STOP_WORD>
<STOP_WORD>you</STOP_WORD>
<STOP_WORD>any</STOP_WORD>
<STOP_WORD>how</STOP_WORD>
<STOP_WORD>the</STOP_WORD>
<STOP_WORD>your</STOP_WORD>
<STOP_WORD>are</STOP_WORD>
<STOP_WORD>I</STOP_WORD>
<STOP_WORD>under</STOP_WORD>
<STOP_WORD>can</STOP_WORD>
<STOP_WORD>is</STOP_WORD>
<STOP_WORD>what</STOP_WORD>
</STOP_WORDS>
You can add application specific stop words in above mentioned file and run Endeca baseline update.
Hello Ravi,
ReplyDeletecan we configure stop word and stemming without having Dev Studio.If yes, could you please help with steps.
Hi Mukesh,
DeleteYou can edit file directly on the index box and example location would be as below:
Endeca 3.1.X
/opt/apps/endeca/apps/CRS/config/CRS.stop_words.xml
Endeca 11.X
/opt/app/endeca/apps/CRS/config/mdex/CRS.stop_words.xml
After adding stop words run baseline update to apply it on index.
Regards,
Ravi
Hi Ravi,
ReplyDeleteThank you very much. It works perfect.
Mukesh
Hi Ravi,
ReplyDeleteWe need to do dimension search but a few words in the dimension search are configured as stop words. Is there a way to make endeca ignore stop words for a specific query?
Thanks
Hi Ravi,
ReplyDeleteWe have implemented both wild card search and stop words functionality in endeca search,But if we have wildcard search then stop word functionality is not working.Is there any solution to achieve both functionality?
Thanks