Endeca - Stop words

Endeca Stop words
Stop words are words that are ignored by the MDEX Engine when the words are part of a keyword search.

Where I can find OOTB Stop words?
Endeca 3.1.X
/opt/apps/endeca/apps/CRS/config/CRS.stop_words.xml
Endeca 11.X
/opt/app/endeca/apps/CRS/config/mdex/CRS.stop_words.xml

<?xml version="1.0" encoding="UTF-8" standalone="no" ?>
<!DOCTYPE STOP_WORDS SYSTEM "stop_words.dtd">
<STOP_WORDS>

  <STOP_WORD>a</STOP_WORD>
  <STOP_WORD>do</STOP_WORD>
  <STOP_WORD>me</STOP_WORD>
  <STOP_WORD>when</STOP_WORD>
  <STOP_WORD>about</STOP_WORD>

  <STOP_WORD>find</STOP_WORD> 
 <STOP_WORD>not</STOP_WORD>
  <STOP_WORD>where</STOP_WORD>
  <STOP_WORD>above</STOP_WORD>

  <STOP_WORD>for</STOP_WORD>
  <STOP_WORD>or</STOP_WORD>
  <STOP_WORD>why</STOP_WORD>
  <STOP_WORD>an</STOP_WORD>
  <STOP_WORD>from</STOP_WORD>
  <STOP_WORD>over</STOP_WORD>
  <STOP_WORD>with</STOP_WORD>
  <STOP_WORD>and</STOP_WORD>
  <STOP_WORD>have</STOP_WORD>
  <STOP_WORD>show</STOP_WORD>
  <STOP_WORD>you</STOP_WORD>
  <STOP_WORD>any</STOP_WORD>
  <STOP_WORD>how</STOP_WORD>
  <STOP_WORD>the</STOP_WORD>
  <STOP_WORD>your</STOP_WORD>
  <STOP_WORD>are</STOP_WORD>
  <STOP_WORD>I</STOP_WORD>
  <STOP_WORD>under</STOP_WORD>
  <STOP_WORD>can</STOP_WORD>
  <STOP_WORD>is</STOP_WORD>
  <STOP_WORD>what</STOP_WORD>
</STOP_WORDS>

 You can add application specific stop words in above mentioned file and run Endeca baseline update.

5 comments:

  1. Hello Ravi,

    can we configure stop word and stemming without having Dev Studio.If yes, could you please help with steps.

    ReplyDelete
    Replies
    1. Hi Mukesh,

      You can edit file directly on the index box and example location would be as below:
      Endeca 3.1.X
      /opt/apps/endeca/apps/CRS/config/CRS.stop_words.xml
      Endeca 11.X
      /opt/app/endeca/apps/CRS/config/mdex/CRS.stop_words.xml

      After adding stop words run baseline update to apply it on index.

      Regards,
      Ravi


      Delete
  2. Hi Ravi,

    Thank you very much. It works perfect.

    Mukesh

    ReplyDelete
  3. Hi Ravi,

    We need to do dimension search but a few words in the dimension search are configured as stop words. Is there a way to make endeca ignore stop words for a specific query?

    Thanks

    ReplyDelete
  4. Hi Ravi,

    We have implemented both wild card search and stop words functionality in endeca search,But if we have wildcard search then stop word functionality is not working.Is there any solution to achieve both functionality?


    Thanks

    ReplyDelete