Endeca Pipeline - Source Data manipulation

Endeca pipeline support three types of manipulators:  
1) Record Manipulator
- Uses XML expressions
- Very fast and highly scalable but has limited functionality

2) Java Manipulator
- Uses Java language 
- Slower than Record manipulator but most flexible and well-known language in market

3) Perl Manipulator
- Uses Perl language
- Slowest compered to both Record and Java manipulator but very much flexible and convenient for string manipulation

2 comments:

  1. Hi Ravi,

    Can you please let me know in forgeless process where to add manipulators? My requirement is to filter out web crawler output with .css, .png etc files. Not while parsing, after the records are generated.

    Thanks,
    daliya

    ReplyDelete
  2. Hi Ravi,

    Can you please let me know in forgeless process where to add manipulators? My requirement is to filter out web crawler output with .css, .png etc files. Not while parsing, after the records are generated.

    Thanks,
    daliya

    ReplyDelete