The Search And Replace Processor replaces all substrings in the given columns matching the given regular expression.
The processor can operate on any dataset.
The input dataset is forwarded to the output with the replaced values that have been matched by the regular expression. If no column is selected, the regex replacement will be applied to all textual columns. If numeric or date columns are chosen for replacement these columns have type Text after this transformation. Null-values in to-be-replaced cells are set to be an empty string.
In the following example we use a dataset with information about certain books (authors, language code, number of pages..)
The Column Selection Processor is used to extract the column "title":
The previous configuration is used to "trim" the title, and that's by removing what's inside the parentheses. Therefore, only the main book title remains.
Extract Regex Processor
Column Selection Processor