Generate new columns based on the respective values in an existing column.
This processor works on any kind of input dataset.
The names of the columns created by the second field in the configuration will be according to this schema: columizedColumnName_columizedColumnValue_duplicatedColumnName where duplicatedColumnName are the columns which are aggregated.
Warning: Note that in order for the workflow to be executed, at least one of the three aggregation methods (in the blue boxes) should be configured.
The result table contains the columns selected in the first configuration field, along with the created columns with values from the feature selected in the second configuration field.
In the following example, we would like to output the the cheapest accommodation in different locations for each accommodation type.
Minimum values are selected for each location and missing values are replaced by the chosen default value.
If a second aggregation method is to be chosen, new columns are created.
Here the average price is selected as a second aggregation method. The used default value is 250.