Transformation Processors

Grouped Peak Elimination Processor
Overview The Grouped Peak Elimination Processor corrects peaks in the dataset by evaluating the deviation of observations from a calculated correction valu...
Tue, 30 Nov, 2021 at 2:57 PM
Manual Column Specification Processor
Overview The Manual Column Specification Processor is an auxiliary processor that gives the user control over the column information that is currently pres...
Tue, 30 Nov, 2021 at 2:58 PM
Lag/Lead Generation Processor
Overview The Lag Generation Processor creates a lag on a selected column based on a time interval. The generation of lags is often necessary for time serie...
Tue, 30 Nov, 2021 at 2:59 PM
Versioning Processor
Overview The Versioning Processor appends versioning information to an input dataset (version number and date and time on which the version was created). I...
Tue, 30 Nov, 2021 at 3:00 PM
Columnization Processor
Overview Generate new columns based on the respective values in an existing column.  Input This processor works on any kind of input dataset. Conf...
Tue, 30 Nov, 2021 at 3:02 PM
Train Model Processor
Overview This processor allows to configure a Machine Learning model and build a Spark/mleap pipeline to train then save this model. Note: Changed mach...
Tue, 30 Nov, 2021 at 3:13 PM
Rounding Processor
Overview The Rounding Processor rounds numeric columns to a specific number of decimal places and significant numbers based on a selected rounding strategy...
Tue, 30 Nov, 2021 at 3:14 PM
Caching Processor
Overview The Caching Processor caches an input dataset and forwards the cached dataset. This can improve performance on big iterative and complex calculati...
Tue, 30 Nov, 2021 at 3:15 PM
Multiple Intify Processor
Overview The Multiple Intify processor changes the data type of the selected columns to integer. Input All kinds of data types are possible, though th...
Tue, 30 Nov, 2021 at 3:16 PM
Sample Data Subsetting Processor
Overview The Sample Data Subsetting Processor draws a random subset of the input data set. It may be used for splitting the data randomly in training and t...
Tue, 30 Nov, 2021 at 3:17 PM