Overview

The KPI (key performance indicator) Alternatives for Projects Processor collects KPI Alternatives from the dataset input and groups them by projects. Alternatives come with information about the column location (schema, table, column name), distinct values and formats, fill rate of the column and total amount of entries. 


Input

The processor has two input ports. The left port is for a certain dataset suitable for a KPI analysis. The right input port is for a table containing the project information. Please read through the example for more detailed information. 


Configuration

Configuration interface (part 1) 


Configuration interface (part 2)


  • KPI Column: Select the column containing the KPI name.
  • Schema Column: Select the column containing the schema name.
  • Table Column: Select the column containing the table name.
  • Column name Column: Select the column containing the column name.
  • Filter Column: Select the column containing the SQL WHERE statement.
  • Probability Column: Select the column containing the probability. How probable is the current alternative?
  • Total Count Column: Select the column containing the total value count. How many entries does the current alternative have?
  • Fill Rate Column: Select the column containing the fill rate. What is the percentage of valid entries in the current alternative?
  • Column Formats Column: Select the column containing information about the most common column format. What are the most common formats of the entries in the current alternative?
  • Most frequent Values Column: Select the column containing information about the most common values. What are the most frequent distinct values in the current alternative?
  • Separator: Specify a separator for the values in the most frequent formats and most frequent distinct values column. Default is set to \|.
  • Probability Threshold for Preselection: Choose a threshold for the probability value. The most probable alternative for a KPI will be preselected automatically when its probability is over the given threshold. Must be a value between 0.0 and 1.0.. Defaults to 0.5.
  • Preselections (leave empty): After executing the processor you can store its result inside its configuration and run it again obtaining all selected alternatives in the output, when switched ON. Default is OFF.
  • Project Column: Select the column containing project name.


Output

The processor itself contains results: An interactive table with the initial data and additional information on distinct values and data location, all grouped by projects.

The output port delivers a table with the collected KPI Alternatives.


Additional Information

The KPI Alternatives Processor is very similar to this processor but does not group by project, hence only has one input port. 


Example

Workflow


Input

Right Custom Input Table:




Left Custom Input Table:


Configuration

configuration of processor (part 1) 


configuration of processor (part 2)


  • KPI Column: KPI
  • Schema Column: SCHEMA
  • Table Column: TABLE
  • Column name Column: COLUMN
  • Filter Column: FILTER
  • Probability Column: PROBABILITY
  • Total Count Column: TOTAL_COUNT
  • Fill Rate Column: FILL_RATE
  • Column Formats Column: COLUMN_FORMAT
  • Most frequent Values Column: MOST_FREQUENT_FORMATS
  • Separator: \|
  • Probability Threshold for Preselection: 0.6
  • Preselections (leave empty): [object Object],[object Object],[object Object],[object Object]
  • Project Column: A


Results

The Processor itself return following results: 


 

The attached Result Table Processor delivers the following table: 


KPI Alternatives delivered by the Result Table Processor

 

Related Articles

KPI Alternatives Processor