Overview
This processor computes the Inner join on two input Datasets by one column match. Columns selected as join partners must have same data types. All other columns in the two Datasets may not have identical names.
INNER JOIN selects records that have matching values in both tables.
Inner join syntax:
SELECT column_name(s) FROM table1 INNER JOIN table2 ON table1.column_name = table2.column_name;
Input
The processor requires two input Datasets having one attribute column in common.The join operation will be based on this common column.
Configuration
Output
The result contains columns from both the first and the second input Datasets, the join partner columns are replaced by the new column specified in the configuration. The new data set includes all the matching entries from both Datasets.
Example
In the following example we join two custom input tables using the inner join processor.
Workflow


Input tables
First Table
OrderID | CustomerID | OrderDate |
10308 | 2 | 1996-09-18 |
10309 | 37 | 1996-09-19 |
10310 | 77 | 1996-09-20 |
10311 | 2 | 1996-09-21 |
Second Table
CustomerID | CustomerName | Country |
1 | Alfred | Germany |
2 | Ana Trujillo | Mexico |
3 | Antonio Moreno | Mexico |
Example Configuration
Result
The result contains columns from both data sets, with the common customer IDs presented as InnerJoin (the chosen new column name).