Join Metrics
When you create a join, Dataprep by Trifacta attempts to match up columns as the keys in your join. For each set of join keys, you can review the following metrics related to the join.
Match percentage:
When you hover over the percentage of matches between key values, you can see the details that make up the calculation:
Metric | Description |
---|---|
All Rows | Total count of rows in the dataset |
Matches | Total count of values in the join key of the selected column with matching values in the join key of the other dataset. |
Non-Matches | Total count of values in the join key of the selected column with values that do not have a match in the join key of the other dataset. |
The percentage is calculated by summing the count of matches for both datasets and dividing that by the total count of rows across both datasets:
(Matches_Current_Dataset + Matches_Joined-in-Dataset) / (All_Rows_Current_Dataset + All_Rows_Joined-in_Dataset)
Rows in output:
When you hover over the Rows in Output metric, you can see the following values:
Metric | Description |
---|---|
All Rows | Total count of rows in each dataset. |
Included | Count of rows from each dataset that are included in the output. |
Excluded | Count of rows from each dataset that are excluded from the output. |