Data Health Tool Icon

Data Health

Version:
2021.3
Last modified: August 11, 2021

Use Data Health to check on the health of your data. You can use the tool with the feature-engineering tools, like Build Features and Feature Types, to improve the health of your data. To determine how healthy your data is, the tool analyzes missing values, outliers, and sparsity.

Tool Components

The Data Health tool has 4 anchors.

  1. Input anchor: The input anchor connects to the data whose health you want to check.
  2. S output anchor: The S output anchor passes each column's associated the data-health scores downstream.
  3. R output anchor: The R output anchor passes a comprehensive report about the data's health downstream. You can view the report using a Browse tool.
  4. O output anchor:​​​​ The O output anchor passes the outliers from the data downstream.

Configure the Tool

To use the Data Health tool, you have to configure options for what scale to use for scoring the health of the data and whether to include recommendations for how to improve the health of the data in the output.

To use this tool, the upstream data has to have at least 30 rows.

1. Scale

From the dropdown, choose whether you want to output scores on a percentage scale (0–100%) or a normalized scale (0–1). The tool generates the score by assessing missing values, outliers, and sparsity.

2. Output Recommendations Based On Score

Check the box if you want the tool to give you recommendations for how to improve the health of the data, based on the score it receives. The recommendation appears as an additional column in the dataset.

Was This Page Helpful?

Running into problems or issues with your Alteryx product? Visit the Alteryx Community or contact support. Can't submit this form? Email us.