Field Summary Tool
The Field Summary tool analyzes data and creates a summary report containing descriptive statistics of data in selected columns.
Use the Field Summary Tool to gain insight into data and receive recommendations for managing data.
Numeric, string, spatial, and date/time are supported in this tool. A unique set of descriptive statistics are provided for each data type. See Data Types for more on types of data.
Configure the tool
Select all, some, or a random sample of columns to include in the output.
Columns containing data that is not supported in this tool, such as boolean data, can be selected during tool configuration; however, the column is not analyzed or included in the tool output.
- Select the fields to product summary info: Select the columns to analyze and include in the output.
- All: Click to select all columns in the data.
- None: Click to deselect previously selected columns.
- Sample input data: Select to include a sample of columns in the output instead of select all, or specific columns.
- Random N Records: Select the number of records to include in the output.
- Random N% of Records: Select the percentage of incoming records to include in the output.
View the output
The Field Summary tool has three outputs:
- O anchor: An Alteryx data stream with descriptive statistics for selected columns along with recommendations, in the Remarks column, for managing data.
- R anchor: A static report with a scatterplot and descriptive statistics for selected columns along with recommendations (see Remarks) for managing data in a column. To view the report, add a Browse tool and connect it to the R output. See Browse Tool.
- I anchor: An interactive dashboard consisting of expandable panels for each column. To view the report, add a Browse tool and connect it to the I output. See Browse Tool. Each panel consists of a histogram or column chart and summary statistics:
- Hover over a panel to display additional icons.
- Click the information icon to view additional information
- Click the expand icon in to open the report in a detailed view.
- Hover over a bar in the plot to display details.
- Select variable to view to focus on a smaller set of columns.
- Sort the panels alphabetically or by percentage of missing values.
- Hover over a panel to display additional icons.
The descriptive statistics available in the output depend on the type of data in the columns selected for analysis. Results are listed horizontally. Scroll left to right to see statistics for each data type.
If a column contains numeric data, the following statistics are provided:
- Min: The minimum value in the data.
- Max: The maximum value in the data.
- Median: The median value in the data.
- Std. Dev.: The measure of how dispersed the values are in the data.
- Percent Missing: The percentage of values in the data that are null.
- Unique Values: The number of unique values in the data.
- Mean: The average of the data.
- Layout: Add a Browse tool and connect it to the R output to view the statistics in a visual format. See Browse Tool.
- Remarks: Recommendations for managing data, if available.
If a column contains string data, the following statistics are provided:
- Percent Missing: The percentage of values in the data that are null.
- Unique Values: The number of unique values in the data.
- Shortest Value: The short (length) string value in the data.
- Longest Value: The longest (length) string value in the data.
- Min Value Count: The number of values that equal the minimum value.
- Max Value Count: The number of values that equal the maximum value.
- Remarks: Recommendations for managing data, if available.
If a column contains spatial data, the following statistics are provided:
- Percent Missing: The percentage of values in the data that are null.
- Object Type: The type of spatial object (for example, Point or Polygon) in the data.
- Avg Area (Sq Miles): The average area, in square miles, of the values in the data.
- Avg Length (Miles): The average length, in miles, of the values in the data.
- Avg Num Points: The average number of values in the data that are Points.
- Remarks: Recommendations for managing data, if available.
If a column contains date/time data, the following statistics are provided:
- Percent Missing: The percentage of values in the data that are null.
- Unique Values: The number of unique values in the data.
- Latest Date: The latest, or most future, date in the data.
- Earliest Date: The earliest date in the data.
- Interval: The interval of dates (for example, Monthly) in the data.
- Remarks: Recommendations for managing data, if available.