The Plot of Means tool takes a numeric or binary categorical field (with the binary categorical field converted into a set of zero and one values) as a response field along with a categorical field and plots the mean of the response field for each of the categories (levels) of the categorical field. In addition, error bars, based on the standard error, standard deviation, or a confidence interval for each mean can be placed on the plot. The tool is an excellent choice for visualizing the effect of a possible categorical predictor field on a numerical or binary target field in the process of developing predictive models.
This tool uses the R programming language. Go to Options > Download Predictive Tools to install R and the packages used by the R Tool.
An Alteryx data stream.
Response field: The numeric or binary categorical field for which the means will be calculated.
The response field is binary categorical... : A check-box to indicate that the response field is a binary categorical. When checked, the user is asked to indicate the field value that will correspond to a value of one (the "target"), with entries with the other field value taking the value zero.
Select a categorical field to group by: The categorical field each level of which will correspond to a response field mean. If a level of the categorical field consists of only a single record in the data stream, the error bars on the plot will be suppressed and a warning message to the user will be issued.
Error bars: Select one of standard error, standard deviation, confidence interval, or none. If none is selected, error bars will be suppressed.
Confidence level (%): The confidence interval band (e.g., the 95% confidence interval) to use if "confidence interval" is the selection for the error bars.
Graph resolution: Select the resolution of the graph in dots per inch: 1x (96 dpi); 2x (192 dpi); or 3x (288 dpi). Lower resolution creates a smaller file and is best for viewing on a monitor. Higher resolution creates a larger file with better print quality.
Base font size (points): The point size of the base font used to produce the title and labels of the plot(s) to be produced. The plotting functions will expand the size of the plot title to be larger than the base font automatically.
©2018 Alteryx, Inc., all rights reserved. Allocate®, Alteryx®, Guzzler®, and Solocast® are registered trademarks of Alteryx, Inc.