Warning
This method of creating custom data types is likely to be deprecated in a future release. Please consider switching to other types of custom data validation. For more information, see Validate Your Data.
A custom data type can be created when you create and upload a CSV dictionary file. This dictionary file includes all accepted values for the custom data type.
Note
The method described in this section validates against a fixed set of values. If you would like to validate your custom type against a pattern, you can specify the pattern using RegEx. See Create Custom Data Types Using RegEx.
A dictionary represents one or more columns of reference data, which can be used for data validation. For defining custom types, if a value is included in the dictionary, it is a valid member of the custom type. For example, you may wish to create a custom type calledstoreId
, which contains a valid identifiers for stores in your enterprise.
Warning
After a custom type has been added, it cannot be removed or disabled.
For your custom data type, you must create a dictionary of values in your local environment. This file is then uploaded to Designer Cloud Powered by Trifacta Enterprise Edition.
File characteristics:
CSV file format
File can be multi-column, but data validation only uses one of the columns.
File can contain up to 250,000 values. If your data type contains more values than this limit, you might see values in your dataset identified as members of the data type, when they are not.
Remove any header row from your file.
Values are case-insensitive during matching.
Special restrictions on the newline (
\n
) character are described below.
Notes and Limitations:
While you can use newline as a delimiter, dictionaries do not support using the newline ( \n
) character within a cell value. if your dictionary includes this character in cell values, it is dropped from use in the generated dictionary. In the following CSV example data, the first row is acceptable, while the second is not:
"Arizona"\n"Alaska"\n
"Arizona\n"\n"Alaska"
For example, your data contains size information from Extra Small (XS) to Extra Extra Large (XXL). You can create a one-column dictionary file with values for these sizes on separate lines. This dictionary file could be use to validate the custom type Sizes
. Your data might look like the following. Note that the column has no header.
Extra Small |
Small |
Medium |
Large |
Extra Large |
Extra Extra Large |
XS |
S |
M |
L |
XL |
XXL |
Extra-Small |
Extra-Large |
Extra-Extra-Large |
You can download this source file: Dict-Sizes.csv.
To begin, you must enable the use of custom data types:
You can apply this change through the Admin Settings Page (recommended) or
trifacta-conf.json
. For more information, see Platform Configuration Methods.Locate the following property:
"feature.enableCustomTypes": true,
Save your changes and restart the platform.
Please complete the following steps to create the custom type.
Note
After a custom type has been created, a platform restart is required. Please contact your Alteryx administrator.
Steps:
Click the data type drop-down in a column where you want to apply the custom type.
Click More. Scroll down and click Custom Type.
The Custom Type dialog is displayed. Click the Create New Custom Type tab.
Click Upload Dictionary. Select the CSV file you created. Click Open.
Note
After you upload a dictionary file, it cannot be removed. If necessary, upload a new version with a different filename.
The file is uploaded:
Figure: Custom Data Type dialog
Click the caret next to the filename to review the contents of the dictionary.
Select one of the column headers in the left side of the dialog. On the right side, you can review values in the selected column.
Note
You must expand the custom dictionary to see values before you can save the custom type. This is a known issue.
Select the column you want to use for validating the custom type.
Enter a name for the data type.
Note
This name appears in the data type drop-down for each column. Also, it can be referenced explicitly in transforms that utilize a named data type as a parameter.
Click Save.
Restart the platform. See Start and Stop the Platform.
For more information, see Custom Type Dialog.
Steps:
Select Custom Type from the column drop-down.
In the Custom Type dialog, click the Use Existing Custom Type tab.
Note
If you cannot see a recently created custom data type, you may need to logout and login again.
Select the name of the custom type. ClickSave.
When the data type is saved, the values in the column are validated against this custom type.
Make sure you review the missing and mismatched values for the column.
Tip
You can also reference the data type by name in your transforms.
For more information, see Custom Type Dialog.