Column Naming Requirements
These requirements should be followed for column names in your datasets used in the Dataprep by Trifacta platform.
Note
In some cases, names that violate these requirements may be accepted by an individual application. However, these column names may be rejected by the underlying platform or running environment where the job is executed. For best results, you should follow these naming conventions.
Column names are case-insensitive and cannot begin with whitespace.
Column names cannot contain escaped characters, such as
\n
.
Note
When publishing to Avro,Parquet, or database tables, column names support alphanumeric characters and the underscore (_) character only. Column names cannot begin with a numeral. Other characters cause an error to occur.
Note
Column names with spaces or special characters in a transformation must be wrapped by curly braces. Example:
column1,{Column 2 with space},column3
Tip
To prevent potential issues with downstream systems, you should limit your column lengths to no more than 128 characters.
Reserved keywords
The following keywords should not be used as column names, as they may conflict with underlying requirements of the platform or the running environments with which it integrates:
Note
This list may not be complete. If your job fails with a duplicate column error, please review your column names to identify potential reserved keywords among them.
TRIFACTA__LINEAGE_INFO
TRIFACTA__FILE_LINEAGE_INFO
Note
There are two underscore characters in a row (__) after
TRIFACTA
in each of the above entries.