Parquet Data Type Conversions
This section covers data type conversions between the Cloud Portal and the Parquet file format.
Note
The Alteryx data types listed on this page reflect the raw data type of the converted column. Depending on the contents of the column, the Transformer (Trifacta Classic) or Data (Designer Experience) page might re-infer a different data type when you load a dataset using this type of source.
Import
Note
Designer Cloud does not support ingest of Parquet files with nested values, which can occur for Map or Object data types.
Parquet Data Type | Alteryx Data Type | Notes |
---|---|---|
STRING | String | |
INT | Integer | |
DECIMAL | Decimal | |
DATE | Datetime | |
TIME | Datetime | |
TIMESTAMP | Datetime | |
LIST | Array | |
MAP | Object |
Limitations on import:
The Parquet data format supports the use of row groups for organizing chunks of data. This row grouping is helpful for processing across distributed systems.
Designer Cloud places limitations on the volume of data that displays in the browser. By default, these limits are set to 10 MB.
If Parquet row groups are greater than the size limit of 10 MB:
You can't preview data from the file before import.
When a Parquet-based dataset is loaded on the Transformer or Data page, the screen might be blank.
Tip
You can create a new sample from inside the Transformer page. The sample is displayed normally.
Other product functions work as expected with the Parquet format.