System Requirements
This section contains hardware and software requirements for successful installation of Designer Cloud Powered by Trifacta Enterprise Edition.
Platform Node Requirements
Node Installation Requirements
IftheDesigner Cloud Powered by Trifacta platformis installed in a Hadoop environment, the software must be installed on an edge node of the cluster.
If it is integrated with a Cloudera cluster, it must be installed on a gateway node that is managed by Cloudera Manager.
If it is integrated with Cloudera Data Platform, it must be installed on an edge node of the cluster.
Customers who originally installed an earlier version on a non-edge node will still be supported. If the software is not installed on an edge node, you may be required to copy over files from the cluster and to synchronize these files after upgrades. The cluster upgrade process is more complicated.
This requirement does not apply to the following cluster integrations:
AWS EMR
Azure Databricks
Note
If you are installing the Designer Cloud Powered by Trifacta platform into a Docker container, a different set of requirements apply. For more information, see Install for Docker in the Install Guide.
Hardware Requirements
Tip
For in-place upgrades, there should be at least twice as much available disk space as listed below.
Minimum hardware:
Item | Required |
---|---|
Number of cores | 8 cores, x86_64 |
RAM | 64 GB The platform requires 24 GB of dedicated RAM to start and perform basic operations. |
Disk space to install software | 16 GB |
Total free disk space | 24 GB Space requirements by volume:
|
Recommended hardware:
Item | Recommended |
---|---|
Number of cores | 16 cores, x86_64 |
RAM | 128 GB The platform requires 24 GB of dedicated RAM to start and perform basic operations. |
Disk space to install software | 24 GB |
Total free disk space | 100 GB Space requirements by volume:
|
Operating System Requirements
The following operating systems are supported for the Trifacta node. TheDesigner Cloud Powered by Trifacta platformrequires 64-bit versions of any supported operating system.
CentOS/RHEL versions:
CentOS 7.4 - 7.9, 8.1, 8.4
Note
MySQL 5.7 Community is not supported on CentOS/RHEL 8.x.
RHEL 7.4 - 7.9, 8.1, 8.4
Notes on CentOS/RHEL installation:
Installation on CentOS/RHEL versions 7.4 or earlier requires an upgrade of the RPM software on the Trifacta node. Details are provided during the installation process.
Disabling SELinux on the Trifacta node is recommended. However, if security policies require it, you may need to apply some changes to the environment.
Ubuntu versions:
Ubuntu 18.04 (codename Bionic Beaver)
Ubuntu 20.04 (codename Focal Fossa)
Notes on Ubuntu installation:
For Ubuntu installations, some packages must be manually installed. Instructions are provided later in the process.
For more information on RPM dependencies, see System Dependencies.
Database Requirements
The following database versions are supported by the Designer Cloud Powered by Trifacta platform for storing metadata and the user's Wrangle recipes.
Supported database versions:
PostgreSQL 12.X
Note
The latest stable release of PostgreSQL 12 is installed with the Designer Cloud Powered by Trifacta platform. If you need to install an earlier version of PostgreSQL 12, you can perform those installations manually.
PostgreSQL 11.X
Note
PostgreSQL 11 is supported for Azure installs only. Installation on Azure requires PostgreSQL 11. Please follow the database installation instructions for PostgreSQL 12, modifying them for version 11.
MySQL 5.7 Community
Note
MySQL 5.7 Community is not supported on CentOS/RHEL 8.x.
Notes on database versions:
MySQL 5.7 is not supported for installation in Amazon RDS.
Note
If you are installing or upgrading a deployment of Designer Cloud Powered by Trifacta Enterprise Edition that uses or will use a remote database service, such as Amazon RDS, for hosting the Alteryx databases, please contact Alteryx Customer Success and Services. For this release, additional configuration may be required.
If you are installing the databases into MySQL, you must download and install the MySQL Java driver onto theTrifacta node. For more information, seeInstall Databases for MySQLin the Databases Guide.
H2 database type is used for internal testing. It is not a supported database.
For more information on installing and configuring the database, see Install Databases in the Databases Guide.
Other Software Requirements
The following software components must be present.
Java
Where possible, you should install the same version of Java on theTrifacta nodeand on the cluster with which you are integrating.
Java 11 (runtime) (recommended)
Java 8
Notes on Java versions:
OpenJDK 8 and 11 are supported.
Note
If you are using Azure Databricks as a datasource, please verify that openJDKv1.8.0_302 or earlier is installed on the Trifacta node. Java 8 is required. There is a known issue with TLS v1.3.
There are additional requirements related to Java JDK listed in the Hadoop Components section listed below.
If you are integrating yourAlteryx instance with S3, you must install the Oracle JRE 1.8 onto theTrifacta node. No other version of Java is supported for S3 integration. For more information, seeS3 Accessin the Configuration Guide.
Other Software
For Ubuntu installations, the following packages must be manually installed using Ubuntu-specific versions:
Nginx: 1.22.1
NodeJS 16.19 LTS
Instructions and version numbers are provided later in the process.
Root User Access
Installation must be executed as the root user on the Trifacta node.
SSL Access
(Optional) If users are connecting to the Designer Cloud Powered by Trifacta platform, an SSL certificate must be created and deployed. See Install SSL Certificate in the Install Guide.
Internet Access
(Optional) Internet access is not required for installation or operation of the platform. However, if the server does not have Internet access, you must acquire additional software as part of the disconnected install. For more information, see Install Dependencies without Internet Access in the Install Guide.
Hadoop Cluster Requirements
The following requirements apply if you are integrating the Designer Cloud Powered by Trifacta platform with an enterprise Hadoop cluster.
For general guidelines on sizing the cluster, seeSizing Guidelines.
If you have upgrades to the Hadoop cluster planned for the next year, you should review those plans with Support prior to installation. For more information, please contact Alteryx Support.
Supported Hadoop Distributions
The Designer Cloud Powered by Trifacta platform supports the following minimum Hadoop distributions.
TheDesigner Cloud Powered by Trifacta platformonly supports the latest major release and its minor releases of each distribution.
The Designer Cloud Powered by Trifacta platform only supports the versions of any required components included in a supported distribution. Even if they are upgraded components, use of non-default versions of required components is not supported.
Cloudera supported distributions
CDH 6.3 Recommended
CDH 6.2
Note
CDH 6.x requires that you use the native Spark libraries provided by the cluster. Additional configuration is required. For more information, see Configure for Spark in the Configuration Guide.
Cloudera Data Platform 7.1
See Supported Deployment Scenarios for Cloudera in the Install Guide.
EMR supported distributions
See Configure for EMR in the Configuration Guide.
AWS Databricks supported distributions
See Configure for AWS Databricks in the Configuration Guide.
Azure Databricks supported distributions
See Configure for Azure Databricks in the Configuration Guide.
Node Requirements
Each cluster node must have the following software:
Java JDK 8 (some exceptions may be listed below)
Hadoop Component Access
The Alteryx deployment must have access to the following.
Java and Spark version requirements
The following matrix identifies the supported versions of Java and Spark on the Hadoop cluster. Where possible, you should install the same version of Java on theTrifacta nodeand on the cluster with which you are integrating.
Notes:
Java must be installed on each node of the cluster. For more information, see https://www.cloudera.com/documentation/enterprise/latest/topics/cdh_ig_jdk_installation.html.
The versions of Java on the Alteryx node and the Hadoop cluster do not have to match.
Spark 2.3 | Spark 2.4 | Spark 3.0.1 | |
---|---|---|---|
Java 8 | Required. | Required. | Required |
Support for Spark 3.0.1 has limitations. See Configure for Spark in the Configuration Guide.
If you are integrating with an EMR cluster, there are specific version requirements for EMR. See Configure for Spark in the Configuration Guide.
Other components
HDFS Namenode
WebHDFS
In HDFS, Append Mode must be enabled. SeePrepare Hadoop for Integration with the Platform.
If you are enabling high availability failover, you must use HttpFS, instead of WebHDFS. SeeEnable Integration with Cluster High Availabilityin the Configuration Guide.
For YARN:
ResourceManager is running.
ApplicationMaster's range of ephemeral ports are open to the Trifacta node.
HiveServer2:
HiveServer2 is supported for metadata publishing.
WebHCat is not supported.
Hadoop System Ports
For more information, see System Ports.
Site Configuration Files
Hadoop cluster configuration files must be copied into the Alteryx deployment. See Configure for Hadoop in the Configuration Guide.
Security Requirements
Kerberos supported:
If Kerberos is enabled, a keytab file must be accessible to the Designer Cloud Powered by Trifacta platform.
See Configure for Kerberos Integration in the Configuration Guide.
If Kerberos and secure impersonation are not enabled:
A user
[hadoop.user
(default=trifacta
)]
must be created on each node of the Hadoop cluster.A directory
[hadoop.dir
(default=trifacta
)]
must be created on the cluster.The user
[hadoop.user]
must have full access to the directory. which enables storage of the transformation recipe back into HDFS.See Configure for Hadoop in the Configuration Guide.
Cluster Configuration
For more information on integration with Hadoop, see Prepare Hadoop for Integration with the Platform.
User Requirements
Users must access the Designer Cloud Powered by Trifacta platform through one of the supported browser versions. For more information on user system requirements, see Browser Requirements.