Install on CentOS and RHEL
This guide takes you through the steps for installing Designer Cloud Powered by Trifacta Enterprise Edition software on CentOS or Red Hat.
For more information on supported operating system versions, see Product Support Matrix in the Planning Guide.
Preparation
Before you install software, please review and verify the following.
Note
Except for database installation and configuration, all install commands should be run as the root user or a user with similar privileges. For database installation, you will be asked to switch the database user account.
Steps:
Review key sections of the Planning Guide:
Review the System Requirements and verify that all required components have been installed.
Verify that all required System Ports are opened on the node.
Review the System Dependencies in the Planning Guide.
Cluster Configuration: Additional steps are required to integrate the Designer Cloud Powered by Trifacta platform with the cluster. See Prepare Hadoop for Integration with the Platform in the Planning Guide.
Acquire your License Key.
Install and verify operations of the datastore, if used.
Note
Access to the Spark cluster is required.
Verify access to the server where the Designer Cloud Powered by Trifacta platform is to be installed.
Required version of RPM for CentOS
The installer for the Designer Cloud Powered by Trifacta platform on CentOS/RHEL requires RPM version 4.11.3-40. Please upgrade if necessary.
Installation
Python setup tools
The Python setup tools can be useful for debugging startup issues.
Tip
These tools are useful. They are not required.
To install:
CentOS/RHEL 9.x, CentOS/RHEL 8.x:
yum install python3-setuptools
1. Install Dependencies
Without Internet access
If you have not done so already, you may download the dependency bundle with your release directly from Alteryx. For more information, see Install Dependencies without Internet Access.
With Internet access
Use the following to add the hosted package repository for CentOS/RHEL, which will automatically install the proper packages for your environment.
# If the client has curl installed ... curl https://packagecloud.io/install/repositories/trifacta/dependencies/script.rpm.sh | sudo bash # Otherwise, you can also use wget ... wget -qO- https://packagecloud.io/install/repositories/trifacta/dependencies/script.rpm.sh | sudo bash
Additional dependencies for CentOS 8.x
If you are installing on CentOS 8.x, you must complete the following manual dependency installs.
NodeJS:
yum -y --disablerepo="*" --enablerepo="trifacta_dependencies" install nodejs
PostgreSQL:
Note
This step is required only if you are installing the Designer Cloud Powered by Trifacta platform onto CentOS 8.x and are using PostgreSQL to host the Alteryx databases. Otherwise, you may skip this step.
yum -y --disablerepo="*" --enablerepo="trifacta_dependencies" install postgresql96-server
2. Install JDK
By default, the Trifacta node uses OpenJDK for accessing Java libraries and components. In some environments, basic setup of the node may include installation of a JDK. Please review your environment to verify that an appropriate JDK version has been installed on the node.
Note
Use of Java Development Kits other than OpenJDK is not currently supported. However, the platform may work with the Java Development Kit of your choice, as long as it is compatible with the supported version(s) of Java. For more information, see System Requirements in the Planning Guide.
Tip
OpenJDK is included in the offline dependencies, which can be used to install the platform without Internet access. For more information, see Install Dependencies without Internet Access.
The following commands can be used to install OpenJDK. These commands can be modified to install a separate compatible version of the JDK.
Note
If this package is not included, the batch job runner service, which is required, fails to start.
Java 11:
sudo yum install java-11-openjdk-11 java-11-openjdk-devel
Java 8:
sudo yum install java-1.8.0-openjdk-1.8.0 java-1.8.0-openjdk-devel
3. Set JAVA_HOME
The platform must be made aware of the location of Java. By default, the JAVA_HOME
environment variable is configured to point to a default install location for the OpenJDK package.
When to change:
Note
If you have installed a JDK other than the OpenJDK version provided with the software, you must perform the following updates.
Note
If you have installed Java in a non-default location, you must perform the following updates.
Steps:
Edit the following file:
trifacta-conf.json
Update the following parameter value. Please note the Java version number (
1.11
) below, which can be modified for other supported versions of Java."env": { "JAVA_HOME": "/usr/lib/jvm/java-1.11-openjdk.x86_64" },
Save changes.
Edit the following file:
/opt/trifacta/conf/env.sh
Modify the
JAVA_HOME
value as needed.Save changes.
4. Install Alteryx package
Note
If you are installing without Internet access, you must reference the local repository. The command to execute the installer is slightly different. See Install Dependencies without Internet Access.
Note
Installing the Designer Cloud Powered by Trifacta platform in a directory other than the default one is not supported or recommended.
Install the package with yum, using root:
sudo yum install <rpm file>
5. Verify Install
The product is installed in the following directory:
/opt/trifacta
6. Install License Key
Please install the license key provided to you by Alteryx. See License Key.
7. Install Hadoop dependencies
If you are integrating with a supported Hadoop cluster, you must install the dependencies for the Hadoop cluster on the Trifacta node. See below.
8. Set File Ownership
Warning
All files in the Alteryx install directory and sub-directories must be owned by the same user that is used to run the Designer Cloud Powered by Trifacta platform. Mismatches in ownership and execution permissions can cause services to fail to start.
Steps:
Before you upgrade, please complete the following:
Login to the Trifacta node as the root user.
Execute the following command. The user that is being granted ownership of the install directory is
trifacta
, which is the default user that runs the platform. If you are using a different user to run your Alteryx deployment, please substitute that name.chown -R trifacta:trifacta /opt/trifacta
9. Store install packages
For safekeeping, you should retain all install packages that have been installed with this Alteryx deployment.
Install Hadoop Dependencies
If you are integrating Hadoop cluster, the associated Hadoop dependencies must be installed on the Trifacta node.
Included Dependencies
The Hadoop dependencies for the latest supported version of each Hadoop distribution are included in the Alteryx software distribution.
Supported Versions:
Configure for EMR in the Configuration Guide
Not required for:
Note
If you are integrating with one of the following running environments, please skip installing Hadoop dependencies.
Azure running environments:
Azure Databricks
Acquire Other Dependencies
Hadoop dependencies for other versions of the Hadoop distribution can be acquired from the Alteryx FTP site using one of the following methods.
Via a web browser
Log in: https://ftp.trifacta.com/login
Browse to the following directory:
Releases/Trifacta_x.y/hadoop/
where:
x.y
corresponds to the release number that you are installing (e.g. Release 6.8).Download the following file:
hadoop_deps.tar.gz
Via WGET
Example is for Release 6.8:
wget --user CustomerUsername --ask-password ftps://ftp.trifacta.com/Releases/Trifacta_6.8/hadoop/hadoop-deps.tar.gz
Via SFTP
Example is for Release 6.8:
sftp CustomerUsername@ftp.trifacta.com:Releases/Trifacta_6.8/hadoop/hadoop-deps.tar.gz .
Via CURL
Example is for Release 6.8:
curl -O -C - -u CustomerUsername:CustomerPassword ftps://ftp.trifacta.com/Releases/Trifacta_6.8/hadoop/hadoop-deps.tar.gz
Via FTP/FTPS
Access the FTP server via your preferred FTP client.
Browse to the following directory:
Releases/Trifacta_x.y/hadoop/
where:
x.y
corresponds to the release number that you are installing (e.g. Release 6.8).Download the following file:
hadoop_deps.tar.gz
Install Dependencies
If needed, transfer the download to the Trifacta node.
Extract it to the following directory:
sudo tar -vxf hadoop-deps.tar --directory /opt/trifacta/
Note
After you extract the files to the target directory, verify that the ownership of the new directory (/opt/trifacta/hadoop-deps/
) and its subfolders match the ownership settings for the rest of the Alteryx installation in /opt/trifacta
.
Next Steps
Install and configure Alteryx databases
The Designer Cloud Powered by Trifacta platform requires installation of several databases. If you have not done so already, you must install and configure the databases used to store Alteryx metadata. See Install Databases in the Databases Guide.
Install configuration
After installation is complete, additional configuration is required to make the platform operational. See Install Configuration.