Skip to main content

Configure Storage Environment

When you sign up to use the Alteryx Analytics Cloud, you are provided a default storage environment which you can use to immediately get started using the product. A storage environment is used to store your data and metadata assets.

Default Storage Environment

The Alteryx Analytics Cloud supports multiple storage environments, one of which is the default one. The default storage environment provides all of the storage capabilities of other storage environment, as well as storage for data assets generated by use of the product. In the following table, you can see the types of data assets that are stored in each type of storage environment:

Asset Type

Description

Default Storage

Non-Default Storage

imported datasets

When you upload data to the product, it is stored in the default storage environment. From these uploaded assets, you create imported datasets, which are sources of data in the Alteryx Analytics Cloud.

You can also import data that is stored in other storage environments.

Yes

Yes

job results

When you run a job to transform your data, the results of the job execution are stored in a storage environment.

Yes

Yes

samples

When you transform your data, you are working on a sample of the source data.

Yes

No

temporary files

During ingestion of data and job execution, the Alteryx Analytics Cloud requires storage space in the default storage environment to store temporary files.

Yes

No

Storage Environment Options

You can use either of the following storage environments as the default or secondary storage environment.

Defaults

Warning

You should choose your default storage environment when you first start using the product.

When the product is first launched:

  • TFS is automatically configured to be the default storage environment.

  • S3 connectivity is also enabled as secondary storage.

    • You must configure access to your S3 buckets and assets.

Storage options

One of the following must be set as the default storage environment.

Default Storage Environment

Description

Secondary Storage Environments

TFS

Short for TFS, this S3-backed storage environment is managed by Alteryx and requires no additional configuration to manage. When the Alteryx Analytics Cloud is first launched, TFS is defined as the default storage environment. Details are below.

This storage environment provides storage for the above data asset types. TFS is backed by AWS S3 buckets hosted by Alteryx and secured by IAM policies.

Using TFS is very similar to navigating S3 buckets to find and select assets to import or to locate job results that you have published. For more information, see Using TFS.

When TFS is your default storage environment:

  • You can enable general connectivity to S3.

    • You can create connections to specific S3 buckets.

  • You can also choose to disable S3 connection for the entire workspace.

S3

Your S3 buckets and their assets.

Note

To access your S3 assets, you must provide authentication credentials, policies, and other configuration information to the Trifacta Application. Additional information is provided below.

See Using S3.

When S3 is your default storage environment:

  • You can use TFS as secondary storage.

    • You can create connections to specific S3 buckets.

  • You can choose to disable access to TFS for the entire workspace.

Configure Access to S3

If you are using S3 as your default storage environment or as a secondary environment, additional configuration is required. These steps include configuration that must be performed within AWS by an S3 administrator.

Note

Please review and complete any necessary configuration in AWS before changing your default storage environment to S3.

Required information for S3

If you are enabling access to S3, you must choose the method of access and acquire information from your S3 administrator to insert into the application to enable access.

Tip

The following section should be shared with your S3 administrator, who can provide the required information.

For more information, see Enable Access to S3 and AWS Resources.

Configure access mode

To enable access to AWS and S3 settings, you must decide your access mode:

  • Workspace mode: All workspace users use the same credentials to access S3.

    • Pro: Easy to set up.

    • Con: All users have access to the same resources.

  • User mode: All workspace users must configure individual credentials to access S3.

    Note

    This feature may not be available in all product editions. For more information on available features, see Compare Editions.

    • Pro: Finer-grained security.

    • Con: Requires configuration for each workspace user.

Steps:

  1. Login as a workspace administrator.

  2. In the Trifacta Application, select User menu > Admin console > AWS Settings.

  3. For Mode, select your preferred access mode:

    1. All users in the workspace share the same AWS credentials: workspace mode

    2. Each user in the workspace can use their own AWS credentials: user mode

  4. For Authentication Method, you must determine your choice:

    Note

    The authentication method requires information from S3. Specific requirements and configuration are covered in later steps.

    1. Use a cross-account role (IAM role): An IAM role is an AWS object that contains policies defining permissions and access level to AWS and S3 resources. This object must be created by an AWS or S3 administrator. This method of access is recommended.

    2. Use access keys: These key-secret combinations can be used to provide access to S3 buckets.

Configure access by mode

Workspace mode

Continue completing the specification listed in the above page.

For more information, see AWS Account Page.

User mode

If you have enabled per-user access to the workspace:

Note

This feature may not be available in all product editions. For more information on available features, see Compare Editions.

  1. Workspace administrators who have chosen to use IAM roles must ensure that any IAM role includes the proper trust relationship for the Trifacta Application. For more information, see Insert Trust Relationship in AWS IAM Role.

  2. Workspace users must configure their access.

    1. For more information on configuring access, see Configure Your Access to S3.

    2. Individual users can also configure directories to use in their S3 bucket for storing assets. See Storage Page.

Configure Storage

Set Default Storage Environment

You can configure S3 to be the default storage environment, instead of TFS.

Note

Before you begin, you should verify that you have the appropriate credentials to access S3.

Please complete the following.

Steps:

  1. You apply this change through the Workspace Settings Page. For more information, see Platform Configuration Methods.

  2. Locate the following setting and set it to your preferred storage environment:

    Default storage environment

After completion of the above:

  • The selected environment is now used as the default storage environment.

    • The assets listed above are now stored in this environment.

  • The other storage environment type can be disabled if preferred. See below.

Disable S3

By default, S3 is enabled for access as a storage environment. If needed, you can disable access to S3 when TFS is the default storage environment.

Tip

You can add still use S3 as a secondary storage environment.

Note

If S3 has been enabled previously, all access to assets stored on S3 is cut off for the Trifacta Application.

To disable S3 access, please complete the following.

Steps:

  1. Login as a workspace administrator.

  2. You apply this change through the Workspace Settings Page. For more information, see Platform Configuration Methods.

  3. Locate the following setting and set it to Enabled:

    Trifacta File System
  4. Set the following to TFS:

    Default storage environment
  5. Locate the following setting and set it to Disabled:

    Enable S3 connectivity
  6. Access to S3 is closed. TFS is used as the default storage environment.

Disable TFS

If you wish to disable all access to TFS, please do the following.

Tip

You can add still use TFS as a secondary storage environment.

Note

If TFS has been enabled previously, all access to assets stored on TFS is cut off for the Trifacta Application.

Steps:

  1. Login as a workspace administrator.

  2. You apply this change through the Workspace Settings Page. For more information, see Platform Configuration Methods.

  3. Locate the following setting and set it to Enabled:

    Enable S3 connectivity
  4. Set the following to S3:

    Default storage environment
  5. Locate the following setting and set it to Disabled:

    Trifacta File System
  6. Access to TFS is closed. S3 is used as the default storage environment.