Skip to main content

Blue icon with database being plugged in. Connect In-DB Tool

Use Connect In-DB to create an in-database connection in a workflow. Use the tool to connect to a new or existing connection.

In-Database enables blending and analysis against large sets of data without moving the data out of a database and can provide significant performance improvements over traditional analysis methods. For more about the In-Database tool category, visit In-Database Overview.

Configure the Tool

  1. In the Configuration window, select the ConnectionName dropdown and select an option:

    • ManageConnections: Create a new connection or use an existing connection.

    • Open File Connection: Browse to a saved database connection file.

  2. Once the connection is configured, Table or Query displays the name of the selected database table.

  3. (Optional) Select Query Builder to select tables and construct queries. Visit Choose Table or Specify Query Window for more information about this topic.

Add a New In-DB Connection

  1. Select the Connection Name dropdown arrow and select Manage Connections.

  2. Select Data Source and select a source. Visit In-Database Overview for more information.

    Note

    From the Connect In-DB tool, you can select the Generic ODBC option to attempt a connection to an unsupported data source. This option does not guarantee a successful connection to unsupported data sources; however, data sources that are similar to Microsoft SQL Server have the best chance of success.

  3. Select Connection Type and select type.

    • User: Create a connection that only you can use.

    • System: Create a connection that can be shared. Open Alteryx Designer as an administrator on your computer. This option is only for Designer Admin.

    • File: Saves a database connection as an .indbc file so it can be packaged with a workflow. If this option is selected, a Connection File path location must be specified in order to save the file.

  4. Select Connections and select an existing connection from the list or select New.

  5. In ConnectionName, enter a name for the connection.

  6. Select PasswordEncryption and select an encryption option:

    • Hide: Hide the password using minimal encryption. If you plan to schedule this workflow to run on any machine other than your computer, select Hide. Visit Schedule Workflows for more information.

    • Encrypt for Machine: Any user on the computer will be able to fully use the workflow.

    • Encrypt for User: The logged-in user can use the workflow on any computer.

    • Allow Decryption of Password: Decrypts the password and passes it in the metadata. This option is only used in conjunction with In-DB predictive tools.

  7. On the Read tab, select Driver and select an option or leave as default.

  8. Select the ConnectionString dropdown arrow and select New database connection. For Oracle OCI and SQL Server ODBC connections, you can alternately select a saved or recent data connection.

  9. Select the Write tab.

  10. Select Driver and select a driver or leave as default.

  11. In ConnectionString, enter or paste a connection string. For Oracle OCI and SQL Server ODBC connections, you can alternately select a saved or recent data connection. HDFS Connections To connect to HDFS:

    1. Select the Connection String dropdown arrow and select New HDFS Connection.

    2. Select HTTPFS, WebHDFS, or Knox Gateway server configuration. If you are using Knox Gateway with Spark, select Override default Namenode URL.

    3. In Host, enter the Hadoop server URL or IP address.

    4. In Port, leave the default port number which is based on your server configuration selection, or enter a port number.

    5. By default, URL is based on Host. Enter a different URL, if desired.

    6. By default, the TempDirectory is /tmp. Enter a different location for the temporary directory to write to, if desired.

    7. Enter a user name in UserName and a password in Password. Required credentials vary based on the cluster setup.

      • httpfs: A user name is required, but it can be anything.

      • webhdfs: A user name is not required.

      • KnoxGateway: A user name and password are required. Use a trusted certificate when configuring Knox authentication. Alteryx does not support self-signed certificates.

    8. Click Kerberos and select an authentication option for reading and writing to HDFS.

      • None: No authentication is used.

      • KerberosMIT: Alteryx will use the default MIT ticket to authenticate with the server. You must first acquire a valid ticket using the MIT Kerberos Ticket Manager.

      • KerberosSSPI: Alteryx will use Windows Kerberos keys for authentication, which are obtained when logging in to Windows with your Windows credentials. The User Name and Password fields are therefore not available. The option you choose depends on how your IT admin configured the HDFS server.

    9. (Spark-only) Select Override default Namenode URL to override the Namenode URL and enter a host and port number if using Knox Gateway, or if the namenode server is running on a different computer than the httpfs or webhdfs server.

    10. (Recommended) Select Test to test the connection.

    11. Select OK.

      Note

      Visit Hadoop Distributed File System for more information. Visit Manual Connection Setup for more information on authentication requirements.

  12. Select OK.

  13. If you are connecting to a database with multiple tables the Choose Table or Specify Query window opens. Select the Tables tab. Visit Choose Table or Specify Query Window for more information.

  14. Select a table and select OK.

Use an Existing In-DB Connection

  1. Select the Connection Name dropdown arrow and select Manage Connections.

  2. Select Data Source and select a source.

    Note

    From the Connect In-DB tool, you can select the Generic ODBC option to attempt a connection to an unsupported data source. This option does not guarantee a successful connection to unsupported data sources; however, data sources that are similar to Microsoft SQL Server have the best chance of success.

  3. Select Connections and select an existing connection from the list.

  4. Select OK.

You can also edit connection details and select Apply. Visit Manage In-DB Connections for more on managing existing In-DB connections.

Note

Visit Manage In-DB Connections to learn how to manage In-DB connections.