Skip to main content

Databricks Azure OAuth Authentication

Following steps will help you set up the OAuth authentication for Databricks connection to read and write.

Authentication Set Up

In Designer, place the Input tool to the canvas.

  1. Select the Use Data Connection Manager (DCM) check box.

  2. Select Set Up a Connection.

  3. In the Data Connections window, go to Data sources and choose Databricks ODBC.

  4. Select + Add Data Source.

  5. When adding a Data Source, you can choose the connection with or without DSN.

Connection with DSN

  1. Choose the DSN Technology, fill in the Data Source Name and choose the prepared OAuth DSN.

  2. Select Save.

  3. To link the credentials select + Connect Credential.

  4. You can choose User name and password or OAuth as an Authentication Method.

  5. You can choose AzureOAuth with Simba, AzureOAuth with Simba(Alteryx Application), No Credentials, Username, or Username and password as an Authentication Method.

    1. For AzureOAuth with Simba you need

      1. AzureOAuth Application Credentials. You can choose existing one or Create New Credential. For a new credential fill in the Credential Name, Tenant ID, Client ID, and Client Secret. All you can find on Microsoft Azure application registration page.

      2. AzureOAuthTokens. You can choose existing token or Create New Credential for OAuth Tokens. For a new credential fill in the Credential Name.

    2. AzureOAuth with Simba(Alteryx Application) you can Create New Credential or choose an existing one.

      1. For a new credential fill in the Credential Name.

  6. Select Create and Link.

  7. Select the Connect button.

  8. Sign in on the external browser window.

After successfull authentication you are able to choose the table or specify query with the Visual Query Builder and run the workflow.

Connection without DSN

  1. Choose the DSN-less Technology from the drop down menu.

  2. Fill in the information:

    1. Data Source Name,

    2. Server (address of your cluster),

    3. Port,

    4. HttpPath (listed in your cluster Configuration - Advanced options).

  3. Select Save.

  4. To link the credentials select + Connect Credential.

  5. You can choose AzureOAuth with Simba and AzureOAuth with Simba(Alteryx Application) as an Authentication Method.

    1. For AzureOAuth with Simba you need

      1. AzureOAuth Application Credentials. You can choose existing one or Create New Credential. For a new credential fill in the Credential Name, Tenant ID, Client ID, and Client Secret. All you can find on Microsoft Azure application registration page.

      2. AzureOAuthTokens. You can choose existing token or Create New Credential for OAuth Tokens. For a new credential fill in the Credential Name.

  6. Select Create and Link.

  7. Select the Connect button.

  8. Sign in on the external browser window.

After successfull authentication you are able to choose the table or specify query with the Visual Query Builder and run the workflow.

Bulk loading using Oauth Authentication

Loading data from Desiger is supported by In-Database bulk loading.

Connection Configuration

In Designer, place the Data Stream In tool to the canvas.

  1. Select Manage Connections in the Data Stream In Configuration - Connection Name drop down menu.

  2. Select the Databricks datasource on the In-DB manager pop-up window.

  3. You can select New to create a new connection or choose to use an existing one.

  4. For the new connection, select the Use Data Connection Manager (DCM) check box.

  5. Fill in the Connection Name.

  6. To configure Read connection

    1. Select Setup Connection.

    2. Select + Add Data Source on the Connection Manager window.

    3. When adding a Data Source, you can choose the connection with or without DSN.

    4. Follow steps from Connection with DSN and Connection without DSN sections.

  7. To configure Write connection

    1. Choose staging method from the drop down menu.

    2. Select Setup Connection.

    3. Select + Add Data Source on the Connection Manager window.

    4. Follow steps from Connection with DSN and Connection without DSN sections.

      Note

      Note that for a DSN bulk loading, you need to manually specify Databricks server address when adding a new datasource.

  8. Select Apply and OK.

  9. Sign in on the external browser window (you may be prompted to authenticate twice for both Read and Write connection).

  10. Select Create New Table in the Data Stream In Configuration - Creation Mode drop down menu.

  11. Fill in the Table Name and select the canvas.

You can run the workflow.