Apache Impala Connections
Note
This feature may not be available in all product editions. For more information on available features, see Compare Editions.
Apache Impala is a MPP (Massively Parallel Processing) SQL query system for processing volumes of data stored in a Hadoop cluster. For more information, see https://impala.apache.org/.
Tip
This connection is in early preview. It is read-only and available only in SaaS product editions. For more information on early previews, see Early Preview Connection Types.
Limitations and Requirements
Note
During normal selection or import of an entire table, you may encounter an error indicating a problem with a specific column. Since some tables require filtering based on a particular column, data from them can only be ingested using custom SQL statements. In this case, the problematic column can be used as a filter in the WHERE clause of a custom SQL statement to ingest the table.
For more information, please consult the CData driver documentation for the specific table.
For more information on using custom SQL, see Create Dataset with SQL.
Note
For filtering date columns, this connection type supports a set of literal functions on dates. You can use these to reduce the volume of data extracted from the database using a custom SQL query. For more information, see the pg_dateliteralfunctions.htm
page in the driver documentation for this connection type.
Supports authentication using LDAP with Username and Password combination.
Create Connection
via Designer Cloud application
When you create the connection, please review the following properties and specify them accordingly:
Connection Property | Description |
---|---|
Connect String Options | The following default value sets the connection timeout in seconds: Timeout=0; Setting this value to |
Server | The name of the server running Apache Impala. |
Port | The port for the connection to the Apache Impala Server instance. The default value is 21050. |
User Name | The username used to authenticate with Apache Impala. |
Password | The password used to authenticate with Apache Impala. |
Default Column Data Type Inference | Leave this value as |
For more information, see the driver documentation https://cdn.cdata.com/help/GIG/jdbc/default.htm.
via API
Depending on your product edition, you can create connections of this type. Key information:
"vendor": "Impala", "vendorName": "Impala", "type": "jdbc"
For more information, see
Designer Cloud Powered by Trifacta: API Reference docs
Data Type Conversions
For more information, see the driver documentation https://cdn.cdata.com/help/GIG/jdbc/default.htm.