Skip to main content
Version: Next

Dremio

Testing

Important Capabilities

CapabilityStatusNotes
Asset ContainersEnabled by default
Data ProfilingOptionally enabled via configuration
DomainsSupported via the domain config field
Platform InstanceEnabled by default
Schema MetadataEnabled by default

CLI based Ingestion

Install the Plugin

The dremio source works out of the box with acryl-datahub.

Starter Recipe

Check out the following recipe to get started with ingestion! See below for full configuration options.

For general pointers on writing and running a recipe, see our main recipe guide.

source:
type: dremio
config:
# Coordinates
hostname: localhost
port: 26257

# Credentials
authentication_method: password
username: user
password: pass

sink:
# sink configs

Config Details

Note that a . is used to denote nested fields in the YAML recipe.

FieldDescription
authentication_method
string
Is a Personal Access Token or a password used for authentication
Default: password
collect_pds
boolean
Default: False
collect_system_tables
boolean
Ingest Dremio system tables in catalog
Default: False
data_product_pattern
object
data_product_specs
object
disable_certificate_verification
boolean
Disable TLS certificate verification
Default: False
hostname
string
Hostname or IP Address
include_copy_lineage
boolean
Default: True
include_table_rename_lineage
boolean
Default: True
match_fully_qualified_names
boolean
Whether schema_pattern is matched against fully qualified schema name <database>.<schema>.
Default: False
password
string
Dremio REST API port
Default: 9047
path_to_certificates
string
Default: /vercel/path0/metadata-ingestion/venv/lib/python3....
platform_instance
string
The instance of the platform that all assets produced by this recipe belong to.
Default:
port
integer
REST API port
Default: 9047
table_allow
boolean
Default: True
tls
boolean
Whether the Dremio REST API port is encrypted
Default: True
username
string
Dremio username
env
string
Environment to use in namespace when constructing URNs.
Default: PROD
schema_pattern
AllowDenyPattern
Default: {'allow': ['.*'], 'deny': [], 'ignoreCase': True}
schema_pattern.ignoreCase
boolean
Whether to ignore case sensitivity during pattern matching.
Default: True
schema_pattern.allow
array
List of regex patterns to include in ingestion
Default: ['.*']
schema_pattern.allow.string
string
schema_pattern.deny
array
List of regex patterns to exclude from ingestion.
Default: []
schema_pattern.deny.string
string

Code Coordinates

  • Class Name: datahub.ingestion.source.dremio.dremio_source.DremioSource
  • Browse on GitHub

Questions

If you've got any questions on configuring ingestion for Dremio, feel free to ping us on our Slack.