v0.56

Google BigQuery

To add a database connection, click on the gear icon in the top right, and navigate to Admin settings > Databases > Add a database.

Prerequisites

You’ll need to have a Google Cloud Platform account with a project you would like to use in Metabase. Consult the Google Cloud Platform documentation for how to create and manage a project. This project should have a BigQuery dataset for Metabase to connect to.

Google Cloud Platform: creating a service account and JSON file

You’ll first need a service account JSON file that Metabase can use to access your BigQuery dataset. Service accounts are intended for non-human users (such as applications like Metabase) to authenticate (who am I?) and authorize (what can I do?) their API calls.

To create the service account JSON file, follow Google’s documentation on setting up a service account for your BigQuery dataset. Here’s the basic flow:

Create service account. From your Google Cloud Platform project console, open the main sidebar menu on the left, go to the IAM & Admin section, and select Service account. The console will list existing service accounts, if any. At the top of the screen, click on + CREATE SERVICE ACCOUNT.
Fill out the service account details. Name the service account, and add a description (the service account ID will populate once you add a name). Then click the Create button.
Grant the service account access to this project. You’ll need to add roles to the service account so that Metabase will have permission to view and run queries against your dataset. Make sure you add the following roles to the service account:
- BigQuery Data Viewer
- BigQuery Metadata Viewer
- BigQuery Job User (distinct from BigQuery User)

For more information on roles in BigQuery, see Google Cloud Platform’s documentation.

Create key. Once you have assigned roles to the service account, click on the Create Key button, and select JSON for the key type. The JSON file will download to your computer.

You can only download the key once. If you delete the key, you’ll need to create another service account with the same roles.

Connection and sync

After connecting to a database, you’ll see the “Connection and sync” section that displays the current connection status and options to manage your database connection.

Here you can sync the database schema and rescan field values, and edit connection details.

Edit connection details

You can edit these settings at any time. Just remember to save your changes.

Connection string

Paste a connection string here to pre-fill the remaining fields below.

Display name

The display name for the database in the Metabase interface.

Project ID

Each BigQuery dataset will have a Project ID. You can find this ID via the Google Cloud Console. If you’re not sure where to find the Project ID, see Google’s documentation on getting information on datasets.

When entering the Project ID, omit the Project ID prefix. For example, if your ID is project_name:project_id, only enter project_id.

Service account JSON file

The JSON file contains the credentials your Metabase application will need to access BigQuery datasets, as defined by the roles you added to the service account. If you need to add additional roles, you have to create another service account, download the JSON file, and upload the file to Metabase.

Datasets

You can specify which BigQuery datasets you want to sync and scan. Options are:

All
Only these…
All except…

A BigQuery dataset is similar to a schema. Make sure to enter your dataset names (like marketing), not your table names (marketing.campaigns).

Let’s say you have three datasets: foo, bar, and baz.

To sync all three datasets, select Only these… and enter:

foo,bar,baz

To sync datasets based on a string match, use the * wildcard:

To sync bar and baz, select Only these… and enter the string b*.
To sync foo only, select All except… and enter the string b*.

Note that only the * wildcard is supported; you can’t use other special characters or regexes.

Use the Java Virtual Machine (JVM) timezone

We suggest you leave this off unless you’re doing manual timezone casting in many or most of your queries with this data.

Include User ID and query hash in queries

This can be useful for auditing and debugging, but prevents BigQuery from caching results and may increase your costs.

Alternate hostname

If you want to use a different hostname to connect to BigQuery. Format: https://<hostname>:<port>. If you’re using a proxy service to connect to BigQuery (e.g. a privacy proxy that anonymizes PII), you should configure this field to the proxy hostname or IP. Remember to set the complete URI with protocol and port number.

Re-run queries for simple explorations

Turn this option OFF if people want to click Run (the play button) before applying any Summarize or filter selections.

By default, Metabase will execute a query as soon as you choose an grouping option from the Summarize menu or a filter condition from the drill-through menu. If your database is slow, you may want to disable re-running to avoid loading data on each click.

Choose when syncs and scans happen

See syncs and scans.

Periodically refingerprint tables

Periodic refingerprinting will increase the load on your database.

Turn this option ON to scan a sample of values every time Metabase runs a sync.

A fingerprinting query examines the first 10,000 rows from each column and uses that data to guesstimate how many unique values each column has, what the minimum and maximum values are for numeric and timestamp columns, and so on. If you leave this option OFF, Metabase will only fingerprint your columns once during setup.

Connecting Metabase to Google Drive data sources

You can connect Metabase to Google Drive data sources via BigQuery. There is some setup involved, but basically what you’ll be doing is creating a dataset in BigQuery and adding an external table to that dataset that points to a Google Sheet. Useful for uploading CSVs to Google Sheets, and then analyzing and visualizing the data with Metabase.

To connect to a data source stored in Google Drive (like a Google Sheet), first make sure you’ve completed the steps above, including:

creating a project in Google Cloud Platform,
adding a BigQuery dataset, and
creating a service account.

While viewing your Drive file, (e.g., a Google Sheet with an uploaded CSV file), click the Share button in the top right. In the text box labeled Add people or groups, paste in the email of your service account, which you can find on the Service Accounts page in the Google Cloud Console.

That email address will look something like service-account-name@your-project-name.iam.gserviceaccount.com, with the your service account and project names filled in accordingly.

Choose Viewer from the dropdown, uncheck the Notify people option, and click Share.

Create an external table in BigQuery that points to your Google Drive source

If you don’t already have a BigQuery dataset, create one.

Next, using the Google Cloud Console, create an external table within your BigQuery dataset that points to your Google Sheet.

Be sure to specify the correct Drive URI and file format.

If you haven’t already, connect your Metabase to your BigQuery.

Once you’ve completed these steps, you’ll be able to ask questions and create dashboards in Metabase using a Google Drive source as your data.

Using Legacy SQL

As of version 0.30.0, Metabase tells BigQuery to interpret SQL queries as Standard SQL (GoogleSQL). If you prefer using Legacy SQL instead, you can tell Metabase to do so by including a #legacySQL directive at the beginning of your query, for example:

#legacySQL
SELECT *
FROM [my_dataset.my_table]

Troubleshooting

If you’re having trouble with your BigQuery connection, you can check out this troubleshooting guide that covers BigQuery issues, this one on data warehouse connections, or visit Metabase’s discussion forum to see if someone has encountered and resolved a similar issue.

Model features

There aren’t (yet) any model features available for BigQuery.

Database routing

With database routing, an admin can build a question once using one database, and the question will run its query against a different database with the same schema depending on who is viewing the question.

Database routing for BigQuery works between BigQuery projects with identical schemas.

See Database routing.

Danger zone

See Danger zone.

Business Intelligence

Embedded Analytics

Documentation

Learn

Analytics

Embedding

Administration

Other resources

Google BigQuery

Prerequisites

Google Cloud Platform: creating a service account and JSON file

Connection and sync

Edit connection details

Connection string

Display name

Project ID

Service account JSON file

Datasets

Use the Java Virtual Machine (JVM) timezone

Include User ID and query hash in queries

Alternate hostname

Re-run queries for simple explorations

Choose when syncs and scans happen

Periodically refingerprint tables

Connecting Metabase to Google Drive data sources

Create an external table in BigQuery that points to your Google Drive source

Using Legacy SQL

Troubleshooting

Model features

Database routing

Danger zone

Further reading

Was this helpful?

Thanks for your feedback!

Business Intelligence

Embedded Analytics

Documentation

Learn

Analytics

Embedding

Administration

Other resources

Google BigQuery

Prerequisites

Google Cloud Platform: creating a service account and JSON file

Connection and sync

Edit connection details

Connection string

Display name

Project ID

Service account JSON file

Datasets

Use the Java Virtual Machine (JVM) timezone

Include User ID and query hash in queries

Alternate hostname

Re-run queries for simple explorations

Choose when syncs and scans happen

Periodically refingerprint tables

Connecting Metabase to Google Drive data sources

Share your Google Drive source with the service account

Create an external table in BigQuery that points to your Google Drive source

Using Legacy SQL

Troubleshooting

Model features

Database routing

Danger zone

Further reading

Was this helpful?

Thanks for your feedback!