Data and Business Intelligence Glossary Terms

A/B test: A test that compares two different versions of a product or interface (typically the existing one and a new variant) by presenting different options at random to different people and gathering data on their behavior.
API: A program’s defined set of endpoints that other programs can interact with.
ARR: The annualized value of all currently active recurring subscriptions, usually calculated as MRR multiplied by 12.
Aggregation: The act of summarizing data with a mathematical function, such as averaging the values in a column, or counting the number of rows in a table.
Alert: A notification about a question that Metabase delivers by email or Slack.
Analytical database: An analytical database is a database optimized for a few large analysis operations rather than many small transactions.
Application database: The database in which Metabase stores operation data about questions, dashboards, and other Metabase information.
Area chart: A type of line chart where the space between plotted values and the x-axis is filled in with a solid color.
Attribute: An attribute is a property that describes or identifies some entity. In some Metabase plans, user attributes are used to restrict which data people can access.
Audit log: A record of activity and performance in a Metabase, available to admins in some plans.
BI tool: An app designed for people to look at data without relying on code.
Bar chart: A data visualization that uses rectangles that are proportional in size to the values they measure.
Bin: A single range of continuous values used to group values in a chart.
Breakout: The grouping of aggregated results by one or more dimensions.
Bubble chart: A data visualization that plots data with three variables.
CAC: The total sales and marketing cost of acquiring a new customer, divided by the number of new customers acquired in the same period.
CTE: A named result set in a SQL query, equivalent to a subquery. CTEs are evaluated each time the main query is run.
Card: A component of a dashboard that displays data or text.
Change data capture: Change data capture (CDC) is a technique for detecting row-level changes in a source database and streaming just those changes to another system.
Churn: The share of customers, subscriptions, or recurring revenue you lose over a given period.
Cohort analysis: An analysis that groups records by a shared starting event — usually the month they signed up — and tracks each group separately over time.
Collection: A set of items in Metabase, including questions, models, dashboards, and other collections.
Column: A list of values, usually belonging to a particular field, displayed vertically in a table.
Columnar storage: Columnar storage is a way of laying data out on disk column by column instead of row by row, so analytical queries only read the columns they need.
Combo chart: A chart that combines bars and lines.
Compaction: A maintenance process that rewrites scattered data — many small files or fragmented pages — into fewer, larger, contiguous units so queries read less and run faster.
Concat: A function that combines multiple strings into a single string.
Cross-filtering: Cross-filtering is clicking on a chart or table in a dashboard to filter everything else in the dashboard.
Cumulative value chart: A chart that shows how numbers accumulate over time and how they are progressing towards a goal.
Custom column: Sometimes called a “calculated” column. You can use custom expressions to add columns to the results of your query in the notebook editor. For example, adding a column that calculates the difference between the total and subtotal.
Custom destination: A setting on a dashboard card that determines what happens when people click on the card.
Custom expression: A formula in Metabase that uses functions and mathematical operators to allow for more complicated questions, similar to a formula in spreadsheet software.
DAG: A directed acyclic graph (DAG) is a graph in which it’s impossible to get back to any starting node by following the links between nodes.
Dashboard: A data visualization tool that holds important charts and text, collected and arranged on a single screen.
Dashboard subscription: Dashboard subscriptions allow you to schedule the sending of results of the questions on a dashboard via email or Slack.
Data dictionary: A document that describes the tables, fields, and other elements in a database and explains their meaning and origin.
Data lake: A data lake is a place to store both structured and unstructured information, typically as files or blobs.
Data lakehouse: A storage architecture that adds warehouse-style tables, schemas, and transactions on top of cheap object storage, so one copy of your data serves both engineering and analytics.
Data mart: A data mart is a subset of a data warehouse curated for one team or subject area, along with the tables, models, and dashboards that team uses.
Data model: Any pattern that organizes and labels information.
Data pipeline: A data pipeline is the set of automated steps that move data from where it’s produced to where it’s analyzed, transforming it along the way.
Data reference: Metabase’s data reference section lets you add and look up metadata about your databases and their tables and fields.
Data sandbox: A data sandbox is set of boundaries that define access to a table down to its columns and rows. You can coordinate data sandboxes with your SSO setup.
Data type: A field’s classification as implemented directly by a database, such as integer, text, or timestamp.
Data warehouse: A database used for analytics.
Dimension: A descriptive, qualitative attribute of your data, like a name, category, or URL field.
Drill-through: To explore a pre-selected subset of data, e.g., by clicking on a bar in a histogram to see the raw values for that bar.
ERD: An ERD, or entity relationship diagram, is a graphical representation of how tables in a database connect to each other.
ETL: Extract, transform, load: an ETL is a common operation in data processing systems that reads data from sources, massages it, and stores it in another system.
Embedded analytics: Using third-party software to include charts and dashboards in your application that customers can query.
Embedding: Placing some functionality of one app inside another. Metabase uses iframes to embed questions, dashboards, or (in some plans) the full Metabase application.
Entity key: Metabase’s field type for the column that uniquely identifies each row in a table — the equivalent of a database primary key.
Field: Sometimes called a column, a field is an attribute of a database table.
Field filter: Field filters are special variables you can use in SQL queries that let you create “smart” filter widgets.
Field formula: The equation that powers a custom expression, like the formula you’d enter in the cell of a spreadsheet.
Field type: The meaning of a field (or column) in a table.
Filter: A filter is a predicate expression that limits the results of a query based on some stated criteria.
Filter widget: A UI element on a dashboard or SQL query where you can input a value or select one from a menu to filter results.
Foreign key: A column in one database table that holds the primary key of a row in another table, creating a link between the two.
Full outer join: A join that keeps every row from both tables regardless of whether keys matched.
GUI question: Queries built using Metabase’s graphical query builder.
GeoJSON: A popular format for representing maps.
Goal line: A constant line added as a marker to a chart showing how high (or low) values must be to be considered noteworthy.
Grid map: A map with values overlaid graphically in a regular grid, e.g., as squared 10km by 10km colored to show values.
Histogram: A chart that displays continuous data using vertical bars that each represent a discrete, equal-sized range.
Horizontal scaling: Adding more machines to a system to increase capacity — scaling out rather than up — so work is spread across many servers instead of one bigger one.
Inner join: A join that only keeps rows where the keys from the input tables matched.
JSON: A way to represent data by combining basic values in arrays and key/value structures using the syntactic conventions of JavaScript.
JWT: A standard for sharing signed authentication claims between web services.
Join: The combination of results from two tables in a relational database.
KPI: A metric that shows progress toward a personal or company goal, and that a team has agreed to be measured by.
LDAP: A protocol for querying and updating directory services, commonly used as an organization’s single source of truth for user accounts, groups, and authentication.
Leading indicator: A metric that changes before the outcome it predicts, giving you time to act — as opposed to a lagging indicator, which confirms a result after the fact.
Left outer join: A join that keeps every row from the first (left-hand) table regardless of whether that row matched any in the second (right-hand) table.
Line chart: A type of visualization that connects discrete values connected by lines to show changes and trends.
Linear axis: An axis in a chart that spaces marks according to their values.
Linked filter: A filter widget that only displays a subset of values depending on the value of another filter.
Load balancer: Hardware or software that sits in front of a group of servers and distributes incoming traffic across them, so no single server is overloaded and failed servers stop receiving requests.
Measure: A numerical attribute of your data that can be broken out by one or more dimensions.
Metadata: Information that describes data to make it easier to find, manipulate, and make use of that data.
Metric: A metric is a calculation performed on a measure. In Metabase, a Metric is a saved, reusable definition of an aggregation that anyone on your team can use as a starting point for questions.
Model: A fundamental building block in Metabase that you can use as a starting point for new questions.
Multitenancy: A software architecture where a single instance or deployment of an application serves multiple, distinct customers, each of which can only see its own data.
Native query: A query written as text in the database’s query language (usually SQL).
Native query editor: The editor in Metabase used to write queries as text. It includes a sidebar with three tabs: data reference, SQL variables, and Snippets.
Normalization: The process of structuring information in a relational database to reduce redundancy.
North star metric: The single metric a company or team aligns around, chosen because it captures the value customers get from the product.
Notebook editor: The GUI interface for asking questions in Metabase, where you can filter, summarize, join tables, and create custom columns.
OLAP: Online analytical processing: processing that involves a small number of large operations, such as creating quarterly reports.
OLTP: Online transaction processing: processing that involves a large number of small operations, such as logging user activity on a website.
Ordinal axis: An axis in a chart that displays values according to the discrete groups those values are in.
Our Analytics: The root collection in Metabase.
Parameter: A special type of variable that specifies an input to a query.
Personal collection: Your semi-private collection in Metabase, viewable only by you and Admins. Your personal collection is a great place to draft questions and dashboards.
Pin map: A map that displays a single discrete marker for each point of interest.
Pivot table: A data visualization that summarizes rows and columns of a table and lets you rotate (pivot) the columns.
Predicate: An expression that evaluates to either true or false, like quantity > 0. True and false values are known as Boolean values.
Primary key: A column (or set of columns) whose value uniquely identifies every row in a database table.
Production database: A database that stores information that Metabase manages (as opposed to the application database that stores Metabase’s own management data).
Public embedding: Putting an iframe to a publicly-visible question in another web page.
Pulse: Now deprecated in favor of Dashboard Subscriptions, pulses were questions that ran according to a specific schedule and would be sent out via email or Slack.
Query builder: Learn about Query builder, the graphical interface for asking questions in Metabase.
Question: In Metabase, a question is a query, its results, and its visualization.
Read replica: A copy of a database that receives a continuous stream of changes from the primary and serves read-only queries, used to spread load and to keep heavy reads away from production writes.
Record: A group of related data with the same structure. A relational database stores each record as a row in a table.
Region map: A map that shows information for geographic regions (such as countries or states), e.g., by coloring each region to show a value.
Relational database: A collection of tabular data, or the application that manages the storage and retrieval of tabular data.
Retention: The share of customers, users, or recurring revenue from a starting group that you still have at the end of a period.
Reverse ETL: Reverse ETL is the process of copying data out of your data warehouse and into the operational tools your teams work in every day, like a CRM or a support desk.
Right outer join: A join that keeps every row from the second (right-hand) table, regardless of whether that row matched any in the first (left-hand) table.
Row: A single group of related data within a table.
Row chart: A variation on a bar chart, where data is visualized with horizontal bars rather than vertical.
SAML: An XML-based standard for exchanging authentication and authorization data.
SKU: A scannable bar code used to uniquely identify some type of product.
SQL: A standardized and widely-used language for accessing and manipulating data in a relational database.
SQL snippet: A named bit of SQL code that you can insert in a SQL query.
SSO: An authentication (auth) setup that lets people use one login to access independent apps.
Sample Database: A small database that ships with Metabase so you can start asking questions and creating dashboards before you connect to your database.
Saved question: A Metabase question that has been saved and given a name. The results of saved questions can be used as the input to other questions.
Schema: The design or structure that defines the organization of a dataset, including its tables, columns, relationships, data types, and integrity constraints.
Search filter: A filter that hides any rows in a table that don’t match some criteria.
Search widget: A widget that can be added to a dashboard that applies a search filter to the values used in one or more cards.
Secure embed: A method for embedding an iframe displaying a question or dashboard that uses signed tokens for authentication.
Segment: A specific subset of a larger group of items, like a certain grouping of customers.
Semantic layer: A semantic layer is a shared set of definitions for the metrics and dimensions in your data, sitting between the raw warehouse tables and the tools people use to ask questions.
Serialization: The process of storing the state of an application or other object as a file so that it can be reloaded (or deserialized) later.
Session metric: A measure of how someone interacted with a system, such as the number of pages they viewed in succession.
Session token: A random value generated by a server to identify a specific user for the duration of one interaction session.
Soft delete: Marking a record as no longer active or valid without actually removing it from the database, usually by setting a deleted_at timestamp or an is_deleted flag.
Stacked bar chart: A bar chart in which several values are shown for each category by stacking bars sized according to each value.
Star schema: A star schema is a way of organizing analytical data into one central fact table surrounded by dimension tables that describe it.
Summary table: The result of an aggregation that gets saved in a database or data warehouse so that people can work with those precomputed metrics.
Table: Data’s natural habitat. In a database, a table is a series of fields, with the values of those fields arranged in rows.
Text card: A card on a dashboard where you can enter Markdown text to give context for your dashboard. Text cards can also be used as dividers on a dashboard.
Time series: A time series is a sequence of points charted over time.
Transactional database: A database optimized for a large number of small operations rather than a few large transactions. OLTP usually relies on transactional databases.
Variable: Any value in a program or query that can change. In Metabase, variables in SQL queries get enclosed in double braces.
Vertical scaling: Making a single machine more powerful — more CPU, more memory, faster disks — to increase performance, rather than adding more machines.
View: A query and its results that function like a virtual table in your database.
White labeling: The business practice of applying branding to someone else’s product.
X-ray: A summary or visualization of a table or column generated automatically to give users a quick insight into their data.

Business Intelligence

Embedded Analytics

Documentation

Learn

Business Intelligence pricing

Embedded Analytics pricing

A

A/B test

API

ARR

Aggregation

Alert

Analytical database

Application database

Area chart

Attribute

Audit log

B

BI tool

Bar chart

Bin

Breakout

Bubble chart

C

CAC

CTE

Card

Change data capture

Churn

Cohort analysis

Collection

Column

Columnar storage

Combo chart

Compaction

Concat

Cross-filtering

Cumulative value chart

Custom column

Custom destination

Custom expression

D

DAG

Dashboard

Dashboard subscription

Data dictionary

Data lake

Data lakehouse

Data mart

Data model

Data pipeline

Data reference

Data sandbox

Data type

Data warehouse

Dimension

Drill-through

E

ERD

ETL

Embedded analytics

Embedding

Entity key

F

Field

Field filter

Field formula

Field type

Filter

Filter widget

Foreign key

Full outer join

G

GUI question

GeoJSON

Goal line

Grid map

H

Histogram

Horizontal scaling