Dataplex Universal Catalog has evolved into the Knowledge Catalog. Pricing SKUs will remain named as Dataplex. Dataplex pricing is based on pay-as-you-go usage. Dataplex currently charges based on the following SKUs:
The following is a high-level overview of how each key Dataplex Universal Catalog capability is billed:
Capability | ||
|---|---|---|
Standard | N/A | |
Premium | Yes | |
Premium | Yes - if published to Catalog | |
Premium | Yes - if published to Catalog | |
Enrich metadata in Dataplex Universal Catalog | N/A | Yes |
Gemini-powered features in Dataplex, including data insights and automated metadata generation features, are billed as part of Gemini in BigQuery or Gemini Code Assist.
Other usage
Data organization features in Knowledge Catalog (lake, zone, or asset setup) and security policy application and propagation, are provided free of charge.
In addition, some Knowledge Catalog functionalities (including discovery scans, scheduled data quality and data ingestion tasks, and managed connectors for ingesting metadata from CloudSQL and Looker) trigger job execution using GCS, Managed Service for Apache Spark, BigQuery, Dataflow, and Cloud Scheduler. Those usages are charged according to the GCS, Managed Service for Apache Spark, BigQuery, Dataflow, and Cloud Scheduler pricing models respectively, and charges will show up under GCS, Managed Service for Apache Spark, BigQuery, and Dataflow instead of Dataplex Universal Catalog.
Dataplex Universal Catalog standard and premium processing are metered by the Data Compute Unit (DCU). DCU-hour is an abstract billing unit for Dataplex Universal Catalog and the actual metering depends on the individual features you use.
Dataplex Universal Catalog standard processing pricing for Knowledge Catalog
Dataplex Universal Catalog standard tier covers the data discovery functionality that discovers metadata across Dataplex Universal Catalog managed data. The following are the prices as per the region of your choice.
Item | Meter | Default* (USD) | BigQuery CUD - 1 Year* (USD) | BigQuery CUD - 3 Year* (USD) |
|---|---|---|---|---|
Dataplex processing | per DCU per unit time | $0.06 | $0.054 | $0.048 |
Dataplex Universal Catalog free tier for Knowledge Catalog
As part of the Google Cloud Free Tier, Knowledge Catalog offers some resources free of charge up to a specific limit. These free usage limits are available during and after the free trial period. If you go over these usage limits and are no longer in the free trial period, you will be charged according to the pricing as described in the sections above.
Note: The Knowledge Catalog free tier is only available for the Dataplex Universal Catalog Standard Processing SKU, and is not available for the Dataplex Universal Catalog Premium Processing SKU.
Resource | Monthly free usage limits |
|---|---|
Dataplex Universal Catalog Processing | 100 DCU-hour |
Item | Meter | Default* (USD) | BigQuery CUD - 1 Year* (USD) | BigQuery CUD - 3 Year* (USD) |
|---|---|---|---|---|
Dataplex premium processing pricing | per DCU per unit time | $0.089 | $0.0801 | $0.0712 |
Calculation of DCU charges
DCU charges for each feature are calculated as follows:
1. Auto data quality scans:
2. There are several options to reduce the cost of auto data quality scans:
3. To filter aggregate charges, use the following labels available in billing export in BigQuery:
4. Data profiling scans:
5. There are several options to reduce the cost of data profiling scans:
6. To filter aggregate charges, use the following labels available in billing export in BigQuery:
You pay for the compute power consumed by automatic lineage reporting from supported Google Cloud systems. The billing appears in the project where you ran the job.
To separate data lineage charges from other charges in the Dataplex premium processing SKU, use the label goog-dataplex-workload-type with the value LINEAGE on the Cloud Billing report.
Billable and non-billable usage
The following activities incur charges:
You incur these costs when the Data Lineage API Origin sourceType is set to any value other than CUSTOM. Values such as BIGQUERY, DATAPROC, or COMPOSER indicate that the lineage is being automatically extracted by Google Cloud from the service's logs and metadata. This extraction process requires compute resources (DCUs), which is why it is billable. In contrast, CUSTOM indicates that you have performed the parsing yourself and are only storing the result, which is free.
The following activities are free:
Pricing example
In this example, you enable data lineage to track lineage for BigQuery in your project. The project is in the us-central1 location, where one DCU-hour costs $0.089.
Reporting lineage to or from one EntityReference costs approximately 0.0056 DCU-hours. An EntityReference uniquely identifies a data resource within Google Cloud, such as a BigQuery table or a Cloud Storage bucket.
CREATE TABLE tableB1 AS SELECT * FROM tableA JOIN tableC USING (column);
Result: This operation generates lineage links connecting two source tables to one target table (3 distinct entities).
CREATE TABLE tableB2 AS SELECT * FROM tableA, tableC, tableD;
Result: This operation generates lineage links connecting three source tables to one target table (4 distinct entities).
Cost calculation
For this month, assume 1 GiB of data lineage metadata was generated. Costs are calculated as follows:
Total: ~$0.045 + $0.030 + $1.998 = ~$2.07
Metadata storage pricing
Knowledge Catalog uses the metadata storage SKU to charge for metadata storage. Metadata storage is measured in gibibytes (GiB), where 1 GiB is 1,073,741,824 bytes. Knowledge Catalog measures the average amount of the stored metadata during a short time interval. For billing, these measurements are combined into a one-month average, which is multiplied by the monthly rate.
Note: Metadata storage for automatically ingested technical metadata from Google Cloud services, such as BigQuery, is offered at no charge.
If you pay in a currency other than USD, the prices listed in your currency on Cloud Platform SKUs apply.
Knowledge Catalog storage pricing
Metadata storage charges (including those for entries and aspects) are billed to the project where the respective resource was created.
Monthly average storage | Price (USD) |
|---|---|
Any | $0.002739726 / 1 gibibyte hour |
When a resource in Data Catalog is made simultaneously available in the Dataplex Universal Catalog, you are charged for only one active instance of such resource.
Dataplex Universal Catalog API charges for Knowledge Catalog
As users interact with the Knowledge Catalog, API calls are free of charge, including:
Pricing example
This section provides examples of how to calculate the Dataplex Universal Catalog cost.
Small aspects
Large aspects
Data Catalog pricing (Deprecated)
This section describes the pricing for Data Catalog, which is in deprecation phase (transition to the Dataplex Universal Catalog in avoid service disruption).
Data Catalog charges apply to metadata storage for Data Catalog and API calls made to the Data Catalog API.
Metadata storage and API call charges accrue daily. You can view unbilled usage on the Google Cloud console.
Note: Pricing models apply to accounts, not projects, unless specified otherwise.
Data Catalog storage pricing
Monthly average storage | Price (USD) |
|---|---|
Up to 1 MiB | No charge |
Over 1 MiB | $0.002739726 / 1 gibibyte hour |
Data Catalog API charges
Data Catalog API calls are billed as described in the following table:
Note: Search queries performed on the Data Catalog page in the Google Cloud console are offered at no charge.
Item | Price (USD) |
|---|---|
API calls | 0 count to 1,000,000 count $0.00 (Free) / 100,000 count, per 1 month / account 1,000,000 count and above $10.00 / 100,000 count, per 1 month / account |
If you pay in a currency other than USD, the prices listed in your currency on Cloud Platform SKUs apply.