Google BigQuery - Download Query Results to Google Cloud Storage
Overviewβ
Turn the results of your SQL SELECT statement into CSV files that get stored in Google Cloud Storage (GCS).
When exporting data from BigQuery, a file cannot contain more than 1GB of data. As a result, this Blueprint automatically splits up data that is >1GB in size into multiple files. The name of these files will be the Bucket File Name provided, with _#
base 0 enumeration appended to the file name, before the file extension. This is the default behavior provided by Google.
Ex. If you provide a file name of data.csv
and your data is 2.4GB in total, 3 files would need to be generated. The names of these files will be data_0.csv
, data_1.csv
, and data_2.csv
.
Variablesβ
Name | Reference | Type | Required | Default | Options | Description |
---|---|---|---|---|---|---|
Query | BIGQUERY_QUERY | Alphanumeric | β | - | - | Standard SQL query to be executed against BigQuery. Does not support Legacy SQL. |
Bucket Name | BIGQUERY_BUCKET_NAME | Alphanumeric | β | - | - | Name of the GCS bucket to store the results file(s) in. |
Bucket File Name | BIGQUERY_DESTINATION_FILE_NAME | Alphanumeric | β | - | - | Name of file to be generated with the results. Should be .csv extension. If the file size is >1GB, file name will be enumerated with _# before the extension. |
Bucket Folder Name | BIGQUERY_DESTINATION_FOLDER_NAME | Alphanumeric | β | - | - | Folder where the file(s) should be uploaded. Leaving blank will place the file in the root directory. |
Service Account | GOOGLE_APPLICATION_CREDENTIALS | Password | β | - | - | JSON from a Google Cloud Service account key. |
YAMLβ
Below is the YAML template for this Blueprint and can be used in the Fleet YAML Editor.
source:
blueprint: Google BigQuery - Download Query Results to Google Cloud Storage
inputs:
BIGQUERY_QUERY: null ## REQUIRED
BIGQUERY_BUCKET_NAME: null ## REQUIRED
BIGQUERY_DESTINATION_FILE_NAME: null ## REQUIRED
BIGQUERY_DESTINATION_FOLDER_NAME: null
GOOGLE_APPLICATION_CREDENTIALS: null ## REQUIRED
type: BLUEPRINT
guardrails:
retry_count: 1
retry_wait: 0h0m0s
runtime_cutoff: 1h0m0s
exclude_exit_code_ranges:
- '101'
- '102'
- '200'
- '203'
- '205'
- '206'