Redshift projects require the following configuration settings:
[name].[id].[region].redshift.amazonaws.com
5439
)Redshift
in your AWS console.Clusters
.The Redshift user should have permissions to CREATE
schemas and SELECT
from INFORMATION_SCHEMAS.TABLES
and INFORMATION_SCHEMAS.COLUMNS
. Please contact our team via slack if you need help.
Redshift specific options can be applied to tables using the redshift
configuration parameter.
You can configure how Redshift distributes data in your cluster by configuring the distStyle
and distKey
properties.
1config { 2 type: "table", 3 redshift: { 4 distKey: "user_id", 5 distStyle: "key" 6 } 7} 8SELECT user_id FROM ...
This query compiles to the following statement:
1CREATE TABLE "dataform"."example" 2DISTKEY(user_id) 3DISTSTYLE even 4AS SELECT user_id FROM ...
You can also configure how Redshift sorts data in your datasets with the sortKeys
and sortStyle
properties.
1config { 2 redshift: { 3 sortKeys: [ "ts" ], 4 sortStyle: "compound" 5 } 6} 7SELECT 1 AS ts
The blog post offers a walkthrough to load data from S3 to Redshift.
If you are using Dataform web and are having trouble connecting to Redshift, please reach out to us by using the intercom messenger icon at the bottom right of the app.
If you have other questions related to Redshift, you can join our slack community and ask question on the #Redshift channel.