Dataform uses the Git version control system to maintain a record of each change made to project files and to manage file versions. Each Dataform project has an associated Git repository.
When you first use Dataform we will automatically create a personal development branch for you called name_dev
. Only you can see this branch and commit to it. All the changes you've made so far should be on this branch.
Press the Commit Changes
button:
Give your commits a description and press Commit
.
Click the Push to Master
button to push your changes to the master branch:
production
and used for future schedule runs.For more information on how version control works in Dataform, see our docs.
You’ve now created your two tables; order stats
and customers
. customers
depends on order_stats
and will not be run until order_stats
is updated. You have added data quality tests so that you know the data is reliable and you have documented your datasets so other team members know what the tables contain. A schedule will run every hour to keep the tables up to date, and you will be automatically alerted if the pipeline fails or there are data quality issues.
Follow the steps in this tutorial to get started with your own project. Connect to your own warehouse and create your first dataset using SQL.
Learn more about Dataform & SQLX in 5 minutes and our best practices.
Join our Slack channel and read our blog to learn more about Dataform use cases, best practices and to meet other people solving similar problems!