0% found this document useful (0 votes)
49 views3 pages

NovelVista - Course Content - DBT - PDF

The document outlines a training course on the data build tool dbt. The 7 modules cover database and data warehouse concepts, the dbt workflow, environment setup, projects and models, sources and tests, documentation, YAML, Jinja, macros, materializations, analyses, seeds, packages, deployment, and best practices.

Uploaded by

Anjani Janyavula
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
49 views3 pages

NovelVista - Course Content - DBT - PDF

The document outlines a training course on the data build tool dbt. The 7 modules cover database and data warehouse concepts, the dbt workflow, environment setup, projects and models, sources and tests, documentation, YAML, Jinja, macros, materializations, analyses, seeds, packages, deployment, and best practices.

Uploaded by

Anjani Janyavula
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
You are on page 1/ 3

Data Build Tool

Module 1
Databases and Data Warehouse Concepts
OLTP Databases
Need of OLAP databases
Dimensional Modelling
Data Warehouse Architecture

Methods of getting data into DW


Types of Extraction
Data Loading Types
Transformation
Methods ETL vs ELT

Module 2
Data Build Tool – An Introduction
What is dbt?
The workflow in dbt
Data Analytics Engineer
Role Database Connections
Basic dbt Commands

Environment Setup
Create a dbt account in cloud
Create a GitHub repository
Connections for repository and
Warehouse dbt Cloud IDE – Overview
dbt CLI – Introduction
Basic CLI Commands
Querying and Transforming data

Module 3
Projects
The project file .yml
Creating new projects
Using existing projects
Structuring the Project

Models
Modularity Ref
Functions
Naming Conventions
Module 4
Sources
Importance of Sources
Configuration
Freshness of the Source

Tests
Configuring model file for testing
Singular & Generic Tests
Tests on Sources
Methods of Running tests on
Models Storing Failed Records

Documentation
Writing documentation
Doc Blocks
Documenting Sources

Module 5
YAML
General Syntax
Data Structure Definitions
Utilizing in dbt

Jinja
Basics of Jinja
Applications in dbt

Macros
Macros – Overview
Usage of Macros
Executing Macros independently

Module 6
Materializations
Types of Materializations
Incremental Models
Ephemeral Models
Implementing Snapshots
Advanced Configurations
Pre &Post Hooks

Analyses
Overview
Implementing Analyses
The usage of Analyses

Seeds
Purpose of Seeds
Limitations on usage
Functioning of dbt Seeds command
Module 7
Packages
Introduction to Packages
Installing Packages
Packages with Macros
Packages with Models

Deployment
Create Deployment Environment
Commit and Pull Request with
Repository Creating and Running Jobs
Continuous Integration with GitHub

Best Practices
Environment Setup
Configuring Models Using Tags
Limiting Data in Development
Reduce complexity with CTEs
Handling Merging Conflicts
Manage documentation with assets
Enabling Continuous Integration

You might also like