Agile for Data Warehouse and ETL Projects

Course:  AGLDWP
Duration:  5 Days
Level:  I
Course Summary

Dimensional modeling is a proven technique for developing understandable, high-performance data warehouses. This course will teach Agile dimensional modeling techniques and practical detailed steps of the data warehouse ETL system including extracting, cleaning, conforming, and delivering data and its associated metadata. Students will learn to design, obtain, prepare and publish data in a dimensional data warehouse. They will also learn how to precisely design and build reusable processes to deliver the foundation for an efficient, reduced cost, successful data warehouse implementation.

« Hide The Details
Topics Covered In This Course

Dimensional Modeling Fundamentals

  • Modeling for measurement - the case for dimensional modeling
  • Fundamentals of stars, snowflakes, facts and dimensions
  • Slowly changing dimensions - accurately reflecting history, supporting current, historically correct and alternative views
  • Common dimensional modeling techniques - Time, multi-role and degenerate dimensions, surrogate keys, value chains and other common multi-star design patterns
  • The Data Warehouse Bus Architecture of conformed dimensions and facts, how data marts can enable incremental data warehouse development

Dimensional Analysis

  • Gathering Analytical Requirements - how to ask the right questions
  • Identifying business events and processes that must be measured
  • Identifying business dimensions by classification (Who, What, Where, When and Why or People, Things, Places, Timestamps and Reasons)
  • Identifying and documenting the relationships between business events and dimensions - the dimensional matrix (reloaded)
  • Identifying Key Performance Indicators (KPIs) and Metrics - aggregates, comparisons and exceptions
  • Defining granular facts - additive, semi-additive and non-additive measures
  • Identifying and classifying dimensional attributes and hierarchies - completeness checks

ETL Functional Practices

  • Planning and designing your ETL system
  • Choosing the appropriate architecture
  • Managing the implementation
  • Managing the day to day operations
  • Building the development/test/production suite of ETL processes
  • Building a data cleaning subsystem
  • Understanding the tradeoffs of various staging data structures, including flat files, normalized schemas, XML, and dimensional schemas
  • Analyzing and extracting source data
  • Creating the logical data mapping

ETL Technical Practices

  • Structuring the data into dimensional schemas for the most effective delivery to end users
  • Conforming heterogeneous data from multiple sources into standardized dimension tables and fact tables
  • Building ETL modules for handling the three distinct types of slowly changing dimensions (SCDs)
  • Building ETL modules for multi-valued dimensions and hierarchical dimensions
  • Running high-performance surrogate key pipelines
  • Loading the three fundamental fact table grains - transaction, periodic snapshot and accumulating snapshot
  • Handling late arriving dimensions and facts
  • Optimizing ETL processes to fit into highly constrained load windows
  • Structuring and presenting metadata
  • Converting batch and file-oriented processes into continuously streaming real-time ETL systems
What You Can Expect

At the end of this course, students will be able to:

  • Plan, design and incrementally develop agile data warehouse solutions.
  • Model data requirements directly with stakeholders.
  • Translate data analysis requirements into efficient, flexible dimensional models.
  • Maximize the usability and performance of their data warehouse design.
Who Should Take This Course

This course is designed for anyone involved or interested in learning the latest techniques for planning, designing, and managing dimensional data warehouses and ETL processes.

Training Style

Instructor led with 50% lecture and 50% lab

« Hide The Details
Related Courses
Code Course Title Duration Level
Agile Principles
2 Days
Agile Bootcamp
3 Days
2 Days
Introduction to Data Warehouse Concepts
1 Day
Agile Project Management
3 Days
Dimensional Modeling for Data Warehouse Projects
5 Days

Every student attending a Verhoef Training class will receive a certificate good for $100 toward their next public class taken within a year.

You can also buy "Verhoef Vouchers" to get a discounted rate for a single student in any of our public or web-based classes. Contact your account manager or our sales office for details.

Schedule For This Course
There are currently no public sessions scheduled for this course. We can schedule a private class for your organization just a couple of weeks from now. Or we can let you know the next time we do schedule a public session.
Notify me the next time this course is confirmed!
Can't find the course you want?
Call us at 800.533.3893, or
email us at [email protected]