Advanced SQL Server 2005 Integration Services (SSIS) for Developers

Course:  SSISD
Duration:  4 Days
Level:  III
Course Summary

As SSIS is mainly an Extract, Transform and Load (ETL) tool and since ETL has been accepted as being between 70-80% of the data warehouse maintenance challenge, this course is accordingly dedicated. After discussing advanced features of the SSIS control and data flow engines, delegates will be taught how to design, develop and implement an SSIS-based ETL system, accounting for some of the more common and complex requirements, including:

  • Source-to-destination mapping
  • Extraction (e.g., incremental)
  • Staging tables
  • Slowly Changing Dimensions
  • Late arriving (inferred) dimensions
  • Late arriving facts
  • Fact aggregation
  • Data profiling
  • Audit
  • Etc.
« Hide The Details
Topics Covered In This Course

REVIEW OF BI TERMS & CONCEPTS

  • The data warehousing (DW) environment
  • The DW computing context
  • What is a data warehouse?
  • What is a data mart?
  • What is Business Intelligence (BI)?
  • How do OLTP & OLAP differ?
  • What is data mining?
  • Operational vs. historical data
  • Dimensional modelling
  • Some dimensional modelling characteristics
  • What is a star schema?
  • What is a snowflake schema?
  • What is metadata?

REVIEW OF SSIS BASICS

  • Architecture
  • Tools
  • Wizards
  • Packages
  • Control Flows
  • Data Flows
  • SSIS in operational/data warehousing environments
  • SSIS data types
  • SSIS transformation types
  • Containers
  • Precedence constraints
  • Variables
  • Expressions

REVIEW OF PACKAGE OBJECTS

  • Tasks
  • Connection Managers
  • Log providers
  • Containers
  • Precedence constraints

SSIS DATA TYPES & OPTIMISATION

  • Types
  • How to alter and map
  • Parsing
  • Optimisation

SSIS VARIABLES

  • Definition
  • Properties
  • Scope
  • Type
  • Namespaces
  • System
  • User
  • Execution behaviour

SSIS EXPRESSIONS

  • Definition
  • Elements
  • Property expressions
  • Variable expressions
  • Precedence constraint expressions
  • For Loop expressions

CONTAINERS

  • Concepts
  • Types (e.g., Foreach Loop)
  • Scoping
  • Properties

ADVANCED CONTROL FLOW TASKS

  • File System
  • Send Mail
  • Message Queue
  • WMI
  • Execute Package
  • Execute Process
  • Bulk Insert
  • Analysis Services
  • Data Mining
  • Execute SQL Agent Job
  • Update Statistics
  • Script Task

ADVANCED DATA FLOW COMPONENTS

  • Source adapters (e.g., Raw Files)
  • Destination adapters (e.g., Data Reader)
  • Conditional Split
  • Multicast
  • Fuzzy Lookup/Fuzzy Grouping
  • Script component
  • OLE DB command
  • Union All
  • Sort
  • Merge/Merge Join
  • Lookup
  • Pivot/Unpivot (Excel worksheet)
  • Aggregate
  • Slowly Changing Dimension

OPTIMISING DATA FLOW TRANSFORMATIONS

  • Buffer architecture
  • Pipeline considerations
  • Synchronous vs. asynchronous
  • Blocking vs. semi-blocking vs. non-blocking
  • Execution trees interpretation and usage

CHECKPOINT RESTART (WITH/WITHOUT TRANSACTIONS)

  • Definition
  • When to consider
  • Checkpoints – how to implement and manage
  • Checkpoint files
  • Single Container and checkpoints
  • Multiple Containers and checkpoints
  • Single vs. multiple transactions
  • How to configure
  • How to set properties
  • Distributive transaction considerations
  • Scoping per container
  • Isolation levels
  • Snapshot isolation

ADVANCED DEPLOYMENT CONSIDERATIONS

  • Package configuration
  • What does the Deployment Wizard do?
  • When should the deployment utility be used vs. import/Export, XCopy, etc?
  • How to override configurations
  • How to deploy and account for security

COMPREHENSIVE ETL DESIGN & DEVELOPMENT WORKSHOP

  • Top-level planning
  • Source-to-destination mapping
  • Data profiling
  • Load frequency
  • Historical vs. incremental loads
  • Extraction design
  • Staging area
  • Recovery considerations
  • Dimension table processing
  • Slowly Changing Dimensions (SCD)
  • Late arriving (inferred) dimensions
  • Fact table processing
  • Extracting fact updates and deletes
  • Transforming dimension/fact data
  • Fact data aggregation
  • Confirmed dimensions/facts
  • Surrogate key maintenance
  • Fact table load
  • Periodic vs. accumulating snapshot fact considerations
  • Audit considerations
  • Parent package/child package considerations
  • Event handling
  • Unit testing

SSIS PERFORMANCE & TUNING

  • How to optimise sources
  • How to optimise logging
  • How to optimise data flows
  • How to optimise error handling
  • How to optimise insert performance
  • Useful naming conventions
What You Can Expect

Upon completion of this course, the delegate should be able to use a variety of SSIS control and data flow facilities to design, develop and maintain simple-to-complex ETL solutions.

Who Should Take This Course

SQL Server 2005 developers in particular and DBAs in general.

Recommended Prerequisites

Completion of our SQL Server 2005 Integration Services Fundamentals or equivalent knowledge.

Training Style

Lecture and hands-on.

« Hide The Details
Related Courses
Code Course Title Duration Level
SSASMDX
SQL Server Analysis Services with MDX
5 Days
II
Details
SSISF
SQL Server 2005 Integration Services Fundamentals
3 Days
II
Details
SSISA
Advanced SQL Server 2005 Integration Services (SSIS) for DBAs
4 Days
III
Details

Every student attending a Verhoef Training class will receive a certificate good for $100 toward their next public class taken within a year.

You can also buy "Verhoef Vouchers" to get a discounted rate for a single student in any of our public or web-based classes. Contact your account manager or our sales office for details.

Schedule For This Course
3/19/2012
Topeka, KS
Notify me the next time this course is confirmed!
Can't find the course you want?
Call us at 800.533.3893, or
email us at info@verhoef.com