Datastage

Reading Time: 2 minutes

DataStage

    Module 1: Introduction to Data Warehouse Concepts

    • What is Data Warehouse?
    • Data Mart
    • OLAP VS OLTP
    • Data Warehouse Architecture
    • What is Data Modelling?
    • Explorer on Dimensional Modelling
    • Explorer on Star Schema
    • Explain Snowflake Schema
    • Understanding on Dimension
    • Understanding on Fact
    • Slowly Changing Dimension
    • Lifecycle of Data Warehous

    Module 2: Understanding on ETL (EXTRACTION, TRANSACTION, LOAD)

    • Overview of ETL
    • Feature and benefit for Business
    • Different SCD Types
    • ETL tools in markets
    • Explain on staging tables
    • Explain on Transformation
    • Loading data into different stage of table

    Module 3: Overview of InfoSphere DataStage

    • What is InfoSphere DataStage?
    • Architecture of DataStage
    • Explain on Topologies
    • Components in DataStage
    • Runtime Architecture
    • OSH Script and Execution Flow

    Module 4: Install and Configuration on InfoSphere DataStage

    • Prerequisite for InfoSphere DataStage
    • Install InfoSphere DataStage
    • Verify Installation
    • Setup Environment variables
    • Create / Update / Delete projects
    • User creation and Grand permission

    Module 5: Working with DataStage Designer

    • Overview of Designer
    • Explorer on DataStage Designer
    • High level overview of Commonly used Stages
    • Schema
    • Pipeline Parallelism
    • Manipulate configuration file
    • Repository Palette
    • Passive and Active stages
    • Annotation and Create jobs
    • Import and Export Metadata
    • Dataset Management
    • Partition technique

    Module 6: DataStage Job

    • Overview of Job types
    • Explain on Sequence and Parallel Jobs
    • Explain on Server Jobs
    • Different stages
    • Understanding Containers

    Module 7: DataStage Director

    • Introduction to DataStage Director
    • User Interface Director
    • Job status and view
    • Compiling Single and Multiple jobs
    • Run, Reset ad Restart jobs
    • Scheduling Batches
    • Performance monitor

    Module 8: Creating Parallel Job

    • Overview of Parallel Jobs
    • Design a Parallel Job using Designer
    • Pipeline Parallelism
    • Partition Parallelism
    • NLS Mode Work
    • Maps in Parallel Jobs
    • Run Parallel Jobs

    Module 9: Handle Files

    • Introduction to file handling
    • Sequence and Complex file stage
    • Huge File Manipulation
    • Error and Invalid Records Rejection

    Module 10: File Stages

    • File Stages
    • Sequential File stage
    • Explain on DataSet
    • Complex Flat File stage
    • Create jobs to read and write on sequential files
    • Multiple file reader using file patterns
    • Null handling in File Stage
    • Lookup file Set

    Module 11: Combining and Partitioning Data

    • Overview of data process for combine and Partitioning
    • Combine data using by Lookup stage
    • Combine data using by merge stage
    • Combine data using by Join stage
    • Combine data using the Funnel stage

    Module 12: Sorting and Aggregating Data

    • Sort data using in-stage sorts and Sort stage
    • Data Segregation using Aggregates stage
    • Unique data using Duplicates stage

    Module 13: Transformation on Data

    • Under