Datastage Developer Resume
AZ
SUMMARY
- Have 8 years of IT experience in software development, designing and implementing high quality business solutions, with major focus on Data Warehouse and ETL processing.
- Strong experience in building and developing strategies for Extraction, Transformation and Loading from disparate sources into Data warehouse and Data Marts by using IBM/Ascential Data Stage 8.7/8.5/8.1/8.0/7.5 (Designer, Director, Manager and Administrator).
- Extensive experience in DW/ETL requirements gathering, design and testing for many data extractions and data migration projects.
- Hands on experience with master data management.
- Used the Data stage Designer to develop processes for extracting, cleansing, transforming, integrating, and loading data into data warehouse database.
- Expertise in working with IBM Datastage, Administrator, Manager, Designer, Director, Parallel Extender and Quality Stage.
- Implemented complex business rules using different Stages, Jobs, Sequencer, Shared Containers etc.
- In depth experience in dealing with Data Stage stages (Hash, ftp, merge, sort, aggregator, lookup, transformer, Join, Funnel, filter etc.).
- Experience in integration of various data sources like Oracle, DB2, Teradata, SQL Server, MS Access, XML and Flat files into the Staging Area.
- Extensively used TOAD 9.0/8.5 to access Oracle database and Control center to access DB2 Database.
- Experience in Scheduling the DataStage jobs using Control - M, Autosys.
- Excellent Knowledge of SQL, PL/SQL, Stored Procedures, Function, and Triggers.
- Experience in programming using SQL, PL/SQL, and UNIX Shell Scripting.
- Strong Experience in Unit testing and System testing of the DataStage jobs.
- Performed Debugging, Troubleshooting and performance tuning of DataStage Jobs.
- Expertise in OLTP/OLAP System Study with focus on Oracle Hyperion Suite of technology, developing Database Schemas like Star schema and Snowflake schema (Fact Tables, Dimension Tables) used in relational, dimensional and multidimensional modeling, physical and logical Data modeling using Erwin 4.2/4.1/3.x.
- Involved in designing and preparing Functional specification documents, Technical specification documents, Mapping Documents for Source to Target mapping with ETL transformation.
- Strong Experience in Banking, Financial, Medical, Retail and Insurance Domains.
- Excellent communication and interpersonal skills, teamwork, problem solving skills, flexible, self-direct and energetic person.
TECHNICAL SKILLS
ETL Tools: IBM Information Server 8.7/8.5/8.1/7.5 , Qualitystage 7.5x (Manager, Administrator, Designer, Director), Parallel Extender
Technology: Master Data Management
Dimensional Data Modeling: Data Modeling, Star Schema Modeling, Snow-Flake Modeling, FACT and Dimensions tables, physical and logical data modeling, Erwin 4.1.2/3.x and Oracle Designer
Programming Languages: C, C++, java, Unix shell scripting, SQL, PL/SQL
Operating Systems: Microsoft Windows 2003 Server XP/2000, UNIX (AIX).
Databases: MS SQL Server 2000/2005, Oracle 11g/10g/9i/8i, DB2 UDB, Teradata v2r5, Oracle Hyperion
DB Utilities: Toad, SQL*Loader
PROFESSIONAL EXPERIENCE
Confidential
DataStage Developer
Responsibilities:
- Responsible for implementing proper SDLC from the design document, development, unit test and performance test to production.
- Worked on tuning Data stage server engine and parallel extender to improve existing processes for extracting, cleaning, transforming, integrating, and loading data into Teradata target tables.
- Involved in designing and implementing Best practices for ETL processes. The Best practices include Restartability, Recovery, Parameter standardization, Error Management and Capacity planning etc.
- Configured the DataStage projects - created projects, enabled project properties and set the user accounts using DataStage Administrator.
- Converted business rules into ETL processes using Parallel Extender/Server jobs and extensively wrote user SQL coding for overriding for generated SQL query in DataStage.
- Developed Master Controlling Sequencer jobs using DataStage Sequencers to maintain the dependency between jobs and restartability.
- Optimized DataStage jobs utilizing partitioning and pipelining features of DataStage.
- Worked extensively in performance tuning by removing performance bottlenecks that occurs in the target, source, mapping and session.
- Developed user defined Routines and transformations to implement business logic and Shell scripts to automate file manipulation and data loading procedures
- Extensively used SQL scripts to unit test and validate proper functioning of DataStage code.
- Wrote Shell scripts to run DataStage jobs, for passing parameters to DataStage jobs and to automate the FTP process to transfer the files between the UNIX and Windows
Environment: IBM Infosphere Server Suite 8.1.1/8.5.1 (Designer, Director and Administrator), Ascential DataStage (Designer, Director, Manager and Administrator), Test Director, Control M, Oracle 10g, UNIX AIX 5.3, IBM Cognos 10.1, SQL Server, Oracle, Teradata, Windows XP, Erwin.
Confidential, AZ
DataStage Developer
Responsibilities:
- Worked closely with Business analysts and Business users to understand the requirements and to build the technical specifications.
- Involved in business meeting to understand the retail and institutional customer information and stocks, bonds, mutual funds data.
- Responsible to create Source to Target mappings.
- Responsible to design, develop and built datastage parallel jobs using DataStage designer.
- Developed and supported the Extraction, Transformation and Load process (ETL) for a data warehouse from various data sources using DataStage Designer.
- Designed and developed Parallel jobs to extract data, clean, transform, and to load the target tables using the DataStage Designer.
- Designed developed job sequential to run multiple jobs
- Used DataStage Designer for importing the source and target database schemas, importing and exporting jobs/projects, creating new job categories and table definitions.
- Designed and developed the Routines and Job Sequence for the ETL jobs.
- Prepared Technical Specification Documents for DataStage Jobs.
- Involved in Unit testing and integration testing.
- Responsible for running jobs using Job Sequencers, Job Batches.
- Deployed different partitioning methods like Hash by field, Entire and Range for bulk data loading and for performance boost.
- Developed parallel jobs using various Development/debug stages (Peek stage, Row generator stage, Column generator stage, Sample Stage) and processing stages (Aggregator, Change Capture, Change Apply, Filter, Sort & Merge, Funnel, Remove Duplicate Stage).
- Implemented logical and physical data modeling with Star and Snowflake techniques using Erwin in Data warehouse.
- Extensively worked on Oracle Connector, Oracle Enterprise Stage and Oracle OCI Stage to extract, transform and load data into/from Oracle database.
- Extensively used Reject Link, Job Parameters, and Stage Variables in developing jobs.
- Used the DataStage Director to run, schedule, monitor, and test the application on development, and to obtain the performance statistics.
- Developed Packages and customized functions and Triggers based upon the business logics.
- Involved in Performance tuning of complex queries.
- Developing Oracle PL/SQL stored procedures, Functions, Packages, SQL scripts to facilitate the functionality for various modules.
Environment: IBM Infosphere Server Suite 8.1.1 (Designer, Director and Administrator), Ascential DataStage (Designer, Director, Manager and Administrator), Test Director, Control M, Oracle 10g, UNIX AIX 5.3, SQL Server, IBM Cognos 10.1, Toad for Oracle, UNIX, Windows XP, Erwin.
Confidential, CA
DataStage Developer
Responsibilities:
- Participated in gathering the user requirements and involved in writing technical design documents.
- Involved in the analysis, data modeling, detailed system design, development and technical documentation.
- Worked closely with Architect, BA and BI Staff to get a clear idea of business rules and logics and target mappings.
- Developed ETL procedures to ensure conformity, compliance with standards and lack of redundancy, translated business rules and functionality requirements into ETL procedures using DataStage.
- Extensively used DataStage Designer to develop various jobs to extract, cleanse, transform, integrate and load data into data warehouse.
- Designed DataStage Parallel jobs involving complex business logic, update strategies, transformations, filters, lookups and necessary source-to-target data mappings to load the target.
- Extensively used QualityStage stages like Investigate, Match and Survivorship.
- Extensively used QualityStage to convert data from legacy sources into consolidated high quality information throughout a complex enterprise warehouse and Datamart.
- Extensively worked on Job Sequences to Control the Execution of the job flow using various Activities & Triggers (Conditional and Unconditional) like Job Activity, Wait for file, Email Notification, Sequencer, Exception handler activity and Execute Command.
- Extensively used the Sequential File stage, Complex Flat File Stage, Hashed File Stage, Modify, Dataset, Filter, Funnel, Join, Lookup, Copy, Aggregator, and Change Capture during ETL development.
- Used complex flat file stage for sourcing of data files that contain numerous record formats in a single file.
- Used DataStage Director to Run and Monitor the Jobs for Performance Statistics.
- Worked with DataStage Manager to import/export metadata, jobs, and routines from repository and also created data elements.
- Implemented logic for Slowly Changing Dimensions Type II by using Date methodology.
- Involved in Performance Tuning of Parallel Jobs using Performance Statistics.
- Proven track record in troubleshooting of DataStage job and addressing production issues such as restartability, error and exceptional handling.
- Created local containers and shared containers based on the requirement as a reusable component and simplify the design.
- Generated the Data lineage and Impact Analysis reports in Metadata Workbench.
- Extensively worked with database objects including tables, views, indexes, schemas, PL/SQL packages, stored procedures, functions, and triggers.
- Developed UNIX Shell scripts to automate repetitive database processes and maintained shell scripts.
- Involved in solving Production support issues.
- Used Autosys for job scheduling.
Environment: IBM Information Server DataStage 8.1 (Designer, Director, Manager and Administrator), QualityStage 8.1, Metadata Workbench 8.1, Oracle 10g, TOAD 7.3, ERWIN 4.2, Win XP, UNIX, Autosys.
Confidential, Foothill Ranch, CA
Data Stage Developer
Responsibilities:
- Designed and customized data models for Data warehouse supporting data from multiple sources on real time.
- Developed and designed Data marts extensively using Star Schema.
- Project Life Cycle - from analysis to production implementation, with emphasis on identifying the source and source data validation, developing particular logic and transformation as per the requirement and creating jobs and loading the data into different targets.
- Designed, developed Data Stage jobs, enabling the extract, transport and loading of the data into target tables.
- Wrote the Design documents and specifications in accordance with Confidential ’s design frameworks and best practices.
- Defined the Mapping and Test Case documents.
- Extensively analyzed the Data Sources in identifying data anomalies, patterns, value ranges.
- Used Data Stage Enterprise Edition/Parallel Extender stages namely Datasets, Sort, Lookup, Peek, Standardization, Row Generator stages, Remove Duplicates, Filter, External Filter, Aggregator, Funnel, Modify, and Column Export in accomplishing the ETL Coding.
- Developed Data Stage job sequences used the User Activity Variables, Job Activity; Wait for File stages, Execute Command, Loop Activity, and Terminate.
- Wrote DDL scripts for creating, altering tables.
- Wrote Extensive Unix scripts for running the Data Stage jobs, validating data.
- Tuned Data Stage jobs to enhance their performance.
- Involved in writing, testing and implementation of the triggers at Database level using PL/SQL.
- Wrote Release notes, Deployment documents and scheduled the jobs.
Environment: IBM Information Server DataStage 8.1 (Designer, Director, Manager and Administrator), QualityStage 8.1, Flat files, Oracle10g/9i, Erwin 4.0, PL/SQL, UNIX Shell Programming.
Confidential, MN
DataStage Developer
Responsibilities:
- Involved in analysis, requirements gathering, functional/technical specification, development, deploying and testing.
- Used IBM DataStage EE/Parallel Extender as an ETL tool to extract data from sources like DB2 and flat files and loaded to target tables.
- Developed DataStage Parallel jobs for various components and strings involved in the calculation of quotes and rates.
- Developed parameter driven ETL process to map source systems to target data systems with DataStage complete source system profiling.
- Created Parallel Shared Containers for reusable calculations in various strings.
- Developed Batches & Sequencers in designer to run & control set of jobs.
- Extensively used DataStage Designer stages such as ODBC, Aggregator, Transformer, Join, Dataset, Lookup, Funnel, Peek, Pivot and Modify.
- Extensively used Built-in, Plug-in and Custom Stages for Extraction, transformation and loading of the data.
- Used Director for executing, analyzing logs and scheduling the jobs.
- Worked on Control-M tool for schedule and monitoring the DataStage job sequences in development and test environments.
- Used Manager to import/export various job components to various projects.
- Used Type2 mapping to update a slowly changing dimension table to keep full history.
- Involved in SQL & PL/SQL programming for procedures, functions, packages to extract data from various source systems.
- Developed UNIX Shell scripts to automate repetitive database processes and maintained shell scripts.
Environment: IBM DataStage EE 8.0(Designer, Administrator, Director), DB2 UDB, Oracle 9i, SQL, PL/SQL, Unix Shell scripts, SQL*Loader, TOAD, UNIX AIX 5.2., Control - M.
Confidential
DataWarehouse Developer
Responsibilities:
- Involved in design and implementation of Data warehouse BDW (Banking data warehouse).
- Involved in creating entity relational and dimensional relational data models using Data modeling tool Erwin.
- Designing the Target Schema definition and Extraction, Transformation and Loading (ETL) using Data stage.
- Performed tuning on database queries, ETL mappings and end-user queries.
- Developed Architecture for building a Data mart by using data modeling.
- Developed Analytical applications that can analyze large amounts of online and offline data.
- Worked with Datastage Manager for importing metadata from repository.
- Used the Datastage Designer to develop processes for Extracting, Cleansing, Transforming, Integrating, and Loading data into data mart database.
- Worked on programs for scheduling Data loading and transformations using Data Stage from legacy system to Data Warehouse.
- Used the Data Stage Director and its run-time engine to schedule running the solution, testing and debugging and monitoring the Datastage jobs.
- Wrote shell scripts to automate file manipulation and data loading procedures.
- Unit tested and deployed the jobs to Production.
Environment: Oracle 8i, PL/SQL, and SQL*plus, Ascential DataStage 7.x (DataStage Designer, DataStage Administrator, DataStage Manager, and DataStage Director), Windows NT, UNIX.