Sr. Etl Data Stage Developer Resume
Columbus, OH
SUMMARY
- Highly competent Sr. ETL IBM Data Stage Developer / Designer with over 7 years of experience in Information technology using IBM WebSphere / InfoSphere Data Stage v8.x and Ascential Data Stage v7.x.
- Extensively worked with Data Stage Designer, Director, Administrator and Manager.
- Complete Software Development Life Cycle (SDLC) experience with system design, development, implementation, testing, support and enhancements etc.
- Extensive ETL tool experience using IBM Infosphere/Websphere Datastage and Ascential Datastage.
- Worked on Datastage client tools like Datastage Designer, Datastage Director and Datastage Administrator.
- Good Knowledge about the principles of DW like Data marts, OLTP, OLAP, Dimensional Modeling, fact tables, dimension tables and star/snowflake schema modeling.
- Excellent in using highly scalable parallel processing infrastructure using parallel jobs with multi - node configuration files.
- Experienced in scheduling sequence, parallel and server jobs using DataStage Director, UNIX scripts and scheduling tools.
- Designed and developed parallel jobs, server and sequence jobs using Datastage Designer.
- Experience in using different types of stages like Transformer, Aggregator, Merge, Join, Lookup, Sort, copy, Remove duplicate, Funnel, Filter, Pivot, Shared containers for developing jobs.
- Worked and extracted data from various data sources such as Oracle, MS-SQL Server, MS- Access, Teradata, DB2, XML and Flat files.
- Extensive experience in Unit Testing, Functional Testing, System Testing, Integration Testing, Regression Testing, User Acceptance Testing and Performance Testing.
- Created local and shared containers to facilitate ease and reuse of jobs.
- Proven track record in addressing production issues like performance tuning, enhancement, data, environment and memory issues.
- Imported the required Metadata from heterogeneous sources at the project level.
- Used the Data Stage Director and its run-time engine to schedule running the solution, testing and debugging its components, and monitoring the resulting executable versions (on an ad hoc or scheduled basis).
- Experience in Production Support, extensively worked on production support issues.
- Extensively used BI Integration Services to design ETL process and BI Analysis Services to create cubes.
- Quick learner and adaptive to new and challenging technological environments.
- Project Management experience with excellent problem-solving, organization and leadership skills.
TECHNICAL SKILLS
ETL Tools: IBM InfoSphere / WebSphere Datastage 8.5/8.0.1/ 7.5.2 (Designer, Director, Administrator, Manager, and Quality Stage)
Operating Systems: Windows 7, Windows XP, UNIX, IBM AIX, HP-UX
Languages: C, UNIX Shell scripting, Java, SQL and PL/SQL
Database: Oracle 11g/10g/9i/8i/, SQL Server 2000/2005/2008 , Teradata, UDB/DB2.
Other: SQL Assistant, TOAD, SQL Developer, WinSQL, SAP BI 7.1, Management console, DataStageTrouble shooting, Data Issue resolution, batch monitoring.
PROFESSIONAL EXPERIENCE
Confidential, Columbus OH
Sr. ETL Data Stage Developer
Responsibilities:
- Actively involved in Design and development of Extraction, Transformation and Loading Process.
- Extensively used Information Analyzer Client to perform Database comparisons and OLTP analysis
- Involved in dimensional modeling of the data mart to design the business process.
- Worked with various Partitioning (Round-Robin, Hash fields, Entire, Same,modulus etc.) and Collection (Round-Robin, Ordered and Sorted Merge) techniques.
- Symmetric Multi-Processing processors were used to accommodate partition parallelism.
- Imported and Exported Jobs, table definitions, created custom transforms and routines.
- Worked with IBM WebSphere Web Console client to configure WebSphere to the Console connection.
- Worked with DataStage Director to run, schedule, monitor and view logs for jobs.
- Developed various DataStage server and parallel jobs and Transformations to load data from relational and flat file sources into the data mart.
- Created shared and local containers to simplify and modularize server job designs by replacing complex areas of the diagram with a single container stage.
- Used various stages like lookup, aggregator, merge on the extracted source data according to the business rules and technical specifications.
- Optimized the jobs and routines using various optimization techniques.
- Involved in creating Technical Specification Document (TSD) for the project.
- Performed debugging of some existing jobs using the Director to test and fix the errors.
Environment: IBM InfoSphere DataStage 8.5 (Administrator, Director, Designer), PL/SQL, Oracle 10g/9i, UNIX Shell scripts, Teradata, Flat files, SQL Developer
Confidential, Bloomfield, CT
ETL Data stage developer
Responsibilities:
- Involved in the Analysis of the functional side of the project by interacting with functional experts to design and write technical specifications.
- Data base design & data model - Logical & Physical design.
- Worked on the Architecture of ETL process.
- Created Data stage jobs (ETL Process) for populating the data into the Data warehouse constantly from different source systems like ODS, flat files, scheduled the same using Data Stage Sequencer for SI testing.
- Obtaining productive gains during vehicle life to meet aggressive cost reduction target.
- Extracted data from sources like Oracle and Flat Files.
- Preparing development timing plans & reporting to senior management about the supplier progress system & ensuring their engineering support for onsite integration & production launch.
- Working in a team with other associate product & component developers.
- Involved in Unit Testing and Integration Testing to test jobs and also the flow.
- Worked on changed requests as per clients and projects technical specification needs.
- Awareness about the functional/business aspects for the components.
- Automated process of job monitoring which helped in minimizing the manual intervention & documenting them perfectly.
- Provide support for monthly/weekly batches in production run.
- Involved in the Documentation of the ETL phase of the project.
- Developed the reusable components, best practices that were later on used in other Data warehouse.
- Documenting business process, lesson learnt & best practices for the project
Environment: IBM InfoSphere DataStage 8.5 (Administrator, Director, Designer), PL/SQL, Oracle 10g/9i, Korn Shell scripts, Flat files, ETL Support, Toad.
Confidential, Nashville, TN
ETL Datastage Developer
Responsibilities:
- Designed and tested the DataStage jobs for extract, transform and load.
- Developed various jobs using stages DB2 enterprise, lookup, Dataset, Join, Sort, Aggregator, Filter, Modify, Funnel etc..
- Involved in performance tuning of the DataStage jobs and queries.
- Extensively worked with Parallel Extender for parallel processing to improve job performance while working with bulk data sources.
- Written SQL in DB2 for using in DataStage and testing the data.
- Worked exclusively with the Teradata SQL Assistant to interface with the Teradata.
- Involved in Extracting, transforming, loading and testing data from XML files, Flat files, Oracle and DB2 using Datastage jobs.
- Used DataStage Designer for importing metadata from repository, new job categories and creating new data elements.
- Involved writing shell scripts for running Datastage jobs, Using CSV (Hierarchy file for Datastage jobs).
- Implemented ETL Processes and developed complex SQL queries involving extremely large tables using DataStage.
- Using DataStage Designer designed and created jobs for Target Warehouse Entity for Oracle.
- Performed the Unit testing for jobs developed to ensure that it meets the requirements.
Environment: IBM WebSphere DataStage 8.0.1 - DataStage Designer, DataStage Director, Data Stage Administrator, Oracle, PL/SQL, UNIX Shell scripting, Teradata SQL Assistant, CSV files.
Confidential, Dallas, TX
DataStage Developer
Responsibilities:
- Involved in the design and development of Data Warehouse
- Worked closely with data modeler and database administrator to understand the business process and participated in gathering of business requirements to build the data warehouse model
- Design the Data Mart model with Erwin using Star Schema methodology
- Worked extensively with Parallel Extender using Parallel Processing (Pipeline and partition parallelism) techniques to improve job performance while working with bulk data sources
- Extensively used Parallel Job Stages like Stored Procedure, Dataset/File set, Lookup File set, Aggregator, Join, Transformer, Sort, Merge, Filter, FTP, Lookup, Head, Tail and Pivot
- Used Job Sequencer stages to link multiple jobs in Series/Parallel based on the requirement
- Used partition key in Parallel Extender job for bulk loading of data and to get better performance
- Used the DataStage Designer to develop processes for extracting, cleansing, transforms, sintegrating and loading data into data ware house database
- Used DataStage Manager for importing metadata from repository, new job categories and creating new data elements
- Used the DataStage Director and its run-time engine to schedule running the solution, testing and debugging its components, and monitoring the resulting executable versions (on an ad hoc or scheduled basis)
- Implemented shared containers for multiple jobs, which have the same business logic
- Extensively used Shell Scripts to trigger the DS jobs through a Master Sequencer
Environment: IBM WebSphere Data Stage 8.0.1, Teradata, Oracle 9i/10g, PL/SQL, IBM DB2/UDB 8.x, UNIX Shell Scripts, SQL Developer, Teradata SQL Assistant.
Confidential, Milwaukee, WI
IBM DataStage Developer
Responsibilities:
- Extensively used DataStage for extracting, transforming and loading databases from sources including Oracle, DB2 and Flat files.
- Generation of Surrogate Keys for the dimensions and fact tables for indexing and faster access of data in Data Warehouse.
- Tuned transformations and jobs for Performance Enhancement.
- Extracted data from flat files and then transformed according to the requirement and Loaded into target tables using various stages like sequential file, Look up, Aggregator, Transformer, Join, Remove Duplicates, Change capture data, Sort, Column generators, Funnel and Oracle Enterprise.
- Created Batches (DS job controls) and Sequences to control set of jobs.
- Extensively used DataStage Change Data Capture for DB2 and Oracle files and employed change capture stage in parallel jobs.
- Executed Pre and Post session commands on Source and Target database using Shell scripting.
- Collaborated in design testing using HP Quality Center.
- Extensively worked on Job Sequences to Control the Execution of the job flow using various Activities & Triggers (Conditional and Unconditional) like Job Activity, Wait for file, Email Notification, Sequencer, Exception handler activity and Execute Command.
- Utilized Parallelism through different partition methods to optimize performance in a large database environment.
- Developed DS jobs to populate the data into staging and Data Mart.
- Executed jobs through sequencer for better performance and easy maintenance.
- Performed the Unit testing for jobs developed to ensure that it meets the requirements.
- Developed UNIX shell scripts to automate file manipulation and data loading procedures.
Environment: IBM WebSphere Data Stage 8.0.1, Oracle 10g, Teradata, PL/SQL, IBM DB2/UDB 8.x, UNIX Shell Scripts, SQL Developer, Teradata SQL Assistant.
Confidential
DataStage Developer
Responsibilities:
- Reviewed Technical Spec’s and determined the design and Recommended options and approaches.
- Translated high level requirements into efficient ETL process.
- Mapped the source and target databases by studying the specifications and analyzing the required transforms.
- Designed and developed DataStage PX ETL jobs (Parallel jobs and job sequencers).
- Developed Parallel jobs using Stages, which includes Join, Transformer, Sort, Merge, Filter and Lookup.
- Extracted merchant data from XML documents using XML input stage.
- Used Remove Duplicates stage in PX (EE) to remove the duplicates in the data.
- Created job sequences. Created job schedules to automate the ETL process.
- Used DS Director to validate, schedule and run production jobs.
Environment: Ascential DataStage 7.5.2, (Director, Designer, Administrator), Oracle, DB2, UNIX, Windows XP, Toad, flat files, SQL Scripting.