Data Stage Developer Resume
Chicago, IL
SUMMARY
- Over 7 years of experience in ETL using Datastage (Ver. 11.5/11.3/9.1/8.5/8.1/7.5 ) client components, Datastage designer, Datastage manager and Datastage director.
- Experienced in building data warehouses and data marts using Datastage8/7.5.2 (datastage enterprise edition).
- Developed efficient jobs for data extraction/transformation/loading (ETL) from different sources to a target data warehouse.
- Experience working with Teradata and Oracle databases.
- Strong knowledge of data warehousing basics and relational database management systems and dimensional modeling (star schema and snowflake schema).
- Proficient at 6developing server jobs, parallel jobs and shared containers.
- Experience with various stages such as Aggregator, Filter, Merge, Hash, Sort, Transformer, Lookup, Change Capture Stage, Pivot, Join and Remove duplicates.
- Experienced in performance tuning.
- Extracted, transformed and loaded data from various heterogeneous data sources and destinations like Oracle, flatfiles, csv files and xml files.
- Strong inclination for finishing jobs before project deadlines. Flexible and versatile enough to adapt to the changing environments.
- Experience in migrating ETL code to upgraded version.
- Experience in programming using SQL, PL/SQL, and UNIX Shell Scripting.
- Extensive experience in Microsoft SQL Server 2005/2000, IBM DB2 UDB 8.0/7.0, Informix, Teradata, Oracle 10g/9i/8i, XML files and Message Queues.
- Strong Experience in Unit testing and System testing of the DataStage jobs.
- Experience in Scheduling the Data Stage jobs using TWS scheduling, Autosys.
- Knowledge on designing and developing reports using BI/OLAP tools like Business Objects and Cognos.
- Experience in troubleshooting of jobs and addressing production issues like data issues, ENV issues, performance tuning and enhancements.
- Knowledge in using Erwin as leading Data modeling tool for logical (LDM) and physical data model (PDM).
- Extensive experience in Unit Testing, Functional Testing, System Testing, Integration Testing, Regression Testing, User Acceptance Testing (UAT) and Performance Testing.
- Extensively handled offshore team for project by assigning the tasks and resolving the issues.
- Excellent communication, interpersonal, technical, analysis and leadership skills. Thrive in both independent and collaborative work environments.
TECHNICAL SKILLS
ETL Tools: Data Stage 11.x/9.x/8.x/7.X/6.0/5.2 (Manager, Designer, Director, Administrator, Parallel Extender, QualityStage), DataStage Plug - In, OLAP and OLTP.
OLAP Tools: Business Objects, Cognos, OBIEE.
Databases: Oracle 10g/9i/8i, Teradata, MS SQL Server 2000/7.0/6.5 , DB2 UDB.
Operating Systems: Windows NT/95/98/2000, UNIX, MS-DOS
Programming Languages: SQL, PL/SQL, Unix Shell Scripting, C, C++, C#, PERL.
PROFESSIONAL EXPERIENCE
Confidential, Chicago, IL
Data stage Developer
Responsibilities:
- Interacted with End user to understand the business requirements and in identifying data sources.
- Helped in preparing the mapping document for source to target.
- Used Parameter files, User variables and Stage variables in parallel and sequence jobs based on the requirements.
- Controlled jobs execution using sequencer, used notification activity to send email alerts.
- Imported table/file definitions into theDataStagerepository.
- Extensively used SQL tuning techniques to improve the performance of Data Stage Jobs.
- Extensively used Teradata utilities like FastExport and MLoad.
- Implemented slowly change dimensions (SCD) type 1, type 2, type 3.
- Implemented incremental extraction and incremental load.
- Extracted data from various sources like Oracle and DB2 and loaded into Teradata.
- Involves in loading data from flat files to ware house landing zone tables using Teradata Utilities.
- Tuned DataStage transformations and jobs to enhance their performance.
- Developed common Jobs, Shared containers and Server Routines which are used across the project in most of the interfaces.
- Prepared Unit Test Cases and executing them.
- Fix the defects raised by testing team and maintain the status in HP Quality Center.
- Used different types of stages like Transformer, CDC, Remove Duplicate, Aggregator, ODBC, Join, Funnel, dataset and Merge for developing different jobs.
- Extensively used Parallel stage like row generator, column generator and Peek Stages for debugging purpose.
- Experienced in scheduling Sequence and Parallel jobs usingDataStageDirector, Unix ScriptsandSchedulingtoolTWS.
Environment: IBM-InfoSphere Data stage 11.5 (Server & EE), TOAD, DB2,Teradata, Teradata SQL Assistant UNIX- AIX, Windows XP, MS Word, Excel, Clearcase, Clear Quest.
Confidential, Austin, TX
Data stage Developer
Responsibilities:
- Involved in design and development of parallel jobs, sequences using the Designer.
- Interacted with Business Analysts to finalize the requirements and documented the technical design document for Coding.
- Designed several parallel jobs using Sequential File, Dataset, Join, Merge, Lookup, Change Apply, Change Capture, remove duplicates, Funnel, Filter, Copy, Column Generator, Peek, Modify, Compare, Oracle Enterprise, Surrogate Key, Aggregator, Transformer, Row Generator stages.
- Encompasses Database designing, ETL phases of Data Warehousing. This is achieved with an emphasis on relational data modeling and dimensional data modeling for OLTP and OLAP systems.
- Tuned Data Stage jobs for better performance by creating Data stage Lookup files for staging the data and lookups.
- Created Shared Containers to increase Object Code Reusability and to increase throughput of the system.
- Experience developing complex transformations, surrogate keys, routines, dimension tables and fact tables.
- Used Environment Variables, Stage Variables and Routines for developing Parameter Driven Jobs and debugging them.
- Data staging of the ETL solutions from different source systems to target database.
- Performed extraction, transformation and loading of data using different types of stages and by performing derivations over the links connecting these stages.
- Enhanced the Job Performance by using proper Partitioning methods and analyzing the resources utilized using Job Monitor.
- Used Data Stage Director for running and monitoring performance statistics.
- Scheduled existing UNIX shell scripts using Crontab.
- Modified UNIX shells according to the latest business requirement.
- Modified UNIX shells to implement new landing path for various feeds with new FTP connections and for proper log maintenance.
- Created SQL statements used for modifying, monitoring and resetting jobs during monitoring process.
- Redesigned few jobs in Datastage Designer to meet the changes in new incoming feeds.
- Created and modified all the jobs and shell scripts with new email Id’s and groups for abort and failure notifications.
- Actively involved in everyday job monitoring and resolved all the issues which are related to job aborts and job failures.
Environment: IBM-InfoSphere Data stage9.1 (Server & EE), Erwin 4.2, Oracle, TOAD, UNIX- AIX, Windows XP, MS Word, Excel, Crontab, Clear Quest.
Confidential, Rochester, MN
Data stage Developer
Responsibilities:
- Analyzed Business Requirements by working closely with Business Area.
- Designed Star/Snowflake schemas, converted pre-existing serial applications to parallel processing using DataStage parallel extender.
- Designed jobs using different parallel job stages such as Join, Merge, Lookup, Remove Duplicates, Copy, Filter, Funnel, Dataset, Change Captured, Modify, and Aggregator.
- Used Debugging stages like Row generator, Column Generator and Peek, DB2 and Teradata.
- Demonstrated proficiency in working with the XML Input and XML Output stages to extract data and create XML documents.
- Used XML parallel Real Time Integration stages to extract and transform XML nodes from OLTP systems.
- Extensively used Teradata utilities like FastExport and MLoad.
- Implemented slowly change dimensions (SCD) type 1, type 2, type 3.
- Implemented incremental extraction and incremental load.
- Designed and developed various jobs for scheduling and running jobs under job sequencer and DataStage Director
- Performed Administrator functions such as creating projects, setting tunables, protecting project, releasing the jobs, and setting environment variables.
- Extensively implemented import/export utilities for migrating code.
- Replaced transformer stages with other stages to improve performance of job.
- Worked with configuration file in parallel extender to take maximum benefit from parallel environment.
Environment: IBM Information Server (DataStage & QualityStage) 8.5/8.7, MS Visio, Tectia SSH, IBM DB2, Teradata, Teradata SQL Assistant, AutoSys, Serena Dimensions (Version Control), Cognos.
Confidential
ETL Data Stage Developer
Responsibilities:
- Gathering requirements and writing specifications for ETL Job modules.
- Involved in Defining and designing the process for Extraction, Transformation and Loading Data from various source systems to ODS & Data warehouse.
- Analyze newly converted data to establish a baseline measurement for data quality in Data Warehouse.
- Designing and developing Master controlling sequencer jobs using the DataStage Job Sequencer.
- Developed Common UTL - ETL Jobs, PLSQL Procedures which will run the ETL-Jobs.
- Performance tuning of DataStage ETL- Jobs.
- Define constraints and derivations to implement business rules in DataStage jobs.
- Developed DS Routines for extract job parameters from files and developed DS routines for job Auditing. Perform data manipulation using BASIC functions and DataStage transforms.
- Design complex job control processes to manage a large job network
- Involved in developing UNIX scripts to automate the Data Load processes.
- Involved in All ETL Testing Process, Bug Fixing.
- Implemented and Maintained ETL jobs in production.
Environment: DataStage 8.0 (Parallel Extender), Oracle 9i, DB2UDB,Mainframe, PVCS, SQL, PL/SQL, TOAD, Control M, Hummingbird, Shell Scripts, Universe Basic, HP UNIX, Windows 2000.
Confidential
Data Stage Developer
Responsibilities:
- Analyzing the mapping documents, understanding the business requirements.
- Understand the Low Level Design Documents and thereby develop the jobs.
- Data from different source systems was extracted, transformed and loaded into data warehouse tables.
- Responsible for using different types of Stages such as Transformer, Join, Sequential File, Copy, Filter and Aggregator to develop different jobs.
- Developed Parallel and Sequence Jobs.
- Created Shared Containers, Local Containers for DS jobs.
- Used Data Stage Director to schedule, monitor, and run jobs.
- Prepare Unit test cases and execute them.
- Maintain the test cases and test results in Quality Center Tool, Update the defect status as on when required.
- Prepare Unix Scripts and Control M jobs flow.
- Implemented Best practices to optimize the performance.
- Designed ETL Process as per the requirements and documented the Flow using Visio.
- Did Unit testing and tuned for better performance with updates on data ware house tables.
- Provided Post Implementation Support.
Environment: DataStage 8.0/8.1, Oracle 9i, HP Quality Centre, UNIX Shell Scripting, SQL, PL/SQL