Etl Datastage Developer Resume
Richardson, TX
SUMMARY
- Over 9 years of experience on ETL tools like Confidential InfoSphere DataStage and Informatica with strong business understanding and knowledge of Financial, Insurance and Banking projects. Hands on experience in all aspects of Software Development Life Cycle (SDLC) and Agile/scrum methodologies.
- Expertise in Confidential InfoSphere DataStage v 7.5/8.5/8.7/9.1 tools like DataStage Designer, DataStage Director,DataStage Administrator and Expertise in Informatica PowerCenter v9.5 Designer tools like Source Analyzer, Warehouse Designer, Mapping Designer, Mapplet Designer, Transformation Developer, Workflow Manager and Workflow Monitor.
- Have an overview to translate business rules/requirements into logical and physical models using Erwin tool.
- Practical understanding of theData modeling(Dimensional & Relational)data warehouse concepts like OLTP, OLAP, Star - Schema Modeling, Snowflake Schema Modeling, Fact and Dimension tables.
- Experience with Data Extraction, Transformation, and Loading (ETL) from disparate Data sources.
- Excellent Experience in Designing, Developing, Documenting, Testing of ETL jobs and mappings in Server and Parallel jobs using Confidential InfoSphere Data Stage to populate tables in Data Warehouse and Data marts.
- Experience in building ETL jobs like EXTRACT, LOADS, BATCHES and SEQUENCERS.
- Expert in designing Parallel jobs using various stages like Join, Merge, Lookup, Remove duplicates, Filter, Dataset, Complex flat file, Modify, Aggregator.
- Extensively worked with Informatica Power Center Transformations such as Source Qualifier, Lookup, Filter, Expression, Router, Normalizer, Joiner, Update Strategy, Rank, Aggregator, Stored Procedure, Sorter, Union, Sequence Generator
- Experience in integration of various data sources (DB2-UDB, SQL Server,PL/SQL,Oracle and Teradata) into data staging area.
- Implemented Slowly Changing Dimensions - Type I & II in Dimension tables as per the requirements.
- Proven track record in troubleshooting of Data Stage jobs and addressing production issues like performance tuning and enhancement.
- Proficient in writing, implementation and testing of triggers, procedures and functions inPL/SQLandOracle.
- Expertise in various types of ETL testing such as Integration, Volume, Performance, Structure Validation, Count Validation and Data Validation.
- Experience in UNIX Shell scripting. Knowledge in Perl Scripting.
- Strong skills in coding and debugging Teradata utilities likeFast Load, Fast Export, MultiLoadandTpumpfor Teradata ETL processing huge volumes of data throughput.
- Experience in Performance Tuning and production.
- Experience on Autosys and Cronacle (third party scheduling tool) for scheduling the jobs.
- Excellent communication skills, good organizational skills, self-motivated and extremely hardworking with ability to implement business concepts quickly and efficiently.
- Excellent communicator with exceptional team-building skills.
TECHNICAL SKILLS
RDBMS: Oracle 10g,Teradata, SQL Server,DB2
ETL Tools: Datastage 7.5Px/ 8.1/8.5/9.1/11.0 , Informatica 9.1/9.5
OS: Windows and UNIX
Scheduling Tools: Autosys, Cronacle, Control-M
Programming Language: Oracle SQL,PL/SQL, Perl
Data Modeling Tools: Erwin data modeler 7.3.8
Version control Tools: Subversion
PROFESSIONAL EXPERIENCE
Confidential, Richardson, TX
ETL DataStage Developer
Responsibilities:
- Provided timely resolution of production issues, development and implementation of enhancements and change request from the management team.
- Wrote complex and medium queries in DB2 for analyzing the data.
- Extracted data from Flat files, DB2 and Loaded into data warehouse.
- Developed application views in DB2 .
- Creation of mappings with the transformations like connected/unconnected Lookup, Expressions, Joiner, Sequence Generator, Source Qualifier, Sorter, Aggregator, Normalizer, Update Strategy, Rank and Router.
- Involved in error handling, debugging and troubleshooting sessions using the Session logs, Debugger and Workflow Monitor.
- Used Pre-SQL and Post-SQL scripts for loading the data into targets according to the requirement.
- Provided the template to the Zena jobs to schedule the run of the mappings.
- Implemented Slowly Changing Dimensions (SCD Type 2) to update the dimensional schema and used Confidential CDC tool to get the recent updates.
- Generated UNIX shell scripts for triggering/automating the execution of the Datastage jobs, encryption and decryption of data files; secure FTP to vendor sites, pre and post data processing/validations, and automated email notifications.
- Worked on performance tuning of SQL queries in DataStage jobs.
- Involved in writing test plan, Unit, Integration and Regression testing of the DataStage jobs and UNIX shell scripts.
- Worked on deployment and production checkout planning of data and schema changes in the database.
- Coordinated with different infrastructure teams to execute deployments.
- Coordinated with users in incident management in processing the claims
Environment: Confidential InfoSphere DataStage v8.5, DB2, Flat files, Unix, Zena, Confidential CDC tool.
Confidential, Foster City, CA
ETL Informatica Developer
Responsibilities:
- Provided timely resolution of production issues, development and implementation of enhancements and change request from the management team.
- Correlate the business requirement to technical aspect and come up with high level and detailed design to create the Informatica mappings.
- Developed the Informatica mappings using various transformations, Sessions and Workflows.
- Extracted data from Flat files, Oracle, XML and Loaded into data warehouse.
- Developed application views in Oracle and using expression and router transformations simplified the data.
- Creation of mappings with the transformations like connected/unconnected Lookup, Expressions, Joiner, Sequence Generator, Source Qualifier, Sorter, Aggregator, Normalizer, Update Strategy, Rank and Router.
- Involved in error handling, debugging and troubleshooting sessions using the Session logs, Debugger and Workflow Monitor.
- Used Pre-SQL and Post-SQL scripts for loading the data into targets according to the requirement.
- Provided the template to the Autosys jobs to schedule the run of the mappings.
- Implemented Slowly Changing Dimensions (SCD Type 2) to update the dimensional schema.
- Performed Informatica code migrations to different Informatica repositories.
- Generated UNIX shell scripts for triggering/automating the execution of the informatica mappings, encryption and decryption of data files; secure FTP to vendor sites, pre and post data processing/validations, and automated email notifications.
- Worked on performance tuning of SQL queries in Informatica mappings.
- Involved in writing test plan, Unit, Integration and Regression testing of the Informatica mappings and UNIX shell scripts.
- Worked on deployment and production checkout planning of data and schema changes in the database, Informatica component migration.
- Coordinated with different infrastructure teams to execute deployments.
Environment: Informatica Power Center 9.5, Oracle 11g, Flat files, Unix, Autosys
Confidential
DataStage Developer
Responsibilities:
- Analyzed, designed, developed, implemented and maintained Parallel jobs using Confidential info sphere Data stage.
- Adept knowledge and experience in mapping source to target data using Confidential Data Stage 8.x.
- Experience in PX file stages that include Complex Flat File stage, DataSet stage, LookUp File Stage, Sequential file stage.
- Implemented Shared containerformultiple jobs and Local containersforsame jobs as per requirements.
- Implemented multi-node declaration using configuration files (APT Config file) for performance enhancement.
- Successfully implemented pipeline and partitioning parallelism techniques and ensured load balancing of data.
- Worked on different partitioning methods like Hash by column, Round Robin, Entire, Modulus, and Range for bulk data loading and for performance boost.
- Used ETL Data Stage Director to schedule and run the jobs, test and debug its components & monitor performance statistics.
- Created Batches (DS job controls) and Sequences to control set of jobs. Executed jobs through sequencers and created batch jobs for better performance and easy maintenance.
- Debugged, tested and fixed the transformation logic applied in the parallel jobs
- Involved in creating UNIX shell scriptsfordatabase connectivity and executing queries in parallel job execution. Knowledge on NDM file transfers.
- Participated inbuild/reviewof theBTEQ Scripts, FastExports, MultiloadsandFast Load scripts.
- Performed unit testing and provided the test cases.
- Scheduled the jobs usingAutoSys andCorntab.
- Responsible for daily verification that all scripts, downloads, and file copies were executed as planned, troubleshooting any steps that failed, and providing both immediate and long-term problem resolution.
- Provided technical assistance and support to IT analysts and business community.
- Provided maintenance support for Confidential month end loads.
Environment: Confidential InfoSphere Datastage v8.5, Cronacle, Teradata, Oracle 10g, subversion, UNIXSQL*Plus.
Confidential, Parsipanny, NJ
DataStage Developer
Responsibilities:
- Provided Technical support to the team as the ETL developer. Addressed best practices and productivity enhancing issues.
- Loadeddataintoload, stagingandlookup tables.Staging area was implemented usingflat files.
- Created jobs in DataStage to import data from heterogeneous data sources like Oracle 9i, Text files.
- Extensively worked on Job Sequences to Control the Execution of the job flow using various Activities & Triggers (Conditional and Unconditional) likeJob Activity, Wait for file, Email Notification, Sequencer, Exception handler activity and Execute Command.
- Assisted Mapping team to transform the business requirements into ETL specific mapping rules.
- Enhanced variouscomplex jobsfor performance tuning.
- Responsible for version controlling and promoting code to higher environments.
- PerformedUnit Testing, System Integration Testing and User acceptance testing
- Involved in ongoing production support and process improvements. Ran the DataStage jobs through third party schedulers Autosys.
- Involved with the batch team in preparing the JILs.
- UtilizedParallelismthrough different partition methods to optimize performance.
- Developed DS jobs to populate the data intostagingandData Mart.
- Performed Unit testingfor jobs developed to ensure that it meets the requirements.
- DevelopedUNIX shell scriptsto automate file manipulation and data loading procedures.
- Scheduled the jobs usingAutoSys.
- Involved in some of the admin tasks for deleting the datasets. Restarting the servers and maintaining the sftp connections.
- Creating of New DataStage projects and maintenance.
- Autosys setup for new interfaces.
- Rebooting the servers as part of the regular maintenance activities and performing health checks.
- Code migration to different datastage projects .
- Sftp setup between servers.
- Monitoring error and warning less data transmission from source to target.
- Develop and implement strategies forperformance tuning.
- Performed unit testing and provided the test cases.
Environment: Confidential InfoSphere Datastage v8.5 & v9.1, Autosys,Oracle 10g, Erwin, UNIX, PVCS Serena Dimensions.
Confidential, Foster City, CA
DataStage Developer
Responsibilities:
- Worked on DataStage tools likeDataStage Designer, DataStageDirector and DataStage Administrator.
- Strong understanding of the principles ofData Warehousingusingfact tables,dimension tablesExtensive ETL tool experience usingIBM Infosphere/Websphere DataStage, Ascential DataStage.
- Experience in Data modeling strategiesstar/snowflake schema modeling.
- Knowledge in usingErwinas leading Data modeling tool forlogical (LDM)andphysical data model (PDM).
- Developed parallel jobs using different processing stages likeTransformer, Aggregator, Lookup, Join, Sort, Copy, Merge, Funnel, CDC, Change Apply and Filter.
- Experience in clear understanding of design goals ofER modelingforOLTPand dimension modeling forOLAP.
- Assisted in development efforts forData martsandReporting.
- Worked SCDs to populate Type I and Type II slowly changing dimension tables from several operational source files
- Generation ofSurrogate IDsfor the dimensions in the fact table for indexed and faster access of data in server jobs.
- Extensively worked on Job Sequences to Control the Execution of the job flow using various Activities & Triggers (Conditional and Unconditional) likeJob Activity, Wait for file, Email Notification, Sequencer, Exception handler activity and Execute Command.
- Extensive experience inUnit Testing, Functional Testing, System Testing, Integration Testing, Regression Testing, User Acceptance Testing (UAT)andPerformance Testing.
- Prepared the test documents.
- Fixing of code and data related bugs in the existing application.
- Internal process review and Peer to Peer review.
- Process documentation and revision as and when required.
Environment: Confidential InfoSphere Datastage v8.5, Erwin DataModeler, Autosys, DB2, UNIX, SQL*Plus.
Confidential
DataStage Developer
Responsibilities:
- Based on the Business Requirements developed Data stage Jobs to Extract, Transform and Load the data from Source to Target and Target to Data Distribution Area.
- Expert in designing Server jobs using hashed file stages, Link partitioner, Link collector stages and Sequencer jobs using Job activity, wait-for-file activity, Terminator etc.
- Used Data Stage Designer to develop process for extracting, cleansing, transforming, integrating and loading the data into Confidential DB2 Data warehouse.
- Used extensively Reject Link, Job Parameters, and Stage Variables in developing jobs.
- Involved in job level performance tuning.
- Involved in promoting the jobs by version controlling from development to integration.
- Assists in Production support & Fixing the production issues as back up.
- Used Transformer, Remove Duplicates, Copy, Funnel, Lookup, and Change Capture Stages in designing jobs.
- Involved in Unit Testing, Integration testing.
- Import and Export of jobs using Data Stage Manager.
- Involved in Different reviews like Internal and external code review, weekly status calls, issue resolution meetings and onsite code acceptance meetings.
Environment: Confidential InfoSphere Datastage 7.5PX,DB2,UNIX.