Sr. Datastage Developer Resume
Richardson, TexaS
SUMMARY
- Over 8 years of experience in Data Warehousing, ETL design and development in various verticals and industries like Insurance, Banking, Healthcare and Retail Marketing.
- Involved in Software Development life - cycle (SDLC) of various projects, including requirement gathering and analysis, system designing, data modeling, application and enhancement development, migration, maintenance and production support.
- Experience and understanding in Agile methodology, releasing the code on sprint bases and in Waterfall methodology of development life cycle.
- Extensive experience with IBM InfoSphere DataStage 11.3.1, IBM InfoSphere DataStage 9.1, IBM InfoSphere DataStage 8.5, Informatica PowerCenter 9.x.
- Expertise in the Data Analysis, Design, Development, Implementation and Testing of Data Warehousing using Data Extraction, Data Transformation and Data Loading.
- Experience as a Lead developer, ETL developer and Point of Contact for the release and sprint of projects with business and stakeholders.
- Designed deliverables including Functional, Design, Technical and Mapping documentation and Code. Adhering to the solution delivery standards, SLAs and processes.
- Strong understanding of principles of Data Warehousing using fact and dimension tables, Star and Snowflake schema modeling.
- Worked with Informatica Transformations including Designer, Workflow manager and Monitor.
- Worked extensively on different DataStage stages like Filter, Join, Merge, Lookup, Funnel, Aggregator, Transformer, Sort, Change Data Capture stages, Slowly Changing Dimension, Surrogate Key Generator, Modify, Remove Duplicates, Copy, Connector stages for Databases (DB2, Oracle), ODBC Connector, Real Time (XML Input/ Output), Shared Containers, Column Import/ Export for developing jobs.
- Worked extensively on DataStage Sequence Stages - Job Activity, Execute Command, End-Loop Activity, Exception Handler, Nested Condition, Notification Activity, Routine Activity, Sequencer, Start-Loop Activity, Terminator Activity, User Variables Activity etc.
- Extensively used Sequential File, Data set, File set, XML, Web services for data manipulation and storage.
- Experience with SQL query and PL/SQL stored procedures and different databases Oracle 11g/10g, Teradata 12/13, DB2 9/10/11, Netezza 6/7, MS SQL Server 2005/2008.
- Experience in using Unix shell scripts and commands for source/target validation, job triggers, etc.
- Experience with scheduling tools Autosys, Control M for executing DataStage/Informatica jobs and version control tools like TortoiseSVN and Service Now Scheduling.
- Excellent communication, interpersonal, analytical skills and strong ability to manage and motivate the team. Fast learner and efficient developer with the ability to work individually or in a team based environment.
- Experience in handling changing prioritization and solving critical issues to remove blockers in SDLC.
TECHNICAL SKILLS
ETL Tools: IBM InfoSphere DataStage 11.3/9.1/8.5/8.0.1 , Informatica Power Center 10.1.0/9.x/8.x, Ascential DataStage 7.5(Enterprise Edition)/7.0/6.0
Data Modeling and Reporting Tools: MS Visio 2008/2010/2013 , ER studio, Erwin, Business Objects XI
Databases: Oracle 11g/10g, SQL Server 2005/2008, IBM DB2 9/10/11, Netezza version 6/7, Teradata Database 12/13
Operating Systems: Unix, Linux, Windows 10/8/7/XP
Programming Languages: SQL, PL/SQL, Java 7/8, C, C++, Shell scripting, XML
Scheduling/Version control Tools: Autosys, Control M, Git, Service Now Scheduling, TortoiseSVN, TIDAL
PROFESSIONAL EXPERIENCE
Confidential, Richardson, Texas
Sr. DataStage Developer
Responsibilities:
- Worked on all phases of Software Development Life Cycle from Planning to Deployment.
- Performed requirement gathering, analysis and designing decisions based on business analysis and technical design discussions with the stakeholders.
- Applied UML concepts in designing the HLD and LLD using ER studio.
- Coordinated with the Business Analyst to understand and design the explanation of benefits and monthly bill reporting to ease the report generation process from the Enterprise data warehouse.
- Coordinated with different team members to resolve and design the effective solution concerning accuracy of the data in the data mart environment maintaining the HIPAA compliance rules.
- Implemented complex ETL jobs depending on the requirements like member claims, copay, deductible and access to the appropriate benefits available according to the plans and groups.
- Developed parallel jobs using CC and SCD stages to achieve complex Change Capture process to maintain history of customers.
- Used parameters and parameter sets to aide code consistency of modules across environments.
- Prepared and performed unit test cases to validate the data generated.
- Extensively used SQL queries for retrieving and processing data in Databases and DataStage.
- Performed code reviews to validate the deliverables like ETL code, design documents, test cases.
- Developed UNIX shell scripts for scheduling, supplying parameter values and performing business requirements, FTP and audit activities.
- Debugging, Troubleshooting, Monitoring and Performance Tuning of DataStage Jobs.
- Utilized the DataStage director client to monitor, analyze performance and bottlenecks in the job run.
- Performed Production activities like deployment activities and production support in hypercare period.
- Diligently carried out version control for jobs and scripts along with code review and validation across the environments, using Serena migration tool.
- Utilized Control M scheduler handling dependency checks on incoming file and prior job runs.
- Performed knowledge transfer to client team members and mentored new team members.
Environment: IBM InfoSphere DataStage 11.3.1, Oracle 11g, SQL Server 2012, XML, Netezza V7, Flat files, Control M, ER studio, Unix/Windows.
Confidential, Dallas, Texas
Sr. DataStage Developer
Responsibilities:
- Performed Software Development Life Cycle(SDLC) activities including Requirement Analysis, Client Interaction, Design, Coding, Testing, Support and Documentation and implementation.
- Prepared DataStage Execution plan for various processes to extract, transform, integrate and load data from various sources into the Data Warehouse database.
- Involved in modeling review for the designs with the project architect and the cross platform teams.
- Identified and reviewed database DDL conflicts on the enhancement and coordinated with the Database Administrator to resolve conflicts.
- Developed DataStage jobs based on business requirement including payments, claims and billing with efficiency and lossless data loading.
- Worked on Change Capture process to replicate transactional data from one data mart to another to maintain the consistency and integrity of the hierarchical data format.
- Developed jobs to extract XML responses using various stages MQ connector, XML transformation stage to load into Netezza target tables.
- Extracted the data from legacy sources using the different integration tools.
- Extensively developed jobs requiring lookup and referential integrity to validate transactions before updating the new records.
- Performed data integration and balanced loads for the different entities involved like insurer, dealer, contracts, etc. in the transaction update.
- Project used Agile methodology, actively prepared user stories and tasks for each application, and ensured code development followed closely with the requirements and got the business user approvals.
- Developed SQL/PL SQL queries to create analysis and reporting based on the data from different sources into Netezza database.
- Extensively worked on DataStage stages like Sequential File, Copy, Aggregator, Surrogate key, Transformer, ODBC stage, Dataset, look up, Aggregator, Join, Remove Duplicates, Sort, Column generator and Funnel.
- Employed auditability control on the jobs to ensure performance and data accountability.
- Ensured reusability of existing code and documentation. Contributed to new development standards and best practices.
Environment: IBM InfoSphere DataStage 9.1/11.3, Putty, DB2 11, Netezza v7, SQL, PL/SQL, XML files, Legacy sources, Unix/Windows, Unix shell scripts.
Confidential, San Antonio, TX
DataStage Developer
Responsibilities:
- Designed and documented technical specification of existing DataStage jobs.
- Developed DataStage jobs corresponding to technical specifications and requirements based on the policy changes to the members and plans.
- Extracted data from various sources like Oracle, Flat Files, SAP-BW and loaded into staging area.
- Involved in developing and supporting applications using BO universe for daily High volume MicroStrategy reports of monthly change control data.
- Created and enhanced DataStage jobs to ensure member enrollment and eligibility criteria were complaint.
- Coordinated with systems partners to finalize designs and formalize requirements utilized story based on the length of the backlog and priorities.
- Designed Parallel jobs using various stages like Join, Remove Duplicates, Filter, Dataset, Lookup file set, Modify, Transformer and Funnel stages.
- Used Quality Stage stages such as Investigate, Standardize, Match and Survive for data quality and data profiling issues.
- Involved in code and design documents review.
- Designed and performed unit Test cases to validate data and perform load balancing activities.
- Developed UNIX commands / shell scripts to automate the Data Load processes to the target Data warehouse using Autosys Scheduler.
- Agile methodology used, reported daily status in meetings and documented the weekly status to be shared in the weekly sprint meeting with the client.
- Performed code demo and knowledge transfer for the production support team.
Environment: IBM InfoSphere DataStage 9.1, SAP Business objects, Oracle 10g/11g, DB2 10/11, Netezza v6, SQL, PL/SQL, Unix/Windows, UNIX shell scripting, Agile Methodology, Autosys.
Confidential, Arlington, Texas
ETL Developer
Responsibilities:
- Designed and documented technical specification of existing DataStage jobs.
- Developed DataStage jobs and workflows corresponding to existing Informatica mappings and technical specifications.
- Actively involved in resolving data warehouse discrepancies in the new model and suggested improvements based on the business requirements in the design discussion meetings between architects, manager, leads and database administrators.
- Undertook migration activities within the team from ETL tool Informatica Power Center to IBM InfoSphere DataStage.
- Involved in code and design documents review for the converted jobs.
- Designed and performed unit Test cases to validate data and perform load balancing activities.
- Undertook validation tests between the legacy code and new code execution performance.
- Converted the windows commands/ batch scripts to UNIX commands/ shell scripts to perform in conjunction with the new environment and jobs.
- Followed Agile Methodology for SDLC, actively participated in weekly status meetings.
- Coordinated with the other teams involved and resolved the development issues.
Environment: IBM InfoSphere DataStage 8.5, Informatica Power Center 9.x, Oracle 10g/11g, SQL, PL/SQL, Unix/Windows, Microsoft SQL Server 2008/2012, Flat Files, Autosys.
Confidential
Responsibilities:
- Worked with the Business analysts SAS programmers and the DBAs for requirements gathering, analysis, testing, and metrics and project coordination.
- Developed the staging area design for database tables. Involved in development and discussion of SCD implementation and metadata maintenance.
- Developed deliverables like design documents, unit test cases, review documents, and sanity checks for the integrated workings of the jobs.
- Worked with DataStage stages like ODBC, Transformer, Hash file, Sequential file, Aggregator, Sort, Merge, Link practitioner, Link collector, etc.
- Involved in designing various jobs using PX and Parallel jobs using Parallel stages like: Merge, Join, Lookup, Transformer (Parallel), Teradata Enterprise Stage, Funnel, Dataset, etc.
- Used Remove Duplicates stage in PX (EE) to remove the duplicates in the data from the sources.
- Involved in the migration of DataStage jobs from Development to Production environment. Responsible for developing the release plan for the jobs in test, QA and production environments.
- Designed and implemented several wrappers to execute the DataStage jobs, create job reports out of the DataStage job execution results from shell scripts.
- Developed and performance tuned SQL queries to perform activities like joins, aggregation for processing the business requirements in the database.
Environment: IBM WebSphere DataStage 8.0.1, IBM InfoSphere DataStage 8.5, Oracle 10g/11g, SQL, PL/SQL, Unix/Windows, Microsoft SQL Server 2008/2012, Flat Files, Autosys.
Confidential
ETL Developer
Responsibilities:
- Gathering and documenting requirements, requirements analysis, converting requirements into High Level Design Documents.
- Created source and target tables used for staging.
- Utilized Informatica PowerExchange to analyze and design database tables for Mainframe legacy source.
- Utilized Teradata FastLoad, Teradata MultiLoad and Teradata FastExport utilities for getting data in and out of Teradata.
- Wrote SQL and PL/ SQL queries for aggregation and outer joins for better performance.
- Developed ETL mappings as per requirements, implemented SCDs.
- Created Unit test cases and performed Unit testing.
- Involved in system and performance testing. Took part in code review and performance analysis of new jobs.
Environment: Informatica PowerCenter 8.6.x/9.0.x/9.1, Informatica PowerExchange, Oracle 10g/11g, Teradata Database V2R6/12.0/13.0, SQL, PL/SQL, Unix/Windows, Mainframe, Autosys, Business Objects XI.