We provide IT Staff Augmentation Services!

Datastage Developer Resume

3.00/5 (Submit Your Rating)

SUMMARY

  • IBM DataStage Developer with over 7 years of experience in Analysis, Development and Implementation of Datawarehousing using Datastage.
  • Having expertise in designing datastage jobs and handling high volume data loads and SQl, Pl/Sql programming.
  • Experience in DataStage 11.5/11.3/9.1/8.5/8.1/7.5 using components like Datastage designer, Director, Quality stage, Information Analyzer and Metadata Workbench.
  • Designed Mapping documents, Technical specification documents, Code - Review checklist, Sql build documents.
  • Prepared test scripts for unit testing and integration testing for Datastage jobs and SQL scripts.
  • Prepared scripts for Data Model request to tech team.
  • Strong understanding of DW concepts of Fact tables, Dimension tables and their relationships.
  • Experience in developing Star Schema and snowflake schema
  • Experience in SQL, PL/SQL and UNIX programming.
  • Experience in ITSM process followed while creating Change Requests and Incidents of issues.
  • Analyzed the source data from different places and designed the document of understand the raw data using Information Analyzer.
  • Have fluency over data modeling using tools like Erwin.
  • Experience in developing test cases for requirement as part of unit testing.
  • Extensively worked on Datastage designer to create new server and parallel datastage jobs for extracting, transforming and loading data into Data Mart and Data Warehouse tables.
  • Migrated datastage jobs from Development to QA and then to Production.
  • Strong background of creating jobs using ETL and analyzing data using Information Analyzer.
  • Extensive experience in Dimensional modeling techniques, troubleshooting of Datastage jobs and performance tuning.
  • Worked extensively on converting Business logic to technical specifications and then to creating corresponding Datastage jobs.
  • Solved all major issues of code migration from one environment to another as part of development life cycle
  • Proficient with developing custom datastage server and parallel routines attuned to business requirements.
  • Good knowledge in development life cycle (SDLC), Data conversion and Data cleansing.
  • Experienced loading huge volumes of data and performance tuning to reduce the processing time.
  • Strong experience in Quality Assurance using various environments like System integration testing, Process scenario testing and User acceptance testing.
  • Participated in discussion with Project Manager, Business Analyst and Team Members on technical or Business requirement issues or discussing about enhancement of existing requirements.
  • Experienced in drawing flow diagrams for data movement in Data warehouse using Microsoft Visio 2010.

TECHNICAL SKILLS

  • IBM Datastage 11.5/11.3/9.1/8.5/8.1/7.5 (Designer
  • Director). IBM Infosphere Quality stage
  • Information Analyzer and Metadata Workbench concepts
  • Windows 2000/2003/2008 Server
  • Solaris
  • Linux
  • SQL
  • PL/SQL Programming
  • Shell Scripting
  • Perl
  • Python. Db2
  • UDB
  • Sybase
  • Oracle
  • SQL Server Microsoft Visio
  • ERWin
  • SQL Developer.

PROFESSIONAL EXPERIENCE

DataStage Developer

Confidential

Responsibilities:

  • Design and development of Extract, Transform, and Load (ETL) processes for extracting data from a various legacy system and loading into target tables using SQL, DataStage Enterprise Edition.
  • Understanding the entire business flow of the project from beginning to end.
  • Extensively used Sequential file, DB2, Join, Lookup, transformer, datasets, filter, Merge, Sort, remove duplicate stage for designing the jobs in the Data Stage Designer.
  • Worked with data feeds from various source systems from flat files, XML files, DB2 databases.
  • Prepared mapping documents for designing and developing the datastage jobs
  • Worked on claims, provider and membership data coming from various vendors like Bcbs Association, BcbsSC, ESI, EyeMed, MedTrack, CVS, LDI and so on.
  • With the help of datastage designer tool, extracted data from various source systems and implemented logic and loaded into target database and outbound files to send it to vendors.
  • Prepared DDL’s and DML and provided to DBA as part of table deployment changes.
  • Created complex sequence jobs and control jobs and adding them to the zena scheduler.
  • Ran the scheduled job in request time trigger mode using zena scheduler in Development, Test, Stage environments.
  • Extensively used File set stage like Sequential file for extracting and reading data.
  • Created the data files in Unix and FTP the files to LAN location.
  • Created Unix shell scripts in creating the temp tables and loading the data in to the tables. Called these ksh scripts from Data stage jobs before querying these tables data.
  • Used Azure DevOps for User stories, tasks, hours, and Deployment.
  • Expert in unit testing, implementation, Volume testing and maintenance of databases jobs.
  • Implemented API calls in DataStage to populate Latitude and Longitude fields to calculate distance between care centers.

Environment: InfoSphereDatastage11.5(Administrator, Director, Designer), Flat files, DB2, SQL, Sybase, Putty, WinSCP, Zena Job scheduler, Azure DevOps for User Stories, Tasks, Deployments, GIT.

DataStage Developer

Confidential, Wayne, PA.

Responsibilities:

  • Extensively used DataStage Designer to develop processes for extracting, transforming, integrating and loading data from various sources into the Data Warehouse database.
  • Involved in design phase of logical and physical data model using Erwin
  • Extracted data from flat files Transformed according to the requirement and Loaded into staging schema using various stages like sequential file, Aggregator, Transformer stage, dataset, Look up, Aggregator, joiner.
  • Used sequencer jobs using activity stages to simplify the complexity of the jobs.
  • Developed several ETL jobs for Historical data loads and ongoing data loads.
  • Involved in importing and exporting jobs category wise and maintaining the backup regularly.
  • Developed complex mappings using multiple sources and targets in different databases.
  • Involved in the Analysis of the functional side of the project by interacting with functional experts to design and write technical specifications.
  • Developed DataStage jobs for Data acquisition, staging to ODS and then to the Data warehouse constantly from different OLTP sources.
  • Extracted data from sources like Flat Files, SQL Server 2008 and third party data sources.
  • Successfully implemented pipeline and partitioning parallelism techniques and ensured load balancing of data.
  • Develop UNIX Shell scripts and schedule the jobs.
  • Wrote scripts to automate DataStage jobs on daily bases
  • Debug, test and fix the transformation logic applied in the parallel jobs
  • Provided estimates for ETL development effort and regularly updated senior management regarding the progress.
  • Designed and developed parallel jobs, server and sequence jobs using DataStage Designer.

Environment: IBM InfoSphere Data Stage 11.5, SQL Loader, Oracle, ERWIN, Unix, IBM DB2

DataStage Developer

Confidential, San Ramon, CA

Responsibilities:

  • Extensively used Data Stage as the data migration/transformation tool for Claims Data Warehouse application.
  • Analyzed source systems and created mapping to the target database schema.
  • Extracted data from DB2, Teradata applied Transformations and loaded in to Oracle Database.
  • Involved in Design Development and Deployment of Data Stage Server and PX jobs, used stages like sort, aggregator, transformer, link collector, link partitioner, XML input, XML output, pivot, FTP stage, etc.
  • Involved in extracting sequential files and flat files from different Sources.
  • Involved in modifying existing Data Stage Jobs for better performance.
  • Worked closely with Business Analysts to understand business rules and developed Prototype jobs.
  • Redesigned certain stages in existing jobs for optimization in accordance with framework.
  • Optimized bulk load jobs in small time frame
  • Developed Shell Scripts for taking backup and recovery of database. Performed physical and logical backup.
  • Provided Support to various groups to design and develop ETL job flow using Data Stage and Shell Scripts.
  • Developed Server side functionality by using PL/SQL and UNIX shell programming.
  • Constructed SQL Scripts to validate the data after loading process.
  • Worked closely with Data Analysts and Reporting Project team to make sure the data is accurate and consistent for table loads.

Environment: IBM InfoSphere Data Stage 9.1, Oracle, SQL Loader, Db2, Unix

Datastage Developer

Confidential

Responsibilities:

  • Created new datastage jobs for loading actual data of corresponding Tanks to AspenTech Scheduling tool to generate Scheduling data.
  • Modified the existing datastage jobs, which earlier pointed to M3 tool source, now replaced with AspenTech tool source.
  • Implemented Star schema while loading the data to Fact tables.
  • Created new configuration file to increase the number of nodes for high load data.
  • Documented all steps followed while creating datastage jobs.
  • Prepared test cases to test the datastage jobs and for Integration Testing.
  • Solved all issues encounter by testing team.
  • Replaced the complete logic of data fetching functionality of earlier source with new logic.
  • Conducted Performance tuning to optimize the datastage jobs while fetching data from new source.
  • Created server, parallel and sequence jobs to process the data from APS source to Datawarehouse.
  • Prepared Sql queries to verify the record count of Datastage jobs before processing and comparison purposes.
  • Created datastage jobs in such a way to integrate with Dotnet tools functionality.
  • Created new datastage jobs to send report to user once data load is done.
  • Error handling and job failure notification to users has been implemented.
  • Created a stored procedure in SQL Server to update all the codes of existing Scheduling system to codes of new Scheduling system.
  • Exceptions were handled successfully using Try - Catch methodology.
  • Notified the user that data has been published to Scheduling tool using datastage jobs.
  • Created datastage jobs able to pull XML file from webservice and load the corresponding XML data to SQL Server .

Environment: IBM DataStage (Administrator, Designer, Director), IBM InfoSphere DataStage, DataStage scheduler, QualityStage, M3, Aspen Tech, SQL Server

Datastage Developer

Confidential

Responsibilities:

  • Worked extensively with Parallel Stages like Copy, Join Merge, Lookup, Row Generator, Column Generator, Modify, Funnel, Filter, Switch, Aggregator, Remove Duplicates and Transformer Stages etc.
  • Gathered requirements and wrote specifications for ETL Job modules.
  • Worked as SME in providing support to the team in designing the flow of complex jobs.
  • Apart from providing technical support to the team and I also handled escalations.
  • Worked on production support by selecting and transforming the correct source data.
  • Data Warehouse was implemented using sequential files from various Source Systems.
  • Worked closely with Database Administrators and BA to better understand the business requirement.
  • Developed Mapping for Data Warehouse and Data Mart objects.
  • Performed through data cleansing by using the Investigate stage of Quality Stage and also by writing PL/SQL queries to identify and analyze data anomalies, patterns, inconsistencies etc.
  • Used DataStage Manager for importing metadata from repository, new job categories and creating new data elements.
  • Design and Develop ETL jobs using DataStage tool to load data warehouse and Data Mart.
  • Performance tuning of ETL jobs.
  • Perform data manipulation using BASIC functions and DataStage transforms.
  • Import relational metadata information for project.
  • Developed DataStage Routines for job Auditing and for extracting job parameters from files.
  • Create master controlling sequencer jobs using the DataStage Job Sequencer.
  • Create and use DataStage Shared Containers, Local Containers for DS jobs and retrieving Error log information.
  • Design, build, and manage complex data integration and load process
  • Developed PL/SQL scripts to perform activities at database level.
  • Developed UNIX scripts to automate the Data Load processes to the target Data warehouse.

Environment: IBM Ascential DataStage DataStage, Quality Stage, Information Analyzer, Metadata Workbench, Business Glossary), Oracle, DB2 UDB, Teradata, Mainframe, PL/SQL, Oracle with 2 node RAC, Autosys, Erwin 4.2, TOAD, SQL Developer, PVCS, Business Objects XI, Shell Scripts, HP Unix, Windows XP.

We'd love your feedback!