Datastage Developer Resume
Mclean, VA
SUMMARY:
- Over 7 years of IT experience in Analyzing, Designing, Developing, Testing and maintaining client/server Business Systems.
- Over 6 years of experience in building data warehouse/data marts with extraction transformation and loading (ETL) processes using IBM InfoSphere Information Server-Data Stage versions 8.1/7.x/6.x.
- Have extensive experience in developing ETL processes to extract, transform and load data from various source and archive systems into data warehouses and data marts using DataStageManager, Designer and Director.
- Developed number of DataStage ETL jobs based on business requirements using various DataStage Stages as Copy, Modify, Aggregator, Filter, Funnel, Join, Lookup, Merge, Change Capture, Datasets, Sequential files and Transformer etc.
- Extensively worked with Parallel Extender /Server to improve job performance while working with data sources.
- Solid experience in using Local Containers, Shared Containers and Job Sequencer.
- Experience in working with large database systems such as Informix, Oracle, DB2, Sybase, Teradata as well as hands-on experience in building data marts and data warehouses.
- Good knowledge in using Slowly Dimension Stage to implement SCD type1 & 2.
- Managing the overall SDLC process and deliver project deliverables within the scheduled time.
- Involved in dimensional data modeling (star schema, snow flake schema) and fact and dimension table design, physical and logical data modeling using Erwin tool.
- Strong technical skills in designing and developing large & complex Jobs.
- Prepare Technical Design, Standards and Specification for enhancements.
- Good experience in writing SQL loader scripts and UNIX Shell Scripts to automate file manipulation and data loading procedures.
- Comfortable in working with version control tool like Clear Case and PVCS andscheduling tools AUTOSYS.
- Extensively worked in analyzing, testing, debugging, identifying problems/issues and giving the appropriate solutions and worked with test Director.
- Excellent communication skills having analytical and documentation skills.
TECHNICAL SKILLS:
ETL Tools : IBM Infosphere DataStage 8.0.1/8.0/7.5/7.0/6.0 (Parallel Extender and Server Edition)
Databases : Oracle 11g/10g/9i/8i, IBM DB2/UDB v9.1, Netezza, MS SQL Server 2005/2000
TeradataV2R6.1/5.1.
Languages : SQL, PL/SQL, UNIX Shell Scripting, C/C++, Java, XML.
Reporting Tools : Business Objects
Data Modeling : Erwin 4.0, Visio
Operating System: Windows NT4.0/2000, MS-DOS, UNIX (Solaris, Linux, AIX)
Web Technologies: HTML, Dreamweaver, FrontPage
QA : Test Director
Version Control : PVCS, Clear Case
WORK EXPERIENCE:
Confidential, Mclean, VA Oct 2010 Till Date
DataStage Developer
The objective of the project is to extract, transform and load into the database from source flat files. The project implemented is to support the (HASP) Homeowner affordability and stability plan program. The project also includes data conversions, updates, reorganizing, creating YEB-year end balances events etc.
Responsibilities:
- Developed both Server and Parallel Jobs in DataStage Designer to Extract data from the Sources Oracle and Complex Flat Files, Cleanse it using Quality Stage, Transform by applying business rules, staging it in Data marts and Load (Initial/Incremental) into Target Teradata.
- Used the DataStage stages Oracle Enterprise, CFF, Copy, Filter, Lookup, Transformer, Sort, Funnel, Shared Containers, Join, Dataset, Aggregator, Sequential file, Remove Duplicates.
- Used Job Compare option to compare the jobs developed for efficient job design.
- Used the Parameter set to define the parameters at project level in 8.1.1 instead of defining the parameters for each and every job as in 7.5.2 thereby lessening the development effort and reducing the redundant task of adding new parameters at job level.
- Improved the server version jobs performance by using Hash file, In Process and Inter Process settings, Link Partitioner, Link Collector.
- For Parallel jobs, configured the multiple nodes and used parallel engine capacity efficiently as well as designed a master sequence to run multiple jobs in parallel.
- Developed Sequences to run Dimensions and Facts and also developed a Master Sequence to run the entire project.
- Used the Slowly Changing Dimension Stage itself to implement SCD Type-1 & 2.
- Exported the project from Development to Test environment using DataStage Manager.
- Wrote the SQL queries & PL/SQL code tables and Data Mart Staging Tables to validate the data results by counting the number of rows in both the tables.
- Developed UNIX shell script to run jobs in multiple instances by using a parameter file.
- Performed Import and Export of DataStage jobs using DataStage Manager to take backups and maintain them in CVS.
- Interacted with Solutions analysts, system staff, ETL, BO developers and the data load staff.
- Involved in Unit, Integration, System and User Acceptance Testing (UAT).
Environment: Ascential DataStage 7.5.2 Parallel Extender, Profile Stage 7.5.1A, Quality Stage 7.5.1A, MetaStage 7.5, PL/SQL, Oracle 10g, Solaris 8.9, Windows NT.
Confidential,Atlanta, GA Nov 2008 Sep 2010
DataStage Developer
The implemented Enterprise Data Warehouse collects and organizes the data from operational systems to provide historical data for the purpose of end-user reporting, analysis, single source of integrated data, provide error ending data, real time data for the existing records and the updated records when the transactions get completed. The project goal is also to provide information to various assets and the respective values, status information.
Responsibilities:
- Gathered requirements and wrote specifications for the ETL Job modules.
- Worked on production support by selecting and transforming the correct source data.
- Data Warehouse was implemented using sequential files from various Source Systems.
- Worked closely with Database Administrators and BA to better understand the business requirement.
- Developed Mapping for Data Warehouse and Data Mart objects.
- Performed through data cleansing by using the Investigate stage of Quality Stage and also by writing PL/SQL queries to identify and analyze data anomalies, patterns and inconsistencies.
- Create and use DataStage Shared Containers, Local Containers for DS jobs and retrieving Error log information.
- Perform data manipulation using BASIC functions and DataStage transforms.
- Worked extensively with Parallel Stages like Copy, Join Merge, Lookup, Row Generator, Column Generator, Modify, Funnel, Filter, Switch, Aggregator, Remove Duplicates and Transformer Stages etc extensively.
- Design and Develop ETL jobs using DataStage tool to load data warehouse and Data Mart.
- Performance tuning of ETL jobs.
- Developed UNIX scripts to automate the Data Load processes to the target Data warehouse.
- Import relational metadata information for project.
- Developed DS Routines for extract job parameters from files and developed DS routines for job Auditing.
- Create master controlling sequencer jobs using the DataStage Job Sequencer.
- Design, build, and manage complex data integration and load process
- Developed PL/SQL scripts to perform activities at database level.
Environment: Ascential DataStage 7.5.2 EE (Parallel Extender, Quality Stage, Designer, Director, Manager, Administrator), Oracle 10g, COBOL, CSV Files, SQL, PL/SQL, Windows XP.
Confidential,Cincinnati, OH Oct 2007 Oct 2008
DataStage Developer
Union Central Life Insurance provides homeowners insurance, life insurance, disability income insurance and investment products. The goal of the project is to implement ETL process to load Enterprise data warehouse and develop customer relationship data mart (CRDM) which supports Business Intelligence.
Responsibilities:
- Extensively used DataStage for extracting, transforming and loading databases from sources including Oracle, Teradata and Flat files.
- Worked onDataStage Designer, Manager, Administrator and Director.
- Created re-usable components using shared containers for local use or shared use. Exported the universe to the Repository to make resources available to the users.
- Loaded data into load, staging and lookup tables.
- Executed Pre and Post session commands on Source and Target database using Shell scripting.
- Developed Parallel jobs using Aggregator, Join, Transformer, Sort, Filter, Funnel, Peak, Surrogate Key Generator, Remove Duplicates, Lookup Stages.
- Used Quality Stage Jobs to cleanse the data, by calling it from DataStage.
- Created Data Model using Erwin from data sources and defined the dimensions, levels and measures in the model.
- Extensively used Parallel Extender to distribute bulk data into subsets to all available nodes to optimize performance in large databases which achieves best job performance.
- Created stored procedures, functions using PL/SQL for implementing business rules and transformations.
- Implemented the underlying logic for Slowly Changing Dimensions.
Environment: Ascential DataStage 7.5.1EE (Parallel Extender, Designer, Manager, Director, Administrator), Oracle 9i, DB2 UDB 7.0, HP-UNIX 10.2, MS SQL Server 2000, ERWIN 3.4/4.0, TOAD, Shell, SQL * Loader, Windows NT 4.0.
Confidential,Rochester NY June 2006 Sep 2007
Datastage Developer
BCBS Information Services Division also known as ISD is part of BCBS Corporation, which handles information Technology operations for BCBS. IBM Information Server is a market-leading data integration platform that helps to understand, cleanse, transform and deliver trusted information to the critical business initiatives. BCBS uses IBM Information Server product for its Data-Integration requirements.
Responsibilities:
- Involved in developing the job naming standards according to the requirements.
- Developed Datastage jobs to parse the dump file to create individual source files for the subsequent jobs.
- Developed parameter driven ETL jobs and defined reference lookups and joins, aggregations, defined constraints and derivations to populate final tables in the target.
- Created master level controlling sequencer for loading and deleting jobs using the Datastage job sequencer.
- Involved in performance tuning of the jobs by carefully monitoring the visual performance statistics in the director.
- Created effective test data by modifying source file and involved in unit testing and end to end testing of the design.
- Developed the Datastage repository by importing source file schemas and the target database schemas using Datastage Manager and created new job categories in the project.
- Extensively used Director to validate and run the jobs by specifying run time parameters and carefully analyzed job log to fix the defects.
Environment: Ascential Datastage, Oracle 9i, Erwin, Windows XP, Solaris 2.x, Putty, SQL Plus.
Confidential,INDIA May 2005 - May 2006
SQL Developer
United Telecom is one of India\'s largest business houses with over 20 companies in areas like telecom, electronics and banking under its stable. Its telecom arm has implemented projects like the Andhra Pradesh State Wide Area Network or the APSWAN, which was the state government\'s e-governance initiative etc. my responsibility was to maintain the data.
Responsibilities:
- Good SQL and PL/SQL skills
- Utilized PL/SQL developer tool in developing all back end database interfaces.
- Involved in analysis, database design, coding, and implementing.
- Generated server side PL/SQL Scripts for data manipulation and validation and created various snapshots and materialized views for remote instances.
- Produced SQL reports, data extraction and data loading Scripts for various schemas.
- Wrote DDL Scripts for Schema, Table space and Cluster creation and alteration.
- Involved in performing extensive Back-End Testing by writing SQL queries to extract the data from the database using Oracle SQL and PL/SQL.
- Developed automated notification of Emails, using UNIX shell script, to the users in case of failure in the process from time to time.
- Interacted with end user for problems faced and providing necessary technical support by fixing the bugs.
- Wrote stored functions and procedures for common update routines.
Environment: MS Windows 95, PL/SQL, Oracle.
Education: Bachelor of Engineering