Datastage Developer Resume
Los Angeles, CA
SUMMARY
- Over 6 years of IT experience with special expertise in Development, Analysis, and Design of ETL methodologies in all the phases of the Data Warehousing life cycle coupled with knowledge of DSS, OLAP, Data Integration and Data Migration.
- Extensive experience in IBM WebSphere DataStage 8.0, Ascential DataStage 7.5/6.0XE using components like DataStage Designer, DataStage Manager, DataStage Director, DataStage Administrator and Parallel Extender with QualityStage and MetaStage.
- Familiar in using highly scalable parallel processing Infrastructure using DataStage Enterprise Edition (Parallel and Server).
- Strong Business Analysis experience on Data Analysis, User Requirement Gathering, User Requirement Analysis, Gap Analysis, Data Cleansing, Data Transformations, Data Relationships, Source Systems Analysis and Reporting Analysis.
- Expertise in administering Job Scheduler using Autosys on UNIX machines.
- Involved and assisted in Unit testing, implementation, maintenance and performance tuning
- In Depth knowledge in Life Cycle Development including requirement analysis, design, development, testing and implementation.
- Expertise in Forward/Reverse Engineering using Erwin.
- Expertise in integrating Heterogeneous data sources like Oracle, SQL server, DB2 UDB, Flat Files and MS Access using Active/ Passive stages available in DataStage.
- Expertise in Windows Servers.
- Effective in cross - functional and global environments to manage multiple tasks and assignments concurrently.
- Scheduled jobs using DataStage director and its run-time engine.
- Used Unix Shell Scripts for moving, copying data, run jobs.
- Extensively used TOAD 9.0/8.5to access Oracledatabase and Control center to access DB2 Database
- Experience in development and effective implementation of Data cleansing, Data acquisition and Data integration tasks using ETL tool DataStage and reporting tool Business Objects
- Worked on Data Modeling, Strong in Data Warehousing concepts, Dimensional, Star Schema and Snowflake Schema methodologies. Complete understanding of Ralph-Kimball and Inmon approaches to Data Warehousing.
- Excellent communication skills and Strong Analytical skills.
- Strives for excellence and drives continuous improvement.
TECHNICAL SKILLS
ETL Tools: IBM WebSphere DataStage8.5/ 8.0.1,QualityStage 8.17.5, Ascential DataStage 7xParallel Extender (Designer, Director, Manager, Administrator)
Data Modeling: Dimensional Data Modeling, Star Join Schema Modeling, Snowflake Modeling, FACT and Dimensions Tables, Physical and Logical Data Modeling
RDBMS: Oracle 11g/10g/9i/8i/8.0/7.0, MS SQL Server 6.5/7.0/2000/2005 , DB2 UDBMS Access
Tools: ERwin 4.5.2/4.1/3.5 , Toad, SQL*Loader, SQL * Plus, Microsoft Visio
Languages: PL/SQL, SQL, C, C++
UNIX: Job Schedulers (Autosys), Shell Scripting
OS: UNIX (AIX 4.2/4.3), Sun Solaris2.6/2.8,Windows 4.0/95/98/2000/ XP/7, windows2005/2000 Server, Red Hat LINUX.
PROFESSIONAL EXPERIENCE
Confidential, Los Angeles, CA
Datastage Developer
Responsibilities:
- Involved in data Extraction and Transformation from source database and Loading to EDW(Enterprise Data Warehouse) with DataStage
- Worked closely with the Sales team and gathering the functional requirements and working with the project manager to give a high level design and the cost associated with the project
- Worked with datastage Designer and Director to design job to extract data from external files, tables.
- Developed jobs to aggregate the source data using Aggregator and handled duplicate records using Remove Duplicate stage
- Used Parallel Extender to create jobs using stages like Aggregation, Transformation, join, merge, surrogate key generator and vector stage etc.
- Involved in the Design, Development and Production support of Data Warehouse
- Worked closely with Clear Case for secure Import and Export of DataStage Jobs
- Designed DataStage jobs using the stages like ORAOCI, ODBC, Transformer, Join, Merge and Sort to populate and incrementally load the source data into Data Ware House tables
- Worked on the project from scratch that is from Mapping till the Exportation of the Developed Jobs to other environments SAT and PROD.
- Created jobs to produce unit test data using Row Generator, Column Generator, Sequential file and Dataset to perform UTP (unit test plan)
- Involved in design, planning and implementation of test strategies.
- Created Stored Procedures for faster data extraction from databases
- Created data validation scripts to verify the loading process from source to ODS to DW
- Documented Source-Target mapping and Unit Test plans and scripts for the DS jobs.
Environment: IBM Websphere DataStage 8.5, Oracle 10g, ERwin, WinSQL, TOAD, SQL*Plus, SQL*Loader, UNIX Shell scripts, Autosys.
Confidential, Princeton, NJ
ETL DataStage Developer
Responsibilities:
- Extensively worked on Parallel Extender to create Parallel Jobs using various stages like Aggregator, Transformer, Dataset, Row Generator, Column Generator, Filter, Funnel, Join, Lookup, Lookup File Set, Copy, Peek, Oracle Enterprise, DB2 UDB, Change Capture, Remove Duplicates, Merge, Pivot, Sequential file and FTP Stages.
- Analyzed, designed and developed ETL jobs for Vendor Feeds.
- Encrypted files before sending to vendors.
- Created shell scripts to integrate the process of Hashing, Encrypting and FTP.
- Created Data stage jobs to load data to internal data feeds.
- Scheduled and monitored the jobs in Production.
- Reviewed the errors in Organizational Manager production jobs and apply fixes.
- Created the logical and physical data model for Data Mart.
- Implemented Slowly Changing Dimension Type2 process for Data Mart.
- Developed jobs to load dimension and fact tables.
- Maintained the DataMart database objects.
- Documented the entire ETL Process in the ETL Design document.
- Improved the run time for the process that loads the fact table.
Environment: IBM WebSphere DataStage 8.0.1, SQL Server 2005, Oracle 10g, Red Hat LINUX, TOAD 9.5.0, SQL Developer, ERwin.
Confidential, Bloomington, MN
DataStage ETL Developer
Responsibilities:
- Involved in studying the impact and issues of incorporating the new business into Data Warehouse.
- Participated in Business Requirement Analysis and assisted with defining requirements.
- Responsible in designing the schema with a special focus on User Interface Design
- Responsible for conceptualizing and generating the need for Data Warehousing solutions as applicable to billing systems
- Evaluated the Consistency and integrity of the model and repository
- Used Parallel Extender for parallel processing of data extraction and transformation.
- Used Integrity & Parallel Extender for cleansing the data and performance improvement
- Extensively used almost all of the transforms of DataStage for various types of Date Conversions
- Optimized Query Performance, Session Performance and Reliability
- Responsible for Tuning the DataStage Repository and Jobs for optimum performance
- Extensively used Integritys existing wizards to remove duplicates
- Parsed, Matched and removed duplicate records using Integritys built - in wizards
- Used MetaBroker with Ascential products such as Ascential Quality Stage, DataStage for direct tool-to-tool import/Exports
- Scheduled and monitored automated weekly jobs
- Performed the Unit testing of individual modules and their integration testing
- Debugged and sorted out the errors and problems encountered in the production environment.
Environment: IBM WebSphere DataStage 8.0.1/7.5 (PX) (Designer, Manager, Director, Parallel Extender), QualityStage 7.5, MetaStage (Directory, Explorer), SQL, PL/SQL, Oracle 9i/8i, MS Access, Windows 4.0, Sun Solaris 2.6
Confidential, Richmond, VA
DataStage ETL Developer/Tester
Responsibilities:
- Designed and developed jobs for extracting, transforming, integrating, and loading data into data mart using DataStage Designer, used Data Stage manager for importing metadata from repository, new job categories and creating new data elements.
- Generation of Surrogate IDs for the dimensions in the fact table for indexed and faster access of data in server jobs.
- Creation of surrogate key tables to map the required dimensions.
- Creation of hash tables used for referential integrity and/or otherwise while transforming the data representing valid information.
- Proper selection of the hash table design parameters for faster table look-up.
- Creation of re-usable components using shared containers for local use or shared use.
- Created Error Files and Log Tables containing data with discrepancies to analyze and re-process the data.
- Wrote routines for Error Correction and inserting default values for specific errors.
- Troubleshooting the designed jobs using the DataStage Debugger.
- Tuned DataStage transformations and jobs to enhance their performance.
- Sequential File, Aggregator, ODBC, Transformer, Hashed-File, Sort, Link Partitioner, Link Collector and Plug-ins Stages were extensively used to develop the server jobs.
- Performed Unit and System testing on the jobs created.
Environment: Ascential DataStage 7.5, Oracle 9i, PL/SQL, SQL * Plus, UNIX Shell Scripts, Windows 2000/NT 4.0, ERWIN 4.1, Quality Stage, PVCS 6.7.10 ERWIN 4.5.2, Business Objects 5.0, Meta Stage and Integrity
Confidential, Lexington, KY
DataStage ETL Developer
Responsibilities:
- Extensively used Data Stage Designer to develop jobs for extracting, transforming, integrating and loading data into data warehouse tables.
- Extensively used the Hash file Stage, Aggregator Stage, Sequential File Stage, Oracle OCI stage.
- Was involved in getting requirements from the business users and preparing the appropriate ETL Design specifications and test plans.
- Provided project and developer support on the usage of Data Stage and Quality Stage.
- Participated in reviews of business requirement analysis and assisted with defining requirements.
- Wrote the master Batch Script which calls all the jobs to load the inventory data warehouse.
- Used Job sequencer for setting up the job execution sequence.
- Wrote user defined Data Stage routines and transform functions to carry out the complex transformations in the transformer stage.
- Performed unit and integrated testing of the ETL process.
- Was involved in Migrating the ETL Process from Development to QA and QA to production using Data Stage Manager Export/Import utility.
- Wrote SQL scripts to load the manually maintained tables in the staging area.
- Was involved in writing UNIX Script to automate the ETL process.
- Was involved in performance tuning of the ETL process and upgrading the ETL best practices document.
- Wrote user defined SQL queries to extract the suitable data from Oracle sources.
Environment: Ascential Data Stage 7.x (Server Edition), Oracle 8i, Oracle 9i, TOAD, Windows XP, UNIX and Microsoft Visio.
Confidential
Systems Engineer
Responsibilities:
- Based on the Functional Spec provided by the business requirement, will prepare the program specifications and Unit Test Plan.
- Develop the code based on the business requirement.
- Test the code for the various test conditions.
- Independent Unit testing and User Acceptance testing are planned for the sign off to move the code to production environment.
- Post production monitoring and resolving the issues.
- Customer Interaction and clarified the concept asked by the user and using tech10 tool.
- Responsible for explaining Program Functional flows, troubleshooting technical queries
- Fixing of the program aborts.
- Resolving production issues.
- Preparing PL/SQL one timer scripts.
- Coordinating with Data Center/DBA/SE, Business OPS units.
Environment: ORACLE9i, Pro*C, UNIX, PL/SQL, Tech10 Too.