Sr. Etl Datastage Developer Resume
Detroit, MI
OBJECTIVE
- Over 7 years of experience in Data Warehousing, ETL design, development, deployment, migration, production support and maintenance using IBM InfoSphere / WebSphere DataStage. Seeking a Data stage Developer position, specializing in ETL architecture, design, and development offering technical and analytical growth to my career.
SUMMARY
- 7+ years of experience in IT in software design and development of Data Migrations, Data warehouses and Data marts and Testing experience with large CDW, data mart systems having a distinctive competence in designing complex jobs using DataStage
- Experience in writing stored procedures, packages, functions, triggers and cursors.
- Hands on Experience with DDL’s, DML’s, partitions, database links.
- Experience and expertise in DataStage 7.x/8.x, IBM DataStage Information Server and IBM WebSphere Information Analyzer
- Strong experience of Extraction, Transformation and Loading (ETL) processes using DataStage, PL/SQL, SQL - Loader and Unix Shell scripting.
- Extensively used Datastage processing category stages to implement various transformation logics like Join, Merge, Lookup, Sort, Copy, Aggregator, Change Data Capture (CDC), Remove Duplicate, Filter, SK Generator, Transformer etc.
- Strong knowledge and background in analysis of business process, metadata documentation and development of business architecture as-is model.
- Strong experience with design and deployment ETL jobs, document technical processes and procedures, test documentation
- Strong experience in Configuration Management process and End User/Customer Interaction
- Experience of physical and logical data warehouses/mart.
- Solid experience both in Development and Production support.
- Excellent experience in working with Oracle 11g and SQL-Server 2008
- Experience in integration of various data sources like SQL-Server and Oracle into staging area.
- Experience in conducting Unit Testing, System testing and Integration testing.
- Experience with UNIX shell scripts.
- Knowledge in using PL/SQL to write stored procedures, functions, and triggers.
- Involved in providing support in Dev, QA and Prod environments.
- Ability to analyze, diagnose and resolve complex programming and system problems independently.
- Technical and analytical skills with clear understanding of design goals of ER modeling for OLTP and dimension modeling for OLAP.
- Experience using rational toolset like Clearcase for source code migration and version management.
- Excellent communication, interpersonal, analytical skills and strong ability to manage and motivate the team.
TECHNICAL SKILLS
OS: Windows 7, XP,2000/NT/98/95/3.11. Linux
Languages: PL/SQL, Unix Shell Scripting, JAVA, VB Script, JAVA Script, Unix Shell Scripting, UML, PL/SQL
ETL Tools: IBM InfoSphere DataStage 8.5, IBM WebSphere DataStage 8.0.1, Ascential Data Stage 7.x
Databases: Oracle 11g/10g/9i, SQL Server2005/2008, DB2
Data Modeling: Erwin, Snowflake Modeling, Fact and Dimensions, Physical and Logical data Modeling, Star Schema Modeling.
Other tools: MySQL, Toad, SQL Navigator, and SQL*Loader, WinCVS, Control-M
PROFESSIONAL EXPERIENCE
Confidential, Detroit, MI
Sr. ETL Datastage Developer
Responsibilities:
- Designed the ETL jobs using IBM Infosphere DataStage 8.5 to Extract, Transform and load the data into staging, ODS and EDW.
- Designed and developed the ETL jobs using Parallel Edition which distributed the incoming data concurrently across all the processors, to achieve the best performance.
- Designed parallel jobs using stages such as Join, Merge, Lookup, Remove Duplicates, Copy, Filter, Funnel, Dataset, Lookup, Pivot, and Sort, Surrogate key Generator, Change Data Capture (CDC), Modify, Row Generator and Aggregator.
- Responsible for generation of DDL statements which are executed for database creation.
- Responsible for preparing Physical/logical data models .
- Responsible for data analysis, requirements gathering, report analysis, source-to-target mapping, frequency analysis, process flow diagrams, and documentation.
- Handled Performance Tuning of Jobs to ensure faster Data Loads.
- Designed sequence jobs using the activities such as Job Activity, Nested Condition, Notification Activity, Sequencer Activity, Terminator Activity and Execute Command.
- Performed the Integration and System testing on the ETL jobs.
- Responsible for preparing ad hoc jobs.
- Translated business rules and functionality requirements into ETL derivations.
- Scheduled jobs using Autosys scheduler utility based on the requirements and monitored the production processes closely for any possible errors.
- Migrated projects from development to QA to Production environments.
- Assisted operation support team for transactional data loads in developing SQL & UNIX scripts.
- Imported the required Metadata from heterogeneous sources at the process level.
- Created Job Parameters and Environment variables to run the same job for different sources and targets.
- Used Multi-job-compiler during deployment of projects.
- Created Batches (DS job controls) and Sequences to control set of jobs.
- Used the Director to schedule running the job, testing and debugging its components, and monitoring.
- Created Shared Containers for Re-using the Business functionality.
- Collaborated with BO team to design Crystal reporting and reports for enterprise reporting applications.
- Worked with Developers to troubleshoot and resolve issues in job logic as well as performance.
Environment: IBM WebSphere DataStage 8.5(Administrator, Designer, Director), IBM Information Analyzer 8.1.1a, Microsoft SQL 2005/2008, IBM DB2 9.1, AIX6.0, Oracle 11g, Toad 9.5, MS-Access, shell scripts, PUTTY, WinSCP, ERwin R8.1.
Confidential, Naperville, IL
ETL DataStage Developer
Responsibilities:
- Designed the ETL jobs using IBM Infosphere DataStage 8.5 to Extract, Transform and load the data into staging, ODS and EDW.
- Responsible for data analysis, requirements gathering, source-to-target mapping, process flow diagrams, and documentation.
- Responsible for generation of DDL statements which are executed for database creation.
- Designed a dimensional database with Snow Flake schema and creation of physical tables in Oracle 10g
- Designed parallel jobs using stages such as Join, Merge, Lookup, Remove Duplicates, Copy, Filter, Funnel, Dataset, Lookup, Pivot, and Sort, Surrogate key Generator, Change Data Capture (CDC), Modify, Row Generator and Aggregator.
- Designed sequence jobs using the activities such as Job Activity, Nested Condition, Notification Activity, Sequencer Activity, Terminator Activity and Execute Command.
- Worked on pipeline and partitioning parallelism techniques and ensured load balancing of data.
- Deployed different partitioning methods like Hash by field, Round Robin, Entire, Modulus, and Range for bulk data loading.
- Implemented various shared containers for Re-using the business functionality.
- Worked within a team to populate Type I and Type II slowly changing dimension tables from several operational source files
- Created UNIX shell scripts for database connectivity and executing queries in parallel job execution.
- Used the ETL Data Stage Director to schedule running the solution, testing and debugging its components & monitor the resulting executable versions.
- Experienced in using SQL Loader and import utility in TOAD to populate dataset in the data warehouse tables.
- Responsible for importing the required Metadata from heterogeneous sources at the project level.
- Working with BI team to apply the business rules for OLAP, designing the Frame Work models.
- Used QualityStage to ensure consistency, removing data anomalies and spelling errors of the source.
- Used Multi-job-compiler during deployment of projects.
- Created Shared Containers for Re-using the Business functionality.
- Scheduled jobs using Autosys scheduler utility based on the requirements and monitored the production processes closely for any possible errors.
Environment: IBM Info sphere DataStage 8.5, Oracle 11g, UNIX, IBM AIX, Oracle, DB2, Flat files, Sequential files, Fixed Width Files, Autosys, TOAD 9.6.
Confidential, Deerfield, IL
Technical Datastage Developer
Responsibilities:
- Involved in various roles of Administrator and Developer throughout the project.
- Managed analysis, design, coding and testing of ETL jobs for 7 Source Systems.
- Involved in implementing the Best practices and design standards. The Best practices include Restart-ability, Recovery, Parameter standardization and Capacity planning, etc.
- Participated in the review of Technical, Business Transformation Requirements Document.
- Prepared documentation to describe process development, logic, coding, testing, changes and corrections.
- Used Partition methods and collection methods for implementing parallel processing.
- Developed complex DataStage jobs according to the business requirements / mapping documents.
- Performed Unit Testing, System Integration Testing and User acceptance testing.
- Extensively Designed local containers and shared containers to simplify and modularize job design by replacing complex logics with single container stage and also to promote reusability of job designs.
- Involved in importing and exporting jobs category wise and maintaining the backup regularly.
- Used designer and director to schedules and monitor jobs and to collect the performance statistics.
- Worked within a team to populate Type I and Type II slowly changing dimension tables from several operational source files Created some routines (Before-After, Transform function) used across the project.
- Involved in creating UNIX shell scripts for database connectivity and executing queries in parallel job execution.
- Responsible to tune ETL processes to optimize load and query performance.
- Created standards document, best practices guide and performance tuning techniques documents.
Environment: IBM Web Sphere Data stage and Quality Stage 8.0.1a (Administrator, Designer, Director), Oracle 10g, Teradata, Cognos, Toad 8.0, Microsoft SQL 2005, IBM 2094-System z9, AIX, Java, Star Team, WinSCP(FTP), SSIS 2005, Putty, Windows 2003, Zeke, MS Visio, SAP MDM, Quality Center.
Confidential, Providence, RI
Datastage Developer
Responsibilities:
- Provided Technical support to the team as the ETL developer. Addressed best practices and productivity enhancing issues.
- Worked on designing and developing the Quality stage.
- Assisted in understanding the infrastructure requirements at offshore including sizing; played key role in smooth operation of development environment.
- Loaded data into load, staging and lookup tables. Staging area was implemented using flat files.
- Created jobs in DataStage to import data from heterogeneous data sources like Oracle 9i, Text files and SQL Server.
- Generation of Surrogate IDs for the dimensions in the fact table for indexed and faster access of data in server jobs.
- Extensively worked on Job Sequences to Control the Execution of the job flow using various Activities & Triggers (Conditional and Unconditional) like Job Activity, Wait for file, Email Notification, Sequencer, Exception handler activity and Execute Command.
- Dicing and Slicing of the input data for the Business feedback. Testing of the system.
- Designing Data masking techniques to mask sensitive information when working with offshore
- Assisted Mapping team to transform the business requirements into ETL specific mapping rules.
- Enhanced various complex jobs for performance tuning.
- Responsible for version controlling and promoting code to higher environments.
- Worked on Teradata optimization and performance tuning.
- Performed Unit Testing, System Integration Testing and User acceptance testing
- Involved in ongoing production support and process improvements. Ran the DataStage jobs through third party schedulers
Environment: IBM Web Sphere Data Stage 8.0.1, IBM AIX 5, IBM DB2 8.0, Oracle 10g, Teradata, MS Visio, MS Project, MS Office Suite, Teradata, Windows XP, Flat Files, SQL Developer, Unix Shell Scripting
Confidential, Westchester, IL
ETL Developer
Responsibilities:
- Involved in the analysis, data modeling, detailed system design, development and technical documentation
- Designed an ETL process using Business Intelligence Integrated Services to Extract, Tranform and Load from .csv files to SQL server Database.
- Used SSAS to build cubes and analyze the data by MDX querying.
- Maintained a separate Database for each department.
- Used Star schema to design the ETL process.
- Used multiple connection managers for source and destinations.
- Used IBM Web sphere DataStage 7.5.1 to extract data from the individual department databases and used transformations to load into an Oracle Warehouse.
- Deployed different partitioning methods like Hash by field, Entire, Modulus, and Range for bulk data loading.
- Used Type II slowly changing dimension tables from several operational source files.
- Defined Measures and Measure Groups, Dimension Relationships and key Performance Indicators.
- Written Multi Dimensional Queries to analyze the data.
- Participated in the development and maintenance of a Date Warehouse / Data Mart Routine Load Schedule.
- Worked with heterogeneous sources including relational sources and flat files
- Used the Director to schedule running the job, testing and debugging its components, and monitoring.
Environment: IBM Web sphere DataStage 8.0.1, IBM AIX 5, Oracle 9i, MS Visio, MS Project, SQL Server Management Studio, Business Intelligence Integration Services and Analysis Services.